Chen Qian issues

Results 42 issues of


Chen Qian

Tensorboard callback is blocking process

As pointed out by our user, the Tensorboard callback is doing blocking I/O, which means the training is halted until the writing finishes. This creates a performance bottleneck especially when...

`char_to_token` in `keras_nlp.tokenizers.Tokenizer`

`char_to_token` is a method that converts the character index to the token index. See the HuggingFace method [here](https://huggingface.co/transformers/v3.2.0/main_classes/tokenizer.html#transformers.BatchEncoding.char_to_token) This is useful in span classification tasks, such as SQuaD, as we...

type:feature

Make an eval script for SQuaD

From a high level it is just a classification task, but there are some details to handle. The whole workflow can be described as: - Data 1. We can use...

Add TPU and evaluation saving for GLUE finetuning

TransformerDecoder should support single inputs

Currently TransformerDecoder must have both `encoder_inputs` and `decoder_inputs`, while for models like GPT2, only `decoder_inputs` is required. We should make a change to mark `encoder_inputs` as optional.

Integrate MLMMaskGenerator into BERT example

Currently the BERT example writes [custom code](https://github.com/keras-team/keras-nlp/blob/master/examples/bert/create_pretraining_data.py#L392) to generate MLM mask, which is slow. We should replace it with the [MLMMaskGenerator](https://github.com/keras-team/keras-nlp/blob/master/keras_nlp/layers/mlm_mask_generator.py).

good first issue

contributions welcome

Chen Qian

Tensorboard callback is blocking process

`char_to_token` in `keras_nlp.tokenizers.Tokenizer`

Make an eval script for SQuaD

Add TPU and evaluation saving for GLUE finetuning

TransformerDecoder should support single inputs

Integrate MLMMaskGenerator into BERT example

[Training] Track the progress of BERT Training

Use preallocated buffer for text generation

WordPieceTokenizer token splitting

[Training] Add checkpoint saving/restoring in BERT pretraining script