keras-nlp issues

Adding Data Augmentation Techniques Natively

18

I'm interested in contributing scripts which allow users to incorporate data augmentation techniques directly without using external libraries. I can start with stuff like synonym replacement, random insertion, random swap,...

aflah02

type:feature

Add an initial integration test using model.fit

1

We should add an integration test, running some actual limited training job for TransformerEncoder/TransformerDecoder, and possibly also using a tokenizer and position embedding.

mattdangerw

Candidate metric: MAUVE

**Is your feature request related to a problem? Please describe.** This idea is from one best paper in NeurIPS 2021: [MAUVE: Measuring the Gap Between Neural Text and Human Text...

chenmoneygithub

Adding a Random Encoder for Baseline Runs

3

NLP Papers often compare against baselines and having a prebuilt random encoder could help with that. A random encoder is similar to a simple encoder with a slight difference here...

aflah02

Add NLP-specific metrics

16

@mattdangerw and the keras-nlp team: For standard classification metrics (AUC, F1, Precision, Recall, Accuracy, etc.), [keras.metrics](https://keras.io/api/metrics/) can be used. But there are several NLP-specific metrics which can be implemented here,...

abheesht17

Add Token Classification, Text Summarisation, QA Examples

3

We can add a few examples: - Token Classification with BERT **Dataset:** CoNLL 2003 **What's different?** Here, we have to classify every word into its NER type. However, since BERT...

abheesht17

type:feature

Add type annotations to the bert example

5

We would like to use type annotations in KerasNLP. We should add them to the BERT example code in https://github.com/keras-team/keras-nlp/tree/master/examples/bert

mattdangerw

Add `from_preset` constructor to `BertPreprocessor`

Closes #388

jbischof

BPE tokenizer

2

This PR is a rework on https://github.com/keras-team/keras-nlp/pull/303. Recreate the PR instead of direct editing for clear remote-local tracking.

chenmoneygithub

Add `from_preset` constructor to `BertPreprocessor`

2

# Proposal In #387 we allowed construction of a BERT model from a "preset" model architecture and weights; for example `Bert.from_preset("bert_base_uncased_en")`. I propose to do the same with `BertPreprocessor`, automatically...

jbischof

keras-nlp
keras-nlp copied to clipboard

Metadata

Adding Data Augmentation Techniques Natively

Add an initial integration test using model.fit

Candidate metric: MAUVE

Adding a Random Encoder for Baseline Runs

Add NLP-specific metrics

Add Token Classification, Text Summarisation, QA Examples

Add type annotations to the bert example

Add `from_preset` constructor to `BertPreprocessor`

BPE tokenizer

Add `from_preset` constructor to `BertPreprocessor`

← Metadata

Owner

Metadata

keras-nlp keras-nlp copied to clipboard

Metadata

← Metadata

Owner

Metadata

keras-nlp
keras-nlp copied to clipboard