shayne-longpre comments

Results 16 comments of


                                            shayne-longpre

Pin versions for flan/v2/requirements.txt

@takiholadi I'd recommend updating to the latest nightly versions of seqio and tfds as bugs in their older versions affected our repo. I will look into what the best stable...

Reproducing the flan_v2 results of T5-xl

@danczs Thanks for the question. A couple thoughts: * I'm surprised that the SirNeural did so well. The reason we recommended Enrico's version is because it properly applies the dataset...

Reproducing the flan_v2 results of T5-xl

@danczs Hmm I'm not sure why it was so low. I noticed that a few recent papers seem to have gotten strong results with a 100k sample of the training...

[Question] What environment did you use for fetching large data set like dialog_mixture

@quq99 it is quite memory intensive. We ran it a while ago internally with Google infrastructure so I don't have specific numbers unfortunately, but in terms of compute it should...

[Question] What environment did you use for fetching large data set like dialog_mixture

You can also now manually download the Dialog submixture (and the others) -- see the new README! :)

Here are some resources, and there should be more info in the documentation: https://github.com/google/seqio#optional-offline-caching. This caching is for if you are using the same vocabulary as T5. If you want...

[Question] What license is used for this FLAN dataset(not the code).

@quq99 Good question. As the Flan Collection (or P3, or Natural Instructions v2) is a compilation of hundreds of different datasets, with many different licenses, the rendered data would not...

[Question] What license is used for this FLAN dataset(not the code).

@quq99 Update: we plan to release this in the last week of May.

[Question] What license is used for this FLAN dataset(not the code).

@balachandarsv apologies again for the wait on this. It turns out license labelling is much more complex than we had originally anticipated. It has gone from a side project into...

shayne-longpre

Pin versions for flan/v2/requirements.txt

Reproducing the flan_v2 results of T5-xl

Reproducing the flan_v2 results of T5-xl

Code for mixing Enrico sets?

[Question] What environment did you use for fetching large data set like dialog_mixture

[Question] What environment did you use for fetching large data set like dialog_mixture

How to cache my mixture

[Question] What license is used for this FLAN dataset(not the code).

[Question] What license is used for this FLAN dataset(not the code).

[Question] What license is used for this FLAN dataset(not the code).