gmryu comments

Results 66 comments of


                                            gmryu

how to fix the KeyError: 'prev_output_tokens'

@theamato It is not "train". It happens in "preprocess". You need to check how you prepared your data.

How to run FSDP on multiple nodes?

I believe there is no "sharding the model in one node and train with multiple nodes with parallel data distribution." While I would say sharding the model over all nodes...

How to train a single model over multiple datasets

I believe current normal fairseq does not provide such feature. Not in command line for sure. -- As for the implementation, If your data is not that huge, (size is...

How to train a single model over multiple datasets

@martianmartina I guess your solution is not bad. (Though I do not understand what you mean incorporating to target sentences.) At first glance, I would have a new argument for...

How to train a single model over multiple datasets

@martianmartina Do not know if you still need my help. Sorry I am pretty poor at understanding your implementation. `prev_output_tokens` is the same as `target`, while `prev_output_tokens` are passed to...

How to train a single model over multiple datasets

@martianmartina Okay, I had the same problem facing `IndexedCachedDataset` and I choose to ignore it and use list instead. It is very brave and cool of you to use those...