peter-sk
peter-sk
# What does this PR do? It adds the class GPTNeoForTokenClassification, which allows using GPT Neo models for token classification tasks. The implementation follows the one for other models (such...
# What does this PR do? It adds the class GPTNeoXForTokenClassification, which allows using GPT NeoX models for token classification tasks. The implementation follows the one for other models (such...
# What does this PR do? Fixes # (issue) ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss the other checks...
# What does this PR do? Adds GPTNeoXForQuestionAnswering. Includes #23030 and #23057. ## Before submitting - [ ] This PR fixes a typo or improves the docs (you can dismiss...
# What does this PR do? Adds QA support for GPT Neo. Includes PR #23030. ## Before submitting - [ ] This PR fixes a typo or improves the docs...
…panish, swedish I hope this can be accepted, as I would gladly use clean-text also for my non-English/German text cleaning tasks :-)
Generating synthetic data is gaining increased attention, particularly in areas such as health data where data sharing is inhibited by sharp data protection laws (for good reasons). I propose to...
Cleaned-up version of https://github.com/bloc97/DeMo for integrating efficient distributed training a la Decoupled Monentum Optimization (https://arxiv.org/abs/2411.19870)