BMTrain icon indicating copy to clipboard operation
BMTrain copied to clipboard

Efficient Training (including pre-training and fine-tuning) for Big Models

Results 24 BMTrain issues
Sort by recently updated
recently updated
newest added

(cpm) D:\GitHub\BMTrain>python setup.py install running install C:\ProgramData\Anaconda3\envs\cpm\lib\site-packages\setuptools\command\install.py:37: SetuptoolsDeprecationWarning: setup.py install is deprecated. Use build and pip and other standards-based tools. setuptools.SetuptoolsDeprecationWarning, C:\ProgramData\Anaconda3\envs\cpm\lib\site-packages\setuptools\command\easy_install.py:147: EasyInstallDeprecationWarning: easy_install command is deprecated. Use build and...

question

Existing `TransformerBlockList` cannot output the hidden states and attention scores for each transformer layer. Sometimes we want to get the hiddens and attention scores to conduct analysis and feed them...

enhancement

## Pull Request Template ### Description [WIP] Update doc and notes for BMTrain. ### Type of Change - [ ] Bug fix (non-breaking change which fixes an issue) - [...

## 1F1B Pipeline schedule ### Description We have implemented 1f1b pipeline schedule based on our pointTopoint ops. ### Type of Change - [ ] Bug fix (non-breaking change which fixes...

## Pull Request Template ### Issue Reference Please mention the issue number if applicable, or write "N/A" if it's a new feature. Issue #... ### Description Please describe your changes...

Revise Spinix docs and add some suggestions