torchscale issues

torchscale 0.3.0 does not include LongNet

Hi! torchscale 0.3.0 does not include LongNet. When will a new version with LongNet be released?

train.py: error: unrecognized arguments: --flash-attention --segment-length [2048,4096] --dilated-ratio [1,2]

(torchscale) yehuicheng@bdp-gpu04:~/torchscale/examples/fairseq$ torchrun --nproc_per_node=8 --master_port 29501 --nnodes=1 train.py /home/data/dataset/yehuicheng/LongNet_example/DNA_example/longnet_example --num-workers 0 --activation-fn gelu --share-decoder-input-output-embed --validate-interval-updates 1000 --save-interval-updates 1000 --no-epoch-checkpoints --memory-efficient-fp16 --fp16-init-scale 4 --arch transformer --task language_modeling --sample-break-mode none --tokens-per-sample 4096...

github2657529567

No module named 'sentencepiece'

I try the script :Breadcrumbs[torchscale](https://github.com/microsoft/torchscale/tree/main)/[examples](https://github.com/microsoft/torchscale/tree/main/examples) LongNet Model,but meet issue: /fairseq/(torchscale) :~/data/results/fairseq$ torchrun --nproc_per_node=8 --master_port 29501 --nnodes=1 train.py /home/data/dataset/yehuicheng/LongNet_example/DNA_example/longnet_example --num-workers 0 --activation-fn gelu --share-decoder-input-output-embed --validate-interval-updates 1000 --save-interval-updates 1000 --no-epoch-checkpoints --memory-efficient-fp16 --fp16-init-scale...

github2657529567

torchscale
torchscale copied to clipboard

Metadata

torchscale 0.3.0 does not include LongNet

train.py: error: unrecognized arguments: --flash-attention --segment-length [2048,4096] --dilated-ratio [1,2]

No module named 'sentencepiece'

← Metadata

Owner

Metadata

torchscale torchscale copied to clipboard

Metadata

torchscale 0.3.0 does not include LongNet

train.py: error: unrecognized arguments: --flash-attention --segment-length [2048,4096] --dilated-ratio [1,2]

No module named 'sentencepiece'

← Metadata

Owner

Metadata

torchscale
torchscale copied to clipboard