matmulfreellm
matmulfreellm copied to clipboard
how to train or ft?
0%| | 0/1250 [00:00<?, ?it/s]loc("/mnt/anaconda3/envs/tf2/lib/python3.10/site-packages/mmfreelm-0.1-py3.10.egg/mmfreelm/ops/hgrn/recurrent_fuse.py":105:22): error: 'arith.addf' op requires the same encoding for all operands and results
Traceback (most recent call last):
File "/mnt/jicheng/uniem-main/mmfree/match_entity_number_mmfree.py", line 325, in
Hi, it seems that the triton compiling process failed, are you using CUDA devices to run it?
@dongjicheng @ridgerchu Which python file is executed first? What is this parameter set to? Would you be so kind as to say? Because when I look at the code all I see is a built-in module, there is only a "setup" file and a "generate" file. These two files are not working. I see that you are inquiring about fine-tuning and pre-training, so I would like to ask.
@ridgerchu As this project is highly relevant to my research topic, I would like to consult as much as possible. I would like to reproduce it.