cipo lee comments

Results 5 comments of


                                            cipo lee

xformers error: NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs

same in training how can i turn off the xformers

xformers error: NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs

thanks!

[Badcase]: 相同的数据，微调时在qwen2.5 72B预训练模型上的loss是qwen2 72B的3倍，请问2.5除了数据变多了，其他有什么不一样吗

mark，用lora微调7b instruction模型时也相比qwen2的loss高一些

> 玩了这么久AI与Python，你们的项目是技术文档写的最烂的一个，没有之一。做了这么多年的项目，真是让人难以想象。 > > 1. Python本身就是跨平台的，一个ocr项目还需要docker或wsl本身就不合理 > 2. 通过命令行运行却不详细描述命令行参数。你们以为简化了步骤，其实不然，运行起来下载一堆乱七八糟的模型到c盘去了 > 3. 说是开源了一个新的SOTA模型，整个文档全篇找不到可以配置模型位置的地方，所有开源项目里面这是唯一一个，没有之一 > 4. 别人都是pytorch，只有你们特殊，不兼容pytorch > > 本来看宣传说ocr挺厉害的，想下载玩玩，模型倒是下载了，搞了半天也没看明白该配置到哪里去，真服了。我可是程序员出身，Python怎么也算是掌握，硬是没玩明白！和你一样想吐槽。deepseek ocr就是基于torch的，模型配置是AutoModel.from_pretrained一目了然，paddle这帮人能不能学学

cipo lee

xformers error: NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs

xformers error: NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs

[Badcase]: 相同的数据，微调时在qwen2.5 72B预训练模型上的loss是qwen2 72B的3倍，请问2.5除了数据变多了，其他有什么不一样吗

support LORA-GA

windows本地如何使用PaddleOCR VL模型