Moonlit
Moonlit copied to clipboard
This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.
My error message says: File "/home/Moonlit/Compresso/train.py", line 235, in main() File "/home/Moonlit/Compresso/train.py", line 204, in main from transformers.integrations import AzureMLCallback, ProgressCallback ImportError: cannot import name 'ProgressCallback' from 'transformers.integrations' (/home/longyongliu/.local/lib/python3.10/site-packages/transformers/integrations/__init__.py) It...
Bumps [idna](https://github.com/kjd/idna) from 3.4 to 3.7. Release notes Sourced from idna's releases. v3.7 What's Changed Fix issue where specially crafted inputs to encode() could take exceptionally long amount of time...
Bumps [pillow](https://github.com/python-pillow/Pillow) from 10.0.1 to 10.3.0. Release notes Sourced from pillow's releases. 10.3.0 https://pillow.readthedocs.io/en/stable/releasenotes/10.3.0.html Changes CVE-2024-28219: Use strncpy to avoid buffer overflow #7928 [@hugovk] Use functools.lru_cache for hopper() #7912 [@hugovk]...
Bumps [grpcio](https://github.com/grpc/grpc) from 1.53.0 to 1.53.2. Release notes Sourced from grpcio's releases. Release v1.53.2 This is release gRPC Core 1.53.2 (glockenspiel). For gRPC documentation, see grpc.io. For previous releases, see...
currently after merging pruning masks and LoRA weights, LLaMA-7B size is increasing from 15GB to 26GB. Please provide support to remove pruned weights from the model
Hi, thank you for your nice work of ElasticViT. I'm trying to run the codes according to the instruction information, and I met this issue when I run the training...
Hi, I have read the paper of compresso and it is a great job. I would like to know if there is any result for the datasets like wikitext2, ptb...
Hi, thank you for your nice work of Litepred. I noticed that litepred mentioned that "different versions of tflite have different inference latency", but I used different versions of tensorflow...
Hi, thank you for your work! I am running the codes according to the instructions, but i found that the two vars `v_top1, v_top5` are always 0.0, just like that:...
Bumps [werkzeug](https://github.com/pallets/werkzeug) from 3.0.1 to 3.0.6. Release notes Sourced from werkzeug's releases. 3.0.6 This is the Werkzeug 3.0.6 security fix release, which fixes security issues but does not otherwise change...