llCurious

Results 13 issues of llCurious

Hey @snwagh . I try to test the AlexNet and VGG16 which takes **CIFAR-10** and **ImageNet** datasets as training data. Is there data loader for these two datasets? And i...

Hey, snwagh. I have been reading your paper of Falcon and found this repo. And I am interested in how you perform the computation of **Batch Normalization**. I have the...

Hey, i tried to run the demo code in `transform_example.ipynb` and no error occurred. However, when i add `transformer.register(nn.Linear, QLinear)`, there is an error. > TypeError: empty() received an invalid...

Hi, guys. I notice that BigDL utilizes BigDL nano and ggml to accelerate int8/int4 computations. I wonder how to invoke these APIs in LLMs like LLAMA. Specifically, I want to...

user issue

Thx for providing a comprehensive graph dataset loader. I am recently working on heterogeneous graphs, which typically contains nodes and edges, both of different types. E.g., nodes has type paper,...

I try to run LLAMA using EasyLM. I follow the [README](https://github.com/young-geng/EasyLM/blob/main/docs/llama.md) for llama. The first step is conver raw LLAMA parameters. ``` sh python -m EasyLM.models.llama.convert_torch_to_easylm.py \ --checkpoint_dir='path/to/torch/llama/checkpoint' \ --output_dir='path/to/output/easylm/checkpoint'...

Hey, @stevenygd . I recently checked your code base and the paper. - One point i do not understand is the **shifting operation in activations** seems missed in this code...

Hi, @Guangxuan-Xiao . I try to test the Bloom model. You have provided the act_scales for Bloom models, could you provide the demo code for Bloom model as well?

Hey. I try to run the experiments for LeNet model. However i noticed that the convergence of LeNet is weird. The test accuracy of LeNet remains low (far from 98%)....

Hey, your work is well-presented and i just wonder one detail: **How do you ensure that the input to your quantization function is in the range [0,1]?** As you mentioned...