whcao

Results 29 issues of whcao

First of all, thank you for sharing your valuable code. As shown in fig2(c) of the paper, the softmax function is used to convert the matrix product to the dot-product...

# Usage ```diff custom_hooks = [ + dict(type=HFCheckpointHook), ] ``` The HF model will be saved at `work_dir/timestamp/hf_model` # Notes 1. As the priority of `HFCheckpointHook` is 95 which is...

![WD6cNbOe5R](https://github.com/InternLM/xtuner/assets/41630003/48e4b6e3-1bcd-4349-90f0-dbbbc0f1cee7)

## Usage 1. Draw the first type of plot which shows the absmax/absmean/max/mean/min value of a linear layer at different layers. For instance, the code below presents the visualized results...