torchchat issues

don't default max_seq_length to 128 for executorch models

1

### 🐛 Describe the bug ExecuTorch has a bug right now so we need to default max_seq_length to 128. Once this has been fixed remove the default here and during...

byjlw

enable llava on torchchat

1

This PR enable llava1.5 on torchchat, which is the first multi-modality model on torchchat. How to play? You can use `--prompt` as the flag for text input, and `--image-prompt` as...

Gasoonjia

CLA Signed

### 🐛 Describe the bug ``` (pt) sunshine@raspberrypi:~/torchchat $ ./install/install_requirements.sh + pip3 install -r install/requirements.txt --extra-index-url https://download.pytorch.org/whl/nightly/cu121 Looking in indexes: https://pypi.org/simple, https://www.piwheels.org/simple, https://download.pytorch.org/whl/nightly/cu121 Ignoring tomli: markers 'python_version < "3.11"' don't...

sunshinesfbay

bug

convert_hf_checkpoint only relies on model_name to resolve TransformerArgs

### 🐛 Describe the bug [`convert_hf_checkpoint`](https://github.com/pytorch/torchchat/blob/main/torchchat/cli/convert_hf_checkpoint.py#L37) transforms a HF checkpoint into a torchchat format. As part of this process, `ModelArgs` is created for the newly downloaded model. Currently it constructs...

Jack-Khuu

good first issue

actionable

remove model.model in ckpt loading

2

This PR introduced a hook for model checkpoint remapping to remove model.model when model loading for better clearance.

Gasoonjia

CLA Signed

[Distributed] Support loading from single checkpoint binary

### 🚀 The feature, motivation and pitch This is for aligning distributed's load behavior with single-device's case. Today distributed relies on an index file containing a `param->bin` mapping to limit...

kwen2501

Issue running on iOS

1

### 🐛 Describe the bug I followed all the instructions in the repo and got to the point of launching the Xcode project, when I hit the "Play" button, I...

raghukiran1224

bug

Mobile - iOS

[Distributed] Did not find tokenizer at {tokenizer_path}

2

### 🐛 Describe the bug ``` torchrun --nproc-per-node 8 dist_run.py ``` ``` known configs: ['13B', '30B', '34B', '70B', '7B', 'CodeLlama-7b-Python-hf', 'Mistral-7B', 'stories110M', 'stories15M', 'stories42M', 'Meta-Llama-3-70B', 'Meta-Llama-3-8B', 'Meta-Llama-3.1-70B-Tune', 'Meta-Llama-3.1-70B', 'Meta-Llama-3.1-8B-Tune', 'Meta-Llama-3.1-8B']...

kwen2501

Distributed

[distributed][perf] ensure that all decoding ops are happening on gpu with no cpu sync

### 🐛 Describe the bug per @kwen2501 - when we are doing decoding step: ~~~ next_token = torch.tensor([decode_results[0][0]], device=device) ~~~ "nit: I am not sure if the use of torch.tensor...

lessw2020

performance

Distributed

modified generate.py for cli and browser

2

This allows to run the server and generate chat versions seamlessly

nobelchowdary

CLA Signed

torchchat
torchchat copied to clipboard

Metadata

don't default max_seq_length to 128 for executorch models

enable llava on torchchat

install requirements fails

convert_hf_checkpoint only relies on model_name to resolve TransformerArgs

remove model.model in ckpt loading

[Distributed] Support loading from single checkpoint binary

Issue running on iOS

[Distributed] Did not find tokenizer at {tokenizer_path}

[distributed][perf] ensure that all decoding ops are happening on gpu with no cpu sync

modified generate.py for cli and browser

← Metadata

Owner

Metadata

torchchat torchchat copied to clipboard

Metadata

← Metadata

Owner

Metadata

torchchat
torchchat copied to clipboard