torchchat issues

AOTI/DSO model does not run in Linux

3

### 🐛 Describe the bug I am running an Arch Linux system with a 4090/3090 w/ and up-to-date CUDA 12.5 (`Build cuda_12.5.r12.5/compiler.34385749_0`) I have created a new mamba env for...

lhl

bug

Compile / AOTI

Cuda

[UX] We are too quiet about errors - in particular missing HF authentication...

3

The command `python3 torchchat.py where llama3` fails quietly presumably because I might not have the HF Token configured. I assumed the code was broken, though because I got a backtrace...

mikekgfb

good first issue

actionable

OpenAI API JSON formatted

2

Implement JSON formatted responses using OpenAI API types for server completion requests. Rather than giving single tokens at a time, the server will respond with a JSON following the API...

vmpuri

CLA Signed

`scripts/build_native.sh et` errors out

7

### 🐛 Describe the bug I am trying to build the llama runner natively on a rasperry pi following the torchchat description, and the post at https://dev-discuss.pytorch.org/t/run-llama3-8b-on-a-raspberry-pi-5-with-executorch/2048 I was able...

sunshinesfbay

bug

need-user-input

ExecuTorch

Memory usage is wrong (reporting 0) for non-CUDA commands

4

### 🐛 Describe the bug For example `Memory used: 0.00 GB` ``` > python3 torchchat.py generate llama3.1 --dso-path exportedModels/llama3.1.so --prompt "Hello my name is" NumExpr defaulting to 10 threads. PyTorch...

byjlw

bug

actionable

Weird model behaviour on Server/Browser: Looks like it's not using the template

2

Hi, I'm trying out the torchchat right now, started the streamlit application with llama3 model ![image](https://github.com/user-attachments/assets/3ee31c11-29ed-423a-ac29-c155bf38ebcf) I just texted Hi !! - Why is this text generation behaviour unusal ,...

akhilreddy0703

bug

actionable

Browser

Can't install requirements when using Python-3.12

4

### 🐛 Describe the bug `pip install` fails because it could not find `torch` even though, it's present in the environment: ``` % pip install git+https://github.com/pytorch/ao.git@d36de1b144b73bf753bd082109c2b5d0141abd5b Collecting git+https://github.com/pytorch/ao.git@d36de1b144b73bf753bd082109c2b5d0141abd5b Cloning https://github.com/pytorch/ao.git...

malfet

Known Gaps

actionable

deps: Add set -x for installation commands

1

Adds set -x for all installation commands in install_requirements.sh so that users can see what's actually being installed and can help debug when they run into any issues. NOTE: This...

seemethere

CLA Signed

[Llava][multimodal] enable Llava in torchchat

1

This PR aims to enable Llava in torchchat. TODOs: - [x] Create Model and ModelArgs as model definition entrances - [x] Support model definition with multiple transformers - [ ]...

Gasoonjia

CLA Signed

Leverage the HF cache for models

2

### 🚀 The feature, motivation and pitch torchchat currently uses the hf hub which has it's own model cache, torchchat copies it into it's own model directory so you end...

byjlw

enhancement

actionable

torchchat
torchchat copied to clipboard

Metadata

AOTI/DSO model does not run in Linux

[UX] We are too quiet about errors - in particular missing HF authentication...

OpenAI API JSON formatted

`scripts/build_native.sh et` errors out

Memory usage is wrong (reporting 0) for non-CUDA commands

Weird model behaviour on Server/Browser: Looks like it's not using the template

Can't install requirements when using Python-3.12

deps: Add set -x for installation commands

[Llava][multimodal] enable Llava in torchchat

Leverage the HF cache for models

← Metadata

Owner

Metadata

torchchat torchchat copied to clipboard

Metadata

← Metadata

Owner

Metadata

torchchat
torchchat copied to clipboard