h2ogpt issues

2

When interacting with h2ogpt I regularly run into a situation where the base model is loaded just to realize a validation fails, e.g. local_files_only but missing a tokenizer, wrong folder...

chris-aeviator

connect to elastic search

https://python.langchain.com/en/latest/modules/indexes/vectorstores/examples/elasticsearch.html

pseudotensor

gradio generation of html leads to odd UI content

![image](https://github.com/h2oai/h2ogpt/assets/2249614/ad6dbdb4-718c-4da6-b5f2-7606a0896829)

pseudotensor

Better post-quantization performance

Notes about Intriguing Properties of Quantization at Scale https://arxiv.org/pdf/2305.19268.pdf So basically should do: * bf16 instead of fp16 training, less sensitive to quantization later * weight decay order 0.05, not...

pseudotensor

See if can memorize 20MB with LoRA state of ~200MB without loss of generalization

1

### 2x A6000Ada 48GB tiiuae/falcon-40b + h2ogpt-fortune2000-personalized PRETRAINING (4-bit) `CUDA_VISIBLE_DEVICES=0,1 torchrun --nproc_per_node=2 finetune.py --data_path=h2oai/h2ogpt-fortune2000-personalized --drop_truncations=True --train_4bit=True --base_model=tiiuae/falcon-40b --micro_batch_size=1 --batch_size=128 --num_epochs=3 --run_id=9 --lora_target_modules='["query_key_value", "dense_h_to_4h", "dense_4h_to_h", "dense"]' &> log.9.txt` ` 6%|▌ |...

arnocandel

Update Triton inference server Docker deployment for Falcon 40B

4

https://github.com/h2oai/h2ogpt/blob/main/docs/TRITON.md do same for Falcon 7B, then Falcon 40B

arnocandel

Create WizardLM dataset with h2oGPT-falcon-40b, finetune and iterate

https://github.com/h2oai/h2o-wizardlm

arnocandel

check peft checkpointing

2

https://discord.com/channels/1097462770674438174/1100717863221870643/1113669041240944722 ![image](https://github.com/h2oai/h2ogpt/assets/2249614/1f381d92-6c17-47a8-9bf7-2f90beed2798)

pseudotensor

Question: can the document query model handle the question to ask something about a table's content?

1

For example, if I have a table of different DL architecture's training and validation accuracy in each row in my **pdf file**, can I ask "what's the validation accuracy of...

zxzhijia

h2ogpt
h2ogpt copied to clipboard

Metadata

Use optional dependencies

validate huge files last

connect to elastic search

gradio generation of html leads to odd UI content

Better post-quantization performance

See if can memorize 20MB with LoRA state of ~200MB without loss of generalization

Update Triton inference server Docker deployment for Falcon 40B

Create WizardLM dataset with h2oGPT-falcon-40b, finetune and iterate

check peft checkpointing

Question: can the document query model handle the question to ask something about a table's content?

← Metadata

Owner

Metadata

h2ogpt h2ogpt copied to clipboard

Metadata

← Metadata

Owner

Metadata

h2ogpt
h2ogpt copied to clipboard