Ruhollah Majdoddin issues

Results 24 issues of


                                            Ruhollah Majdoddin

Add a "stop generation" button to chat.

I am trying the online chat, sometimes it takes too long because of queuing. Sometimes it even seems it has hanged. That button would be very useful then.

website

inference

editable prompt(L0) in chat

I see no practical reason not to make the first user message editable. One can superficially give a `Hi` prompt.

Install (with Docker) fails on Cloud GPUs

Instances of some Cloud GPUs, like colab and cloud.vast.ai, are indeed Docker images, and it is not possible to run a docker image there (see [here](https://github.com/googlecolab/colabtools/issues/299)) Practically Open-Assistant can't run...

adding IPEX, and autocast for CPU

duplicate of https://github.com/karpathy/nanoGPT/pull/27#issuecomment-1482609289, bucause it won't be noticed in the close RP . @karpathy @lantiga why still no autocast for CPU? (see also https://github.com/karpathy/nanoGPT/pull/27) For example, better to change https://github.com/karpathy/nanoGPT/blob/a82b33b525ca9855d705656387698e13eb8e8d4b/sample.py#L32...

Bug - model trained on Xs from two sample texts

Each sample of openwebtext consists of several paragraphs extracted from a single webpage. nanoGPT is trained to predict a token of a sample, given its previous tokens. For train split...

added frequency_penalty to those toml templates that missed it. Other…

## **User description** …wise ChatGPT makes infinte stream (?). For example in 'generate ai tests', for problem 80 - optimal insertion ___ ## **Type** Enhancement ___ ## **Description** - This...

enhancement

Review effort [1-5]: 1

Ruhollah Majdoddin

Add a "stop generation" button to chat.

editable prompt(L0) in chat

Install (with Docker) fails on Cloud GPUs

adding IPEX, and autocast for CPU

Bug - model trained on Xs from two sample texts

added frequency_penalty to those toml templates that missed it. Other…

implementin an Engine to serve the trained model by inferencing

Added Romanian phoneme-based ASR model

Too high vRAM usage

is_flash_attn_2_available() returns False