kitops Updates to dev UI

Describe the problem you're trying to solve Dev UI should be more helpful with helping the application developers integrate with the model.

Describe the solution you'd like

Generate/show example code as the parameters and prompts are entered.
A way to see JSON communicated between the server and responses
hide or drop the list of preferences to highlight more frequently used ones
Ability to see the README.md from the ModelKit if it has one

May 07 '24 13:05 gorkem

https://build.nvidia.com/explore/discover

May 16 '24 19:05 annigro

UX Design

May 16 '24 19:05 annigro

Here is some more clarification on the requests.

POST /completion is an API specific to llama.cpp server. There is example code available for its usage which we will adjust for code generation feature.

POST /v1/chat/completions The endpoint for chat completion API is compatible with the OpenAI endpoint. We can use the existing OpenAPI libraries for code generation.

Options

The initial thought was to reduce the options to match OpenAI. However, with further thinking and considering the options are supported by both endpoints. We should continue to use all options but categorize them better.

single or multiple page

I have not really found a good reason to keep the multiple page. It is unintuitive to go back and forth for changing the parameters. I suggest we do a single page implementation.

May 21 '24 16:05 gorkem

--> We talked about how the generated code for chat mode can get really long and therefore messy. @gorkem I assume the first two lines are your solution. How would it look in the UI?

May 22 '24 00:05 annigro

@annigro no, Gorkem's first two lines are related to internal code usage, and should be transparent for the UI other than one line. We talked about doing something like this for it:

message: [{ actor: 'user', content: 'foo bar fooz' }]

and when is too long:

message: [{ actor: 'user', content: 'foo bar ...' }] <-- ellipsis but for the message's content only, and only if is too long.

but clicking on "copy code" would always copy the whole thing, regardless of the ellipsis.

May 22 '24 12:05 javisperez

@gorkem i gave a try to the open ai api but looks like both the payload and the response is different than the one we already have in llama.cpp (which makes sense). I vote to keep using the completion llama.cpp endpoint instead or redoing everything including the api layer just to support openai v1 endpoints. Thoughts?

May 22 '24 15:05 javisperez

@gorkem Figma file to review

Jun 12 '24 16:06 annigro

I did a pass left a few comments.

Jun 18 '24 16:06 gorkem

Categories and values for devmode

Jul 02 '24 16:07 annigro

This has been merged to main but will be "in prod" in the next Kit Release

Dec 04 '24 13:12 javisperez

kitops
kitops copied to clipboard

Updates to dev UI

Options

Categories

Text Generation Controls

Sampling and Diversity

Advanced Settings and Customization

Probability and Statistical Controls

single or multiple page

kitops kitops copied to clipboard

Updates to dev UI

Options

Categories

Text Generation Controls

Sampling and Diversity

Advanced Settings and Customization

Probability and Statistical Controls

single or multiple page

kitops
kitops copied to clipboard