Daniel issues

Results 124 issues of


                                            Daniel

epic: Support TensorRT-LLM on Windows as Inference Engine

https://www.digitaltrends.com/computing/microsoft-nvidia-tensorrt-llm-update-ignite-2023/ Tasks - [ ] Step by step docs for Jan Windows TensorRT-LLM - 1 day - [ ] Updated code in `triton-tensorrt-llm` extension - 1 day Reference https://github.com/NVIDIA/trt-llm-rag-windows/blob/release/1.0/app.py#L43

roadmap: Jan Runs Models

platform: Windows

epic: Support Intel as Inference Engine

WIP Spec - Need to figure out if BigDL or Intel Extensions are separate - Have Extensions for each inference engine - `model.json` should have an `engine: intel-bigdl` or `engine:...

roadmap: Jan Runs Models

epic: UI to create Copilot using prompts

**Motivation:** **WIP Spec:** - Assistants are a way of packaging things, a framework - Depends on the Hub, and other prereqs - We need a separate epic to track RAG

type: epic

roadmap: Copilot Framework

feat: Queue System for Inference?

## Objective - Do we need a simple queue system? ### Motivation _Nullpointer Errors?_ - Currently, inference requests are handled FIFO - We are adopting an OpenAI API, which means...

engineering: Jan Inference Layer

roadmap: Jan Home Server

feat: Ubuntu Software Install potentially unsafe

## Objective - [ ] Description should be updated, including Project Website - [ ] Code signing for Ubuntu? ![image](https://github.com/janhq/jan/assets/101145494/8548a93d-8d48-45a4-9695-ad52dfb9ac71)

type: feature request

platform: Linux

docs: clearly explain Jan Extensions and Modules Architecture

## Todos - [x] Rename "Plugins" to Extensions or Modules - [ ] Document Extensions and Modules Architecture - [ ] Document key Extensions (e.g. Models, Inference, Threads etc) -...

engineering: Jan Core & Extensions SDK

feat: Llava2/Bakllava for Multi-modal

- Allow multi-modal input to Jan - Requires UI support - Requires API support

type: feature request

roadmap: Jan Runs Models

Hub should autodetect users' RAM or VRAM to Recommend Models

## Objective - As part of larger epic, we need to autodetect the users' hardware and show recommending models - Our long-term goal is to help the user "run best...

type: feature request

roadmap: Jan Hub

feat: Jan lets user choose which GPU to run Model on

**Problem** Feature requested by Sabin_Stargem from r/localllama - I actually think this is a great idea, especially for multi-modal AI - Niche feature for power users with multi-GPU setups -...

P1: important

type: feature request

status: needs designs

roadmap: Jan Runs Models

feat: Apple Ferret and MLX

## Objective - WIP spec ## Resources - Apple Ferret - Need to support MLX - Support Quantization - https://www.reddit.com/r/LocalLLaMA/comments/18oke4y/apples_mlx_framework_adds_quantization_support/

type: feature request

roadmap: Jan Runs Models