clarkmcc / chitchat Goto Github PK

View Code? Open in Web Editor NEW

62.0 62.0 3.0 348 KB

A simple LLM chat front-end that makes it easy to find, download, and mess around with models on your local machine.

Home Page: https://www.clarkmccauley.com/projects/chitchat

License: MIT License

HTML 0.70% JavaScript 57.45% Rust 38.22% CSS 3.63%

gpu llama llm rust tauri

chitchat's People

Contributors

Stargazers

Watchers

Forkers

glopaq rampall linecode

chitchat's Issues

CUDA support

Right now https://github.com/rustformers/llm doesn't support runtime accelerator selection (rustformers/llm#386) which means that the current binaries are metal only for macos, or CPU only for Windows. Until then, maybe we build all Windows binaries for cuda (with feature cublas).

Support LoRA files

Prompt templates

Right now the backend automatically formats all prompts with

USER: {prompt}\nSYSTEM:

but as I understand it, this needs to be configurable based on the model in order for the model to produce the best responses.

Upload files for context

Currently we support a warm-up prompt where you can prompt where you can pre-condition the model before chatting with it. You can drop big text summaries in that model and then ask questions about the text, but this is difficult if there's a lot of text. It would be great if we supported file uploads. Let's start with .txt and .pdf. We won't be able to handle images in PDFs yet, but we should be able to at least deal with the text for now.

Models aren't unloaded from GPU memory (windows)

I was testing i loaded a 2B model then wanted to change the system prompt, so i changed it and hit start, and the new gen was much slower, when i checked i managed to verify the issue, it seems that the app is not unloading the GPU memory before loading the new model so it's getting loaded into shared memory instead of the native gpu memory.

Minor UI/UX and Usability Items...

Gotta say one of the easiest to get going (once i ran from source due to the other ticket issue lol)

Did notice a few UI/UX issues i wanted to mention that would be cool to see adjusted, not sure if it's windows issues only or all builds but since it's tauri i'd imagine it's the same across all...

No way to stop generation, I asked it a question, and wow it was determined to give me the full 2048 response lol and doesn't seem to be a way to stop generation, would be nice if there was a button next to the spinner to kill the current generation somehow. Especially if the LLM starts going down a rabbit hole that doesn't fulfill the original question intension...
The left panel could really use a max-width attribute it taking up 50% of the screen on a desktop for settings gets a little silly looking when maximized, in fact after loading the model it could probably collapse to the side behind a hamburger menu to really give a nice feel, as it is now it looks nice when it's small
Sending a message scrolls down but doesn't seem to fully scroll to show the message that was sent, it appears to focus at the middle of the first line of text that was sent at the bottom of the window oddly enough.
Would be really nice if the interface showed the tokens/s as part of the processing animations, as I noticed something odd and i'm not sure if it's an issue or my imagination, i loaded a model, was generating really fast, changed the system message, and started again and this time it felt slower, but i'm not sure if it actually is or if it's my imagination, and without a readout of tokens/s it's hard to tell.

More user-friendly interface for built-in models

Right now it's a bunch of garbage that is manually typed and prone to failures. Maybe we can consider parsing the quantization level as a parameter, the quantization method, the parameter count and a short description and then pretty-printing that data.

Maybe something more like this

clarkmcc / chitchat Goto Github PK

chitchat's People

Contributors

Stargazers

Watchers

Forkers

chitchat's Issues

CUDA support

Support LoRA files

Prompt templates

Upload files for context

Models aren't unloaded from GPU memory (windows)

Minor UI/UX and Usability Items...

More user-friendly interface for built-in models

Linux Release?

Detected as trojan

Os error 3 after start

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent