klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-03 05:29:12 +00:00

Go to file

Debanjum 16c6bfce8e Improve Quality and Reliability of Offline Chat (#393 )

# Incoming
## Major
### Fix Prompt Size Exceeded Issue
- Fix issues related to prompt size, Closes #386. Use the correct tokenizer to calculate whether the input needs to be truncated or not.

### Improve Llama 2 Model Download
- Use the correct download link for LlamaV2 -- should have been using the small model, but was using the medium
- Add better downloading logic to retry download if it failed, Closes #379 

### Fix Segmentation Fault due to Race
- Add a lock around generating chat responses from the offline model to avoid segmentation faults. Closes #367.
- Add a loading symbol to the web chat UI when the model is thinking. Closes #392

### Improve Chat Response Latency
- Improve performance of offline chat by increasing batch size (via `n_batch`) to automatically engage more cores/GPU, using smaller model and fixing prompt vs response token generation numbers. Closes #363

### Fix Fake Dialogue Continuation
- Fix formatting of user query with offline chat, this was contributing to #398
- Stop Llama 2 from Creating Fake Dialogue Continuations. Closes #398

## Minor
- Improve default message for Chat window on web when it's not configured. Include hint to use offline chat.
- Add null check in `perform_chat_checks` method
- Add offline chat director unit tests

## Performance Analysis (Time to First Token)
|  | v0.10.0 | this branch |
|-|-|-|
| Query 1 | 52s | 28s |
| Query 2 | 33s| 42s |
| Query 3 | 67s| 38s|

2023-08-01 22:07:27 -07:00

.github/workflows

Delete FUNDING.yml

2023-07-27 15:28:47 -07:00

config

Fix configure openai processor for khoj docker

2023-07-30 02:07:33 -07:00

docs

Clarify usage in telmetry.md

2023-07-30 22:37:20 -07:00

scripts

Upgrade bump_version script to handle release and post-release commit

2023-03-10 15:23:17 -06:00

src

Improve Quality and Reliability of Offline Chat (#393 )

2023-08-01 22:07:27 -07:00

tests

Update local Chat Actor and Director tests expected to fail

2023-08-01 20:52:00 -07:00

.dockerignore

Use pypi khoj to fix docker builds and dockerize github workflow

2023-02-19 01:57:01 -06:00

.gitignore

Test Chat Actor Capabilities; ability to answer from notes, chat logs etc

2023-03-16 09:30:37 -06:00

.pre-commit-config.yaml

Run mypy checks in test workflow and on push (via pre-commit)

2023-02-17 16:08:56 -06:00

docker-compose.yml

Fix configure openai processor for khoj docker

2023-07-30 02:07:33 -07:00

Dockerfile

Migrate from PyQT6 to PySide6

2023-07-11 18:43:44 -07:00

Khoj.desktop

Fix Khoj subtitle in desktop entry, pyproject, cli and Obsidian Readme

2023-07-02 16:09:07 -07:00

Khoj.spec

Fix gpt4all import error in Desktop builds (#356 )

2023-07-28 11:54:18 -07:00

LICENSE

Add, configure and run pre-commit locally and in test workflow

2023-02-17 13:31:36 -06:00

manifest.json

Release Khoj version 0.10.0

2023-07-28 19:27:47 -07:00

pyproject.toml

Replace Falcon 🦅 model with Llama V2 🦙 for offline chat (#352 )

2023-07-27 20:51:20 -07:00

README.md

Fix link to Docs website in Khoj readme on Github

2023-07-29 12:50:39 -07:00

versions.json

Release Khoj version 0.10.0

2023-07-28 19:27:47 -07:00

README.md

An AI personal assistant for your digital brain

📜 Read Docs • 🌍 Try Khoj Cloud • 💬 Get Involved

Khoj is a desktop application to search and chat with your notes, documents and images.
It is an offline-first, open source AI personal assistant accessible from your Emacs, Obsidian or Web browser.
It works with jpeg, markdown, notion, org-mode, pdf files and github repositories.

🔎 Search	💬 Chat
Quickly retrieve relevant documents using natural language	Get answers and create content from your existing knowledge base
Does not need internet	Can be configured to work without internet

Languages

Python 51%

TypeScript 36.1%

CSS 4.1%

HTML 3.2%

Emacs Lisp 2.4%

Other 3.1%