klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-02 21:19:12 +00:00

Go to file

Debanjum Singh Solanky 0c52a1169a Put context into separate user message before sending to chat model

The document, online search context are now passed as separate user
messages to chat model, instead of being added to the final user message.

This will improve

- Models ability to differentiate data from user query.
  That should improve response quality and reduce prompt injection
  probability

- Make truncation logic simpler and more robust
  When context window hit, can simply pop messages to auto truncate
  context in order of context, user, assistant message for each
  conversation turn in history until reach current user query

  The complex, brittle logic to extract user query from context in
  last user message isn't required.

Marking the context message with assistant role doesn't translate well
across chat models. E.g
- Gemini can't handle consecutive messages by role = model well
- Claude will merge consecutive messages by same role. In current
  message ordering the context message will result get merged into the
  previous assistant response. And if move context message after user
  query. The truncation logic will have to hop and skip while doing
  deletions
- GPT seems to handle consecutive roles of any type fine

Using context role = user generalizes better across chat models for
now and aligns with previous behavior.

2024-10-22 03:09:36 -07:00

.github

Remove tools cache in dockerize.yml workflow

2024-09-29 00:27:37 -07:00

documentation

Upgrade documentation website dependencies

2024-10-17 11:58:52 -07:00

scripts

Update bump version script to bump new next.js web app version too

2024-08-05 16:20:47 +05:30

src

Put context into separate user message before sending to chat model

2024-10-22 03:09:36 -07:00

tests

Fix PDFs unit test, skip OCR

2024-10-20 22:25:41 -07:00

.dockerignore

Use pypi khoj to fix docker builds and dockerize github workflow

2023-02-19 01:57:01 -06:00

.gitattributes

Exclude tests data file from programming stats on Github

2023-08-28 11:00:52 -07:00

.gitignore

Cycle through chat history in chat input on Obsidian (#861 )

2024-08-12 23:55:25 -07:00

.pre-commit-config.yaml

Add isort to the pre-commit configuration and apply it to the whole project (#595 )

2023-12-28 18:04:02 +05:30

docker-compose.yml

Intelligently initialize a decent default set of chat model options

2024-09-19 20:32:08 -07:00

Dockerfile

Reduce size of Khoj Docker images by removing layers and caches

2024-09-29 04:06:35 -07:00

gunicorn-config.py

Bump gunicorn workers per server up to 2

2024-04-18 11:32:51 +05:30

LICENSE

Change license to GNU AGPLv3 from GNU GPLv3

2023-11-16 11:14:06 -08:00

manifest.json

Release Khoj version 1.26.3

2024-10-21 08:19:05 -07:00

prod.Dockerfile

Reduce size of Khoj Docker images by removing layers and caches

2024-09-29 04:06:35 -07:00

pyproject.toml

Fix the version of pymupdf to avert build errors

2024-10-21 12:56:51 -07:00

pytest.ini

Move the django app into the src/khoj folder for better organization and functionality

2023-11-21 10:56:04 -08:00

README.md

Use Khoj icons. Add automation & improve agent text on web login page

2024-10-17 11:58:52 -07:00

versions.json

Release Khoj version 1.26.3

2024-10-21 08:19:05 -07:00

README.md

Your AI second brain

📑 Docs • 🌐 Web • 🔥 App • 💬 Discord • ✍🏽 Blog

Khoj is a personal AI app to extend your capabilities. It smoothly scales up from an on-device personal AI to a cloud-scale enterprise AI.

Chat with any local or online LLM (e.g llama3, qwen, gemma, mistral, gpt, claude, gemini).
Get answers from the internet and your docs (including image, pdf, markdown, org-mode, word, notion files).
Access it from your Browser, Obsidian, Emacs, Desktop, Phone or Whatsapp.
Create agents with custom knowledge, persona, chat model and tools to take on any role.
Automate away repetitive research. Get personal newsletters and smart notifications delivered to your inbox.
Find relevant docs quickly and easily using our advanced semantic search.
Generate images, talk out loud, play your messages.
Khoj is open-source, self-hostable. Always.
Run it privately on your computer or try it on our cloud app.

See it in action

Go to https://app.khoj.dev to see Khoj live.

Full feature list

You can see the full feature list here.

Self-Host

To get started with self-hosting Khoj, read the docs.

Contributors

Cheers to our awesome contributors! 🎉

Made with contrib.rocks.

Interested in Contributing?

We are always looking for contributors to help us build new features, improve the project documentation, or fix bugs. If you're interested, please see our Contributing Guidelines and check out our Contributors Project Board.

Languages

Python 51%

TypeScript 36.1%

CSS 4.1%

HTML 3.2%

Emacs Lisp 2.4%

Other 3.1%