klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-02 21:19:12 +00:00

Author	SHA1	Message	Date
Debanjum	8a16f5a2af	Reduce logical complexity of constructing context from chat history - Process chat history in default order instead of processing it in reverse. Improve legibility of context construction for minor performance hit in dropping message from front of list. - Handle multiple system messages by collating them into list - Remove logic to drop system role for gemma-2, o1 models. Better to make code more readable than support old models.	2025-08-27 13:43:10 -07:00
Debanjum	c8e07e86e4	Format server code with ruff recommendations	2025-08-01 00:28:17 -07:00
Emmanuel Ferdman	655a1b38f2	Resolve Pydantic deprecation warnings Signed-off-by: Emmanuel Ferdman <emmanuelferdman@gmail.com>	2025-07-25 16:55:00 -07:00
Debanjum	aa081913bf	Improve truncation with tool use and Anthropic caching - Cache last anthropic message. Given research mode now uses function calling paradigm and not the old research mode structure. - Cache tool definitions passed to anthropic models - Stop dropping first message if by assistant as seems like Anthropic API doesn't complain about it any more. - Drop tool result when tool call is truncated as invalid state - Do not truncate tool use message content, just drop the whole tool use message. AI model APIs need tool use assistant message content in specific form (e.g with thinking etc.). So dropping content items breaks expected tool use message content format. Handle tool use scenarios where iteration query isn't set for retry	2025-07-02 23:32:44 -07:00
Debanjum	dca17591f3	Handle parsing json from string with plain text suffix	2025-05-23 19:44:02 -07:00
Debanjum	fd591c6e6c	Upgrade tenacity to respect min time for exponential backoff Fix for issue is in tenacity 9.0.0. But older langchain required tenacity <0.9.0. Explicitly pin version of langchain sub packages to avoid indexing and doc parsing breakage.	2025-05-17 17:37:15 -07:00
Debanjum	2694734d22	Update truncation logic to handle multi-part message content	2025-05-17 17:37:15 -07:00
Debanjum	70b7e7c73a	Improve load of complex json objects. Use it to pick tool, run code Gemini doesn't work well when trying to output json objects. Using it to output raw json strings with complex, multi-line structures requires more intense clean-up of raw json string for parsing	2024-11-26 17:37:57 -08:00
Debanjum Singh Solanky	9986c183ea	Default to gpt-4o-mini instead of gpt-3.5-turbo in tests, func args GPT-4o-mini is cheaper, smarter and can hold more context than GPT-3.5-turbo. In production, we also default to gpt-4o-mini, so makes sense to upgrade defaults and tests to work with it	2024-08-22 19:04:49 -07:00
Debanjum Singh Solanky	5f2442450c	Update truncation test to reduce flakyness in cloud tests Removed dependency on faker, factory for the truncation tests as that seems to be the point of flakiness	2024-06-07 19:42:48 +05:30
Debanjum Singh Solanky	4228965c9b	Handle msg truncation when question is larger than max prompt size Notice and truncate the question it self at this point	2024-03-31 15:50:06 +05:30
Debanjum Singh Solanky	ecddf98430	Handle truncation when single long non-system chat message Previously was assuming the system prompt is being always passed as the first message. So expected there to be at least 2 messages in logs. This broke chat actors querying with single long non system message. A more robust way to extract system prompt is via the message role instead	2024-03-15 15:58:39 +05:30
sabaimran	79913d4c17	Add isort to the pre-commit configuration and apply it to the whole project (#595 ) * Apply isort to the entire repository * Fix missing import issues in text_to_entries * Fix imports in migration files	2023-12-28 18:04:02 +05:30
sabaimran	48363ec861	Add additional check for chat_messages length in UT	2023-08-01 09:25:52 -07:00
sabaimran	e55e9a7b67	Fix unit tests and truncation logic	2023-07-31 21:37:59 -07:00
Saba	3a61919344	Fix failing unit tests by hard-coding model presence of expected search types	2023-06-13 16:32:47 -07:00
Saba	5d5ebcbf7c	Rename truncate messages method and update unit tests to simplify assertion logic	2023-06-06 23:25:43 -07:00
Saba	7119ed0849	Run pre-commit script	2023-06-05 19:29:23 -07:00
Saba	948ba6ddca	Remove unused logger	2023-06-05 19:01:03 -07:00
Saba	f65ff9815d	Move message truncation logic into a separate function. Add unit tests with factory boy.	2023-06-05 18:58:29 -07:00

20 Commits