klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-02 21:19:12 +00:00

Author	SHA1	Message	Date
Debanjum	29e5d7ef08	Improve support for new Deepseek R1 model over Openai compatible api Parse thinking out from <think>..</think> tags in chat response Handle merging structured message content, not just str, for deepseek.	2025-06-27 18:17:35 -07:00
Debanjum	a33580d560	Enable cache, proxy to improve firecrawl webpage scrape speed, success	2025-06-27 16:35:25 -07:00
Debanjum	1566e3c74d	Ease bulk (de-)selecting of files to add/remove to agent knowledge base Add select all, deselect all buttons to select all (filtered) files to add, remove from an agent's knowledge base.	2025-06-27 15:19:50 -07:00
Debanjum	3bb4e63f3e	Add ability to set default chat model via env var in docker-compose.yml	2025-06-27 15:19:50 -07:00
Debanjum	dd89dd3fc8	Bump web, documentation and desktop app package dependencies	2025-06-27 15:19:50 -07:00
Peter Gaultney	9f3ceba541	Allow setting embedded postgres db directory with PGSERVER_DATA_DIR env var (#1202 ) It seems to me that it would be useful to be able to be explicit about where the embedded database should live - as well as where it _does_ live (via the info log), when not specifying.	2025-06-28 03:21:23 +05:30
Debanjum	d37113850c	Let reasoning gemini models dynamically set their thinking budget All gemini 2.5 series models support dynamic thinking budgets by setting thinking_budget to -1.	2025-06-27 13:13:24 -07:00
Debanjum	ba059ad8b0	Deduplicate passing chat history to extract question only in prompt Extract questions has chat history in prompt and in actual chat history. Only pass in prompt for now. Later update prompts to pass chat history in chat messages list for better truncation flexibility.	2025-06-24 02:49:29 -07:00
Debanjum	170a8036fe	Fix 2 document retrieval bugs to not drop valid search results 1. Due to the interaction of two changes: - dedupe by corpus_id, where corpus_id tracks logical content blocks like files, org/md headings. - return compiled, not logical blocks, where compiled track smaller content chunks that fit within search model, llm context windows. When combined they showed only 1 hit compiled chunk per logical block. Even if multiple chunks match within a logical content block. Fix is to either dedupe by compiled text or to return deduped logical content blocks (by corpus_id) corresponding to matched compiled chunks. This commit fixes it by the first method. 2. Due to inferred query, search results zip which resulted in a single search result being returned per query! This silently cut down matching search results and went undetected.	2025-06-24 02:47:07 -07:00
Debanjum	73c384b052	Reduce chat history spacing to reduce wasted space b/w chat input box The tailwing theme spacing of the scroll area surrounding chat history on large screens was what was causing the large gap between chat input box and chat history on some screen layouts. This change reduces the spacing to a more acceptable level.	2025-06-24 02:46:46 -07:00
Debanjum	ca9109455b	Retry on intermitted image generation failure for resilient generation	2025-06-24 02:46:46 -07:00
Debanjum	4448ab665c	Improve google image generation configuration	2025-06-24 02:46:46 -07:00
Debanjum	dc202e4441	Release Khoj version 1.42.6	2025-06-20 15:00:22 -07:00
Debanjum	623c8b65f1	Set failed response message when a research iteration fails. Previously summarizedResult would be unset when a tool call failed. This caused research to fail due to ChatMessageModel failures when constructing tool chat histories and would have caused similar errors in other constructed chat histories. Putting a failed iteration message in the summary prevents that while letting the research agent continue its research.	2025-06-20 14:13:50 -07:00
Debanjum	b85c646611	Make organic web search result text snippet field optional All web search providers, like Jina/Searxng?, do not return a text snippet. Making snippet optional allows processing search results by these web search providers, without hitting validation errors.	2025-06-20 13:47:08 -07:00
Debanjum	22d71cab44	Log ChatMessageModel validation errors during conversation save	2025-06-19 16:48:11 -07:00
Debanjum	494e7b3856	Update gemini 2.5 to stable model pricing from preview pricing	2025-06-19 16:48:11 -07:00
Debanjum	029bd3be56	Handle breaking change in write file to e2b code sandbox For some reason the function signature, kwargs are broken. Removing usage of keyword args resolves the file upload to sandbox error.	2025-06-19 16:48:11 -07:00
Debanjum	b18b7b2e33	Handle unset response thoughts. Useful when retry on failed request Previously resulted in unbound local variable response_thoughts error	2025-06-19 16:48:06 -07:00
Debanjum	906ff46e6c	Handle research iterations where document search returns no results	2025-06-19 16:47:08 -07:00
Debanjum	aa7b23c125	Handle rendering document references with no compiled text on web app	2025-06-17 15:47:58 -07:00
Debanjum	4ca247f0bc	Always append random suffix to shared conversations urls	2025-06-17 15:47:58 -07:00
Debanjum	68b7057a76	Share https url unless explicitly disabled or on localhost	2025-06-17 15:47:58 -07:00
Debanjum	bdda03b0bf	Git ignore obsidian config directories	2025-06-16 12:01:19 -07:00
Debanjum	e635b8e3b9	Handle gemini chat response completion chunk when streaming	2025-06-13 18:36:53 -07:00
Debanjum	963ebc8875	Pass query params to doc search function before user, chat history Makes document search arg ordering more consistent with other tools like online search, run code etc.	2025-06-13 13:29:30 -07:00
Debanjum	9673f8beba	Release Khoj version 1.42.5	2025-06-11 13:36:46 -07:00
Debanjum	e87be4edf4	Pin python version used by github workflow to publish to pypi Avoids having to update python path to write web app static build files to everytime patch version of python is updated	2025-06-11 13:30:15 -07:00
Debanjum	eaae1cf74e	Fix rendering thoughts of Gemini reasoning models Previously there was duplication of thought in message to user and in the train of thought. This should be resolved now	2025-06-11 13:09:38 -07:00
Debanjum	4946ea1668	Fix to save organic results to conversation context in DB This bug was introduced in `05d4e19cb`, version 1.42.2, during migration to save deeply typed ChatMessageModel. As the ChatMessageModel did not use the right field name for organic results (since the start). Previously it did not matter as it was storing to DB irrespective but now the mapping of dictionary to ChatMessageModel drops that field before save to conversation in DB. This was resulting in organic context being lost on page reload and only being shown on first response.	2025-06-11 12:52:42 -07:00
Debanjum	30ced1d86c	Log non schema adhering chat message before save to DB	2025-06-11 12:52:42 -07:00
Debanjum	71763684a9	Explicitly drop stream_options if not streaming openai chat response Not sure why but it some cases when interacting with o3 (which needs non-streaming) the stream_options seems to be set. Cannot reproduce but hopefully dropping the stream_options explicitly should resolve this issue. Related `985a98214`	2025-06-11 12:52:42 -07:00
Debanjum	65644f78b0	Set lower max output tokens for non reasoning Gemini models While reasoning models support longer output tokens. Non reasoning models do not. Use a lower max output tokens for them	2025-06-11 11:12:24 -07:00
Debanjum	71221533c8	Release Khoj version 1.42.4	2025-06-10 23:49:30 -07:00
Debanjum	985a982148	Update openai package to stream response by non-reasoning models Older package (like 1.84.0) seem to always pass reasoning_effort argument to openai api, which now seems to be throwing unexpected request argument error when used with non-reasoning models (like 4o-mini).	2025-06-10 23:49:04 -07:00
Debanjum	9b767438e2	Update model pricing, default models, context and version metadata	2025-06-10 23:49:04 -07:00
Debanjum	753972997f	Enable non-streaming response via openai api to support o3 models	2025-06-10 23:49:04 -07:00
Debanjum	5110a06085	Fix GET agents API to return agent specific chat model There had been a regression that made all agents display the default chat model instead of the actual chat model associated with the agent. This change resolves that issue by prioritizing agent specific chat model from DB (over user or server chat model).	2025-06-10 15:29:46 -07:00
Debanjum	0cd709caf4	Release Khoj version 1.42.3	2025-06-10 10:20:44 -07:00
Debanjum	313f648bd7	Compile ai message content into single string when using DeepInfra DeepInfra only accepts assistant message.content of string type	2025-06-10 01:58:43 -07:00
Debanjum	9e73309d01	Add no think tag for qwen models msgs over api when no deepthought	2025-06-10 01:58:43 -07:00
Debanjum	64886cd0dd	Fix storing code results on server and rendering them on web app - Fix code context data type for validation on server. This would prevent the chat message from being written to history - Handle null code results on web app	2025-06-09 23:46:12 -07:00
Debanjum	b1a6e53d77	Fix populating chat message history to continue interrupted research We now pass deeply typed chat messages throughout the application to construct tool specific chat history views since `05d4e19cb`. This ChatMessageModel didn't allow intent.query to be unset. But interrupted research iteration history can have unset query. This changes allows makes intent.query optional. It also uses message by user entry to populate user message in tool chat history views. Using query from khoj intent was an earlier shortcut used to not have to deal with message by user. But that doesn't scale to current scenario where turns are not always required to have a single user, assistant message pair. Specifically a chat history can now contain multiple user messages followed by a single khoj message. The new change constructs a chat history that handles this scenario naturally and makes the code more readable. Also now only previous research iterations that completed are populated. Else they do not serve much purpose.	2025-06-09 23:46:12 -07:00
Debanjum	bd928b9f3c	Handle unset agent slug, name. E.g when chat with user created agents	2025-06-09 18:11:25 -07:00
Debanjum	5dd8a9cb24	Only add cache control to last Claude text block if exists, non-empty Otherwise Claude API throws error	2025-06-08 19:41:21 -07:00
Debanjum	d638a49cd9	Release Khoj version 1.42.2	2025-06-07 13:32:12 -07:00
Debanjum	2423db0186	Remove broken link to deprecated summarize slash command in docs	2025-06-07 13:31:21 -07:00
Debanjum	b6ceaeeffc	Execute doc search in parallel using asyncio instead of threadpool	2025-06-07 13:06:49 -07:00
Debanjum	dc1c3561fe	Make search type comparison in document search more robust	2025-06-07 12:52:10 -07:00
Debanjum	b9c6252a4a	Increase scroll amount on horizontal scroll in computer environment	2025-06-07 11:17:52 -07:00

1 2 3 4 5 ...

4830 Commits