klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-02 21:19:12 +00:00

Author	SHA1	Message	Date
Debanjum	c8e07e86e4	Format server code with ruff recommendations	2025-08-01 00:28:17 -07:00
Debanjum	f0513cbbb1	Fix to run new automation api tests in ci	2025-07-08 23:27:09 -07:00
Debanjum	721c55a37b	Rename ResponseWithThought response field to text for better naming	2025-07-02 20:48:24 -07:00
Debanjum	d2c7e5516f	Fix online chat actor tests, improve offline chat actor tests The chat actor (and director) tests haven't been looked into in a long while. They'd gone stale in how they were calling thee functions. And what was required to run them. Now the online chat actor tests work again.	2025-06-06 13:28:18 -07:00
Debanjum	2f4160e24b	Use single extract questions method across all LLMs for doc search Using model specific extract questions was an artifact from older times, with less guidable models. New changes collate and reuse logic - Rely on send_message_to_model_wrapper for model specific formatting. - Use same prompt, context for all LLMs as can handle prompt variation. - Use response schema enforcer to ensure response consistency across models. Extract questions (because of its age) was the only tool directly within each provider code. Put it into helpers to have all the (mini) tools in one place.	2025-06-06 13:28:18 -07:00
Debanjum	05d4e19cb8	Pass deep typed chat history for more ergonomic, readable, safe code The chat dictionary is an artifact from earlier non-db chat history storage. We've been ensuring new chat messages have valid type before being written to DB for more than 6 months now. Move to using the deeply typed chat history helps avoids null refs, makes code more readable and easier to reason about. Next Steps: The current update entangles chat_history written to DB with any virtual chat history message generated for intermediate steps. The chat message type written to DB should be decoupled from type that can be passed to AI model APIs (maybe?). For now we've made the ChatMessage.message type looser to allow for list[dict] type (apart from string). But later maybe a good idea to decouple the chat_history recieved by send_message_to_model from the chat_history saved to DB (which can then have its stricter type check)	2025-06-04 00:03:14 -07:00
Debanjum	c4bb92076e	Convert async create automation api endpoints to sync	2024-12-26 21:59:55 -08:00
sabaimran	6c8007e23b	Improve handling of multiple output modes - Use the generated descriptions / inferred queries to supply context to the model about what it's created and give a richer response - Stop sending the generated image in user message. This seemed to be confusing the model more than helping. - Also, rename the open ai converse method to converse_openai to follow patterns with other providers	2024-12-10 16:54:21 -08:00
Debanjum	45c623f95c	Dedupe, organize chat actor, director tests - Move Chat actor tests that were previously in chat director tests file - Dedupe online, offline io selector chat actor tests	2024-11-18 16:10:50 -08:00
Debanjum	2a76c69d0d	Run online, offine chat actor, director tests for any supported provider - Previously online chat actors, director tests only worked with openai. This change allows running them for any supported onlnie provider including Google, Anthropic and Openai. - Enable online/offline chat actor, director in two ways: 1. Explicitly setting KHOJ_TEST_CHAT_PROVIDER environment variable to google, anthropic, openai, offline 2. Implicitly by the first API key found from openai, google or anthropic. - Default offline chat provider to use Llama 3.1 3B for faster, lower compute test runs	2024-11-18 15:11:37 -08:00

10 Commits