klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-04-28 00:19:25 +00:00

Author	SHA1	Message	Date
sabaimran	dd957bedd3	Remove pgserver in repo from git tracking	2025-03-31 15:07:24 -05:00
sabaimran	d53b740197	Improve online search and allow server to skip auto webpage read	2025-03-31 13:52:48 -05:00
Debanjum	177560655d	Fix and Improve Online Search and Webpage Read (#1147 ) New - Support Firecrawl as a online search provider Improve - Fallback to other enabled online search providers on failure - Speed up online search with Jina by excluding webpage content in search results Fix - Fix Jina webpage reader. Improve it to include generated alt text to each image on webpage - Truncate online query to Serper if query exceeds max supported length	2025-04-01 00:09:46 +05:30
Debanjum	d62dd4ef61	Support Firecrawl as a online search provider	2025-03-31 17:06:00 +05:30
Debanjum	3939e995e4	Fallback to enabled, lower priority online search providers on error Make serper.dev higher priority than official google serp api because it provides more detailed results with knowledge cards etc.	2025-03-31 17:05:44 +05:30
Debanjum	9b7442f28f	Truncate online query to Serper if query exceeds max supported length Previously query to serper with longer than max supported would throw error instead of returning at least some results. Truncating the onlien search query to serper to max supported length mitigates that issue.	2025-03-31 17:01:42 +05:30
Debanjum	db7eba56f6	Fix webpage read and improve web search with Jina - Improve webpage read to include image alt text - Improve Jina webpage search to not include each page content - Use POST instead of GET for web search, webpage read with Jina	2025-03-31 17:01:42 +05:30
Debanjum	db68372b81	Update code sandbox prompts to allow network access when using E2B Tell Khoj code writing chat actor that it has access to the network and can use the python requests library in the E2B code sandbox.	2025-03-31 15:33:47 +05:30
Debanjum	5b8c2989d6	Add hover text on button to unshare a conversation on web app	2025-03-31 15:32:43 +05:30
Debanjum	85d627ceb0	Simplify docs to self-host with pip since can use embedded DB now Remove postgres setup instructions from self host with pip docs. It is unnecessary if embedded postgres DB works on the operating system.	2025-03-30 00:17:24 +05:30
Debanjum	713ba06a8d	Release Khoj version 1.38.0	2025-03-29 18:30:06 +05:30
Debanjum	e9132d4fee	Support attaching programming language file types to web app for chat	2025-03-29 01:22:35 +05:30
Debanjum	bdb6e33108	Install pgserver only when `pip install khoj[local]' is enabled This avoids installing pgserver on linux arm64 docker builds, which it doesn't currently support and isn't required to support as Khoj docker images can use standard postgres server made available via our docker-compose.yml	2025-03-29 00:27:19 +05:30
Debanjum	5ee513707e	Use embedded postgres db to simplify self-hosted setup (#1141 ) Use pgserver python package as an embedded postgres db, installed directly as a khoj python package dependency. This significantly simplifies self-hosting with just a `pip install khoj'. No need to also install postgres separately. Still use standard postgres server for multi-user, production use-cases.	2025-03-29 00:03:55 +05:30
Debanjum	56b63f95ea	Suggest Google image gen model, new Anthropic chat models on first run - Update default anthropic chat models to latest good models. - Now that Google supports a good text to image model. Suggest adding that if Google AI API is setup on first run.	2025-03-28 23:07:17 +05:30
Ikko Eltociear Ashimine	1e34de69e9	Fix spelling in Automations Docs (#1140 ) Recieve -> Receive	2025-03-28 23:07:06 +05:30
Debanjum	72986c905a	Fix default agent creation to allow chat on first run Previously agent slug was not considered on create even when passed explicitly in agent creation step. This made the default agent slug different until next run when it was updated after creation. And didn't allow chat to work on first run The fix to use the agent slug when explicitly passed allows users to chat on first run.	2025-03-28 22:49:00 +05:30
Debanjum	03de2803f0	Fallback to default agent for chat when unset in get conversation API	2025-03-28 00:56:18 +05:30
Debanjum	a387f638cd	Enforce json schema on more chat actors to improve schema compliance Including infer webpage urls, gemini documents search, pick default mode tools chat actors	2025-03-28 00:56:18 +05:30
Debanjum	ccd9de7792	Improve safety settings for Gemini chat models - Align remaining harm categories to only refuse in high harm scenarios as well - Handle response for new "negligible" harm probability as well	2025-03-28 00:56:18 +05:30
Debanjum	2ec5cf3ae7	Normalize type of chat messages arg sent to Anthropic completion funcs Previously messages got Anthropic specific formatting done before being passed to Anthropic (chat) completion functions. Move the code to format messages of type list[ChatMessage] into Anthropic specific format down to the Anthropic (chat) completion functions. This allows the rest of the functionality like prompt tracing to work with normalized list[ChatMesssage] type of chat messages across AI API providers	2025-03-26 18:24:17 +05:30
Debanjum	4085c9b991	Fix infer webpage url step actor to request upto specified max urls Previously we'd always request up to 3 webpage url via the prompt but read only one of the requested webpage url. This would degrade quality of research and default mode. As model may request reading upto 3 webpage links but get only one of the requested webpages read. This change passes the number of webpages to read down to the AI model dynamically via the updated prompt. So number of webpages requested to be read should mostly be same as number of webpages actually read. Note: For now, the max webpages to read is kept same as before at 1.	2025-03-26 18:24:17 +05:30
Debanjum	c337c53452	Fix to use agent chat model for research model planning Previously the research mode planner ignored the current agent or conversation specific chat model the user was chatting with. Only the server chat settings, user default chat model, first created chat model were considered to decide the planner chat model. This change considers the agent chat model to be used for the planner as well. The actual chat model picked is decided by the existing prioritization of server > agent > user > first chat model.	2025-03-25 18:31:55 +05:30
Debanjum	df090e5226	Enable unsharing of a public conversation (#1135 ) This change enables the creator of a shared conversation to stop sharing the conversation publicly. ### Details 1. Create an API endpoint to enable the owner of the shared conversation to unshare it 2. Unshare a public conversations from the title pane of the public conversation on the web app	2025-03-25 14:24:01 +05:30
Debanjum	9dfa7757c5	Unshare public conversations from the title pane on web app Only show the unshare button on public conversations created by the currently logged in user. Otherwise hide the button Set conversation.isOwner = true only if currently logged in user shared the current conversation. This isOwner information is passed by the get shared conversation API endpoint	2025-03-25 14:05:29 +05:30
Debanjum	d9c758bcd2	Create API endpoint to unshare a public conversation Pass isOwner field from the get shared conversation API endpoint if the currently authenticated user created the requested public conversation	2025-03-25 14:05:29 +05:30
Debanjum	e3f6d241dd	Normalize chat messages sent to gemini funcs to work with prompt tracer Previously messages passed to gemini (chat) completion functions got a little of Gemini specific formatting mixed in. These functions expect a message of type list[ChatMessage] to work with prompt tracer etc. Move the code to format messages of type list[ChatMessage] into gemini specific format down to the gemini (chat) completion functions. This allows the rest of the functionality like prompt tracing to work with normalize list[ChatMesssage] type of chat messages across providers	2025-03-25 14:04:16 +05:30
Debanjum	7976aa30f8	Terminate research if query or tool is empty	2025-03-25 14:04:16 +05:30
Debanjum	39aa48738f	Set effort for openai reasoning models to pick tool in research mode This is analogous to how we enable extended thinking for claude models in research mode. Default to medium effort irrespective of deepthought for openai reasoning models as high effort is currently flaky with regular timeouts and low effort isn't great.	2025-03-25 14:04:16 +05:30
Debanjum	b4929905b2	Add costs of ai prompt cache read, write. Use for calls to Anthropic	2025-03-25 14:04:16 +05:30
Debanjum	d4b0ef5e93	Fix ability to disable code and internet providers in eval workflow Sets env vars to empty if condition not met so: - Terrarium (not e2b) used as code sandbox on release triggered eval - Internet turned off for math500 eval	2025-03-25 14:04:16 +05:30
sabaimran	a8285deed7	Release Khoj version 1.37.2	2025-03-23 11:38:25 -07:00
sabaimran	b7ac8771de	Update a few pieces of documentation around data sources.	2025-03-23 11:36:20 -07:00
sabaimran	12e7409da9	Release Khoj version 1.37.1	2025-03-23 11:10:34 -07:00
sabaimran	985f1672ed	Remove eval lists from git tracking	2025-03-23 10:59:32 -07:00
Debanjum	d1df9586ca	Standardize AI model response temperature across provider specific ranges - Anthropic expects a 0-1 range. Gemini & OpenAI expect a 0-2 range - Anneal temperature to explore reasoning trajectories but respond factually - Default send_message_to_model and extract_question temps to the same	2025-03-23 18:09:22 +05:30
Debanjum	55ae0eda7a	Upgrade package dependencies nextjs for web app and torch on server	2025-03-23 17:10:40 +05:30
Debanjum	8409e64ff0	Clean AI model API providers documentation	2025-03-23 16:26:34 +05:30
Debanjum	86a51d84ca	Access Claude and Gemini via GCP Vertex AI (#1134 ) Support accessing Claude and Gemini AI models via Vertex AI on Google Cloud. See the documentation at docs.khoj.dev for setup details	2025-03-23 16:26:02 +05:30
Debanjum	16ffebf765	Document how to configure using AI models via GCP Vertex AI	2025-03-23 16:12:46 +05:30
Debanjum	7153d27528	Cache Google AI API client for reuse	2025-03-23 16:12:46 +05:30
Debanjum	da33c7d83c	Support access to Gemini models via GCP Vertex AI	2025-03-23 16:12:46 +05:30
Debanjum	603c4bf2df	Support access to Anthropic models via GCP Vertex AI Enable configuring a Khoj AI model API for Vertex AI using GCP credentials. Specifically use the api key & api base url fields of the AI Model API associated with the current chat model to extract gcp region, gcp project id & credentials. This helps create a AnthropicVertex client. The api key field should contain the GCP service account keyfile as a base64 encoded string. The api base url field should be of the form `https://{MODEL_GCP_REGION}-aiplatform.googleapis.com/v1/projects/{YOUR_GCP_PROJECT_ID}` Accepting GCP credentials via the AI model API makes it easy to use across local and cloud environments. As it bypasses the need for a separate service account key file on the Khoj server.	2025-03-23 16:12:46 +05:30
Debanjum	8bebcd5f81	Support longer API key field in DB to store GCP service account keyfile	2025-03-23 14:55:50 +05:30
Debanjum	f2b438145f	Upgrade sentence-transformers. Avoid transformers v4.50.0 as problematic - The 3.4.1 release of sentence tranformer fixes offline load latency of sentence transformer models (and Khoj) by avoiding call to HF - The 4.50.0 release of transformers is resulting in jax error (unexpected keyword argument 'flatten_with_keys') on load.	2025-03-23 09:02:57 +05:30
Debanjum	510cbed61c	Make google auth package dependency explicit to simplify code Previously google auth library was explicitly installed only for the cloud variant of Khoj to minimize packages installed for non production use-cases. But it was being implicitly installed as a dependency of an explicit package in the default installation anyway. Making the dependency on google auth package explicit simplifies the conditional import of google auth in code while not incurring any additional cost in terms of space or complexity.	2025-03-23 09:02:57 +05:30
Debanjum	5fff05add3	Set seed for Google Gemini models using KHOJ_LLM_SEED env variable This env var was already being used to set seed for OpenAI and Offline models	2025-03-22 08:59:31 +05:30
Debanjum	6cc5a10b09	Disable SimpleQA eval on release as saturated & low signal for usecase Reaching >94% in research mode on SimpleQA. When answers can be researched online, it becomes too easy. And the FRAMES eval does a more thorough job of evaluating that use-case anyway.	2025-03-22 08:05:12 +05:30
Debanjum	45015dae27	Limit to json enforcement via json object with DeepInfra hosted models DeepInfra based models do not seem to support json schema. See https://deepinfra.com/docs/advanced/json_mode for reference	2025-03-22 08:04:09 +05:30
Debanjum	dc473015fe	Set default model, sandbox to display in eval workflow summary on release	2025-03-20 14:44:56 +05:30

1 2 3 4 5 ...

4516 Commits