Add Vision Support (#889)

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-03 21:29:08 +00:00

# Summary of Changes
* New UI to show preview of image uploads
* ChatML message changes to support gpt-4o vision based responses on images
* AWS S3 image uploads for persistent image context in conversations
* Database changes to have `vision_enabled` option in server admin panel while configuring models
* Render previously uploaded images in the chat history, show uploaded images for pending msgs
* Pass the uploaded_image_url through to subqueries
* Allow image to render upon first message from the homepage
* Add rendering support for images to shared chat as well
* Fix some UI/functionality bugs in the share page
* Convert user attached images for chat to webp format before upload
* Use placeholder to attached image for data source, response mode actors
* Update all clients to call /api/chat as a POST instead of GET request
* Fix copying chat messages with images to clipboard

TLDR; Add vision support for openai models on Khoj via the web UI!

---------

Co-authored-by: sabaimran <narmiabas@gmail.com>
Co-authored-by: Debanjum Singh Solanky <debanjum@gmail.com>

This commit is contained in:

Raghav Tirumale

2024-09-09 17:22:18 -05:00

committed by

GitHub

parent b553bba1d8

commit 549686a7a4

33 changed files with 740 additions and 417 deletions

									
										2

src/interface/web/package.json
									
												View File
												
				@@ -47,7 +47,7 @@

				        "eslint": "^8",

				        "eslint-config-next": "14.2.3",

				        "input-otp": "^1.2.4",

				        "intl-tel-input": "^23.8.0",

				        "intl-tel-input": "^23.8.1",

				        "katex": "^0.16.10",

				        "libphonenumber-js": "^1.11.4",

				        "lucide-react": "^0.397.0",

Add Vision Support (#889)

2 src/interface/web/package.json Unescape Escape View File

2

src/interface/web/package.json

View File