klbr/khoj - khoj - Gitea: Git with a cup of tea

klbr/khoj

mirror of https://github.com/khoaliber/khoj.git synced 2026-03-02 21:19:12 +00:00

Go to file

Debanjum Singh Solanky d8abbc0552 Use XMP metadata in images to improve image search

- Details
  - The CLIP model can represent images, text in the same vector space

  - Enhance CLIP's image understanding by augmenting the plain image
    with it's text based metadata.
    Specifically with any subject, description XMP tags on the image

  - Improve results by combining plain image similarity score with
    metadata similarity scores for the highest ranked images

- Minor Fixes
  - Convert verbose to integer from bool in image_search.
    It's already passed as integer from the main program entrypoint

  - Process images with ".jpeg" extensions too

2021-09-16 08:55:20 -07:00

src

Use XMP metadata in images to improve image search

2021-09-16 08:55:20 -07:00

.gitignore

Add Readme, License. Update .gitignore

2021-08-15 22:52:37 -07:00

environment.yml

Enable Semantic Search on Images

2021-08-22 21:42:37 -07:00

LICENSE

Add Readme, License. Update .gitignore

2021-08-15 22:52:37 -07:00

README.org

Update Readme to state can now query beancount transactions, images

2021-08-22 21:50:27 -07:00

sample_config.yml

Update sample config to add minimal config for ledger, image search

2021-08-22 21:54:49 -07:00

README.org

Semantic Search
- Dependencies
- Install
- Run
- Use
- Upgrade
- Acknowledgments

Semantic Search

Allow natural language search on user content like notes, images, transactions using transformer based models

All data is processed locally. User can interface with semantic-search app via Emacs, API or Commandline

Dependencies

Python3
Miniconda

Install

git clone https://github.com/debanjum/semantic-search && cd semantic-search
conda env create -f environment.yml
conda activate semantic-search

Run

Load ML model, generate embeddings and expose API to query specified org-mode files

python3 src/main.py -c=sample_config.yml --verbose

Use

Semantic Search via Emacs
- Install semantic-search.el
- Run M-x semantic-search <user-query> or Call C-c s
Semantic Search via API
- Query: GET http://localhost:8000/search?q="What is the meaning of life"
- Regenerate Embeddings: GET http://localhost:8000/regenerate
- Semantic Search API Docs

Upgrade

  cd semantic-search
  git pull origin master
  conda env update -f environment.yml
  conda activate semantic-search

Acknowledgments

MiniLM Model for Asymmetric Text Search. See SBert Documentation
OpenAI CLIP Model for Image Search. See SBert Documentation
Charles Cave for OrgNode Parser

Languages

Python 51%

TypeScript 36.1%

CSS 4.1%

HTML 3.2%

Emacs Lisp 2.4%

Other 3.1%