1
0
Fork 0
Commit graph

5 commits

Author SHA1 Message Date
Matthew Holt
b10585ed3f
Try supporting MPS (Apple Metal) for GPU 2025-09-04 23:37:38 -06:00
Matthew Holt
b3376b5298
Fix pipeline bugs; rethink embeddings
Fixed several bugs introduced by the pipeline refactoring.

Updated goexif2 fork to use my latest commit which fixes not being able to find EXIF data on some JPEG images.

Embeddings now refer to the item they are for, rather than an item referring to a single embedding. This allows items to have multiple embeddings if necessary, which gives us some flexibility when models change/improve, etc.

Also reworked the Python server to use a smaller model (base siglip2 instead of so400m) so that it will fit on more GPUs, including my 4070; as well as a new "DeviceManager" that ChatGPT helped me figure out, to choose GPU when it has enough memory for it, as conditions change.
2025-09-04 21:40:50 -06:00
Matthew Holt
98069ee66b
Health check for python server
This ensures that searches and embedding jobs don't run before it has fully loaded, since loading models can take a while
2025-04-28 17:33:40 -06:00
Matthew Holt
6d231fd0c2
Improved embeddings with SigLIP2; fix semantic search bug
Still lots of room for improvement here, but I see way better results already.
2025-04-13 15:38:14 -06:00
Matt Holt
35c5a63be4
Refactor python server code, update schema, rename config dir (#68)
* WIP

* Finish updating changes
2025-02-07 11:34:42 -07:00
Renamed from ml/server.py (Browse further)