0.8.1
What's Changed
- Update local-benchmarks script to save results to a new directory during each run by @jparismorgan in #464
- [automated] Update backwards-compatibility-data for release 0.8.0 by @github-actions in #467
- Fix C++ warnings across codebase by @jparismorgan in #461
- Enable more benchmarks in ann-benchmarks script by @jparismorgan in #470
- Optimize distance computations for IVFPQ by @NikolaosPapailiou in #468
- Fix C++ incorrect or ambiguous primitive types by @jparismorgan in #455
- Add C++ sanitizers by @jparismorgan in #471
- Use single shared seed for rng generation in C++ by @jparismorgan in #469
- Make logging utils thread-safe by @jparismorgan in #474
- Add nightly CI which runs sanitizers by @jparismorgan in #476
- Add new test and cleanup C++ code by @jparismorgan in #475
- Fix
counting_sum_of_squares_distance
thread sanitizer error by @jparismorgan in #472 - Parallelize Vamana query by @jparismorgan in #463
- Fix address sanitizer error when storing metadata in destructor by @jparismorgan in #473
- Fix mutex lock error in logging utils by @jparismorgan in #478
- Add option to ann-benchmarks.py to skip benchmarks and leave instance running by @jparismorgan in #477
- Add dump() methods to logging classes by @jparismorgan in #479
- Add more testing for Vamana helpers, add option to skip top_k to improve ingestion speed by @jparismorgan in #481
- Avoid copies in query code by @jparismorgan in #483
- Add Vamana storage spec by @jparismorgan in #484
- Add more comments about IVF parameters
partitions
andnprobe
by @jparismorgan in #487 - IVF_FLAT Distance metric by @cainamisir in #451
- Update matrix to take in a
std::vector
instead ofstd::initializer_list
by @jparismorgan in #492 - Fix flaky IVF_PQ Python test and add more C++ testing around pq-encoding by @jparismorgan in #494
- Fix infinite loop in kmeans by @jparismorgan in #491
- Distance metric integration for vamana, and refactoring of distance metrics by @cainamisir in #460
- Add out-of-core query() support to IVF PQ by @jparismorgan in #485
- IVF_PQ: Remove unused pq_ivf_centroids, remove extra call to train_ivf(), add comments, cleanup code by @jparismorgan in #489
- Refactor logging helpers by @jparismorgan in #497
- Name each benchmark within
local-benchmarks.py
by @jparismorgan in #498 - Add scGPT and scvi embeddings and improve SOMA reader by @NikolaosPapailiou in #501
- Vlad/l2 sumofsquares by @cainamisir in #486
- Update ann-benchmarks.py to connect to running instance by @jparismorgan in #500
- Fix some TODOs after the cloud release by @NikolaosPapailiou in #505
- Various small tdb matrix cleanups by @jparismorgan in #510
- Distance metric small fixes: uninitialized value in IVF_PQ and pass setting during
consolidate_updates()
by @jparismorgan in #508 - Add
fixed_min_triplet_heap
by @jparismorgan in #507 - Update IVF_PQ array names by @jparismorgan in #511
- Check finite and infinite IVF_PQ queries return the same ids and distances, fix
count_intersections()
to not modify inputs, updateread_index_finite()
to return data by @jparismorgan in #509 - Add new plot to local-benchmarks.py showing results from all indexes by @jparismorgan in #512
- DirectoryReader: set text/plain as default mime type if it is not found by @NikolaosPapailiou in #513
- Add option to run local-benchmarks.py with index at a tiledb uri by @jparismorgan in #514
- IVF_PQ re-ranking by @jparismorgan in #502
- Support AWS index URI in
local-benchmarks.py
by @jparismorgan in #516 - Add
k_factor
tolocal-benchmarks.py
by @jparismorgan in #517
Full Changelog: 0.8.0...0.8.1