Vectory

Version-controlled vector indexing

Snapshots, branching, audit, and verifiable indices — designed for reproducible retrieval and enterprise constraints. Pure JVM. No native dependencies.

Contact: contact@datahike.io

Why Vectory

  • Snapshots — Immutable index snapshots in O(1) (structural sharing).
  • Scalable reads — A snapshot is a value. Hand it to any number of workers—no connection pool, no coordinator.
  • Reproducibility — Query historical index states for audits, debugging, and compliance workflows.
  • Verification — Merkle-hashed indices for verifiable, content-addressed retrieval states.
  • Storage — Pluggable persistence via Konserve with backends tailored to your environment.

Performance

Benchmarks are published with methodology and reproducible workloads. The goal is to be explicit about throughput, recall, and operational tradeoffs.

  • Reads — QPS and tail latency (p95/p99), plus recall vs. index settings.
  • Writes — Insert throughput, background compaction costs, and snapshot creation overhead.
  • Method — Workloads + datasets + exact configs are documented (no “magic numbers”).

Integrations

First-class JVM integrations ship alongside the core API.

  • Spring AI — Drop-in integration for Spring-based RAG and retrieval pipelines.
  • LangChain4j — Adapter for LangChain4j-based applications and evaluators.
  • APIs — Full Java API and full Clojure API.

Deployment

Embedded JVM library

Run Vectory where your services run. No separate cluster required unless you want one.

Pluggable storage

Choose persistence backends that match your needs (local, cloud, replicated), with a consistent model.

Commercial

We offer evaluation support, integration work, and enterprise support. If you need procurement-friendly paperwork, we’ll make it easy.

FAQ

  • Production-ready? — We’ll share a concrete readiness checklist and help you evaluate quickly.
  • Upgrades? — We optimize for explicit versioning and reproducible state transitions.
  • Security? — Offline-friendly, minimal dependencies, and transparent benchmarking methodology.