All notable changes to Antfly will be documented in this file.
Roadmap
Upcoming Features
-
Multimodal Search Enhancements
- Support for ImageBind as a multimodal embedder
- Image-to-image similarity search (search with images as queries)
- Extended audio file support (MP3, WAV, FLAC)
- Video file processing and frame extraction
- Automatic scene detection and indexing for video content
-
Advanced AI Capabilities
- Custom prompt templates for different content types
- Model performance comparison tools
- Automatic model selection based on content type
- Fine-tuning support for domain-specific embeddings
Releases
[0.0.13] - 2026-03-03
Highlights
- Sparse Vector Search (SPLADE) — hybrid search combining dense and sparse vectors with weighted fusion for better relevance
- PostgreSQL CDC Replication — automatically sync data from PostgreSQL into Antfly via logical replication
- Operator Improvements — PVC lifecycle management, availability zone topology, admission webhooks, and storage resilience
- Faster Sparse Indexing — up to 3.6x faster sparse index inserts with multi-tier caching
Features
- Sparse Vector (SPLADE) Search — hybrid dense+sparse fusion with configurable per-index merge weights
- PostgreSQL CDC Replication — logical replication with automatic change capture from PostgreSQL tables
- Chunking and Summarization support in sparse embeddings index
- Operator: PVC Lifecycle management, availability zone topology, and storage resilience
- Operator: Admission Webhooks for AntflyCluster resource validation
Performance
- Up to 3.6x faster sparse index inserts
- Multi-tier caching and batched writes for sparse indexes
- Configurable sync level for embeddings indexes
Bug Fixes
- Fix transaction correctness for cross-table OCC commits
- Fix read availability during leader elections by falling back to healthy replicas
- Fix full text index version resolution and rebuild during schema migration
- Fix search result filtering for chunked embedding indexes
- Fix graph node key decoding in fusion results
- Fix UTF-8 streaming in classification reasoning parser
Documentation
- Add SDK documentation pages for Antfly and Termite
- Add CDC replication guide with cloud provider setup instructions
[0.0.9] - 2026-02-22
Highlights
- Secrets Management — new API and dashboard page for managing secrets
- API Key & Bearer Token Auth — authenticate with API keys or bearer tokens
- AI Provider Timeouts — configurable timeout for AI provider calls
- PDF Enrichment Pipeline — zip-direct reading, parallel extraction, vision-based categorization, and Florence 2 re-OCR for low-quality pages
- Omni Edition — renamed install edition with streamlined macOS support
Features
- Secrets Management API and Antfarm dashboard page
- API Key & Bearer Token Authentication
- AI Provider Timeout configuration
- PDF Enrichment — direct zip reading, parallel page extraction, page-type categorization with vision support, and Florence 2 re-OCR for low-quality pages
- Dashboard improvements with reverse proxy support and sidebar redesign
Bug Fixes
- Fix transaction intent conflict detection
- Auto-abort stale transactions to prevent permanent blocking
- Fix out-of-memory when processing large zip archives
- Prevent rechunking all documents on every restart in ephemeral mode
Documentation
- Rename ONNX edition to Omni
- Add pod scheduling guide for operator
- Update quickstart to use built-in providers
[0.0.8] - 2026-02-18
Highlights
- Cross-Table Transactions — optimistic concurrency control (OCC) with read-modify-write support across tables
- Built-in Embedder & Reranker — bundled INT8 quantized all-MiniLM-L6-v2 embedder and reranker, no external service required
- Shared Pebble Block Cache — single block cache shared across all DB instances per process for better memory utilization
- Operator Scheduling Constraints — tolerations, nodeSelector, affinity, and topologySpreadConstraints in AntflyCluster CRD
- Cluster Hibernation — scale operator replicas to zero while retaining PVCs for cost savings
Features
- Cross-Table Transactions with OCC read-modify-write support
- Built-in Embedder — INT8 quantized all-MiniLM-L6-v2 bundled with Antfly
- Built-in Reranker — INT8 quantized reranker model bundled with Antfly
- Shared Pebble Block Cache across all DB instances per process
- Operator Scheduling Constraints — tolerations, nodeSelector, affinity, and topologySpreadConstraints added to AntflyCluster CRD
- Cluster Hibernation — allow scaling metadata and data node replicas to zero
Bug Fixes
- Fix Raft proposal dropped incorrectly treated as permanent error during leader election
- Fix case-sensitive matching in transient shard error detection
- Fix answer agent compatibility layer SSE event mapping and generation enable
- Fix S3 restore path to use
common.ParseS3URL - Fix
publicAPI.enableddefault to false and delete service when disabled
[0.0.7] - 2026-02-13
Highlights
- Retrieval & Generation Agents — new agentic architecture for retrieval-augmented generation
- MCP & A2A Protocol Support — connect Antfly to AI agents via MCP (
/mcp/v1) and Agent-to-Agent protocol - Foreign Tables — federated queries against external PostgreSQL databases
- Named Provider Registry — configure embedders, generators, rerankers, and chunkers by name
- Audio Transcription — speech-to-text support via Termite
- Ephemeral Chunks — transient chunk storage with the
store_chunksconfig option
Features
- Retrieval Agents — tool-use agentic loop for retrieval and generation, replacing the previous answer endpoint (deprecated
/agents/answerstill available for backward compatibility) - MCP Server at
/mcp/v1for AI agent integration - A2A Protocol facade for retrieval and query-builder agents
- Foreign Tables for federated PostgreSQL queries
- Named Provider Registry for embedders, generators, chains, rerankers, and chunkers
- Audio/STT with Termite as speech-to-text provider and media chunking support
- Ephemeral Chunks mode (
store_chunksconfig option) for transient chunk storage - Graph Index — field-based edges, topology constraints, and summarizer
- Remote Content configuration system for web scraping
- CLAP & CLIPCLAP model support for audio embeddings
- Antfly Operator now included in the main repository with docs and install manifests
Bug Fixes
- Fix two transaction bugs found via TLA+ model checking
- Fix shard split finalizing before initialization completes
- Fix node crashes from closed-channel panics and double-close on shard shutdown
- Fix S3 endpoint configuration (
AWS_ENDPOINT_URLenv var now supported) - Extensive JPEG2000 decoding improvements for PDF processing
- Fix MCP schema compatibility issues
Deprecations
/agents/answerendpoint deprecated in favor of/agents/retrieval
[0.0.2] - 2026-01-12
Highlights
- Cross-table join support for queries spanning multiple tables
- Zero-downtime shard splitting with two-phase split
- TTS/STT audio library with OpenAI and Google Cloud providers
- CLIP model support for multimodal image indexing
Features
- Cross-Table Joins — query across multiple tables with shard-aware routing
- Zero-Downtime Shard Splitting — two-phase split for high availability
- Audio Library — TTS and STT support with OpenAI and Google Cloud providers
- Dynamic Templates — flexible field mapping with automatic schema inference
- Aggregations API — range and term aggregations (renamed from Facets)
- Chat Agent — tool execution, clarification handling, confidence scoring, and multi-turn query builder mode
- Indexes in Raft Snapshots — faster recovery with pause/resume for index operations
- ONNX Runtime GenAI bundled for local LLM generation
Bug Fixes
- Fix data loss during shard splits with SyncLevelWrite
- Fix multiple shard split race conditions and state transition issues
- Fix CoreML memory inflation with full-precision ONNX models
- Fix memory leak in store shutdown
- Fix CORS configuration handling
[0.0.1] - 2025-12-20
First official release of Antfly.
Features
- Unified ONNX + XLA build for cross-platform ML inference
- Termite downloads page with Homebrew support
- antflycli included in container images
- Document TTL support
Bug Fixes
- Fix shard split data loss and race conditions
- Fix Raft initialization resilience
- Improve shard split reliability with two-phase approach
Documentation
- Mobile-friendly docs navigation
- Discord and GitHub community links
[0.0.0-dev13]
Features
- Graph Database — declarative graph query DSL, shortest path queries, edge expiration and TTL
- Document Transform Operations — MongoDB-style in-place updates, upserts, path-based field manipulation
- Termite — dynamic model loading, reranker support with ONNX models
- TOON Binary Encoder for efficient data serialization
- RAG — result pruning, streaming LLM responses, sync-level enrichment control
- Dashboard UI — visual Bleve query builder, table filtering, improved results display
- Encrypted Keystore for secrets management
- AVX-512 SIMD acceleration for vector operations
[0.0.0-dev3] - 2025-08-XX
Breaking Changes
- Index configuration moved into OpenAPI specification (previous metadata instances are incompatible)
- API routes namespaced under
/api/v1
Features
- Dashboard UI — web interface for managing tables and indexes, hosted on metadata server