Weave Code
Code Weaver
Helps Laravel developers discover, compare, and choose open-source packages. See popularity, security, maintainers, and scores at a glance to make better decisions.
Feedback
Share your thoughts, report bugs, or suggest improvements.
Subject
Message

Ai Click House Store Laravel Package

symfony/ai-click-house-store

ClickHouse vector store integration for Symfony AI Store. Store and query embeddings in ClickHouse using distance functions and ANN/vector indexes for fast similarity search. Links to ClickHouse docs plus Symfony AI contributing and issue tracker.

View on GitHub
Deep Wiki
Context7

Product Decisions This Supports

  • AI/ML Infrastructure Modernization: Enables cost-effective, scalable vector storage for Symfony AI applications by leveraging ClickHouse’s open-source architecture, reducing reliance on proprietary vector databases (e.g., Pinecone, Weaviate). Aligns with a build-vs-buy strategy favoring open-source solutions for long-term cost and control.
  • Scalable Semantic Search: Supports high-performance, low-latency vector similarity searches (e.g., recommendation engines, chatbots, or document retrieval) using ClickHouse’s ANN indexes and distance functions, ideal for applications requiring sub-millisecond responses at scale.
  • Multi-Tenancy & Cost Optimization: Ideal for SaaS platforms or large-scale deployments where ClickHouse’s open-source model reduces cloud costs significantly compared to managed vector databases. Enables shared vector storage across tenants without per-query fees.
  • Hybrid Search Capabilities: Facilitates combining vector similarity with SQL-based filtering (e.g., metadata queries), enabling advanced use cases like hybrid search (keyword + vector) for enterprise applications or knowledge graphs.
  • Compliance & Data Sovereignty: Appeals to regulated industries (e.g., healthcare, finance) where self-hosted vector stores align better with data residency and compliance requirements than third-party services.
  • Roadmap for AI-Driven Features: Supports future-proofing for AI/ML features like Retrieval-Augmented Generation (RAG), personalized recommendations, or anomaly detection by providing a robust, scalable backend for vector operations.

When to Consider This Package

Adopt This Package If:

  • Your Symfony AI application requires a high-performance vector store for millions of embeddings (e.g., >100K vectors) with sub-millisecond latency.
  • You’re already using ClickHouse for analytics or OLAP workloads and want to consolidate infrastructure to avoid multi-database complexity.
  • Cost efficiency is a priority: ClickHouse’s open-source model eliminates per-query fees, making it significantly cheaper than managed vector databases (e.g., Pinecone at $0.006/1K vectors).
  • You need SQL-based filtering on vectors (e.g., WHERE metadata.category = 'tech' AND vector_distance(...) < 0.5), enabling complex queries beyond pure similarity search.
  • Your use case involves batch processing (e.g., ingesting millions of vectors daily), where ClickHouse’s MergeTree engine excels in write-heavy workloads.
  • You require self-hosted control over your vector store, including data sovereignty, custom indexing, or compliance with specific regulations.

Look Elsewhere If:

  • You need a managed service with auto-scaling, backups, or serverless options (e.g., Pinecone, Weaviate, Milvus).
  • Your vector dataset is small (<10K vectors) or low-dimensional (<128D), where simpler stores (e.g., SQLite, Redis) or lightweight libraries (e.g., FAISS) may suffice.
  • You require fine-tuned ANN accuracy (ClickHouse’s SCANN/LSH may not match the precision of specialized libraries like FAISS or Milvus).
  • Your team lacks ClickHouse expertise, as setup, optimization, and troubleshooting require SQL tuning and infrastructure knowledge.
  • You’re using a non-Symfony PHP stack (this package is specifically designed for Symfony AI’s Store abstraction).
  • Your application demands ACID transactions or complex relational queries, as ClickHouse is optimized for OLAP (analytics) rather than OLTP (transactions).

How to Pitch It (Stakeholders)

For Executives:

"This package allows us to use ClickHouse—a high-performance, open-source database—as our vector store for Symfony AI, delivering cost savings of up to 90% compared to proprietary solutions like Pinecone. By leveraging ClickHouse’s scalability and SQL capabilities, we can support millions of vectors for use cases like semantic search, recommendations, or AI-driven analytics—all while maintaining full control over our data and infrastructure. This aligns with our goals for cost efficiency, compliance, and long-term scalability without vendor lock-in."

Key Ask:

  • Approval to evaluate ClickHouse as a vector store alternative, including benchmarking against current solutions (e.g., Pinecone, Weaviate).
  • Budget allocation for ClickHouse infrastructure (if not already in use) and potential DevOps support for setup and optimization.

For Engineering:

"This bridge integrates ClickHouse as a vector store backend for Symfony AI, enabling:

  • High-performance ANN searches using ClickHouse’s native vector distance functions (e.g., L2, cosine) and ANN indexes (HNSW, QuantizedFlat).
  • SQL-based filtering on vectors, allowing complex queries like WHERE metadata.category = 'tech' AND vector_distance(...) < 0.5.
  • Seamless integration with Symfony AI’s StoreInterface, requiring minimal code changes.

Trade-offs:

  • Self-hosted responsibility: Requires ClickHouse setup (but leverages existing infrastructure if already in use for analytics).
  • Early-stage adoption: Low GitHub activity but backed by Symfony’s AI team.
  • Performance tuning: ANN index configuration (e.g., GRANULARITY, GRAPH_SIZE) may require benchmarking.

Next Steps:

  1. Benchmark against your current vector store (latency, cost, and scalability).
  2. Prototype a high-volume use case (e.g., 1M vectors) to validate performance.
  3. Align with DevOps on ClickHouse deployment (cluster sizing, backups, and monitoring)."

For Data Scientists/ML Teams:

"This unlocks ClickHouse’s vector capabilities for your Symfony AI models, enabling:

  • Sub-second similarity searches for embeddings (e.g., SELECT * FROM vectors ORDER BY vector_distance(...) LIMIT 10).
  • Hybrid search (combine keyword + vector queries in SQL) for richer retrieval.
  • Cost-effective scaling for large datasets (no per-query fees).

Example Use Cases:

  • Document retrieval (e.g., RAG pipelines for LLMs).
  • Product recommendations (filter vectors by category + similarity).
  • Anomaly detection (vector distance thresholds for outlier identification).

Pro Tip: Use ClickHouse’s ANN indexes to optimize recall for high-dimensional embeddings (e.g., 768D)."*


For DevOps/Infrastructure:

"This package requires:

  • A ClickHouse cluster (v22.8+) with vector/ANN support.
  • Schema design for vector storage (e.g., Array(Float32) columns + ANN indexes).
  • Driver configuration (HTTP or native) in Laravel/Symfony.

Operational Considerations:

  • Backup strategy: ClickHouse’s REPLICATED engine for high availability.
  • Monitoring: Track system.asynchronous_metrics for query performance.
  • Scaling: Horizontal scaling via sharding (if needed for >100M vectors).

Recommendation: Start with a single-node ClickHouse instance for testing, then scale based on load."*

Weaver

How can I help you explore Laravel packages today?

Conversation history is not saved when not logged in.
Prompt
Add packages to context
No packages found.
monarobase/country-list
nasirkhan/laravel-sharekit
directorytree/privacy-filter-classifier
directorytree/privacy-filter
datacore/hub-sdk
develia/commons
cuci/prototurk-sdk
cuci/prototurk-sdk-symfony
develia/geo-bundle
dreamzy/livewire-charts
touchestate-sdk/php-sdk
22h/doctrine-garbage-collection-bundle
agtp/agtp-php
agtp/mod-php
splash/sonata-admin
splash/metadata
splash/openapi
splash/scopes
splash/toolkit
testo/output-teamcity