Vespa.ai

Name: Vespa.ai
Rating: 8.3 (1 reviews)

🇳🇴

Trondheim-built open-source tensor-native search and vector database, spun out of Yahoo

8.3/10

EEAGDPREU DataOpen SourceFree Tier

Review by EuropeanStack EditorialUpdated May 2026Verified May 2026

Visit Vespa.ai

Bottom Line

8.3/10

Vespa is the most capable open-source search engine in the European software directory and arguably one of the most capable anywhere. The tensor-native architecture, hybrid retrieval, learned ranking, and production scale at Yahoo, LinkedIn, and Spotify are all genuine technical advantages over both pure vector databases and traditional search engines.

Vespa.ai is a Trondheim-based open-source tensor-native search and vector database operated by VESPA.AI AS (org.nr. 931605569). The technology originated at FAST Search & Transfer in 2001, became a core piece of Yahoo's serving infrastructure, and was spun out as an independent Norwegian AS in October 2023. Released under Apache 2.0, Vespa combines dense vector search, BM25 keyword scoring, structured filtering, and ML-driven ranking in a single engine that serves Yahoo, LinkedIn, and Spotify-scale workloads at sub-100ms latencies. The company raised USD 31 million in Series A funding from Blossom Capital in November 2023, with Yahoo retaining a minority stake and board seat.

Headquarters

Trondheim, Norway

Founded

2023

Pricing

Open Source

EU Data Hosting

Yes

Employees

11-50

Open Source

Yes

vector-databasesearch-engineopen-sourcehybrid-searchtensorrag

Ratings

Ease of Use6.5

Feature Depth9.5

Value for Money8.5

EU Compliance9.0

Support Quality7.5

Integration Ecosystem8.0

Features

Core Features

✓Dense vector search (HNSW + IVF indexes)
✓BM25 keyword and full-text search
✓Sparse vector and ColBERT-style multi-vector retrieval
✓Tensor-native query model
✓Learned ranking with TensorFlow, ONNX, and XGBoost models
✓Structured filtering and grouping
✓Horizontal partitioning and replication
✓Real-time indexing and updates
✓Application packages for declarative schema management
✓Query profiles and ranking expressions

Standout Features

★Tensor-native architecture handles multi-vector and structured workloads that pure vector DBs require separate systems for
★Learned ranking executes ML models in the serving path, enabling production-grade hybrid scoring without external orchestration
★Battle-tested at internet scale — Yahoo's production search workload alone exercised Vespa for over 20 years before the open-source spinout
★Application package model lets teams version control schemas, ranking expressions, and query profiles like code, with deploys that are atomic per application

Compliance

☖GDPR compliant (VESPA.AI AS is a Norwegian AS under EEA jurisdiction)
☖EU data residency available on Vespa Cloud
☖ISO 27001 certified (Vespa Cloud)
☖Apache 2.0 licence permits full self-hosting in any EU region or on-prem
☖Data Processing Agreement available for Vespa Cloud customers

Pricing

14-day free trial available

Open Source

Free

Apache 2.0 licensed
Self-host on any infrastructure
Full feature set including learned ranking
Community support via GitHub and Slack
No restrictions on scale, queries, or vectors

Vespa Cloud Trial

Free

Free trial environment
All managed features enabled
EU or US region
Community support

Vespa Cloud Production

Contact Sales

Managed clusters on AWS or GCP
EU data residency option
Autoscaling and managed upgrades
99.9% uptime SLA
Pricing based on compute, memory, and storage

Enterprise

Contact Sales

Dedicated infrastructure
Custom SLA
Solutions engineering
Compliance documentation
Security review

Billing: monthly, annual

Integrations & API

LangChainLlamaIndexHaystackTensorFlowONNXXGBoostHugging FaceKubernetesPyvespa client

API AvailableWebhook Support

Support

Community-forumGithubDocumentationSlackEnterprise-supportDocs: ExcellentCommunity Forum

Pros

✓Apache 2.0 open-source under VESPA.AI AS (Trondheim, Norway, org.nr. 931605569) — independent Norwegian operating entity, not a Delaware C-Corp with a European office
✓Tensor-native architecture goes beyond dense vectors: a single query can combine BM25 keyword scoring, dense vector similarity, sparse vectors, structured filters, and learned ranking models in one operation
✓Production track record at Yahoo, LinkedIn, and Spotify scale — billions of documents served at sub-100ms latencies, with 20+ years of engineering investment behind the codebase
✓Vespa Cloud is a managed offering for teams that do not want to operate the cluster themselves, but the open-source build runs the identical engine
✓Hybrid search and learned ranking are first-class capabilities, not add-ons — particularly relevant for RAG pipelines that need to combine semantic recall with lexical precision

Cons

✕Substantially steeper learning curve than Weaviate or Qdrant — Vespa is a full search platform, and the application package, schemas, and ranking expression model take real effort to learn
✕Documentation is comprehensive but written for engineers who already understand distributed search systems, not for AI engineers approaching from the LangChain side
✕Memory footprint and operational complexity at scale require genuine SRE capability — this is production-grade infrastructure, not a serverless KV store
✕Vespa Cloud pricing is custom (contact sales) with no published tier table, which makes early-stage cost modelling harder than Pinecone or Weaviate's published plans

Frequently Asked Questions

Yes. Vespa is released under the Apache 2.0 licence with the full engine, admin tooling, SDKs, and ranking framework available on GitHub at github.com/vespa-engine/vespa. The open-source build is the same engine that powers Vespa Cloud and the Yahoo production deployment. There is no proprietary feature gating between OSS and managed.

VESPA.AI AS is a Norwegian AS (org.nr. 931605569) registered in Trondheim, Norway. The company spun out of Yahoo in October 2023 and raised USD 31 million in Series A funding led by Blossom Capital in November 2023. Yahoo retains a minority stake and a board seat. The operating entity is Norwegian — the Series A was advised by DLA Piper's Norway office, consistent with a Norwegian receiving entity.

Weaviate and Qdrant are pure vector databases with strong RAG ergonomics. Vespa is a full tensor-native search engine that includes vector search as one capability — it also handles BM25 keyword search, multi-vector retrieval, structured filters, and learned ranking in a single query. Vespa has more capability and a steeper learning curve. For teams building pure vector search RAG pipelines, Weaviate or Qdrant are simpler; for teams building search and ranking systems at scale, Vespa is more capable.

Yes. VESPA.AI AS is a Norwegian AS under EEA data protection law, which mirrors GDPR. Vespa Cloud offers EU data residency. Self-hosted deployments give complete control over data location. Vespa Cloud holds ISO 27001 certification and a Data Processing Agreement is available for managed customers.

Yes. Vespa was the engine behind Yahoo's production search for years before the spinout, and is currently used at LinkedIn, Spotify, and Wayfair scale. The hybrid retrieval model — combining dense vector search, sparse vectors, BM25, structured filters, and learned ranking in a single query — is particularly well suited to RAG pipelines where pure semantic search returns insufficient precision.

What Is Vespa.ai?

The technology now branded Vespa.ai started in 2001 as the serving engine inside FAST Search & Transfer, a Norwegian search company founded out of NTNU in Trondheim. Microsoft acquired FAST in 2008. Yahoo separately acquired the AllTheWeb assets and built the engine into Yahoo's production serving infrastructure, where it ran search and advertising workloads at internet scale for over fifteen years. Generations of Yahoo engineers extended and hardened the codebase before the project was open-sourced under Apache 2.0 in 2017. In October 2023, Yahoo spun the team out as an independent company.

The legal entity is VESPA.AI AS, Norwegian organisation number 931605569, registered in Trondheim. The Series A in November 2023 raised USD 31 million from Blossom Capital, advised by DLA Piper's Norway office — a structural detail consistent with the Norwegian AS being the receiving entity rather than a Delaware holding company. Yahoo retains a minority stake and a board seat, but the company is independently operated. CEO Jon Bratseth was the original architect of Vespa at Yahoo and now leads the spin-out.

What Vespa actually is matters more than the corporate history. It is a tensor-native search engine that combines dense vector search, BM25 keyword scoring, sparse vector retrieval, structured filtering, and learned ranking models in a single query operation. Pure vector databases like Weaviate and Qdrant focus on vector similarity as the primary operation; Vespa treats vector search as one input into a richer ranking computation that can include any combination of signals.

Key Features

Tensor-Native Query Model

Vespa's defining technical decision is the tensor as the first-class data type. Documents are not just objects with vector fields — they are tensors that the engine can compute on directly. Queries combine tensor operations, BM25 scoring, structured filters, and learned ranking expressions into a single evaluation. The practical consequence is that hybrid retrieval, multi-vector models like ColBERT, and learned ranking can all be expressed natively without external orchestration.

For a RAG application, this means a single Vespa query can fetch documents matching a dense embedding similarity, filter by structured metadata (date range, user permissions, language), boost by BM25 keyword overlap, and rerank with an ONNX or XGBoost model — all in one call. Doing the same in a pure vector database typically requires two or three external services stitched together.

Hybrid Retrieval That Actually Works in Production

The combination of BM25 and dense vector search is increasingly recognised as the right default for RAG retrieval. Pure semantic search misses exact-match cases (product SKUs, function names, unusual terminology); pure keyword search misses paraphrases. Vespa's hybrid model handles both natively and lets engineers tune the weighting per query type. The reciprocal rank fusion and weighted combination methods are documented patterns, not workarounds.

For European search teams replacing legacy Elasticsearch deployments with modern AI-augmented retrieval, Vespa offers a credible upgrade path: the engine handles the legacy keyword search alongside new dense retrieval in the same cluster.

Learned Ranking in the Serving Path

Vespa runs ML ranking models — TensorFlow, ONNX, XGBoost — directly inside the serving path. A query can compute scores from a learned model using document features and query features as inputs, then return the top-K documents ranked by that model's output. This is the architecture used at LinkedIn and Spotify for personalised search and recommendations.

For teams whose ranking is genuinely ML-driven rather than rules-based, executing the model in the serving path eliminates a network hop and reduces latency meaningfully. The application package model lets teams version the ranking expression alongside the schema and deploy atomically.

Production Scale at Sub-100ms Latencies

The Yahoo deployment exercised Vespa at billions of documents and tens of thousands of queries per second for over a decade. The engine is engineered for horizontal scale: partition by document ID across content nodes, replicate for availability, scale stateless query nodes for throughput. Sub-100ms p99 latency at very large document collections is documented in case studies from Wayfair, Spotify, and Vinted.

For teams building search infrastructure that has to handle European traffic loads on EU infrastructure, this proven scale is meaningful — it is one of the few open-source search engines with production references at this magnitude.

Application Packages for Schema as Code

Vespa schemas, ranking expressions, query profiles, and component configurations are declared in an application package that the cluster deploys atomically. This is closer to a Kubernetes manifest model than to a typical database admin UI. The trade-off is real: it takes time to learn, and it does not give you a drag-and-drop schema editor. The upside is that everything is version controlled, reviewable, and reproducible across environments.

Pricing

The open-source build under Apache 2.0 is free at any scale. Teams running their own infrastructure can deploy Vespa on Kubernetes, EC2, or bare metal with no licensing cost. For organisations that already operate at search-engine scale, this is the dominant economic choice.

Vespa Cloud is the managed offering. Pricing is custom and based on the compute, memory, and storage provisioned for the customer's cluster. There is no published per-vector or per-query pricing table — the model is closer to a managed Kubernetes service than to a serverless database. A free trial environment is available without contract negotiation.

For teams that want predictable per-vector pricing at small scale, Pinecone or Weaviate Cloud are easier to model financially. For teams running at scale where the underlying compute matters more than marketing pricing tiers, Vespa Cloud's model can be more economical and is generally more transparent under scrutiny.

EU Compliance & Privacy

VESPA.AI AS is registered in Trondheim, Norway, which places the company under EEA jurisdiction. EEA data protection law mirrors GDPR, and Norwegian companies are not subject to US disclosure regimes such as the Cloud Act. The independence from a US parent company is structural rather than contractual.

Vespa Cloud offers EU data residency and holds ISO 27001 certification. A Data Processing Agreement is available for managed customers. Self-hosted deployments under Apache 2.0 give complete control over data location — common deployment targets include OVHcloud, Hetzner, on-prem Kubernetes, and EU regions of the hyperscalers.

For European search and AI infrastructure teams whose compliance requirements include both technical capability and corporate domicile, the combination of Apache 2.0 licensing, Norwegian AS structure, and EU-native deployment options is unusually well aligned.

Who It's Best For

If you are replacing a legacy Elasticsearch deployment with modern hybrid retrieval, Vespa's combination of BM25, dense vectors, and learned ranking in a single engine is one of the cleanest upgrade paths available.

If you are building search infrastructure at billions-of-documents scale and need sub-100ms serving latency, Vespa's production track record at Yahoo, LinkedIn, and Spotify scale is unmatched among open-source options.

If you are building a straightforward RAG pipeline with a few hundred thousand vectors and want the simplest possible setup, Weaviate or Qdrant have better ergonomics for that use case. Vespa is technically capable but operationally heavier than the situation requires.

If your ranking is genuinely model-driven rather than rules-based, running TensorFlow or ONNX inference inside Vespa's serving path eliminates orchestration complexity that is otherwise a recurring source of latency and bugs.

The Verdict

The trade-off is complexity. Vespa is not a serverless vector store you stand up in an afternoon. The application package model, ranking expression language, and operational footprint require real engineering investment to use well. For teams whose search problem justifies that investment, the return is substantial. For teams whose problem fits a simpler tool, the simpler tool is the right answer.

The Norwegian AS structure, Apache 2.0 licensing, and EU data residency options make Vespa one of the strongest European-headquartered infrastructure components available for AI and search workloads.

Frequently Asked Questions

Is Vespa.ai really open source?

Yes. Vespa is released under the Apache 2.0 licence with the full engine, admin tooling, SDKs, and ranking framework on GitHub at github.com/vespa-engine/vespa. The open-source build is the same engine that powers Vespa Cloud and the historical Yahoo production deployment. There is no proprietary feature gating.

Where is Vespa.ai based and who owns it?

VESPA.AI AS is a Norwegian AS (org.nr. 931605569) registered in Trondheim. The company spun out of Yahoo in October 2023 and raised USD 31 million in Series A funding from Blossom Capital in November 2023. Yahoo retains a minority stake and board seat; the operating entity is Norwegian. The Series A was advised by DLA Piper Norway, consistent with a Norwegian receiving entity.

How does Vespa compare to Weaviate or Qdrant?

Weaviate and Qdrant are pure vector databases with strong RAG ergonomics. Vespa is a full tensor-native search engine that includes vector search as one capability — it also handles BM25, multi-vector retrieval, structured filters, and learned ranking in a single query. Vespa has more capability and a steeper learning curve. For pure vector search RAG, Weaviate or Qdrant are simpler; for search and ranking systems at scale, Vespa is more capable.

Is Vespa.ai GDPR compliant?

Yes. VESPA.AI AS is a Norwegian AS under EEA data protection law, which mirrors GDPR. Vespa Cloud offers EU data residency and holds ISO 27001 certification. Self-hosted deployments give complete control over data location. A Data Processing Agreement is available for Vespa Cloud customers.

Can Vespa.ai handle production RAG workloads?

Yes. Vespa was Yahoo's production search engine before the spinout and is currently used at LinkedIn, Spotify, Wayfair, and Vinted scale. The hybrid retrieval model — combining dense vector search, sparse vectors, BM25, structured filters, and learned ranking in a single query — is particularly well suited to RAG pipelines where pure semantic search returns insufficient precision.

Vespa.ai is an EU alternative to

Pinecone Elasticsearch Milvus

Related Products

Meilisearch🇫🇷

Lightning-fast open-source search engine for apps and websites

EU-BuiltFreemiumOpen SourceEU DataReviewed

Alternative to Algolia

Visit Website

Qdrant🇩🇪

High-performance open-source vector database built in Rust

EU-BuiltFreemiumOpen SourceEU DataReviewed

Alternative to Pinecone, Weaviate

Visit Website

Weaviate🇳🇱

Amsterdam-built open-source vector database with hybrid search and generative AI modules

EU-BuiltOpen SourceEU DataReviewed

Alternative to Pinecone, Milvus, Chroma

Visit Website

What Is Vespa.ai?

Key Features

Tensor-Native Query Model

Hybrid Retrieval That Actually Works in Production

Learned Ranking in the Serving Path

Production Scale at Sub-100ms Latencies

Application Packages for Schema as Code

Pricing

EU Compliance & Privacy

Who It's Best For

The Verdict

The Norwegian AS structure, Apache 2.0 licensing, and EU data residency options make Vespa one of the strongest European-headquartered infrastructure components available for AI and search workloads.

Frequently Asked Questions

Is Vespa.ai really open source?

Where is Vespa.ai based and who owns it?

How does Vespa compare to Weaviate or Qdrant?

Weaviate and Qdrant are pure vector databases with strong RAG ergonomics. Vespa is a full tensor-native search engine that includes vector search as one capability — it also handles BM25, multi-vector retrieval, structured filters, and learned ranking in a single query. Vespa has more capability and a steeper learning curve. For pure vector search RAG, Weaviate or Qdrant are simpler; for search and ranking systems at scale, Vespa is more capable.