⚠️ Pre-alpha: This platform is in early development. Features may change.

Multimodal vector search and embeddings for developers — raggen.ai

raggen.ai

Build RAG features, not plumbing

Multimodal vector search and Gemini embeddings—upload once, query by meaning, subscribe to live index updates.

Upload images, video, audio, and documents once. Search by embedding similarity across bases—GraphQL queries, subscriptions, and MCP included.

1 million tokens & 100 MB storage included per org

Built for your workflow

GraphQL subscriptions, vector search, and org-scoped storage—primitives you call from code.

Search by what things mean—not by filenames

Run topK similarity search over HNSW-indexed Gemini vectors—filter by embed model in GraphQL, scope to a base client-side.

topK · 768d

query FilesWithinDistance(
  $vector: [Float!!]
  $topK: Int!
  $filter: EmbeddingIndexFilter!
) {
  querySimilarEmbeddingIndexByEmbedding(
    vector: $vector
    topK: $topK
    by: embedding
    filter: $filter
  ) {
    id
    model
    vector_distance
    file { id name }
  }
}

GE2-768

768d · Gemini

GE2-1536

1536d · Gemini

Storage

Org-scoped storage

Files, bases, and membership rules—treat uploads as the source of truth for your pipeline.

Realtime

Live GraphQL subscriptions

Indexes update in real time as processing completes—no polling for file status or embeddings.

Search

Vector search API

Cosine HNSW search over Gemini Embedding 2 vectors—filter by model and base at query time.

Ingest

Multimodal ingest

Images, video, audio, text, and PDF—one upload path, one embedding pipeline.

Platform specs

Verified capabilities in the current schema and processing pipeline.

Embedding models

Vector dimensions

GraphQL subscriptions

Supported media

Production APIs & integrations

GraphQL queries and subscriptions, API keys, and MCP—what you need when a side project goes live.

Data & Storage

  • Org-scoped file storage with access rules
  • Knowledge bases with scoped file membership
  • Dgraph-backed GraphQL models
  • File content extraction via getFileContent

Vector & Embeddings

  • querySimilarEmbeddingIndexByEmbedding
  • Three Gemini models (768 / 1536 / 3072)
  • Multiple vectors per file
  • Live index updates via subscriptions

APIs & Delivery

  • GraphQL + subscriptions
  • API keys and session auth
  • MCP at mcp.raggen.ai
  • GraphiQL explorer and API catalog

MCP in your editor

Point Cursor, Claude Desktop, or any MCP client at your bases—one endpoint, GraphQL-backed tools.

Cursor

Add raggen as an MCP server and pull context into the editor while you code.

Claude Desktop

Query org bases and files from Claude over the MCP transport.

Custom MCP clients

Any MCP-compatible client can call the same GraphQL-backed tool surface.

https://mcp.raggen.aiAPI & MCP docs

Frequently asked questions

Common questions about multimodal vector search, GraphQL, and raggen.ai.

What file types does raggen.ai support?

Images (PNG, JPEG, GIF, WebP), video (MP4, MOV, WebM), audio (MP3, WAV), plain text, and PDF. Each file is embedded with Gemini Embedding 2 after upload.

Which embedding models and dimensions are available?

Three Gemini Embedding 2 models: GEM_2_768 (768d), GEM_2_1536 (1536d), and GEM_2_3072 (3072d). A file can hold multiple vectors—one per model—without overwriting prior indexes.

How does vector search work?

Use the querySimilarEmbeddingIndexByEmbedding GraphQL query with a query vector, topK, and EmbeddingIndexFilter (base and model). Results include cosine distance and linked file metadata.

Does raggen.ai support GraphQL subscriptions?

Yes. File, organization, base, and related types use @withSubscription—subscribe to getOrganization or file lists and receive live updates when indexes change.

Can I use raggen.ai for RAG?

Yes, for file-level retrieval: upload corpus files to bases, run vector search to find relevant files, and use File.content (extracted text) or storage URLs as LLM context. Chunk-level RAG is not in the schema today.

How does pricing work?

raggen.ai includes a free tier: 1 million embedding tokens per org per calendar month (file indexing and vector-search queries) and 100 MB of stored media—no credit card required to start. Beyond the free tier, $0.40 per additional million tokens and $0.05 per GB-month storage. We invoice prior-month overage at the start of each month. Add a payment method to unlock 1536/3072-dimension embed models.

What is the MCP endpoint?

https://mcp.raggen.ai — connect Cursor, Claude Desktop, or custom MCP clients to query org bases and files over GraphQL-backed tools.

Is raggen.ai production-ready?

Pre-alpha: the platform is in early development and features may change. Core flows—upload, embed, search, subscribe—are available for development and evaluation.

Start on the free tier

No credit card required—monthly token and storage allowance included on every org.

No credit card required

Free tier

Included monthly allowance · overage billed only if exceeded

Start on the free tier: 1 million embedding tokens per org each month and 100 MB storage included. Overage at $0.40/1M tokens and $0.05/GB-month only when you exceed the allowance.

Free tier tokens
1 million / mo
Free tier storage
100 MB
Token overage
$0.40 / 1M
Storage overage
$0.05 / GB-mo
  • No credit card to start
  • Monthly token allowance included
  • Usage breakdown in Account
  • One invoice per org per month (overage only)
  • Prepaid balance applied before invoice due

Free tier first; overage only if you exceed the allowance

View free tier details

Ready to ship your RAG project?

Spin up a base, upload sources, and query from your app or IDE—same stack from hackathon to prod.