Skip to main content
Scout Databases are the primary way to store, search, and retrieve information for your AI agents. Every piece of text you add to a Database is automatically embedded and indexed, so agents can find the most relevant content by meaning — not just by matching exact words. Whether you’re building a customer support chatbot, a documentation assistant, or an internal knowledge base, Databases give your agents the data retrieval layer they need.

What Are Databases?

A Database is a container made up of one or more Tables. Each Table holds Documents — structured records that combine metadata fields (like title, category, or a timestamp) with a text body that gets automatically embedded for semantic search.
Database
├── Table 1
│   ├── Document 1 (metadata + text)
│   ├── Document 2 (metadata + text)
│   └── ...
├── Table 2
│   └── ...
└── Sources (sync integrations)
For example, a customer support team might have a Help Center database with two tables: FAQs and Troubleshooting Guides. Each document in FAQs could have a category column, a last_updated timestamp, and a text body containing the answer. Agents can search across both tables at once or scope their query to a single table.

Search Modes

Databases support three distinct search modes. You choose the right one based on the kind of query you expect.
ModeHow It WorksBest For
Semantic (Vector)Converts your query to an embedding and finds documents with similar vectorsNatural language questions, finding related concepts
Keyword (BM25)Matches exact keywords using traditional full-text searchProduct codes, SKUs, technical identifiers
HybridFuses semantic and keyword results using Reciprocal Rank Fusion (RRF)General-purpose production search
Semantic search finds relevant content even when the user’s words don’t appear verbatim in the document. A query for "how do I reset my password" can surface documents about "account recovery" or "login troubleshooting" because the embeddings capture meaning, not just terms. Keyword search uses BM25 to find documents containing exact keyword matches. Use it when precision matters — for example, when users search for a specific error code like ERR_CERT_AUTHORITY_INVALID. Hybrid search combines both approaches using Reciprocal Rank Fusion. Results from the semantic pass and the keyword pass are merged and re-ranked, so you get the precision of keyword matching alongside the conceptual coverage of vector search. For most production applications, hybrid search is the recommended default.

Databases vs. Drive

Scout offers two storage systems. Use this table to pick the right one for your use case.
FeatureDatabases & TablesDrive
PurposeStructured data with vector searchRaw file storage (PDFs, images, docs)
SearchSemantic, keyword, or hybridBy path or filename
Use CaseRAG, knowledge bases, CRM dataAssets, attachments, generated outputs
AI AccessAgents search by meaningAgents read and write files directly
SyncNotion, Google Sheets, web scrapingManual upload or agent writes

When to Use Databases

Choose Databases when your agents need to find information by meaning:
  • Building a chatbot that answers questions from your internal docs (RAG)
  • Creating a searchable knowledge repository for support or onboarding
  • Storing and querying customer or CRM records
  • Surfacing content by concept rather than exact path

When to Use Drive

Choose Drive when your agents need raw file access:
  • Storing PDFs, images, and other binary assets
  • Saving generated reports and workflow outputs
  • Passing files between workflow steps
  • Reading files by exact path without semantic querying

Quick Start

Get up and running with Databases in four steps.
1

Create a Database

  1. Navigate to Databases in the Scout dashboard.
  2. Click + New at the top of the page.
  3. Enter a name and optional description.
  4. Click Create.
Scout provisions and indexes your Database automatically. This takes about 30 seconds and the UI shows the current status while it’s setting up.
2

Create and Configure a Table

Every new Database comes with an Untitled table. Rename it and add columns to match your data:
  1. Click the + button in the table header row.
  2. Enter a column name and select a type: Single Line Text, Multi Line Text, Number, Checkbox, or URL.
  3. Repeat for each field your documents need.
Store your main searchable text in a column named content — Scout automatically chunks and embeds this field for semantic search.
3

Add Documents

Add documents via the REST API or Python SDK.
curl -X POST https://api.scoutos.com/v2/collections/{collection_id}/tables/{table_id}/documents \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "documents": [{
      "id": "doc_1",
      "text": "Your searchable content here...",
      "title": "Document Title",
      "category": "documentation"
    }]
  }'
4

Query Your Data

Search your database from a workflow or directly via the API.In a Workflow, add a Query Database Table block and configure it:
Search Term: "{{inputs.user_question}}"
Minimum Similarity: 0.5
Hybrid Search: true
Limit: 10
Via the API:
curl -X POST https://api.scoutos.com/v2/collections/{collection_id}/tables/{table_id}/query \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "search_term": "customer support",
    "min_similarity": 0.5,
    "limit": 10
  }'

Using Databases with Agents

Give your agents the ability to read from and write to Databases by following these two steps.

1. Enable Databases Tools

Open your agent in Scout, go to the Tools tab, and enable the Databases tools. This grants the agent permission to query tables and create or update documents.

2. Add an Instruction Snippet

Add the following to your agent’s system prompt to guide its retrieval behavior:
When a task depends on organizational knowledge or structured records:

1. Query Databases first.
2. Prefer hybrid search for broad user questions.
3. Use metadata filters when the user specifies a category, date, or status.
4. If information is missing and the user provided new facts, write the new record
   to the correct table.
5. In your reply, clearly distinguish between retrieved data and newly added data.

Prompt Examples

  • “Search our support knowledge base for account recovery steps and summarize the answer.”
  • “Find onboarding docs updated in the last 30 days and return only security-related items.”
  • “Add this meeting note to the customer_feedback table with category enterprise.”

Expected Agent Behavior

When configured correctly, your agent will:
  • Query the correct table before formulating an answer
  • Apply metadata filters when the user’s request includes constraints like category or date
  • Write records only when explicitly asked or when your instructions allow it
  • Cite which data came from Databases in its final response

Next Steps

Creating Databases

Create databases, configure table schemas, and populate data via the UI or API.

Sources

Sync data automatically from Notion, Google Sheets, websites, and more.

Querying Data

Master semantic search, hybrid search, and advanced metadata filtering.