Skip to Content
πŸŽ‰ Scout Docs 2.0 is here!
Collections & TablesOverview

Collections & Tables

Store, organize and search structured data with Scout’s Collections and Tables, your foundation for building RAG applications, knowledge bases and data-driven workflows.

What are Collections?

Collections are groups of Tables, and Tables contain Documents: structured data records that can include both metadata and text content. The text content is automatically indexed for semantic search, enabling powerful AI-driven retrieval.

Collection β”œβ”€β”€ Table 1 β”‚ β”œβ”€β”€ Document 1 (metadata + text) β”‚ β”œβ”€β”€ Document 2 (metadata + text) β”‚ └── ... β”œβ”€β”€ Table 2 β”‚ └── ... └── Sources (sync integrations)

Key Features

  • Vector Search β€” Text content is automatically embedded and indexed for semantic search
  • Hybrid Search β€” Combine vector (semantic) search with keyword (BM25) search
  • Structured Metadata β€” Filter and sort results using typed columns
  • Source Syncs β€” Automatically sync data from Notion, Google Sheets, web scrapes and more
  • Workflow Integration β€” Query and save data directly from your workflows

Collections vs. Drive

FeatureCollections & TablesDrive
PurposeStructured data & vector searchFile storage (PDFs, images)
SearchSemantic/vector searchBy path/name
Use CaseRAG, knowledge bases, CRMsAssets, attachments, media
AI AccessAgents can search semanticallyAgents can read files

When to Use Collections

Use Collections when you need:

  • RAG Applications β€” Build chatbots that answer questions from your data
  • Knowledge Bases β€” Create searchable documentation repositories
  • CRM & User Data β€” Store and query customer information
  • Semantic Search β€” Find content by meaning, not just keywords

When to Use Drive

Use Drive when you need:

  • File Storage β€” Store PDFs, images and documents
  • Asset Management β€” Manage media files for workflows
  • Simple Access β€” Read files by path without complex querying

Use with Agents

1) Enable Tools

In the agent’s Tools tab, enable the tools that let the agent read and write Collections data.

2) Add Instruction Snippet

Add this to your agent instructions:

When a task depends on organizational knowledge or structured records: 1. Query Collections first. 2. Prefer hybrid search for broad user questions. 3. Use metadata filters when the user specifies category, date or status. 4. If information is missing and the user provided new facts, write the new record to the correct table. 5. In your reply, distinguish clearly between retrieved data and newly added data.

3) Prompt Examples

  • β€œSearch our support knowledge base for account recovery steps and summarize the answer.”
  • β€œFind onboarding docs updated in the last 30 days and return only security-related items.”
  • β€œAdd this meeting note to the customer_feedback table with category enterprise.”

4) Expected Behavior

  • The agent queries the correct table before answering
  • The agent applies filters when constraints are present
  • The agent writes records only when asked or when instructions allow it
  • The final response cites what came from Collections data

Quick Start

Create a Collection

  1. Navigate to Collections in the Scout dashboard
  2. Click + New
  3. Enter a name and description
  4. Click Create

Scout provisions the underlying vector database automatically (takes about 30 seconds).

Create a Table

When you create a Collection, it automatically includes an β€œUntitled” table. Customize it:

  1. Click the + button in the table header to add columns
  2. Choose column types (Single Line Text, Multi Line Text, Number, Checkbox, URL)
  3. Add data manually or connect a Source

Add Documents

Via the API:

curl -X POST https://api.scoutos.com/v2/collections/{collection_id}/tables/{table_id}/documents \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "documents": [{ "id": "doc_1", "text": "Your searchable content here...", "title": "Document Title", "category": "documentation" }] }'

Using Python:

from scoutos import Scout client = Scout(api_key="YOUR_API_KEY") client.documents.create( collection_id="col_abc123", table_id="tab_xyz789", documents=[{ "id": "doc_1", "text": "Your searchable content here...", "title": "Document Title" }] )

Query Your Data

In a Workflow:

Use the Query Collection Table block to search your data:

Search Term: "{{inputs.user_question}}" Minimum Similarity: 0.5 Hybrid Search: true Limit: 10

Via the API:

curl -X POST https://api.scoutos.com/v2/collections/{collection_id}/tables/{table_id}/query \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "search_term": "customer support", "min_similarity": 0.5, "limit": 10 }'

Next Steps

  • Creating Collections β€” Learn how to create collections, configure tables and define schemas
  • Sources β€” Set up syncs from web, Notion and Google Sheets
  • Querying Data β€” Master semantic search, hybrid search and advanced filtering

Built with ❀️ by Scout OS

Last updated on