Collections & Tables
Store, organize and search structured data with Scoutβs Collections and Tables, your foundation for building RAG applications, knowledge bases and data-driven workflows.
What are Collections?
Collections are groups of Tables, and Tables contain Documents: structured data records that can include both metadata and text content. The text content is automatically indexed for semantic search, enabling powerful AI-driven retrieval.
Collection
βββ Table 1
β βββ Document 1 (metadata + text)
β βββ Document 2 (metadata + text)
β βββ ...
βββ Table 2
β βββ ...
βββ Sources (sync integrations)Key Features
- Vector Search β Text content is automatically embedded and indexed for semantic search
- Hybrid Search β Combine vector (semantic) search with keyword (BM25) search
- Structured Metadata β Filter and sort results using typed columns
- Source Syncs β Automatically sync data from Notion, Google Sheets, web scrapes and more
- Workflow Integration β Query and save data directly from your workflows
Collections vs. Drive
| Feature | Collections & Tables | Drive |
|---|---|---|
| Purpose | Structured data & vector search | File storage (PDFs, images) |
| Search | Semantic/vector search | By path/name |
| Use Case | RAG, knowledge bases, CRMs | Assets, attachments, media |
| AI Access | Agents can search semantically | Agents can read files |
When to Use Collections
Use Collections when you need:
- RAG Applications β Build chatbots that answer questions from your data
- Knowledge Bases β Create searchable documentation repositories
- CRM & User Data β Store and query customer information
- Semantic Search β Find content by meaning, not just keywords
When to Use Drive
Use Drive when you need:
- File Storage β Store PDFs, images and documents
- Asset Management β Manage media files for workflows
- Simple Access β Read files by path without complex querying
Use with Agents
1) Enable Tools
In the agentβs Tools tab, enable the tools that let the agent read and write Collections data.
2) Add Instruction Snippet
Add this to your agent instructions:
When a task depends on organizational knowledge or structured records:
1. Query Collections first.
2. Prefer hybrid search for broad user questions.
3. Use metadata filters when the user specifies category, date or status.
4. If information is missing and the user provided new facts, write the new record to the correct table.
5. In your reply, distinguish clearly between retrieved data and newly added data.3) Prompt Examples
- βSearch our support knowledge base for account recovery steps and summarize the answer.β
- βFind onboarding docs updated in the last 30 days and return only security-related items.β
- βAdd this meeting note to the
customer_feedbacktable with categoryenterprise.β
4) Expected Behavior
- The agent queries the correct table before answering
- The agent applies filters when constraints are present
- The agent writes records only when asked or when instructions allow it
- The final response cites what came from Collections data
Quick Start
Create a Collection
- Navigate to Collections in the Scout dashboard
- Click + New
- Enter a name and description
- Click Create
Scout provisions the underlying vector database automatically (takes about 30 seconds).
Create a Table
When you create a Collection, it automatically includes an βUntitledβ table. Customize it:
- Click the + button in the table header to add columns
- Choose column types (Single Line Text, Multi Line Text, Number, Checkbox, URL)
- Add data manually or connect a Source
Add Documents
Via the API:
curl -X POST https://api.scoutos.com/v2/collections/{collection_id}/tables/{table_id}/documents \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"documents": [{
"id": "doc_1",
"text": "Your searchable content here...",
"title": "Document Title",
"category": "documentation"
}]
}'Using Python:
from scoutos import Scout
client = Scout(api_key="YOUR_API_KEY")
client.documents.create(
collection_id="col_abc123",
table_id="tab_xyz789",
documents=[{
"id": "doc_1",
"text": "Your searchable content here...",
"title": "Document Title"
}]
)Query Your Data
In a Workflow:
Use the Query Collection Table block to search your data:
Search Term: "{{inputs.user_question}}"
Minimum Similarity: 0.5
Hybrid Search: true
Limit: 10Via the API:
curl -X POST https://api.scoutos.com/v2/collections/{collection_id}/tables/{table_id}/query \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"search_term": "customer support",
"min_similarity": 0.5,
"limit": 10
}'Next Steps
- Creating Collections β Learn how to create collections, configure tables and define schemas
- Sources β Set up syncs from web, Notion and Google Sheets
- Querying Data β Master semantic search, hybrid search and advanced filtering
Built with β€οΈ by Scout OS