Search
POST /v1/search
Primary search endpoint that provides advanced search capabilities across all document types stored in Smart Buckets. Supports recursive object search within objects, enabling nested content search like embedded images, text content, and personally identifiable information (PII).
The system supports complex queries like:
- ‘Show me documents containing credit card numbers or social security numbers’
- ‘Find images of landscapes taken during sunset’
- ‘Get documents mentioning revenue forecasts from Q4 2023’
- ‘Find me all PDF documents that contain pictures of a cat’
- ‘Find me all audio files that contain infomration about the weather in SF in 2024’
Key capabilities:
- Natural language query understanding
- Content-based search across text, images, and audio
- Automatic PII detection
- Multi-modal search (text, images, audio)
Authorizations
Request Body required
object
Natural language search query that can include complex criteria
Client-provided search session identifier. Required for pagination and result tracking. We recommend using a UUID or ULID for this value.
Optional list of specific bucket IDs to search in. If not provided, searches the latest version of all buckets
Responses
200
Initial set of 15 search results with pagination information
object
Matched results with metadata
object
Unique identifier for this text segment
chunk_123abc
The actual content of the result
This is a sample text chunk
Source document information in JSON format
{"bucket": "my-bucket", "path": "doc.pdf", "type": "application/pdf"}
Parent document identifier
doc_456def
Relevance score (0.0 to 1.0)
0.95
Content MIME type
text/plain
object
Total number of available results
100
Current page number (1-based)
1
Results per page
10
Total available pages
10
Indicates more results available
true
400
Invalid request parameters
object
Error code
INVALID_ARGUMENT
Error message
Missing required parameter: input