What is Search?
Search in Cloudglue enables you to find relevant video content across your collections using natural language queries. Whether you’re looking for specific topics discussed in videos or particular moments within video segments, Search provides powerful semantic search capabilities to help you discover exactly what you need. Search works with your Rich Transcript Collections to find content based on speech transcripts, visual descriptions, and scene text. This makes it easy to locate specific information across large video libraries without having to manually review each video.Search Scopes
Cloudglue offers two distinct search scopes, each optimized for different use cases:File-Level Search
File-level search helps you find entire videos that are relevant to your query. This scope is perfect when you want to discover which videos in your collection contain information about a particular topic. Key Features:- Searches across entire video summaries and titles
- Returns whole video files as results
- Ideal for content discovery and organization
- Requires collections with
enable_summary: true
in the transcribe configuration
- “Find all videos about machine learning”
- “Which videos discuss quarterly earnings?”
- “Show me cooking videos featuring Italian cuisine”
Segment-Level Search
Segment-level search pinpoints specific moments within videos where relevant content appears. This scope is ideal when you need to find exact timestamps where particular topics are discussed or events occur. Key Features:- Searches within individual video segments
- Returns specific time ranges with precise timestamps
- Includes relevant speech, visual descriptions, and scene text
- Provides detailed context for each result
- “When did the speaker mention the new product launch?”
- “Find moments where charts or graphs are shown”
- “Locate instances where specific technical terms are discussed”
Search Capabilities
Semantic Understanding
Search uses advanced language models to understand the meaning behind your queries, not just exact keyword matches. This means you can search using natural language and find semantically related content even when different words are used. Examples:- Searching for “revenue growth” will also find content about “income increase” or “profit expansion”
- A query about “customer satisfaction” can discover discussions about “user happiness” or “client feedback”
Multi-Modal Content
Search works across all types of rich transcript data:- Speech Transcripts: Find spoken words and conversations
- Visual Scene Descriptions: Locate visual elements and actions
- Scene Text: Discover text that appears on screen
Advanced Filtering
Enhance your searches with powerful filtering options:- Metadata Filters: Filter by custom metadata fields you’ve added to your files
- Video Properties: Filter by duration, audio presence, file size, and more
- File Attributes: Filter by filename, creation date, or file ID
- Multiple Criteria: Combine multiple filters for precise results
How Search Works
Requirements
To use Search, you need:- Rich Transcript Collections: Search only works with collections created with
collection_type: 'rich-transcripts'
- Processed Content: Videos must be fully processed with transcription complete
- Summary Configuration: For file-level search, collections must have
enable_summary: true
Search Process
- Query Processing: Your natural language query is analyzed to understand intent and context
- Content Matching: The system searches across relevant transcript data using semantic similarity
- Ranking: Results are ranked by relevance score (0-1, with higher scores indicating better matches)
- Filtering: Any specified filters are applied to narrow down results
- Response: Results are returned with detailed context and metadata
Search Results
File Search Results
When performing file-level searches, you receive:- File Information: ID, filename, and collection details
- Generated Content: AI-generated title and summary
- Relevance Score: Confidence score for the match quality
- Metadata: Any custom metadata associated with the file
Segment Search Results
For segment-level searches, results include:- Timing Information: Precise start and end timestamps
- Content Context: Relevant speech, visual descriptions, and scene text
- File Details: Information about the source video
- Relevance Score: Match quality assessment
- Segment ID: Unique identifier for the specific segment
Practical Applications
Content Management
- Video Library Organization: Quickly categorize and tag large video collections
- Duplicate Detection: Find similar content across different videos
- Content Auditing: Locate videos containing specific topics or compliance-related content
Research and Analysis
- Interview Analysis: Find specific topics or quotes across multiple interview videos
- Market Research: Locate customer feedback or product mentions
- Educational Content: Find explanations of particular concepts across training videos
Media Production
- B-Roll Discovery: Find specific visual content for editing projects
- Fact Checking: Locate original sources for claims or statements
- Content Repurposing: Identify segments suitable for creating clips or highlights
Getting Started
To start using Search:- Create a Rich Transcript Collection: Set up a collection with appropriate transcription settings
- Add Videos: Upload or add videos to your collection
- Wait for Processing: Ensure all videos are fully transcribed
- Perform Searches: Use the Search API to find content
For file-level search capabilities, make sure to set
enable_summary: true
when creating your Rich Transcript Collection.Best Practices
Query Optimization
- Be Specific: More detailed queries generally yield better results
- Use Natural Language: Write queries as you would ask a person
- Include Context: Add relevant context to help disambiguate your intent
Collection Setup
- Enable Summaries: Include summary generation for file-level search capabilities
- Rich Transcription: Enable visual descriptions and scene text for comprehensive search coverage
- Consistent Metadata: Use standardized metadata fields for effective filtering
Result Management
- Use Relevance Scores: Higher scores indicate better matches
- Apply Filters: Narrow down results using metadata and file property filters
- Combine Scopes: Use both file and segment search for comprehensive discovery
Limitations
- Collection Type: Only works with Rich Transcript Collections
- Processing Dependency: Videos must be fully processed before they become searchable
- Summary Requirement: File-level search requires summary generation to be enabled
- Language Support: Search quality depends on the language and clarity of video content
Next Steps
To learn more about implementing search in your applications:- Explore the API: Check out the Search API documentation