Back to all features

Word-Level Timestamps

Precise start and end times for every word—ideal for subtitles and video editing.

Pro plans and above include word-level timestamps in JSON exports. Each word has exact start and end times, enabling frame-accurate subtitles, precise video editing, and advanced applications like keyword highlighting or searchable video. Use them for SRT/VTT generation, caption sync, or custom tools that need timing at the word level.

Key Benefits

  • Exact start and end time for every word in JSON export
  • Frame-accurate subtitles and caption sync
  • Precise video editing and clip extraction
  • Support for advanced apps and searchable video

Use Cases

  • Professional subtitle creation
  • Video editing with word-level cuts
  • Searchable video and keyword highlights
  • Accessibility captions with precise timing

Technical Details

Included in JSON export on Pro, Studio, Agency, and Enterprise. Not available on Free tier. Compatible with standard subtitle and editing tools.

Available Plans

Monthly
Annual

Pro

$25/month

For creators & power users

  • STT: 1,000 minutes
  • TTS: 25,000 characters
  • Summaries & exports
  • Fast processing queue
  • Premium voices ≤ 10%
Most Popular

Studio

$79/month

For serious creators & small teams

  • STT: 3,000 minutes
  • TTS: 90,000 characters
  • Up to 3 users
  • Batch uploads
  • Priority queue
  • Premium voices ≤ 25%
  • Export formats (SRT, DOCX, JSON)

Agency

$149/month

For teams, SMBs & API users

  • STT: 5,000 minutes
  • TTS: 150,000 characters
  • Team workspaces (up to 10 users)
  • API access (rate-limited)
  • Premium voices ≤ 35%
  • Usage analytics

Enterprise

Custom(Starting at $300/month)

Custom ($300+ / month)

  • Custom volumes
  • BYO provider keys
  • SLAs & compliance
  • Dedicated routing & support