Turn audio into stunning visual productions

Multi-track timeline editor, AI transcription, voice synthesis, and 4K export — all running directly in your browser. No installs, no subscriptions.

Add to Chrome — It's Free See Features ↓
★★★★★ Loved by creators · One-time $4.99 Pro

See it in action

Watch the full demo reel — from audio import to 4K export in under 3 minutes.

Everything you need to create pro-quality audio videos

From real-time visualizers to AI-powered voice tools — Spectrum Studio Pro has it all, right inside Chrome.

Feature 01

Beautiful audio-reactive waveforms

Drop in your audio and watch it come alive. Spectrum Studio renders stunning real-time visualizations synced perfectly to every beat, bass hit, and frequency.

  • 25+ visualizer types — Cosmic Web, Quantum Strings, Orbital Rings, Smooth Wave, Neon Bars and more
  • 15+ curated color palettes including Cyberpunk, Aurora, Chill Wave, Void Space
  • Custom palette builder — design your own color scheme
  • Live Tab capture — visualize Spotify, YouTube, or any web audio in real time
Feature 02

Rich animated cards with AI voice

Add beautiful overlay cards to your timeline with titles, descriptions, and avatars. Then bring them to life with one-click AI voice narration using ElevenLabs.

  • Liquid Glass cards — real-time WebGL refraction shader (Apple-style frosted glass)
  • Templates: Glass Mail, Notification, Article, Call, Info and more
  • Text-to-Speech: generate voice-over from card text in one click Pro
  • Entrance/exit animations — slide, fade, bounce, and more
🔊 Turn on audio
Feature 03

Avatars & callout bubbles

Place character avatars and speech-bubble callouts anywhere on the canvas. Perfect for explainer videos, IVR simulations, and dialogue-driven content.

  • Upload any image as an avatar — circular frame with glow and gradient border
  • Callouts support custom text, positioning, and entrance animations
  • Text-to-Voice on callouts — generate speech audio from callout text Pro
  • Multiple avatar lanes for multi-character scenes
🔊 Turn on audio
Feature 04

Cinematic zoom & pan keyframes

Add smooth, eased camera zoom movements to any moment in your timeline. Guide the viewer's eye with precision — no video editing experience required.

  • Set start/end zoom level and canvas position per block
  • Expo (fast snap) or Quad (smooth) easing curves
  • Continuous drift zoom — keep zooming after the initial animation
  • Match Previous — one click connects blocks for seamless camera motion
  • Export with Zoom tracks Pro
Feature 05

Full 3D perspective camera control

Tilt, rotate, and warp the canvas in 3D space — like a cinematic camera operator on a rig. Animate pitch, depth of field, and focus point over time.

  • Pitch X/Y — tilt the canvas forward, back, left, right in true 3D
  • Depth of Field — blur backgrounds for a macro lens effect
  • Focus X/Y — pin a specific area in sharp focus while the rest blurs
  • Skew, rotation, and position animation between keyframes
  • Export with 3D Perspective tracks Pro
Feature 06

Dynamic animated backgrounds

Set the mood with stunning animated backgrounds that pulse and flow behind all your content layers. From minimal gradients to full particle systems.

  • Solid colors, linear/radial gradients, and uploaded images
  • 8 animated Vanta.js backgrounds — Waves, Fog, Birds, Net, Halo, Clouds, Cells, Dots
  • Multiple background lanes — layer and blend different backgrounds over time
  • Entrance/exit animations on background transitions
Feature 07

AI auto-generate your entire timeline

Import audio and let the AI build your entire production for you. Cards, callouts, waveforms, and avatars — all created automatically from your audio in seconds.

  • Transcribes audio with Whisper AI — 100% free, no API key needed
  • Auto-generates Cards with titles and transcribed descriptions
  • Auto-generates Callouts synced to speech segments
  • Auto-generates audio-reactive Spectrum visualizer lanes
  • Toggle each output type on/off before generating
Feature 08

Offline speech-to-text with Whisper AI

Transcribe any audio entirely inside your browser using open-source Whisper models — no internet required, no API key, no cost. Your audio never leaves your device.

  • Whisper Tiny (fast), Base, and multilingual models available
  • Runs 100% locally via WebAssembly — completely private
  • Automatically syncs transcribed text to timeline timestamps
  • Supports English and 90+ languages with multilingual models
  • Free — no Pro license required Free
🔊 Turn on audio
Feature 09

Voice replacement & AI voice changer

Replace or transform voices in your audio using ElevenLabs. Keep the timing and emotion — swap the voice entirely. Perfect for dubbing, character voices, and demos.

  • ElevenLabs Voice Changer — transform voice while preserving emotion and inflection Pro
  • ElevenLabs Full Voice Replace — replace audio with a different AI voice Pro
  • Works on any media lane audio in the timeline
🔊 Turn on audio
Feature 10

Natural text-to-speech narration

Type text on any card or callout and instantly generate a natural-sounding voice-over. Choose from hundreds of AI voices via ElevenLabs.

  • ElevenLabs TTS — ultra-realistic AI voices with emotion control Pro
  • Generated audio attaches directly to the card's timeline block
  • ElevenLabs credit usage shown live in the API Keys panel
🔊 Turn on audio

Simple, honest pricing

Start free. Upgrade once. Own it forever.

Free
$0
Forever free. No credit card.
  • ✦ All timeline lanes (Media, Waves, Cards, Avatars, Callouts, BG, Spectrum)
  • ✦ Export at 720p & 1080p / 24–30fps
  • ✦ AI Transcription with Whisper (offline)
  • ✦ 25+ audio visualizer types
  • ✦ 15+ color palettes
  • ✦ Zoom & 3D Perspective (preview & edit)
  • ✦ Liquid Glass cards, VFX, animated backgrounds
  • ✦ Save & load projects
Add to Chrome
Most Popular
Pro
$4.99 USD
One-time purchase. Yours forever.
  • ✦ Everything in Free
  • ✦ Export at 1440p (QHD) & 4K (UHD)
  • ✦ Export at 60fps & max refresh rate
  • ✦ HDR10 export (HEVC / BT.2020 PQ)
  • ✦ Export with Zoom & 3D Perspective tracks
  • ✦ ElevenLabs Voice Changer & TTS
  • ✦ ElevenLabs Full Voice Replace
Upgrade to Pro →

Frequently asked questions

Does it work on Mac, Windows, and Linux?

Yes. Spectrum Studio Pro is a Chrome extension and runs on any operating system that supports Google Chrome.

Is my audio uploaded anywhere?

No. All audio processing, AI transcription (Whisper), and rendering happens entirely within your browser. Your files never leave your device.

What's the difference between Free and Pro?

Free gives you the full editing experience including AI transcription, all visual effects, and 1080p export. Pro unlocks 4K/60fps/HDR10 export, Zoom & 3D Perspective export, and ElevenLabs voice features.

Do I need an ElevenLabs account?

Only if you want to use the AI voice features. You supply your own API key — we never store it on any server. AI transcription with Whisper is fully free and requires no third-party account.

Is the Pro license a subscription?

No — it's a one-time purchase of $4.99 USD. Pay once, use forever. No recurring charges.

What export formats are supported?

Default export is H.264 MP4. With HDR10 enabled (Pro), it exports HEVC (H.265) MP4 with BT.2020/PQ for HDR displays. Multiple aspect ratios supported including 16:9, 9:16, 1:1, and ultrawide.

Privacy Policy

Your data stays with you. Here's exactly how Spectrum Studio Pro handles your information.

Last updated: May 21, 2026

Overview

Spectrum Studio Pro ("the Extension") is a browser-based audio visualizer, multi-track timeline editor, and video production tool. We are committed to protecting your privacy.

Data We Do Not Collect

Spectrum Studio Pro does not collect, transmit, or share any personal information.

  • No tracking, analytics, or telemetry
  • Audio, video, and project files are never uploaded
  • No login, account creation, or sign-up required
  • No advertising services of any kind

Local Storage

The Extension uses chrome.storage and localStorage solely to save your project settings, preferences, and API keys locally on your device. This data never leaves your browser.

Audio & Media Files

Audio and video files you import are processed entirely within your browser using the Web Audio API and WebCodecs API. Your media files are never uploaded to any external server.

AI Transcription

The optional AI transcription feature uses the open-source Whisper model loaded from jsDelivr CDN (cdn.jsdelivr.net). The model runs entirely in your browser — your audio is never sent to an external server.

Pro License Verification

If you activate a Pro license, your license key is sent to Polar.sh (api.polar.sh) to verify validity. No other personal data is transmitted. See Polar.sh privacy policy.

Optional Third-Party API Integrations

These integrations are entirely optional, require your own API keys, and are only triggered by explicit user action:

ServiceDomainPurpose
ElevenLabsapi.elevenlabs.ioAI text-to-speech and voice replacement (Pro)
jsDelivr CDNcdn.jsdelivr.netLoads Whisper AI model for local transcription
Google Fontsfonts.googleapis.comUI typography — no personal data transmitted

Permissions

storage / unlimitedStorage — save projects, settings, and API keys locally

Changes to This Policy

We may update this Privacy Policy from time to time. Changes will be reflected on this page with an updated date.

Contact

Questions? Contact us via the Chrome Web Store listing.