Seedance 2.0 Mini — Fast, Low-Cost AI Video
ByteDance's distilled video model generates cinematic clips with native audio from text, images, and references — ~2× faster than the Fast tier and about half the cost of Seedance 2.0, at up to 720p. Run it in Soku AI and turn it into ad creatives.
AI Video Generation
Seedance 2.0 Mini Studio
Model
Seedance 2.0 Mini (Distilled)
Up to 720p · 24fps · ~2× faster than Fast · ~half the cost · Native audio
Video Prompt
Same multimodal inputs as Seedance 2.0 — text, image, video, and audio references
Reference Inputs
Optional — reference them in your prompt as @Image1, @Video1, @Audio1.
Duration
Resolution
Need 1080p/2K? Switch to Seedance 2.0 Standard.
Aspect Ratio
Audio
Generation usually takes 2–3 minutes.
Cinematic Landscape
A lone figure on an endless salt flat at dusk — text-to-video with natural depth and atmosphere
Atmospheric Night
A warmly lit cabin in a snowy pine forest — falling snow, ambient glow, native audio
Culinary ASMR
Close-up of sushi rolled by gloved hands — tactile detail with synchronized sound
Made with Seedance 2.0
Cinematic landscapes, product commercials, ASMR close-ups, action scenes — all generated from text, images, and references with native audio. Mini renders these same scenes at up to 720p for roughly half the cost and twice the speed of the Fast tier. Hover any clip to play; tap the speaker to hear the audio.
Cinematic Landscape
A lone figure on an endless salt flat at dusk — natural depth, haze, and reflection.
Editorial Portrait
A suited figure on a concrete staircase — moody architectural light, smooth dolly.
Surreal Underwater
An octopus guarding a soccer ball on the seabed — fluid motion and stable physics.
Atmospheric Night
A warmly lit cabin in a snowy pine forest — falling snow with native ambient audio.
Culinary ASMR
Sushi rolled by gloved hands in close-up — tactile detail with synchronized sound.
Aerial Scenery
Hot-air balloons drifting over misty rolling hills at dawn — sweeping camera move.
Product Commercial
An animated character interacting with a beverage — ad-style scene with synced audio.
Action Cinematic
A wuxia martial-arts confrontation in the rain — dynamic motion and sound design.
Beauty & Lifestyle
First-person ASMR close-ups with a healing ambiance — soft light, tactile sounds.
Seedance 2.0 Mini at a Glance
Mini is ByteDance's answer to the cost of cinematic AI video. It takes the full Seedance 2.0 multimodal engine — the @-reference workflow, native audio co-generation, and multi-shot storytelling — and distills it into a lighter model that needs far less compute per second. The result is the cheapest, fastest tier in the Seedance family, built for teams that generate video at volume.
Why Seedance 2.0 Mini
Mini trades nothing but resolution headroom and top-end fidelity for a dramatic drop in cost and latency — exactly the trade you want when you're testing creative at scale.
~2× Faster Than Fast
Distillation compresses the full model so it needs less compute per second of output — turning around drafts and variants in a fraction of the time of the Fast tier, at comparable or better quality.
~Half the Cost
Roughly half the price of standard Seedance 2.0 — about ¥0.5 (~$0.07) per second at 720p versus ~¥1 per second for standard. Generate ten variants for the price of five.
Same Multimodal Inputs
Not a stripped-down toy — Mini keeps the entire Seedance 2.0 input stack: up to 9 images, 3 videos, and 3 audio files per generation, with @-reference prompting and native synchronized audio.
Iterate Cheap, Finish Sharp
The pro workflow: explore hooks and angles on Mini, then re-render only the final approved hero shots on standard Seedance 2.0 when you need 1080p or 2K. Spend compute where it counts.
Built for Volume
Short-form social, UGC, e-commerce cutdowns, and ad variants at scale — Mini is tuned for the high-throughput work where you need dozens of clips, not one hero film.
Supersedes the Fast Tier
Mini is faster and cheaper than Seedance 2.0 Fast — and reviewers report stronger motion coherence, character stability, and prompt adherence. For new budget work, it is the tier to reach for.
Full Seedance 2.0 Capabilities — at Mini Speed
Mini inherits everything that made Seedance 2.0 the most flexible multimodal video model. The only thing you give up is resolution headroom above 720p.
Multimodal Input
Accept up to 9 images, 3 videos, and 3 audio files (12 total assets) in a single generation. Reference any asset in natural language — "Take @Image1 as the first frame, adopt camera movement from @Video1".
Native Audio Co-Generation
A dedicated audio branch generates synchronized sound effects, background music, and dialogue alongside video — not stitched on after. Toggle it off when you plan to dub voice-over in post.
Phoneme-Level Lip Sync
Phoneme embeddings drive lip articulation across 8+ languages, with prosodic guidance from audio shaping facial movement — enabling natural multilingual dubbing for global campaigns.
Multi-Shot Storytelling
Write [Shot 1] … [Shot 2] … in your prompt and Seedance plans the cuts and camera work natively, holding characters, clothing, and spatial logic consistent across shots.
Motion & Camera Replication
Upload a reference video and Mini adopts its camera work, movement, and effects — then swap characters, extend the clip, or drop in your own product.
Director-Level Controls
Specify professional cinematography: dolly-ins, lateral pans, follow shots, circular tracking. Control lighting, shadows, shot size, and angle while keeping framing consistent.
Physics & Motion
Realistic collisions, fabric dynamics, and fluid motion in action sequences — reviewers report Mini holds motion coherence better than the older Fast tier.
Style Transfer & Editing
Reference-based editing with customizable visual styles — photorealistic, anime, abstract, and more. Extend clips, replace characters, or restyle a scene while preserving motion.
Mini vs Fast vs Standard
All three tiers share the same multimodal inputs and native audio. The difference is the speed / cost / fidelity trade. Resolution and price columns mix Volcano Engine (RMB) and fal.ai (USD) rates because they are separate price books.
| Dimension | Mini | Fast | Standard 2.0 |
|---|---|---|---|
| Max Resolution | 480p / 720p | 480p / 720p | 1080p · up to 2K |
| Duration | 4–15s | 4–15s | 4–15s |
| Speed | Fastest (~2× Fast) | Mid | Slowest |
| Relative Quality | Beats Fast; near-Standard on short clips | Lowest of three | Highest fidelity |
| Price / sec @720p | ~¥0.5 (~$0.07) | ~$0.24 (fal) | ~¥1 (~$0.14); ~$0.30 (fal) |
| Native Audio | Yes | Yes | Yes |
| Multimodal Inputs | 9 img + 3 vid + 3 audio | 9 img + 3 vid + 3 audio | 9 img + 3 vid + 3 audio |
| Best For | High-volume drafts, social, ads | Legacy budget work | Final deliverables, hero shots |
Pricing figures are drawn from public Volcano Engine and fal.ai rates and may change; consumer-app credit pricing in Jimeng / Dreamina runs higher than raw API rates.
Built for High-Volume Ad Creative
High-Volume Ad Variants
Generate dozens of 720p video ad variants across hooks, angles, and formats in minutes — at roughly half the cost of standard Seedance.
UGC-Style Content
Talking-head and lifestyle clips with lip-synced dialogue for TikTok and Reels — no talent, no studio, cheap enough to make daily.
E-Commerce Cutdowns
Turn product shots into short, scroll-stopping motion clips for feeds and product pages, generated in bulk per SKU.
Creative Testing at Scale
Iterate on prompts and concepts cheaply on Mini to find winning creative before committing budget to a high-res final render.
Multi-Market Campaigns
One concept localized across 8+ languages with native phoneme-level lip sync — affordable enough to ship every market.
Storyboard to Video
Upload a sequence of reference images and get a coherent multi-shot draft with consistent characters to validate a concept fast.
How Soku AI Helps
Soku AI turns Seedance 2.0 Mini into an end-to-end creative testing pipeline — from low-cost batch video generation to cross-channel performance measurement.
Batch video generation
Generate dozens of video ad variants across aspect ratios, hooks, and visual styles in minutes — Mini's low cost makes large batches economical.
Soku AI builds reusable creative briefs tied to your brand guidelines, so every variant stays on-brand while testing different angles, CTAs, and formats.
Multi-platform adaptation
Automatically produce assets for every placement — 9:16 for Reels/Stories, 1:1 for feeds, 16:9 for YouTube — from a single creative brief.
Mini's aspect-ratio presets plus multi-shot consistency mean your product looks identical across every format.
Performance learning loop
Connect video output to real ad performance. Learn which visual styles, camera moves, and hooks drive conversions — then re-render winners on standard 2.0.
Soku AI tracks CTR, CPA, and ROAS by creative variant, feeding insights back into the next Mini generation round.
Pricing
Mini is the cheapest Seedance 2.0 tier — about half the cost of standard 2.0. Note: some English coverage misprinted "$0.50/second" — that is ¥0.5 (≈ $0.07). The simplest way to use Mini for marketing is through Soku AI, where generation is credit-based.
Volcano Engine API (RMB)
| Mode | Rate | Notes |
|---|---|---|
| Text / Image-to-Video | ~¥0.023 / 1K tokens | ≈ ¥0.5 per second @720p |
| Video-to-Video | ~¥0.014 / 1K tokens | Cheapest mode |
| Standard 2.0 (ref) | ~¥1 / second | For comparison — ~2× Mini |
What It Costs Elsewhere
| Tier | Price | Use Case |
|---|---|---|
| Mini @720p | ~$0.07 / sec | High-volume drafts & ads |
| Fast (fal) | ~$0.24 / sec | Legacy budget tier |
| Standard (fal) | ~$0.30 / sec | 1080p / 2K finals |
Availability
Seedance 2.0 Mini is available via Volcano Engine and BytePlus APIs and inside the Jimeng and Dreamina/CapCut apps. Consumer-app credit pricing runs higher than raw API rates. Through Soku AI you run it as part of an ad creative workflow — credit-based, with a starter grant to try it.
Frequently Asked Questions
Make Video Ads at Scale — for Less
Generate Seedance 2.0 Mini variants in Soku AI, deploy them as ads across Meta, Google, and TikTok, and learn what drives ROAS.
