All Tools
Mini Tier~2× Faster~Half the CostNative AudioByteDance

Seedance 2.0 Mini — Fast, Low-Cost AI Video

ByteDance's distilled video model generates cinematic clips with native audio from text, images, and references — ~2× faster than the Fast tier and about half the cost of Seedance 2.0, at up to 720p. Run it in Soku AI and turn it into ad creatives.

AI Video Generation

Seedance 2.0 Mini Studio

Mini Tier

Model

Seedance 2.0 Mini (Distilled)

Up to 720p · 24fps · ~2× faster than Fast · ~half the cost · Native audio

Video Prompt

Same multimodal inputs as Seedance 2.0 — text, image, video, and audio references

Reference Inputs

Optional — reference them in your prompt as @Image1, @Video1, @Audio1.

Duration

Resolution

Need 1080p/2K? Switch to Seedance 2.0 Standard.

Aspect Ratio

Audio

Generation usually takes 2–3 minutes.

Cinematic Landscape

A lone figure on an endless salt flat at dusk — text-to-video with natural depth and atmosphere

Atmospheric Night

A warmly lit cabin in a snowy pine forest — falling snow, ambient glow, native audio

Culinary ASMR

Close-up of sushi rolled by gloved hands — tactile detail with synchronized sound

~2× Fastervs Fast tier
~Half the Costvs Standard 2.0
Up to 720pOutput
Credit-basedIn Soku AI
Sample Gallery

Made with Seedance 2.0

Cinematic landscapes, product commercials, ASMR close-ups, action scenes — all generated from text, images, and references with native audio. Mini renders these same scenes at up to 720p for roughly half the cost and twice the speed of the Fast tier. Hover any clip to play; tap the speaker to hear the audio.

Hover to play
Text-to-Video

Cinematic Landscape

A lone figure on an endless salt flat at dusk — natural depth, haze, and reflection.

Hover to play
Image-to-Video

Editorial Portrait

A suited figure on a concrete staircase — moody architectural light, smooth dolly.

Hover to play
Text-to-Video

Surreal Underwater

An octopus guarding a soccer ball on the seabed — fluid motion and stable physics.

Hover to play
Native Audio

Atmospheric Night

A warmly lit cabin in a snowy pine forest — falling snow with native ambient audio.

Hover to play
Native Audio

Culinary ASMR

Sushi rolled by gloved hands in close-up — tactile detail with synchronized sound.

Hover to play
Camera Motion

Aerial Scenery

Hot-air balloons drifting over misty rolling hills at dawn — sweeping camera move.

Hover to play
Reference-to-Video

Product Commercial

An animated character interacting with a beverage — ad-style scene with synced audio.

Hover to play
Multi-Shot

Action Cinematic

A wuxia martial-arts confrontation in the rain — dynamic motion and sound design.

Hover to play
Native Audio

Beauty & Lifestyle

First-person ASMR close-ups with a healing ambiance — soft light, tactile sounds.

Seedance 2.0 Mini at a Glance

Mini is ByteDance's answer to the cost of cinematic AI video. It takes the full Seedance 2.0 multimodal engine — the @-reference workflow, native audio co-generation, and multi-shot storytelling — and distills it into a lighter model that needs far less compute per second. The result is the cheapest, fastest tier in the Seedance family, built for teams that generate video at volume.

DeveloperByteDance (Seed) · Volcano Engine
ReleasedJune 15, 2026
Model TypeDistilled Seedance 2.0
Max Resolution720p (480p / 720p)
Max Duration4–15 seconds · 24fps
Speed~2× faster than Fast
Cost~½ of standard 2.0
Inputs per Generation9 images + 3 videos + 3 audio
Native AudioYes (joint A/V)

Why Seedance 2.0 Mini

Mini trades nothing but resolution headroom and top-end fidelity for a dramatic drop in cost and latency — exactly the trade you want when you're testing creative at scale.

~2× Faster Than Fast

Distillation compresses the full model so it needs less compute per second of output — turning around drafts and variants in a fraction of the time of the Fast tier, at comparable or better quality.

~Half the Cost

Roughly half the price of standard Seedance 2.0 — about ¥0.5 (~$0.07) per second at 720p versus ~¥1 per second for standard. Generate ten variants for the price of five.

Same Multimodal Inputs

Not a stripped-down toy — Mini keeps the entire Seedance 2.0 input stack: up to 9 images, 3 videos, and 3 audio files per generation, with @-reference prompting and native synchronized audio.

Iterate Cheap, Finish Sharp

The pro workflow: explore hooks and angles on Mini, then re-render only the final approved hero shots on standard Seedance 2.0 when you need 1080p or 2K. Spend compute where it counts.

Built for Volume

Short-form social, UGC, e-commerce cutdowns, and ad variants at scale — Mini is tuned for the high-throughput work where you need dozens of clips, not one hero film.

Supersedes the Fast Tier

Mini is faster and cheaper than Seedance 2.0 Fast — and reviewers report stronger motion coherence, character stability, and prompt adherence. For new budget work, it is the tier to reach for.

Full Seedance 2.0 Capabilities — at Mini Speed

Mini inherits everything that made Seedance 2.0 the most flexible multimodal video model. The only thing you give up is resolution headroom above 720p.

Multimodal Input

Accept up to 9 images, 3 videos, and 3 audio files (12 total assets) in a single generation. Reference any asset in natural language — "Take @Image1 as the first frame, adopt camera movement from @Video1".

Native Audio Co-Generation

A dedicated audio branch generates synchronized sound effects, background music, and dialogue alongside video — not stitched on after. Toggle it off when you plan to dub voice-over in post.

Phoneme-Level Lip Sync

Phoneme embeddings drive lip articulation across 8+ languages, with prosodic guidance from audio shaping facial movement — enabling natural multilingual dubbing for global campaigns.

Multi-Shot Storytelling

Write [Shot 1] … [Shot 2] … in your prompt and Seedance plans the cuts and camera work natively, holding characters, clothing, and spatial logic consistent across shots.

Motion & Camera Replication

Upload a reference video and Mini adopts its camera work, movement, and effects — then swap characters, extend the clip, or drop in your own product.

Director-Level Controls

Specify professional cinematography: dolly-ins, lateral pans, follow shots, circular tracking. Control lighting, shadows, shot size, and angle while keeping framing consistent.

Physics & Motion

Realistic collisions, fabric dynamics, and fluid motion in action sequences — reviewers report Mini holds motion coherence better than the older Fast tier.

Style Transfer & Editing

Reference-based editing with customizable visual styles — photorealistic, anime, abstract, and more. Extend clips, replace characters, or restyle a scene while preserving motion.

Mini vs Fast vs Standard

All three tiers share the same multimodal inputs and native audio. The difference is the speed / cost / fidelity trade. Resolution and price columns mix Volcano Engine (RMB) and fal.ai (USD) rates because they are separate price books.

DimensionMiniFastStandard 2.0
Max Resolution480p / 720p480p / 720p1080p · up to 2K
Duration4–15s4–15s4–15s
SpeedFastest (~2× Fast)MidSlowest
Relative QualityBeats Fast; near-Standard on short clipsLowest of threeHighest fidelity
Price / sec @720p~¥0.5 (~$0.07)~$0.24 (fal)~¥1 (~$0.14); ~$0.30 (fal)
Native AudioYesYesYes
Multimodal Inputs9 img + 3 vid + 3 audio9 img + 3 vid + 3 audio9 img + 3 vid + 3 audio
Best ForHigh-volume drafts, social, adsLegacy budget workFinal deliverables, hero shots

Pricing figures are drawn from public Volcano Engine and fal.ai rates and may change; consumer-app credit pricing in Jimeng / Dreamina runs higher than raw API rates.

Built for High-Volume Ad Creative

High-Volume Ad Variants

Generate dozens of 720p video ad variants across hooks, angles, and formats in minutes — at roughly half the cost of standard Seedance.

UGC-Style Content

Talking-head and lifestyle clips with lip-synced dialogue for TikTok and Reels — no talent, no studio, cheap enough to make daily.

E-Commerce Cutdowns

Turn product shots into short, scroll-stopping motion clips for feeds and product pages, generated in bulk per SKU.

Creative Testing at Scale

Iterate on prompts and concepts cheaply on Mini to find winning creative before committing budget to a high-res final render.

Multi-Market Campaigns

One concept localized across 8+ languages with native phoneme-level lip sync — affordable enough to ship every market.

Storyboard to Video

Upload a sequence of reference images and get a coherent multi-shot draft with consistent characters to validate a concept fast.

How Soku AI Helps

Soku AI turns Seedance 2.0 Mini into an end-to-end creative testing pipeline — from low-cost batch video generation to cross-channel performance measurement.

Batch video generation

Generate dozens of video ad variants across aspect ratios, hooks, and visual styles in minutes — Mini's low cost makes large batches economical.

Soku AI builds reusable creative briefs tied to your brand guidelines, so every variant stays on-brand while testing different angles, CTAs, and formats.

Multi-platform adaptation

Automatically produce assets for every placement — 9:16 for Reels/Stories, 1:1 for feeds, 16:9 for YouTube — from a single creative brief.

Mini's aspect-ratio presets plus multi-shot consistency mean your product looks identical across every format.

Performance learning loop

Connect video output to real ad performance. Learn which visual styles, camera moves, and hooks drive conversions — then re-render winners on standard 2.0.

Soku AI tracks CTR, CPA, and ROAS by creative variant, feeding insights back into the next Mini generation round.

Pricing

Mini is the cheapest Seedance 2.0 tier — about half the cost of standard 2.0. Note: some English coverage misprinted "$0.50/second" — that is ¥0.5 (≈ $0.07). The simplest way to use Mini for marketing is through Soku AI, where generation is credit-based.

Volcano Engine API (RMB)

ModeRateNotes
Text / Image-to-Video~¥0.023 / 1K tokens≈ ¥0.5 per second @720p
Video-to-Video~¥0.014 / 1K tokensCheapest mode
Standard 2.0 (ref)~¥1 / secondFor comparison — ~2× Mini

What It Costs Elsewhere

TierPriceUse Case
Mini @720p~$0.07 / secHigh-volume drafts & ads
Fast (fal)~$0.24 / secLegacy budget tier
Standard (fal)~$0.30 / sec1080p / 2K finals

Availability

Seedance 2.0 Mini is available via Volcano Engine and BytePlus APIs and inside the Jimeng and Dreamina/CapCut apps. Consumer-app credit pricing runs higher than raw API rates. Through Soku AI you run it as part of an ad creative workflow — credit-based, with a starter grant to try it.

Frequently Asked Questions

Seedance 2.0 Mini is a distilled, cost-optimized version of ByteDance's Seedance 2.0 video model, released in June 2026. It is the same underlying multimodal engine — text, image, video, and audio inputs with native synchronized audio — compressed to run roughly 2× faster than the Fast tier at about half the cost of standard Seedance 2.0. It outputs up to 720p, making it the cheapest way to generate Seedance video at scale. You can run it through Soku AI as part of a complete ad creative workflow.

Mini is roughly 2× faster than the Fast tier and cheaper, and reviewers report it also has better motion coherence, character stability, and prompt adherence. For new work, Mini effectively supersedes Fast as the budget tier. Both top out at 720p — step up to standard Seedance 2.0 only when you need 1080p, 2K, or cinema-grade fidelity.

Mini is the cheapest Seedance 2.0 tier — roughly half the cost of standard 2.0 (about ¥0.5 / ~$0.07 per second at 720p versus ~¥1 per second for standard). In Soku AI, generation is credit-based: a clip's credit cost scales with its length and resolution, and credits come with your plan. New accounts get a small starter credit grant to try it, then upgrade for production volume.

Seedance 2.0 Mini generates clips of 4–15 seconds at 24fps, at up to 720p (480p and 720p only). It does not output 1080p, 2K, or 4K — for higher-resolution deliverables, render the final cut on standard Seedance 2.0. Mini supports the full set of aspect ratios (16:9, 9:16, 1:1, 4:3, 21:9) so you can produce feed, Reels, and YouTube formats from one brief.

Yes — high-volume social ads, UGC, e-commerce clips, and short-form video are exactly what Mini is built for. The recommended pro workflow is to iterate cheaply on Mini to find winning hooks and angles, then re-render only the final approved hero shots on standard Seedance 2.0 for maximum fidelity. Through Soku AI you can generate Mini variants and deploy them straight to Meta, Google, and TikTok.

Yes. Like the full Seedance 2.0 model, Mini generates synchronized audio — sound effects, ambient sound, and dialogue — jointly with the video rather than stitching it on afterward. You can also toggle audio off when you plan to dub voice-over or music in post.

Mini is available via ByteDance's Volcano Engine and BytePlus APIs and inside the Jimeng and Dreamina/CapCut apps. The simplest way to put it to work for marketing is through Soku AI — instead of wiring up a raw endpoint, Soku turns Seedance Mini into a creative pipeline: generate on-brand video variants, push them live as ads, and feed performance data back into the next round. Sign up to start.

Make Video Ads at Scale — for Less

Generate Seedance 2.0 Mini variants in Soku AI, deploy them as ads across Meta, Google, and TikTok, and learn what drives ROAS.

Try Seedance 2.0 Mini in Soku AI