AIAny - Dramabox

Introduction

Most TTS systems map text to a neutral read; Dramabox treats the prompt as a theatrical director. By interpreting stage directions outside quotes and literal dialogue inside them, it produces nuanced performances (laughs, sighs, pauses, whispering) and can clone timbre from a short reference.

What Sets It Apart

Prompt-as-performance: scene descriptions and stage directions directly shape prosody and expressive behaviors, so a single text prompt can encode speaker identity, emotion, and timing without separate style tokens.
Short-reference voice cloning: an optional ~10s voice reference provides timbre cloning while keeping inference inputs compact — useful for demos and conditional generation pipelines.
Architecture and safety tradeoffs: an IC‑LoRA fine-tune of LTX‑2.3 (audio DiT + flow-matching) keeps the model footprint manageable while retaining high expressivity; outputs are automatically watermarked with Resemble Perth to enable provenance detection.
Practical resource profile: warm-server inference targets ~2.5s per generation on H100 and peaks at ~24GB VRAM for recommended configs, making it realistic for research or medium-scale on-prem setups.

Who It's For & Tradeoffs

Great fit if you need highly expressive, controllable TTS for demos, voice-forward storytelling, character dialogue, or research into prompt-driven generation and voice cloning. Also suitable for teams who want a Hugging Face-hosted model plus an interactive demo space. Look elsewhere if you require a very small on-device TTS, license-free redistribution for commercial products without LTX‑2 constraints, or deterministic low-latency streaming on CPU-only environments. The model’s fidelity and expressivity come with nontrivial VRAM and licensing considerations (LTX‑2 Community License) and automated watermarking by default.

Dramabox

Introduction

What Sets It Apart

Who It's For & Tradeoffs

Information

Categories

Tags

More Items

MOSS-VL-Realtime

unsloth/inkling-GGUF

LTX-Video 2.3 22B — IC-LoRA: CrossView Prompt v0.9