Performance
95%
39/41190s
activation → loaded1.301×
1.00× – 4.62×13.1s
10.1s – 46.5sCapabilities
Base En
Out of scopeSound Effects
Failure Modes
Out of scopeMultilingual
Out of scopeCode Switching
Out of scopeCloning
Out of scopeVoice Design
Out of scopePronunciation
Out of scopePauses
Out of scopeStreaming
Out of scopeDialogue
Out of scopeStandard harness
Base English
5 cases1.1-short
Hello, this is the first sentence.
500: {"detail":"Synthesis failed: [Errno 11] Resource temporarily unavailable"}1.2-medium
The quick brown fox jumps over the lazy dog, and afterwards goes to sleep.
1.3-question
Could you please confirm whether the deployment succeeded?
1.4-exclamation
Watch out, that is dangerous!
1.5-long-paragraph
The deployment process began at six in the morning. By half past seven, the first replicas were warm and serving traffic. Engineers checked the dashboards every few minutes, watching for the subtle latency increase that always preceded a regression. The new model had been tested for weeks in staging, but production traffic exposed edge cases that no synthetic load could simulate.
Sound effects
5 cases10.1-sfx-footsteps
footsteps on gravel, walking slowly
10.2-sfx-thunder
rolling thunder in the distance, followed by heavy rain
10.3-sfx-glass
a glass shatters on a tile floor
10.4-sfx-animals
a dog barks, then growls and whimpers
10.5-sfx-mechanical
an engine starting, idling, and revving up
Failure modes
5 cases11.1-empty
(no text)
11.2-punctuation-only
!?...?!
11.3-mixed-script
Hello 世界 مرحبا こんにちは namaste
11.4-symbols
$1,234.56 (75% off) @ 3PM EST
11.5-markdown-residue
<b>Hello</b> **world** _italic_
Multilingual
6 cases2.1-zh
你好,今天天气很好,适合出去散步。
2.2-ja
こんにちは、お元気ですか?今日もいい天気ですね。
2.3-es
Hola, ¿cómo estás hoy? Espero que muy bien.
500: {"detail":"Model worker crashed during synthesis"}2.4-fr
Bonjour, comment allez-vous aujourd'hui?
2.5-ar
مرحبا، كيف حالك اليوم؟ أتمنى أن تكون بخير.
2.6-hi
नमस्ते, आप कैसे हैं? आज मौसम बहुत अच्छा है।
Code switching
3 cases3.1-en-zh
I'll meet you at the 茶馆 at three in the afternoon.
3.2-en-es
She said hola and then waved goodbye.
3.3-en-ja
The Japanese word for thank you is ありがとう.
Voice cloning
10 cases4.1-clone-clean-5s
The quick brown fox jumps over the lazy dog.
4.2-clone-clean-15s
The quick brown fox jumps over the lazy dog.
4.3-clone-clean-30s
The quick brown fox jumps over the lazy dog.
4.4-clone-noisy
The quick brown fox jumps over the lazy dog.
4.5-clone-accented
The quick brown fox jumps over the lazy dog.
4.6-clone-whispered
The quick brown fox jumps over the lazy dog.
4.7-clone-raspy
The quick brown fox jumps over the lazy dog.
4.8-clone-reverb
The quick brown fox jumps over the lazy dog.
4.9-clone-child
The quick brown fox jumps over the lazy dog.
4.10-clone-cross-lang
你好,今天天气很好。
Pronunciation
4 cases6.1-irish-name
Her name is Saoirse Ronan.
6.2-brand-hyundai
I drive a Hyundai Ioniq.
6.3-gif-vs-jif
Save the file as a GIF and not a JPEG.
6.4-sql
We use SQL to query the database.
Pauses
3 cases7.1-pause-short
Wait, [pause 0.5s] for it.
7.2-pause-medium
She paused, [pause 1.5s] then continued.
7.3-pause-long
And then [pause 3.0s] silence.
Issues
sfx-fixed-output-length
lowopenAll synthesized outputs are exactly ~10.08s regardless of prompt complexity ("glass shatters" vs. "engine starting, idling, and revving up" both produce 10.08s). This is an upstream model behavior — moss-sfx emits a constant-length codec stream. If we need shorter SFX clips (e.g. <2s impact sounds), the frontend will need to trim or we need an upstream feature to set target length.
sfx-transient-resource-error
lowopenFirst call after activation (1.1-short) returned 'Resource temporarily unavailable'. Subsequent calls succeeded immediately. Likely a worker-process / GPU allocation race at first invocation.
sfx-worker-crash-on-2.3-es
mediumopenCase 2.3-es ('Hola, ¿cómo estás hoy? Espero que muy bien.') triggered 'Model worker crashed during synthesis'. Auto-respawn worked (case 2.4-fr completed, taking 46s vs ~13s norm due to reload). Worth investigating whether specific characters (¿, accented vowels) or a heap/codec edge case caused it — not blocking but a stability flag.
Deployment
era-tts-moss
era-core
nvidia-tesla-a100
gke-spot (preemptible)
era-tts-model-cache-moss
1800s