Armox
    Armox Academy 📚
    AI Models ReferenceAudio Models

    Audio Models

    Audio models in Armox generate music, speech, and sound effects from text descriptions or reference inputs.

    Overview

    Audio models can:

    • Music generation — Create original music from descriptions
    • Text-to-speech — Generate natural voice from text
    • Sound effects — Create ambient sounds and effects
    • Voice cloning — Generate speech in specific voices
    • Audio continuation — Extend existing audio

    Available Audio Models

    ModelProviderCostDurationBest For
    MusicGenMeta100 credits8-30sMusic generation
    Ace StepVarious100 credits60-300sLong-form music
    Dia TTSNari Labs50 creditsVariableText-to-speech
    Kokoro TTSKokoro50 creditsVariableFast TTS
    ChatterboxVarious50 creditsVariableVoice cloning

    Connection Colors

    In the Armox Canvas, audio connections use orange handles and edges:

    • Input Handle: Red circle on the left side of nodes
    • Output Handle: Red circle on the right side of nodes
    • Connection Edge: Red line connecting nodes

    Common Settings

    Duration

    Control the length of generated audio.

    Sample Rate

    • 44.1kHz — CD quality
    • 48kHz — Professional audio

    Format

    • MP3 — Compressed, smaller files
    • WAV — Uncompressed, higher quality

    Choosing the Right Model

    For Music

    • MusicGen (100 credits) — Short music clips
    • Ace Step (100 credits) — Long-form music

    For Speech

    • Dia TTS (50 credits) — Natural dialogue
    • Kokoro TTS (50 credits) — Fast generation
    • Chatterbox (50 credits) — Voice cloning

    Best Practices

    1. Be specific about genre — "jazz", "electronic", "orchestral"
    2. Describe mood — "upbeat", "melancholic", "energetic"
    3. Include instruments — "piano", "guitar", "synthesizer"
    4. Specify tempo — "slow", "moderate", "fast"
    5. For speech, use natural text — Write as you'd speak

    Next Steps

    Explore individual model documentation for detailed settings and use cases.