Audio Prompting
AI 오디오 생성은 배경 음악부터 보이스오버, 사운드 이펙트까지 새로운 창작 가능성을 열어줍니다. 이 가이드는 오디오 모델을 효과적으로 프롬프트하는 방법을 알려줍니다.
Audio Models in Armox
Music Generation
| Model | Credits | Best For |
|---|---|---|
| Music 1.5 | 48 | Quick background music |
| Lyria 2 | 120 | High-quality songs |
| MusicGen | 176 | Detailed compositions |
| Music-01 | 200 | Complex music |
Speech & Voice
| Model | Credits | Best For |
|---|---|---|
| Speech-02 Turbo | 8 | Quick voiceovers |
| XTTS-v2 | 20 | Voice cloning |
| Speech-02 HD | 40 | High-quality speech |
| Voice Cloning | 100 | Clone any voice |
Music Prompting Basics
음악 프롬프트는 다음을 설명해야 합니다:
- Genre/Style — 어떤 장르/스타일인가요?
- Mood/Emotion — 어떤 느낌이어야 하나요?
- Instruments — 어떤 악기/사운드인가요?
- Tempo — 빠른가요 느린가요?
- Purpose — 어디에 쓰이나요?
Basic Structure
Prompt Template
[Genre] music, [mood], [instruments], [tempo], [purpose/context]
Genre Keywords
음악 스타일은 구체적으로 적을수록 좋습니다:
Popular Genres
| Genre | Description |
|---|---|
| "Lo-fi hip hop" | Relaxed, study music vibe |
| "Cinematic orchestral" | Epic, movie soundtrack |
| "Upbeat pop" | Catchy, commercial |
| "Ambient electronic" | Atmospheric, background |
| "Acoustic folk" | Warm, organic |
| "Corporate" | Professional, business |
| "Epic trailer" | Dramatic, building |
| "Chill electronic" | Relaxed, modern |
Fusion Styles
장르를 조합해 독특한 사운드를 만들 수 있습니다:
- "Jazz-influenced lo-fi"
- "Orchestral with electronic elements"
- "Acoustic pop with indie vibes"
- "Cinematic ambient"
Mood and Emotion
Positive Moods
- "Uplifting and inspiring"
- "Happy and cheerful"
- "Energetic and exciting"
- "Warm and comforting"
- "Hopeful and optimistic"
Calm Moods
- "Peaceful and serene"
- "Relaxing and meditative"
- "Dreamy and ethereal"
- "Gentle and soothing"
- "Contemplative and reflective"
Dramatic Moods
- "Intense and powerful"
- "Mysterious and suspenseful"
- "Epic and triumphant"
- "Dark and moody"
- "Emotional and moving"
Instruments and Sounds
Acoustic Instruments
Piano, acoustic guitar, strings,
violin, cello, flute,
drums, bass, percussion
Electronic Sounds
Synthesizer, electronic beats,
bass drops, pads, arpeggios,
808 drums, ambient textures
Orchestral
Full orchestra, brass section,
string ensemble, timpani,
French horn, choir
Example with Instruments
Cinematic orchestral music with soaring strings,
powerful brass, and thundering timpani,
building to an epic climax,
movie trailer style
Tempo and Energy
Tempo Keywords
| Term | BPM Range | Feel |
|---|---|---|
| "Very slow" | 40-60 | Meditative |
| "Slow" | 60-80 | Relaxed |
| "Moderate" | 80-100 | Walking pace |
| "Upbeat" | 100-120 | Energetic |
| "Fast" | 120-140 | Exciting |
| "Very fast" | 140+ | Intense |
Energy Descriptions
- "Starts soft, builds gradually"
- "High energy throughout"
- "Ebb and flow dynamics"
- "Steady and consistent"
- "Explosive crescendo"
Music Prompt Templates
Background Music
Prompt Template
[Genre] background music, [mood], [tempo], suitable for [use case], [instruments], [duration note]
Example:
Lo-fi hip hop background music,
relaxing and focused, moderate tempo,
suitable for studying or working,
mellow beats with soft piano and vinyl crackle,
loopable
Video Soundtrack
Prompt Template
[Genre] music for [video type], [mood] feeling, [tempo], [instruments], syncs with [visual description]
Example:
Uplifting corporate music for product launch video,
inspiring and professional feeling, moderate upbeat tempo,
piano, light percussion, subtle strings,
builds energy toward the end
Podcast/YouTube Intro
Prompt Template
[Genre] intro music, [duration] seconds, [mood], [instruments], catchy and memorable, suitable for [content type]
Example:
Modern electronic intro music,
10-15 seconds, energetic and exciting,
punchy synths and driving beat,
catchy and memorable hook,
suitable for tech podcast
Speech and Voiceover Prompting
Text-to-Speech
Speech-02 같은 모델에서 프롬프트는 “실제로 읽힐 텍스트”입니다:
Prompt Template
[The actual words you want spoken]Voice Characteristics
일부 모델은 보이스 특성을 지정할 수 있습니다:
| Trait | Options |
|---|---|
| Gender | Male, female, neutral |
| Age | Young, middle-aged, elderly |
| Tone | Professional, friendly, authoritative |
| Accent | American, British, Australian |
| Speed | Slow, normal, fast |
Speech Examples
Professional Narration:
Welcome to our quarterly report.
This presentation covers our key achievements
and strategic initiatives for the coming year.
Friendly Explainer:
Hey there! In this video, we're going to show you
exactly how to get started with our app.
It's super easy, I promise!
Dramatic Trailer:
In a world where technology has changed everything...
one company dares to reimagine the future.
Voice Cloning
보이스를 복제하려면:
- Upload 노이즈 없는 선명한 오디오 샘플(10~30초)
- Connect Voice Cloning 노드에 연결
- Provide 읽힐 텍스트 제공
- AI가 해당 목소리로 음성을 생성합니다
Best Practices for Voice Samples
- ✅ Clear, noise-free recording
- ✅ Single speaker only
- ✅ Natural speaking pace
- ✅ 10-30 seconds of speech
- ❌ Background music or noise
- ❌ Multiple speakers
- ❌ Heavily processed audio
Combining Audio with Video
Workflow
- 먼저 비디오를 생성합니다
- 비디오 길이에 맞는 오디오를 만듭니다
- 비디오 편집기에서 합칩니다
Matching Audio to Video
다음을 고려하세요:
- Video duration — 오디오 길이를 맞추기
- Video mood — 영상 분위기와 음악의 조화
- Key moments — 컷 전환에 비트/하이라이트 맞추기
- Pacing — 빠른 영상 = 더 에너지 있는 오디오
Common Music Mistakes
❌ Too Vague
Nice music
✅ Better
Uplifting acoustic folk music,
warm and hopeful, moderate tempo,
acoustic guitar and light percussion,
suitable for lifestyle brand video
❌ Conflicting Descriptions
Sad and depressing but also happy and upbeat
✅ Better
Bittersweet and nostalgic,
melancholic melody with hopeful undertones
❌ Too Specific Technically
Music in C major at 120 BPM with
a I-IV-V-I chord progression
✅ Better
Upbeat pop music with a classic,
familiar chord progression,
catchy and singable
Use Case Examples
YouTube Video Background
Upbeat electronic background music,
energetic but not distracting,
moderate-fast tempo,
synth pads and light beats,
suitable for tech review video,
loopable for long videos
Meditation/Relaxation
Ambient meditation music,
deeply calming and peaceful,
very slow tempo,
soft pads, gentle bells, nature sounds,
suitable for yoga or sleep
Product Commercial
Modern corporate music,
confident and innovative feeling,
moderate tempo building to upbeat,
clean synths and subtle percussion,
30 seconds, ends with resolution
Social Media Reel
Trendy pop music,
catchy and fun, fast tempo,
suitable for Instagram Reels,
15-30 seconds,
hook in first 3 seconds
Iteration Strategy
- Start simple — Genre + mood + tempo
- Add instruments — 핵심 사운드 지정
- Refine energy — 다이내믹/빌드 설명
- Match purpose — 사용 목적에 맞게 조정
Next Steps
- Audio Nodes — 오디오 노드 설정 마스터하기
- Audio & Music Workflow — 오디오 제작 전체 프로세스
- Video Prompting — 오디오에 어울리는 비디오 만들기