2026-01-23 · AI & Agents

Autonomous Multi-Agent Coordination - Jan 22 2026

Autonomous Multi-Agent Coordination: Music, Video, and Voice

Date: January 22, 2026 Project: Orca Monolith Development

Overview

Coordinated 7 autonomous Claude Code agents across different terminal sessions using a custom itermctl tool, achieving significant progress on music generation, video processing, photo galleries, and voice assistant integration.

Achievements

🎵 HeartMuLa Music Generation

🎬 YouTube Shorts Creation

📸 Photo Gallery Implementation

🎨 Shader Analysis

🗣️ Voice Hub Integration

🤖 OpenRouter Integration

Technical Details

Architecture

Coordination Tool

Custom itermctl CLI (AppleScript + bash + jq):

Managed sessions:

Challenges & Solutions

Music Playlist

Challenge: HeartMuLa generates quality audio but doesn't follow prompt labels exactly Solution: Labels are descriptive of intent, actual output is model-driven - quality matters more

Photo Storage

Challenge: 12GB of photos vanished on container restart Solution: Added persistent volume mount for /var/orca/content, cleared orphaned DB records

Voice Hub Integration

Challenge: Voice Hub was standalone app with conflicting modules Solution: Integrated into Orca monolith by:

YuE Music Timeouts

Challenge: YuE 7B model times out after 10 minutes with no output Decision: Stick with HeartMuLa (working, fast, quality results)

Next Steps

  1. Re-run photo collector to populate production storage
  2. Upload YouTube Shorts for algorithm testing
  3. Deploy Voice Hub to Mac for local voice assistant
  4. Monitor OpenRouter quote ranking quality vs cost savings

System Health


Tools Used: Elixir, Phoenix, LiveView, Docker, FFmpeg, Remotion, HeartMuLa, OpenRouter, Qwen 2.5 72B, itermctl

Outcome: Successful multi-agent coordination with 5 major features shipped in parallel across different domains (music, video, photos, voice, AI integration).