A Python MCP server providing audio analysis tools for AI-driven sound recreation. It imports, separates, analyzes, and compares audio so an agent can configure hardware synthesizers to match a target sound.
A suite of MCP tools take a target track from import, through separation and analysis, to a side-by-side comparison with the synthesized result.
import_audioaudio_renderaudio_list_devicesSplit into vocals, drums, bass, other, guitar and piano with Demucs.
stem_separateMel spectrogram, spectral features, ADSR and modulation.
spectrum_analyzePolyphonic transcription via Basic Pitch — MIDI plus note-event JSON.
note_transcribenote_triagenote_isolateTarget vs. synthesized — mel spectrogram distance and per-band energy.
audio_compareRequires Python 3.11 and uv. audio_render needs PortAudio; system capture uses BlackHole.
# install uv sync --dev # macOS audio capture deps brew install portaudio # run the MCP server (spawned by your client over stdio) uv run python -m audio_analysis_mcp
audio-analysis-mcp turns a target recording into the features the agent reasons about.