songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Best use case
songsee is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Teams using songsee should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/songsee/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How songsee Compares
| Feature / Agent | songsee | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# songsee Generate spectrograms + feature panels from audio. Quick start - Spectrogram: `songsee track.mp3` - Multi-panel: `songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux` - Time slice: `songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg` - Stdin: `cat track.mp3 | songsee - --format png -o out.png` Common flags - `--viz` list (repeatable or comma-separated) - `--style` palette (classic, magma, inferno, viridis, gray) - `--width` / `--height` output size - `--window` / `--hop` FFT settings - `--min-freq` / `--max-freq` frequency range - `--start` / `--duration` time slice - `--format` jpg|png Notes - WAV/MP3 decode native; other formats use ffmpeg if available. - Multiple `--viz` renders a grid.
Related Skills
mijia-control
Control and monitor Xiaomi Mijia smart home devices. Use this skill when the user wants to: 1) Switch device status (on/off, brightness, etc.) 2) List available home devices 3) Run automation scenes 4) Check environmental statistics.
weather
Get current weather and forecasts (no API key required).
wacli
Send WhatsApp messages to other people or search/sync WhatsApp history via the wacli CLI (not for normal user chats).
voice-call
Start voice calls via the Clawdbot voice-call plugin.
video-frames
Extract frames or short clips from videos using ffmpeg.
```skill
---
things-mac
Manage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database). Use when a user asks Clawdbot to add a task to Things, list inbox/today/upcoming, search tasks, or inspect projects/areas/tags.
spotify-player
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
sonoscli
Control Sonos speakers (discover/status/play/volume/group).
trello
Manage Trello boards, lists, and cards via the Trello REST API.
tmux
Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).