songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Best use case
songsee is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Teams using songsee should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/songsee/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How songsee Compares
| Feature / Agent | songsee | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# songsee Generate spectrograms + feature panels from audio. Quick start - Spectrogram: `songsee track.mp3` - Multi-panel: `songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux` - Time slice: `songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg` - Stdin: `cat track.mp3 | songsee - --format png -o out.png` Common flags - `--viz` list (repeatable or comma-separated) - `--style` palette (classic, magma, inferno, viridis, gray) - `--width` / `--height` output size - `--window` / `--hop` FFT settings - `--min-freq` / `--max-freq` frequency range - `--start` / `--duration` time slice - `--format` jpg|png Notes - WAV/MP3 decode native; other formats use ffmpeg if available. - Multiple `--viz` renders a grid.
Related Skills
xurl
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.
weather
Get current weather and forecasts via wttr.in or Open-Meteo. Use when: user asks about weather, temperature, or forecasts for any location. NOT for: historical weather data, severe weather alerts, or detailed meteorological analysis. No API key needed.
wacli
Send WhatsApp messages to other people or search/sync WhatsApp history via the wacli CLI (not for normal user chats).
voice-call
Start voice calls via the OpenAEON voice-call plugin.
video-frames
Extract frames or short clips from videos using ffmpeg.
trello
Manage Trello boards, lists, and cards via the Trello REST API.
tmux
Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
things-mac
Manage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database). Use when a user asks OpenAEON to add a task to Things, list inbox/today/upcoming, search tasks, or inspect projects/areas/tags.
summarize
Summarize or extract text/transcripts from URLs, podcasts, and local files (great fallback for “transcribe this YouTube/video”).
spotify-player
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
sonoscli
Control Sonos speakers (discover/status/play/volume/group).
slack
Use when you need to control Slack from OpenAEON via the slack tool, including reacting to messages or pinning/unpinning items in Slack channels or DMs.