songsee
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Best use case
songsee is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Teams using songsee should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/songsee/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How songsee Compares
| Feature / Agent | songsee | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# songsee Generate spectrograms + feature panels from audio. Quick start - Spectrogram: `songsee track.mp3` - Multi-panel: `songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux` - Time slice: `songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg` - Stdin: `cat track.mp3 | songsee - --format png -o out.png` Common flags - `--viz` list (repeatable or comma-separated) - `--style` palette (classic, magma, inferno, viridis, gray) - `--width` / `--height` output size - `--window` / `--hop` FFT settings - `--min-freq` / `--max-freq` frequency range - `--start` / `--duration` time slice - `--format` jpg|png Notes - WAV/MP3 decode native; other formats use ffmpeg if available. - Multiple `--viz` renders a grid.
Related Skills
xurl
A CLI tool for making authenticated requests to the X (Twitter) API. Use this skill when you need to post tweets, reply, quote, search, read posts, manage followers, send DMs, upload media, or interact with any X API v2 endpoint.
xhs-operator
小红书运营执行技能。用于调用已接入的 xhs MCP 工具完成登录检查、内容生成衔接、图文/视频发布、发布后复盘与失败重试。适用于“发笔记”“发视频”“检查登录状态”“查询笔记/搜索内容”等任务。
weather
Get current weather and forecasts via wttr.in or Open-Meteo. Use when: user asks about weather, temperature, or forecasts for any location. NOT for: historical weather data, severe weather alerts, or detailed meteorological analysis. No API key needed.
wacli
Send WhatsApp messages to other people or search/sync WhatsApp history via the wacli CLI (not for normal user chats).
voice-call
Start voice calls via the OpenClaw voice-call plugin.
video-frames
Extract frames or short clips from videos using ffmpeg.
trello
Manage Trello boards, lists, and cards via the Trello REST API.
tmux
Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
spotify-player
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
sonoscli
Control Sonos speakers (discover/status/play/volume/group).
slack
Use when you need to control Slack from OpenClaw via the slack tool, including reacting to messages or pinning/unpinning items in Slack channels or DMs.
skill-creator
Create, edit, improve, or audit AgentSkills. Use when creating a new skill from scratch or when asked to improve, review, audit, tidy up, or clean up an existing skill or SKILL.md file. Also use when editing or restructuring a skill directory (moving files to references/ or scripts/, removing stale content, validating against the AgentSkills spec). Triggers on phrases like "create a skill", "author a skill", "tidy up a skill", "improve this skill", "review the skill", "clean up the skill", "audit the skill".