Loading...
Loading...
Audio spectrograms/features (mel, chroma, MFCC) via CLI.
npx skill4agent add nousresearch/hermes-agent songseego install github.com/steipete/songsee/cmd/songsee@latestffmpeg# Basic spectrogram
songsee track.mp3
# Save to specific file
songsee track.mp3 -o spectrogram.png
# Multi-panel visualization grid
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux
# Time slice (start at 12.5s, 8s duration)
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg
# From stdin
cat track.mp3 | songsee - --format png -o out.png--viz| Type | Description |
|---|---|
| Standard frequency spectrogram |
| Mel-scaled spectrogram |
| Pitch class distribution |
| Harmonic/percussive separation |
| Self-similarity matrix |
| Loudness over time |
| Tempo estimation |
| Mel-frequency cepstral coefficients |
| Spectral flux (onset detection) |
--viz| Flag | Description |
|---|---|
| Visualization types (comma-separated) |
| Color palette: |
| Output image dimensions |
| FFT window and hop size |
| Frequency range filter |
| Time slice of the audio |
| Output format: |
| Output file path |
ffmpegvision_analyze