songsee
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
Chinesesongsee
songsee
Generate spectrograms and multi-panel audio feature visualizations from audio files.
从音频文件生成频谱图和多面板音频特征可视化。
Prerequisites
前提条件
Requires Go:
bash
go install github.com/steipete/songsee/cmd/songsee@latestOptional: for formats beyond WAV/MP3.
ffmpeg需要安装Go:
bash
go install github.com/steipete/songsee/cmd/songsee@latest可选:若要处理WAV/MP3之外的格式,需安装。
ffmpegQuick Start
快速开始
bash
undefinedbash
undefinedBasic spectrogram
基础频谱图
songsee track.mp3
songsee track.mp3
Save to specific file
保存到指定文件
songsee track.mp3 -o spectrogram.png
songsee track.mp3 -o spectrogram.png
Multi-panel visualization grid
多面板可视化网格
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux
songsee track.mp3 --viz spectrogram,mel,chroma,hpss,selfsim,loudness,tempogram,mfcc,flux
Time slice (start at 12.5s, 8s duration)
时间切片(从12.5秒开始,时长8秒)
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg
songsee track.mp3 --start 12.5 --duration 8 -o slice.jpg
From stdin
从标准输入读取
cat track.mp3 | songsee - --format png -o out.png
undefinedcat track.mp3 | songsee - --format png -o out.png
undefinedVisualization Types
可视化类型
Use with comma-separated values:
--viz| Type | Description |
|---|---|
| Standard frequency spectrogram |
| Mel-scaled spectrogram |
| Pitch class distribution |
| Harmonic/percussive separation |
| Self-similarity matrix |
| Loudness over time |
| Tempo estimation |
| Mel-frequency cepstral coefficients |
| Spectral flux (onset detection) |
Multiple types render as a grid in a single image.
--viz使用参数并传入逗号分隔的类型值:
--viz| 类型 | 描述 |
|---|---|
| 标准频率频谱图 |
| Mel标度频谱图 |
| 音高类别分布 |
| 谐波/打击乐分离 |
| 自相似矩阵 |
| 随时间变化的响度 |
| 速度估计 |
| Mel频率倒谱系数 |
| 频谱通量( onset检测) |
传入多个类型时,会在单张图片中以网格形式渲染。
--vizCommon Flags
常用参数
| Flag | Description |
|---|---|
| Visualization types (comma-separated) |
| Color palette: |
| Output image dimensions |
| FFT window and hop size |
| Frequency range filter |
| Time slice of the audio |
| Output format: |
| Output file path |
| 参数 | 描述 |
|---|---|
| 可视化类型(逗号分隔) |
| 调色板: |
| 输出图片尺寸 |
| FFT窗口和步长 |
| 频率范围过滤 |
| 音频的时间切片 |
| 输出格式: |
| 输出文件路径 |
Notes
注意事项
- WAV and MP3 are decoded natively; other formats require
ffmpeg - Output images can be inspected with for automated audio analysis
vision_analyze - Useful for comparing audio outputs, debugging synthesis, or documenting audio processing pipelines
- WAV和MP3格式可原生解码;其他格式需要
ffmpeg - 输出图片可通过进行自动化音频分析
vision_analyze - 可用于对比音频输出、调试合成过程或记录音频处理流程