pytorch-model-cli

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

PyTorch Model to CLI Tool Conversion

PyTorch 模型转 CLI 工具指南

This skill provides guidance for tasks that require converting PyTorch models into standalone command-line tools, typically implemented in C/C++ for portability and independence from Python runtime.

本技能提供了将PyTorch模型转换为独立命令行工具的指导，这类工具通常用C/C++实现，以保证可移植性并摆脱Python运行时依赖。

Task Recognition

任务识别

This skill applies when the task involves:

Converting a PyTorch model to a standalone executable
Extracting model weights to a portable format (JSON, binary)
Implementing neural network inference in C/C++
Creating CLI tools that perform image classification or prediction
Building inference tools using libraries like cJSON and lodepng

当你需要完成以下任务时，可应用本技能：

将PyTorch模型转换为独立可执行文件
将模型权重提取为可移植格式（JSON、二进制）
在C/C++中实现神经网络推理
创建可执行图像分类或预测的CLI工具
使用cJSON、lodepng等库构建推理工具

Recommended Approach

推荐方法

Phase 1: Environment Analysis

阶段1：环境分析

Before writing any code, thoroughly analyze the available resources:

Identify the model architecture
- Read the model definition file (e.g.,
```
model.py
```
  ) completely
- Document all layer types, dimensions, and activation functions
- Note any default parameters (hidden dimensions, number of classes)
Examine available libraries
- Check for image loading libraries (lodepng, stb_image)
- Check for JSON parsing libraries (cJSON, nlohmann/json)
- Identify compilation requirements (headers, source files)
Understand input requirements
- Determine expected image dimensions (e.g., 28x28 for MNIST)
- Identify color format (grayscale, RGB, RGBA)
- Document normalization requirements (divide by 255, mean/std normalization)
Verify preprocessing pipeline
- If training code is available, examine data transformations
- Match inference preprocessing exactly to training preprocessing
- Common transformations: resize, grayscale conversion, normalization

在编写任何代码之前，先全面分析可用资源：

识别模型架构
- 完整阅读模型定义文件（如
```
model.py
```
  ）
- 记录所有层类型、维度和激活函数
- 注意默认参数（如隐藏层维度、类别数量）
检查可用库
- 确认图像加载库（如lodepng、stb_image）
- 确认JSON解析库（如cJSON、nlohmann/json）
- 明确编译要求（头文件、源文件）
理解输入要求
- 确定预期的图像维度（如MNIST的28x28）
- 识别颜色格式（灰度、RGB、RGBA）
- 记录归一化要求（如除以255、均值/标准差归一化）
验证预处理流程
- 如果有训练代码，检查数据转换步骤
- 确保推理阶段的预处理与训练阶段完全一致
- 常见转换：缩放、灰度转换、归一化

Phase 2: Weight Extraction

阶段2：权重提取

Extract model weights from PyTorch format to a portable format:

Load the model checkpoint

python

import torch
import json

# Load state dict
state_dict = torch.load('model.pth', map_location='cpu')

Convert tensors to lists

python

weights = {}
for key, tensor in state_dict.items():
    weights[key] = tensor.numpy().tolist()

Save to JSON

python

with open('weights.json', 'w') as f:
    json.dump(weights, f)

Verify extraction
- Check that all expected layer weights are present
- Verify dimensions match the model architecture
- For a model with layers fc1, fc2, fc3: expect fc1.weight, fc1.bias, etc.

将PyTorch格式的模型权重提取为可移植格式：

加载模型检查点

python

import torch
import json

# Load state dict
state_dict = torch.load('model.pth', map_location='cpu')

将张量转换为列表

python

weights = {}
for key, tensor in state_dict.items():
    weights[key] = tensor.numpy().tolist()

保存为JSON格式

python

with open('weights.json', 'w') as f:
    json.dump(weights, f)

验证提取结果
- 检查所有预期的层权重是否存在
- 验证权重维度是否与模型架构匹配
- 对于包含fc1、fc2、fc3层的模型，应确保存在fc1.weight、fc1.bias等权重

Phase 3: Reference Implementation

阶段3：参考实现

Before implementing in C/C++, create a reference output:

Run inference in PyTorch

python

model.eval()
with torch.no_grad():
    output = model(input_tensor)
    prediction = output.argmax().item()

Save reference outputs
- Store intermediate layer outputs for debugging
- Record the final prediction for verification
- This allows validating the C/C++ implementation

在C/C++实现之前，先生成参考输出：

在PyTorch中运行推理

python

model.eval()
with torch.no_grad():
    output = model(input_tensor)
    prediction = output.argmax().item()

保存参考输出
- 保存中间层输出用于调试
- 记录最终预测结果用于验证
- 这可以帮助验证C/C++实现的正确性

Phase 4: C/C++ Implementation

阶段4：C/C++实现

Implement the inference logic in C/C++:

Image loading and preprocessing
- Load image using the available library (lodepng for PNG)
- Handle color channel conversion (RGBA to grayscale if needed)
- Apply normalization (typically divide by 255.0)
- Flatten to 1D array in correct order (row-major)
Weight loading
- Parse JSON file containing weights
- Store weights in appropriate data structures
- Verify dimensions during loading
Forward pass implementation
- Implement matrix-vector multiplication for linear layers
- Implement activation functions (ReLU, softmax, etc.)
- Process layers in correct order
Output handling
- Find argmax for classification tasks
- Write prediction to output file
- Ensure only prediction goes to stdout (not progress/debug info)

在C/C++中实现推理逻辑：

图像加载与预处理
- 使用可用库加载图像（如用lodepng加载PNG）
- 处理颜色通道转换（如需要将RGBA转为灰度）
- 应用归一化（通常是除以255.0）
- 按正确顺序（行优先）展平为一维数组
权重加载
- 解析包含权重的JSON文件
- 将权重存储在合适的数据结构中
- 加载时验证权重维度
前向传播实现
- 为线性层实现矩阵-向量乘法
- 实现激活函数（如ReLU、softmax等）
- 按正确顺序处理各层
输出处理
- 针对分类任务找到argmax结果
- 将预测结果写入输出文件
- 确保只有预测结果输出到stdout（而非进度/调试信息）

Phase 5: Compilation and Testing

阶段5：编译与测试

Compile with appropriate flags
bash
```
g++ -o cli_tool main.cpp lodepng.cpp cJSON.c -std=c++11 -lm
```
- Double-check flag syntax (avoid concatenation errors like
```
-std=c++11-lm
```
  )
Test against reference
- Run the CLI tool on the same input used for reference
- Compare output to PyTorch reference
- Debug any discrepancies by checking intermediate values

使用合适的编译选项
bash
```
g++ -o cli_tool main.cpp lodepng.cpp cJSON.c -std=c++11 -lm
```
- 仔细检查选项语法（避免出现
```
-std=c++11-lm
```
  这类拼接错误）
与参考结果对比测试
- 在用于生成参考结果的同一输入上运行CLI工具
- 将输出与PyTorch的参考结果对比
- 通过检查中间值调试任何不一致的地方

Verification Strategies

验证策略

Before Implementation

实现前

Model architecture fully documented
All layer dimensions verified
Preprocessing requirements identified
Reference output generated from PyTorch

已完整记录模型架构
已验证所有层的维度
已明确预处理要求
已通过PyTorch生成参考输出

After Weight Extraction

权重提取后

All expected keys present in JSON
Weight dimensions match architecture
Bias terms included for all layers

JSON文件中包含所有预期的键
权重维度与架构匹配
所有层都包含偏置项

After C/C++ Implementation

C/C++实现后

Compilation succeeds without warnings
Output matches PyTorch reference exactly
CLI tool handles missing files gracefully
Only prediction output goes to stdout

编译成功且无警告
输出与PyTorch参考结果完全一致
CLI工具能优雅处理文件缺失情况
只有预测结果输出到stdout

Final Validation

最终验证

All test cases pass
Memory properly managed (no leaks)
Error messages go to stderr, not stdout

所有测试用例通过
内存管理正确（无泄漏）
错误信息输出到stderr而非stdout

Common Pitfalls

常见陷阱

Weight Extraction

权重提取

Forgetting to use
map_location='cpu'
when loading on CPU-only systems
Missing bias terms - ensure both weights and biases are extracted
Incorrect tensor ordering - PyTorch uses different conventions than some C libraries

在仅支持CPU的系统上加载模型时忘记使用
map_location='cpu'
遗漏偏置项 - 确保同时提取权重和偏置
张量顺序错误 - PyTorch的张量顺序与部分C库不同

Preprocessing Mismatches

预处理不匹配

Wrong normalization - training might use mean/std normalization, not just /255
Color channel issues - PNG might be RGBA while model expects grayscale
Dimension ordering - ensure row-major vs column-major consistency

归一化错误 - 训练阶段可能使用均值/标准差归一化，而非简单除以255
颜色通道问题 - PNG图像可能是RGBA格式，但模型预期灰度图
维度顺序错误 - 确保行优先与列优先的一致性

C/C++ Implementation

C/C++实现

Matrix multiplication order - verify (input × weights^T) vs (weights × input)
Activation function placement - apply after linear layer, before next layer
Integer vs float division - use 255.0, not 255, for normalization

矩阵乘法顺序错误 - 确认是（输入 × 权重转置）还是（权重 × 输入）
激活函数位置错误 - 应在线性层之后、下一层之前应用
整数与浮点数除法混淆 - 归一化时使用255.0而非255

Compilation Issues

编译问题

Flag concatenation - ensure spaces between compiler flags
Missing libraries - include all required source files (lodepng.cpp, cJSON.c)
Header dependencies - verify all headers are in include path

选项拼接错误 - 确保编译选项之间有空格
缺少库文件 - 包含所有必需的源文件（如lodepng.cpp、cJSON.c）
头文件依赖问题 - 确认所有头文件都在包含路径中

Output Handling

输出处理

Verbose library output - suppress or redirect debug/progress output
Newline handling - ensure consistent line endings in output files
Buffering issues - flush stdout before program exit

库的冗余输出 - 抑制或重定向调试/进度输出
换行符处理 - 确保输出文件的换行符一致
缓冲问题 - 程序退出前刷新stdout

Efficiency Guidelines

效率指南

Avoid repeatedly checking package managers; identify available tools first
Create reference outputs early to catch implementation bugs quickly
Review complete code before compilation attempts
Minimize status-only updates; batch related operations
Test with multiple inputs when possible, not just the provided test case

避免反复检查包管理器；先确定可用工具
尽早生成参考输出，快速发现实现中的bug
编译前先检查完整代码
减少仅状态更新的操作；批量处理相关任务
尽可能用多个输入测试，而非仅用提供的测试用例