Loading...
Loading...
Compare original and translation side by side
/paper-plan/paper-figure/paper-write/paper-compile/auto-review-loop/paper-plan/paper-figure/paper-write/paper-compile/auto-review-loopgpt-5.4PAPER_IMPROVEMENT_LOG.mdtruefalse💡 Override:/auto-paper-improvement-loop "paper/" — human checkpoint: true
gpt-5.4PAPER_IMPROVEMENT_LOG.mdtruefalse💡 覆盖配置:/auto-paper-improvement-loop "paper/" — human checkpoint: true
paper/main.pdf.texpaper/main.pdf.texPAPER_IMPROVEMENT_STATE.json{
"current_round": 1,
"threadId": "019ce736-...",
"last_score": 6,
"status": "in_progress",
"timestamp": "2026-03-13T21:00:00"
}PAPER_IMPROVEMENT_STATE.json"status": "in_progress"timestampPAPER_IMPROVEMENT_LOG.md"status": "completed""status": "completed"PAPER_IMPROVEMENT_STATE.json{
"current_round": 1,
"threadId": "019ce736-...",
"last_score": 6,
"status": "in_progress",
"timestamp": "2026-03-13T21:00:00"
}PAPER_IMPROVEMENT_STATE.json"status": "in_progress"timestampPAPER_IMPROVEMENT_LOG.md"status": "completed""status""completed"cp paper/main.pdf paper/main_round0_original.pdfcp paper/main.pdf paper/main_round0_original.pdfundefinedundefinedundefinedundefinedmcp__codex__codex:
model: gpt-5.4
config: {"model_reasoning_effort": "xhigh"}
prompt: |
You are reviewing a [VENUE] paper. Please provide a detailed, structured review.
## Full Paper Text:
[paste concatenated sections]
## Review Instructions
Please act as a senior ML reviewer ([VENUE] level). Provide:
1. **Overall Score** (1-10, where 6 = weak accept, 7 = accept)
2. **Summary** (2-3 sentences)
3. **Strengths** (bullet list, ranked)
4. **Weaknesses** (bullet list, ranked: CRITICAL > MAJOR > MINOR)
5. **For each CRITICAL/MAJOR weakness**: A specific, actionable fix
6. **Missing References** (if any)
7. **Verdict**: Ready for submission? Yes / Almost / No
Focus on: theoretical rigor, claims vs evidence alignment, writing clarity,
self-containedness, notation consistency.mcp__codex__codex:
model: gpt-5.4
config: {"model_reasoning_effort": "xhigh"}
prompt: |
你正在评审一篇[VENUE]会议的论文。请提供详细、结构化的评审意见。
## 完整论文文本:
[粘贴合并后的章节内容]
## 评审说明
请以资深机器学习评审专家([VENUE]会议级别)的身份进行评审。请提供:
1. **总体评分**(1-10分,6分=弱接收,7分=接收)
2. **摘要**(2-3句话)
3. **优势**(分点列出,按重要性排序)
4. **问题点**(分点列出,按严重程度排序:CRITICAL(严重)> MAJOR(主要)> MINOR(次要))
5. **针对每个严重/主要问题点**:具体、可执行的修改建议
6. **缺失的参考文献**(如果有)
7. **结论**:是否可提交?是/接近/否
重点关注:理论严谨性、表述与证据的一致性、写作清晰度、内容自洽性、符号一致性。HUMAN_CHECKPOINT = false📋 Round 1 review complete.
Score: X/10 — [verdict]
Key weaknesses (by severity):
1. [CRITICAL] ...
2. [MAJOR] ...
3. [MINOR] ...
Reply "go" to implement all fixes, give custom instructions, "skip 2" to skip specific fixes, or "stop" to end./auto-review-loopHUMAN_CHECKPOINT = false📋 第1轮评审完成。
评分:X/10 — [结论]
主要问题点(按严重程度):
1. [严重] ...
2. [主要] ...
3. [次要] ...
回复“go”执行所有修改,提供自定义指令,“skip 2”跳过特定修改,或“stop”终止流程。/auto-review-loop| Issue | Fix Pattern |
|---|---|
| Assumption-model mismatch | Rewrite assumption to match the model, add formal proposition bridging the gap |
| Overclaims | Soften language: "validate" → "demonstrate practical relevance", "comparable" → "qualitatively competitive" |
| Missing metrics | Add quantitative table with honest parameter counts and caveats |
| Theorem not self-contained | Add "Interpretation" paragraph listing all dependencies |
| Notation confusion | Rename conflicting symbols globally, add Notation paragraph |
| Missing references | Add to |
| Theory-practice gap | Explicitly frame theory as idealized; add synthetic validation subsection |
| 问题 | 修改模式 |
|---|---|
| 假设与模型不匹配 | 重写假设使其与模型匹配,添加正式命题填补差距 |
| 过度表述 | 弱化表述:将“validate”改为“demonstrate practical relevance”,“comparable”改为“qualitatively competitive” |
| 缺失指标 | 添加包含真实参数计数和说明的量化表格 |
| 定理不自洽 | 添加“解释”段落,列出所有依赖项 |
| 符号混淆 | 全局重命名冲突符号,添加符号说明段落 |
| 缺失参考文献 | 添加至 |
| 理论与实践脱节 | 明确将理论表述为理想化情况;添加合成验证小节 |
cd paper && latexmk -C && latexmk -pdf -interaction=nonstopmode -halt-on-error main.tex
cp main.pdf main_round1.pdfcd paper && latexmk -C && latexmk -pdf -interaction=nonstopmode -halt-on-error main.tex
cp main.pdf main_round1.pdfmcp__codex__codex-replymcp__codex__codex-reply:
threadId: [saved from Round 1]
model: gpt-5.4
config: {"model_reasoning_effort": "xhigh"}
prompt: |
[Round 2 update]
Since your last review, we have implemented:
1. [Fix 1]: [description]
2. [Fix 2]: [description]
...
Please re-score and re-assess. Same format:
Score, Summary, Strengths, Weaknesses, Actionable fixes, Verdict.mcp__codex__codex-replymcp__codex__codex-reply:
threadId: [第1轮保存的ID]
model: gpt-5.4
config: {"model_reasoning_effort": "xhigh"}
prompt: |
[第2轮更新]
自上次评审后,我们已完成以下修改:
1. [修改1]:[描述]
2. [修改2]:[描述]
...
请重新评分和评估。格式要求与之前相同:评分、摘要、优势、问题点、可执行修改建议、结论。HUMAN_CHECKPOINT = falseHUMAN_CHECKPOINT = falsecd paper && latexmk -C && latexmk -pdf -interaction=nonstopmode -halt-on-error main.tex
cp main.pdf main_round2.pdfcd paper && latexmk -C && latexmk -pdf -interaction=nonstopmode -halt-on-error main.tex
cp main.pdf main_round2.pdfundefinedundefined
**Auto-fix patterns:**
| Issue | Fix |
|-------|-----|
| Overfull hbox in equation | Wrap in `\resizebox` or split with `\split`/`aligned` |
| Overfull hbox in table | Reduce font (`\small`/`\footnotesize`) or use `\resizebox{\linewidth}{!}{...}` |
| Overfull hbox in text | Rephrase sentence or add `\allowbreak` / `\-` hints |
| Over page limit | Move content to appendix, compress tables, reduce figure sizes |
| Underfull hbox (loose) | Rephrase for better line filling or add `\looseness=-1` |
If any overfull hbox > 10pt is found, fix it and recompile before documenting.
**自动修正模式:**
| 问题 | 修正方法 |
|-------|-----|
| 公式中内容超出边距 | 使用`\resizebox`包裹,或用`\split`/`aligned`拆分 |
| 表格中内容超出边距 | 缩小字体(`\small`/`\footnotesize`)或使用`\resizebox{\linewidth}{!}{...}` |
| 正文中内容超出边距 | 改写句子或添加`\allowbreak`/`\-`换行提示 |
| 超过页数限制 | 将内容移至附录、压缩表格、缩小图片尺寸 |
| 间距过松 | 改写内容以优化行填充,或添加`\looseness=-1` |
如果发现任何超过10pt的内容超出边距问题,在记录前先修正并重新编译。PAPER_IMPROVEMENT_LOG.mdundefinedPAPER_IMPROVEMENT_LOG.mdundefined| Round | Score | Verdict | Key Changes |
|---|---|---|---|
| Round 0 (original) | X/10 | No/Almost/Yes | Baseline |
| Round 1 | Y/10 | No/Almost/Yes | [summary of fixes] |
| Round 2 | Z/10 | No/Almost/Yes | [summary of fixes] |
| 轮次 | 评分 | 结论 | 主要修改 |
|---|---|---|---|
| 第0轮(原始) | X/10 | 否/接近/是 | 基线版本 |
| 第1轮 | Y/10 | 否/接近/是 | [修改摘要] |
| 第2轮 | Z/10 | 否/接近/是 | [修改摘要] |
main_round0_original.pdfmain_round1.pdfmain_round2.pdfundefinedmain_round0_original.pdfmain_round1.pdfmain_round2.pdfundefined~/.claude/feishu.jsonreview_scoredpipeline_done"off"~/.claude/feishu.jsonreview_scoredpipeline_done"off"paper/
├── main_round0_original.pdf # Original
├── main_round1.pdf # After Round 1
├── main_round2.pdf # After Round 2 (final)
├── main.pdf # = main_round2.pdf
└── PAPER_IMPROVEMENT_LOG.md # Full review log with scorespaper/
├── main_round0_original.pdf # 原始版本
├── main_round1.pdf # 第1轮修改后
├── main_round2.pdf # 第2轮修改后(最终版)
├── main.pdf # = main_round2.pdf
└── PAPER_IMPROVEMENT_LOG.md # 包含评分的完整评审日志cat << 'EOF' > filemcp__codex__codex-replycat << 'EOF' > filemcp__codex__codex-reply| Round | Score | Key Improvements |
|---|---|---|
| Round 0 | 4/10 (content) | Baseline: assumption-model mismatch, overclaims, notation issues |
| Round 1 | 6/10 (content) | Fixed assumptions, softened claims, added interpretation, renamed notation |
| Round 2 | 7/10 (content) | Added synthetic validation, formal truncation proposition, stronger limitations |
| Round 3 | 5→8.5/10 (format) | Removed hero fig, appendix, compressed conclusion, fixed overfull hbox |
| 轮次 | 评分 | 主要改进 |
|---|---|---|
| 第0轮 | 4/10(内容) | 基线版本:假设与模型不匹配、过度表述、符号问题 |
| 第1轮 | 6/10(内容) | 修正假设、弱化表述、添加解释、重命名符号 |
| 第2轮 | 7/10(内容) | 添加合成验证、形式化截断命题、强化局限性章节 |
| 第3轮 | 5→8.5/10(格式) | 移除主图、移至附录、压缩结论、修正内容超出边距问题 |