tooluniverse-protein-modification-analysis
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseProtein Post-Translational Modification Analysis
蛋白质翻译后修饰(PTMs)分析
Comprehensive PTM analysis using iPTMnet (primary), ProtVar (functional context), UniProt (baseline), STRING (interactions), ELM (linear motifs), and MassIVE/ProteomeXchange (experimental data).
利用iPTMnet(主要工具)、ProtVar(功能背景)、UniProt(基线数据)、STRING(相互作用)、ELM(线性基序)以及MassIVE/ProteomeXchange(实验数据)进行全面的PTM分析。
LOOK UP DON'T GUESS
查寻而非猜测
- PTM sites/enzymes:
iPTMnet_get_ptm_sites - Functional consequence: +
ProtVar_get_functioniPTMnet_get_ptm_ppi - Proteoforms:
iPTMnet_get_proteoforms - Linear motifs:
ELM_get_instances
- PTM位点/酶:
iPTMnet_get_ptm_sites - 功能影响:+
ProtVar_get_functioniPTMnet_get_ptm_ppi - 蛋白质变体:
iPTMnet_get_proteoforms - 线性基序:
ELM_get_instances
COMPUTE, DON'T DESCRIBE
计算而非描述
When analysis requires computation (statistics, data processing, scoring, enrichment), write and run Python code via Bash. Don't describe what you would do — execute it and report actual results. Use ToolUniverse tools to retrieve data, then Python (pandas, scipy, statsmodels, matplotlib) to analyze it.
当分析需要计算(统计、数据处理、评分、富集分析)时,通过Bash编写并运行Python代码。不要描述你要做什么——直接执行并报告实际结果。使用ToolUniverse工具获取数据,然后用Python(pandas、scipy、statsmodels、matplotlib)进行分析。
Domain Reasoning
领域推理
PTMs are context-dependent: same phosphorylation site can activate or inhibit depending on kinase and effectors. Always check: which enzyme, what functional consequence, in what cell context.
PTMs具有上下文依赖性:同一个磷酸化位点可能根据激酶和效应物的不同而激活或抑制蛋白质功能。始终要检查:涉及哪种酶、产生什么功能影响、处于何种细胞环境。
KEY PRINCIPLES
核心原则
- Disambiguation first -- resolve to UniProt accession before iPTMnet calls
- iPTMnet is SOAP-style -- every call requires parameter
operation - Evidence-graded -- distinguish experimental (T1) from predicted (T4)
- English-first queries
- 先消歧——在调用iPTMnet之前先解析到UniProt登录号
- iPTMnet采用SOAP风格——每次调用都需要参数
operation - 证据分级——区分实验验证(T1)和预测(T4)的结果
- 优先英文查询
Workflow
工作流程
Phase 0: Protein Disambiguation → UniProt accession
Phase 1: PTM Site Inventory → iPTMnet_get_ptm_sites
Phase 2: Proteoform Analysis → iPTMnet_get_proteoforms
Phase 3: PTM-Dependent Interactions → iPTMnet_get_ptm_ppi
Phase 4: Functional Context → ProtVar_get_function at key sites
Phase 4b: Linear Motif Context → ELM_get_instances for SLiM overlap
Phase 4c: Experimental Data → MassIVE/ProteomeXchange
Phase 5: Synthesis & ReportPhase 0: 蛋白质消歧 → UniProt登录号
Phase 1: PTM位点清单 → iPTMnet_get_ptm_sites
Phase 2: 蛋白质变体分析 → iPTMnet_get_proteoforms
Phase 3: PTM依赖的相互作用 → iPTMnet_get_ptm_ppi
Phase 4: 功能背景 → 关键位点的ProtVar_get_function
Phase 4b: 线性基序背景 → SLiM重叠区域的ELM_get_instances
Phase 4c: 实验数据 → MassIVE/ProteomeXchange
Phase 5: 综合分析与报告Phase 0: Disambiguation
Phase 0:消歧
- -- find UniProt IDs
iPTMnet_search(operation="search", search_term="TP53", role="Substrate") - If user provides UniProt accession directly, use it
- Select human entry if multiple hits
- ——查找UniProt ID
iPTMnet_search(operation="search", search_term="TP53", role="Substrate") - 如果用户直接提供UniProt登录号,直接使用该编号
- 如果有多个结果,选择人类条目
Phase 1: PTM Sites
Phase 1:PTM位点
iPTMnet_get_ptm_sites(operation="get_ptm_sites", uniprot_id="P04637")UniProt_get_entry_by_accessioniPTMnet_get_ptm_sites(operation="get_ptm_sites", uniprot_id="P04637")UniProt_get_entry_by_accessionPhase 2: Proteoforms
Phase 2:蛋白质变体
iPTMnet_get_proteoforms(operation="get_proteoforms", uniprot_id=...)iPTMnet_get_proteoforms(operation="get_proteoforms", uniprot_id=...)Phase 3: PTM-Dependent Interactions
Phase 3:PTM依赖的相互作用
iPTMnet_get_ptm_ppi(operation="get_ptm_ppi", uniprot_id=...)STRING_get_interaction_partners(identifiers=gene, species=9606, required_score=700)iPTMnet_get_ptm_ppi(operation="get_ptm_ppi", uniprot_id=...)STRING_get_interaction_partners(identifiers=gene, species=9606, required_score=700)Phase 4: Functional Context
Phase 4:功能背景
ProtVar_get_function(accession=..., position=N, variant_aa=AA)ProtVar_get_function(accession=..., position=N, variant_aa=AA)Phase 4b: Linear Motifs (ELM)
Phase 4b:线性基序(ELM)
ELM_get_instances(operation="get_instances", uniprot_id=..., motif_type="MOD")ELM_list_classes(operation="list_classes")ELM_get_instances(operation="get_instances", uniprot_id=..., motif_type="MOD")ELM_list_classes(operation="list_classes")Phase 4c: Experimental Data
Phase 4c:实验数据
MassIVE_search_datasets(species="9606")MassIVE_get_dataset(accession="MSV...")MassIVE_search_datasets(species="9606")MassIVE_get_dataset(accession="MSV...")Evidence Grading
证据分级
| Tier | Criteria |
|---|---|
| T1 | PTM at validated active/binding site with functional data |
| T2 | PTM in structured domain with ProtVar annotation |
| T3 | Correlation data only (mass spec detection) |
| T4 | Predicted, no experimental validation |
| 层级 | 标准 |
|---|---|
| T1 | 经过验证的活性/结合位点的PTM,带有功能数据 |
| T2 | 结构化结构域中的PTM,带有ProtVar注释 |
| T3 | 仅关联数据(质谱检测) |
| T4 | 预测结果,无实验验证 |
Tool Parameter Reference
工具参数参考
| Tool | Key Params |
|---|---|
| |
| |
| |
| |
| |
| |
| |
Critical: All iPTMnet and ELM tools require as first parameter (SOAP-style).
operation| 工具 | 关键参数 |
|---|---|
| |
| |
| |
| |
| |
| |
| |
重要提示:所有iPTMnet和ELM工具都需要将作为第一个参数(SOAP风格)。
operationFallbacks
备选方案
| Situation | Fallback |
|---|---|
| Not in iPTMnet | UniProt PTM/processing annotations |
| No PTM-PPI data | STRING general PPI |
| No ProtVar data | UniProt domain annotations |
| No ELM data | Proceed with iPTMnet/UniProt only |
| 场景 | 备选方案 |
|---|---|
| 不在iPTMnet中 | UniProt的PTM/加工注释 |
| 无PTM-PPI数据 | STRING的通用PPI数据 |
| 无ProtVar数据 | UniProt的结构域注释 |
| 无ELM数据 | 仅使用iPTMnet/UniProt继续分析 |
Limitations
局限性
- iPTMnet biased toward well-studied proteins
- Proteoform data covers observed combinations only
- PTM-PPI: only PTM-specific evidence; more PPIs exist in STRING
- iPTMnet偏向于研究充分的蛋白质
- 蛋白质变体数据仅涵盖已观测到的组合
- PTM-PPI:仅包含PTM特异性证据;STRING中存在更多的PPI数据