tooluniverse

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

ToolUniverse

Overview

概述

ToolUniverse is a unified ecosystem that enables AI agents to function as research scientists by providing standardized access to 600+ scientific resources. Use this skill to discover, execute, and compose scientific tools across multiple research domains including bioinformatics, cheminformatics, genomics, structural biology, proteomics, and drug discovery.

Key Capabilities:

Access 600+ scientific tools, models, datasets, and APIs
Discover tools using natural language, semantic search, or keywords
Execute tools through standardized AI-Tool Interaction Protocol
Compose multi-step workflows for complex research problems
Integration with Claude Desktop/Code via Model Context Protocol (MCP)

ToolUniverse是一个统一的生态系统，它通过提供对600+科研资源的标准化访问，让AI Agent能够像科研人员一样开展工作。使用本Skill可在生物信息学、化学信息学、基因组学、结构生物学、蛋白质组学和药物发现等多个研究领域中发现、执行并组合科研工具。

核心功能：

访问600+科研工具、模型、数据集和API
支持通过自然语言、语义搜索或关键词发现工具
通过标准化AI-Tool Interaction Protocol执行工具
为复杂研究问题构建多步骤工作流
通过Model Context Protocol (MCP)与Claude Desktop/Code集成

When to Use This Skill

适用场景

Use this skill when:

Searching for scientific tools by function or domain (e.g., "find protein structure prediction tools")
Executing computational biology workflows (e.g., disease target identification, drug discovery, genomics analysis)
Accessing scientific databases (OpenTargets, PubChem, UniProt, PDB, ChEMBL, KEGG, etc.)
Composing multi-step research pipelines (e.g., target discovery → structure prediction → virtual screening)
Working with bioinformatics, cheminformatics, or structural biology tasks
Analyzing gene expression, protein sequences, molecular structures, or clinical data
Performing literature searches, pathway enrichment, or variant annotation
Building automated scientific research workflows

当您遇到以下场景时可使用本Skill：

按功能或领域搜索科研工具（例如："寻找蛋白质结构预测工具"）
执行计算生物学工作流（例如：疾病靶点识别、药物发现、基因组学分析）
访问科研数据库（OpenTargets、PubChem、UniProt、PDB、ChEMBL、KEGG等）
构建多步骤研究管线（例如：靶点发现 → 结构预测 → 虚拟筛选）
处理生物信息学、化学信息学或结构生物学相关任务
分析基因表达、蛋白质序列、分子结构或临床数据
进行文献检索、通路富集或变异注释
构建自动化科研工作流

Quick Start

快速开始

Basic Setup

基础设置

python

from tooluniverse import ToolUniverse

python

from tooluniverse import ToolUniverse

Initialize and load tools

初始化并加载工具

tu = ToolUniverse() tu.load_tools() # Loads 600+ scientific tools

tu = ToolUniverse() tu.load_tools() # 加载600+科研工具

Discover tools

发现工具

tools = tu.run({ "name": "Tool_Finder_Keyword", "arguments": { "description": "disease target associations", "limit": 10 } })

Execute a tool

执行工具

result = tu.run({ "name": "OpenTargets_get_associated_targets_by_disease_efoId", "arguments": {"efoId": "EFO_0000537"} # Hypertension })

undefined

result = tu.run({ "name": "OpenTargets_get_associated_targets_by_disease_efoId", "arguments": {"efoId": "EFO_0000537"} # 高血压 })

undefined

Model Context Protocol (MCP)

For Claude Desktop/Code integration:

bash

tooluniverse-smcp

如需与Claude Desktop/Code集成：

bash

tooluniverse-smcp

Core Workflows

核心工作流

1. Tool Discovery

1. 工具发现

Find relevant tools for your research task:

Three discovery methods:

```
Tool_Finder
```
- Embedding-based semantic search (requires GPU)
```
Tool_Finder_LLM
```
- LLM-based semantic search (no GPU required)
```
Tool_Finder_Keyword
```
- Fast keyword search

Example:

python

undefined

为您的研究任务寻找相关工具：

三种发现方式：

```
Tool_Finder
```
- 基于嵌入的语义搜索（需要GPU）
```
Tool_Finder_LLM
```
- 基于LLM的语义搜索（无需GPU）
```
Tool_Finder_Keyword
```
- 快速关键词搜索

示例：

python

undefined

Search by natural language description

通过自然语言描述搜索工具

tools = tu.run({ "name": "Tool_Finder_LLM", "arguments": { "description": "Find tools for RNA sequencing differential expression analysis", "limit": 10 } })

Review available tools

查看可用工具

for tool in tools: print(f"{tool['name']}: {tool['description']}")


**See `references/tool-discovery.md` for:**
- Detailed discovery methods and search strategies
- Domain-specific keyword suggestions
- Best practices for finding tools

for tool in tools: print(f"{tool['name']}: {tool['description']}")


**详见`references/tool-discovery.md`获取：**
- 详细的发现方法与搜索策略
- 领域特定关键词建议
- 工具查找最佳实践

2. Tool Execution

2. 工具执行

Execute individual tools through the standardized interface:

Example:

python

undefined

通过标准化接口执行单个工具：

示例：

python

undefined

Execute disease-target lookup

执行疾病-靶点查询

targets = tu.run({ "name": "OpenTargets_get_associated_targets_by_disease_efoId", "arguments": {"efoId": "EFO_0000616"} # Breast cancer })

targets = tu.run({ "name": "OpenTargets_get_associated_targets_by_disease_efoId", "arguments": {"efoId": "EFO_0000616"} # 乳腺癌 })

Get protein structure

获取蛋白质结构

structure = tu.run({ "name": "AlphaFold_get_structure", "arguments": {"uniprot_id": "P12345"} })

Calculate molecular properties

计算分子属性

properties = tu.run({ "name": "RDKit_calculate_descriptors", "arguments": {"smiles": "CCO"} # Ethanol })


**See `references/tool-execution.md` for:**
- Real-world execution examples across domains
- Tool parameter handling and validation
- Result processing and error handling
- Best practices for production use

properties = tu.run({ "name": "RDKit_calculate_descriptors", "arguments": {"smiles": "CCO"} # 乙醇 })


**详见`references/tool-execution.md`获取：**
- 跨领域的真实执行示例
- 工具参数处理与验证
- 结果处理与错误处理
- 生产环境使用最佳实践

3. Tool Composition and Workflows

3. 工具组合与工作流

Compose multiple tools for complex research workflows:

Drug Discovery Example:

python

undefined

组合多个工具以完成复杂研究工作流：

药物发现示例：

python

undefined

1. Find disease targets

1. 查找疾病靶点

targets = tu.run({ "name": "OpenTargets_get_associated_targets_by_disease_efoId", "arguments": {"efoId": "EFO_0000616"} })

2. Get protein structures

2. 获取蛋白质结构

structures = [] for target in targets[:5]: structure = tu.run({ "name": "AlphaFold_get_structure", "arguments": {"uniprot_id": target['uniprot_id']} }) structures.append(structure)

3. Screen compounds

3. 化合物筛选

hits = [] for structure in structures: compounds = tu.run({ "name": "ZINC_virtual_screening", "arguments": { "structure": structure, "library": "lead-like", "top_n": 100 } }) hits.extend(compounds)

4. Evaluate drug-likeness

4. 评估药物相似性

drug_candidates = [] for compound in hits: props = tu.run({ "name": "RDKit_calculate_drug_properties", "arguments": {"smiles": compound['smiles']} }) if props['lipinski_pass']: drug_candidates.append(compound)


**See `references/tool-composition.md` for:**
- Complete workflow examples (drug discovery, genomics, clinical)
- Sequential and parallel tool composition patterns
- Output processing hooks
- Workflow best practices


**详见`references/tool-composition.md`获取：**
- 完整的工作流示例（药物发现、基因组学、临床研究）
- 顺序与并行工具组合模式
- 输出处理钩子
- 工作流最佳实践

Scientific Domains

科研领域

ToolUniverse supports 600+ tools across major scientific domains:

Bioinformatics:

Sequence analysis, alignment, BLAST
Gene expression (RNA-seq, DESeq2)
Pathway enrichment (KEGG, Reactome, GO)
Variant annotation (VEP, ClinVar)

Cheminformatics:

Molecular descriptors and fingerprints
Drug discovery and virtual screening
ADMET prediction and drug-likeness
Chemical databases (PubChem, ChEMBL, ZINC)

Structural Biology:

Protein structure prediction (AlphaFold)
Structure retrieval (PDB)
Binding site detection
Protein-protein interactions

Proteomics:

Mass spectrometry analysis
Protein databases (UniProt, STRING)
Post-translational modifications

Genomics:

Genome assembly and annotation
Copy number variation
Clinical genomics workflows

Medical/Clinical:

Disease databases (OpenTargets, OMIM)
Clinical trials and FDA data
Variant classification

See
references/domains.md
for:

Complete domain categorization
Tool examples by discipline
Cross-domain applications
Search strategies by domain

ToolUniverse支持600+工具，覆盖主要科研领域：

生物信息学：

序列分析、比对、BLAST
基因表达（RNA-seq、DESeq2）
通路富集（KEGG、Reactome、GO）
变异注释（VEP、ClinVar）

化学信息学：

分子描述符与指纹
药物发现与虚拟筛选
ADMET预测与药物相似性
化学数据库（PubChem、ChEMBL、ZINC）

结构生物学：

蛋白质结构预测（AlphaFold）
结构检索（PDB）
结合位点检测
蛋白质-蛋白质相互作用

蛋白质组学：

质谱分析
蛋白质数据库（UniProt、STRING）
翻译后修饰

基因组学：

基因组组装与注释
拷贝数变异
临床基因组学工作流

医学/临床：

疾病数据库（OpenTargets、OMIM）
临床试验与FDA数据
变异分类

详见
references/domains.md
获取：

完整的领域分类
各学科工具示例
跨领域应用
分领域搜索策略

Reference Documentation

参考文档

This skill includes comprehensive reference files that provide detailed information for specific aspects:

references/installation.md
- Installation, setup, MCP configuration, platform integration
references/tool-discovery.md
- Discovery methods, search strategies, listing tools
references/tool-execution.md
- Execution patterns, real-world examples, error handling
references/tool-composition.md
- Workflow composition, complex pipelines, parallel execution
references/domains.md
- Tool categorization by domain, use case examples
references/api_reference.md
- Python API documentation, hooks, protocols

Workflow: When helping with specific tasks, reference the appropriate file for detailed instructions. For example, if searching for tools, consult

references/tool-discovery.md

for search strategies.

本Skill包含全面的参考文件，提供各方面的详细信息：

references/installation.md
- 安装、设置、MCP配置、平台集成
references/tool-discovery.md
- 发现方法、搜索策略、工具列表
references/tool-execution.md
- 执行模式、真实示例、错误处理
references/tool-composition.md
- 工作流组合、复杂管线、并行执行
references/domains.md
- 工具领域分类、用例示例
references/api_reference.md
- Python API文档、钩子、协议

工作流建议： 当处理特定任务时，请查阅对应文件获取详细说明。例如，若您需要搜索工具，请参考

references/tool-discovery.md

中的搜索策略。

Example Scripts

示例脚本

Two executable example scripts demonstrate common use cases:

scripts/example_tool_search.py
- Demonstrates all three discovery methods:

Keyword-based search
LLM-based search
Domain-specific searches
Getting detailed tool information

scripts/example_workflow.py
- Complete workflow examples:

Drug discovery pipeline (disease → targets → structures → screening → candidates)
Genomics analysis (expression data → differential analysis → pathways)

Run examples to understand typical usage patterns and workflow composition.

两个可执行示例脚本展示了常见使用场景：

scripts/example_tool_search.py
- 演示三种发现方式：

基于关键词的搜索
基于LLM的搜索
领域特定搜索
获取详细工具信息

scripts/example_workflow.py
- 完整工作流示例：

药物发现管线（疾病 → 靶点 → 结构 → 筛选 → 候选药物）
基因组学分析（表达数据 → 差异分析 → 通路）

运行示例以了解典型使用模式与工作流组合方式。

Best Practices

最佳实践

Tool Discovery:
- Start with broad searches, then refine based on results
- Use
```
Tool_Finder_Keyword
```
  for fast searches with known terms
- Use
```
Tool_Finder_LLM
```
  for complex semantic queries
- Set appropriate
```
limit
```
  parameter (default: 10)
Tool Execution:
- Always verify tool parameters before execution
- Implement error handling for production workflows
- Validate input data formats (SMILES, UniProt IDs, gene symbols)
- Check result types and structures
Workflow Composition:
- Test each step individually before composing full workflows
- Implement checkpointing for long workflows
- Consider rate limits for remote APIs
- Use parallel execution when tools are independent
Integration:
- Initialize ToolUniverse once and reuse the instance
- Call
```
load_tools()
```
  once at startup
- Cache frequently used tool information
- Enable logging for debugging

工具发现：
- 从宽泛搜索开始，再根据结果细化
- 若已知术语，使用
```
Tool_Finder_Keyword
```
  进行快速搜索
- 针对复杂语义查询，使用
```
Tool_Finder_LLM
```
- 设置合适的
```
limit
```
  参数（默认值：10）
工具执行：
- 执行前务必验证工具参数
- 为生产环境工作流实现错误处理
- 验证输入数据格式（SMILES、UniProt ID、基因符号等）
- 检查结果类型与结构
工作流组合：
- 在组合完整工作流前，单独测试每个步骤
- 为长工作流实现检查点机制
- 考虑远程API的调用频率限制
- 当工具相互独立时，使用并行执行
集成：
- 初始化一次ToolUniverse并复用实例
- 在启动时仅调用一次
```
load_tools()
```
- 缓存频繁使用的工具信息
- 启用日志以方便调试

Key Terminology

关键术语

Tool: A scientific resource (model, dataset, API, package) accessible through ToolUniverse
Tool Discovery: Finding relevant tools using search methods (Finder, LLM, Keyword)
Tool Execution: Running a tool with specific arguments via
```
tu.run()
```
Tool Composition: Chaining multiple tools for multi-step workflows
MCP: Model Context Protocol for integration with Claude Desktop/Code
AI-Tool Interaction Protocol: Standardized interface for LLM-tool communication

Tool（工具）：可通过ToolUniverse访问的科研资源（模型、数据集、API、包）
Tool Discovery（工具发现）：使用搜索方法（Finder、LLM、关键词）寻找相关工具
Tool Execution（工具执行）：通过
```
tu.run()
```
运行带特定参数的工具
Tool Composition（工具组合）：将多个工具链接起来构建多步骤工作流
MCP：用于与Claude Desktop/Code集成的Model Context Protocol
AI-Tool Interaction Protocol：LLM与工具通信的标准化接口

Resources

资源

Official Website: https://aiscientist.tools
GitHub: https://github.com/mims-harvard/ToolUniverse
Documentation: https://zitniklab.hms.harvard.edu/ToolUniverse/
Installation:
```
uv uv pip install tooluniverse
```
MCP Server:
```
tooluniverse-smcp
```

官方网站：https://aiscientist.tools
GitHub仓库：https://github.com/mims-harvard/ToolUniverse
文档站点：https://zitniklab.hms.harvard.edu/ToolUniverse/
安装命令：
```
uv uv pip install tooluniverse
```
MCP服务器：
```
tooluniverse-smcp
```