asta-skill
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseAsta MCP — Academic Paper Search
Asta MCP — 学术论文搜索
Asta is Ai2's Scientific Corpus Tool, exposing the Semantic Scholar academic graph over MCP (streamable HTTP transport). This skill tells agents which Asta tool to call for which intent, and how to compose them into useful workflows.
- MCP endpoint:
https://asta-tools.allen.ai/mcp/v1 - Auth: header (request key at https://share.hsforms.com/1L4hUh20oT3mu8iXJQMV77w3ioxm)
x-api-key - Transport: streamable HTTP
Asta是Ai2推出的科学语料库工具,通过MCP(可流式传输的HTTP协议)对外暴露Semantic Scholar学术图谱。本技能指导Agent针对不同意图调用对应的Asta工具,以及如何将这些工具组合成实用的工作流。
- MCP端点:
https://asta-tools.allen.ai/mcp/v1 - 认证方式: 请求头(可通过https://share.hsforms.com/1L4hUh20oT3mu8iXJQMV77w3ioxm申请密钥)
x-api-key - 传输协议: 可流式传输的HTTP
Prerequisite Check
前置检查
Before invoking any tool, verify the Asta MCP server is registered in the host agent. Tool names will be prefixed by the MCP server name chosen at install time (commonly or ). If no Asta tools are visible, direct the user to the Installation section below.
asta__<tool>mcp__asta__<tool>调用任何工具前,请确认Asta MCP服务器已在宿主Agent中注册。工具名称会带有安装时选择的MCP服务器名称前缀(通常为或)。如果未显示任何Asta工具,请引导用户查看下方的安装部分。
asta__<tool>mcp__asta__<tool>Tool Map — Intent → Asta Tool
工具映射 — 意图 → Asta工具
| User intent | Asta tool | Notes |
|---|---|---|
| Broad topic search | | Supports venue + date filters |
| Known paper title | | Optional venue restriction |
| Known DOI / arXiv / PMID / CorpusId / MAG / ACL / SHA / URL | | Single-paper lookup |
| Multiple known IDs at once | | Batch lookup — prefer over N sequential |
| Who cited paper X | | Citation traversal with filters, paginated |
| Find author by name | | Returns profile info |
| An author's publications | | Pass author id from previous call |
| Find passages mentioning X | | ~500-word excerpts from paper bodies |
Search/citation tools accept (format ; year shorthand like , , is also accepted) and (comma-separated) filters, plus for field selection — pass them whenever the user's intent constrains scope (e.g., "recent", "since 2022", "at NeurIPS").
publication_date_rangeYYYY-MM-DD:YYYY-MM-DD"2021:"":2015-01""2015:2020"venuesfields| 用户意图 | Asta工具 | 说明 |
|---|---|---|
| 宽泛主题搜索 | | 支持会议/期刊筛选 + 日期筛选 |
| 已知论文标题 | | 可选会议/期刊限制 |
| 已知DOI / arXiv / PMID / CorpusId / MAG / ACL / SHA / URL | | 单篇论文查询 |
| 批量查询已知ID | | 批量查询 — 优先使用该工具,而非连续调用N次 |
| 哪些论文引用了论文X | | 带筛选条件的引用遍历,支持分页 |
| 通过姓名查找作者 | | 返回作者个人资料信息 |
| 作者的所有出版物 | | 传入上一步调用返回的作者ID |
| 查找提及X的段落 | | 论文正文中约500词的摘录片段 |
搜索/引用类工具支持**(格式为;也支持年份简写,如、、)和(逗号分隔)筛选条件,以及用于字段选择的**参数——当用户意图存在范围限制时(例如“近期”、“2022年以来”、“NeurIPS会议”),请传入这些参数。
publication_date_rangeYYYY-MM-DD:YYYY-MM-DD"2021:"":2015-01""2015:2020"venuesfields⚠️ fields
parameter — avoid context blowups
fields⚠️ fields
参数 — 避免上下文溢出
fieldsget_paperget_paper_batchfieldscitationsreferencesfieldsget_citationsget_referencesget_paperfields=referencesSafe default for :
fieldsget_papertitle,year,authors,venue,tldr,url,abstractAdd , , , only when needed.
journalpublicationDatefieldsOfStudyisOpenAccessget_paperget_paper_batchfieldsfieldscitationsreferencesget_citationsget_referencesget_paperfields=referencesget_paperfieldstitle,year,authors,venue,tldr,url,abstract仅在需要时添加、、、字段。
journalpublicationDatefieldsOfStudyisOpenAccessRetrieving DOI / external IDs (undocumented but supported)
获取DOI / 外部ID(未正式文档化但支持)
Asta's official list does not include , but the field is transparently passed through to the underlying Semantic Scholar API and works in practice. Add to to retrieve , , , , , , . Caveats:
fieldsexternalIdsexternalIdsfieldsDOIPubMedPubMedCentralArXivMAGDBLPCorpusId- Not all papers have a DOI — pure arXiv preprints often only return +
ArXiv.CorpusId - lookup is not 100% reliable; some valid DOIs return
get_paper("DOI:..."). Prefer searching by title first, then readingnot foundoff the result.externalIds - Since this is undocumented, treat it as best-effort and degrade gracefully if a future Asta release drops it.
Asta官方的列表不包含,但该字段会透传给底层的Semantic Scholar API,实际可用。在中添加可获取、、、、、、。注意事项:
fieldsexternalIdsfieldsexternalIdsDOIPubMedPubMedCentralArXivMAGDBLPCorpusId- 并非所有论文都有DOI——纯arXiv预印本通常仅返回+
ArXiv。CorpusId - 通过查询并非100%可靠;部分有效DOI会返回
get_paper("DOI:...")。建议先通过标题搜索,再从结果中读取not found。externalIds - 由于该功能未正式文档化,请视为尽力而为的功能,若未来Asta版本移除该功能,请优雅降级处理。
Workflow Patterns
工作流模式
Pattern 1 — Topic Discovery
模式1 — 主题探索
- → initial hits (compute the lower bound from today's date — e.g., in 2026 pass
search_papers_by_relevance(keyword, publication_date_range="<current_year-5>:", venues=?); adjust or drop the filter if the user asks for older work)publication_date_range="2021:" - Rank/present top N by citationCount + recency
- Offer follow-ups: on the most influential, or
get_citationsfor specific claimssnippet_search
- → 初始搜索结果(根据当前日期计算下限——例如2026年传入
search_papers_by_relevance(keyword, publication_date_range="<当前年份-5>:", venues=?);若用户要求查找更早的文献,可调整或移除该筛选条件)publication_date_range="2021:" - 按引用量 + 时效性对前N篇结果排序/展示
- 提供后续操作选项:对最具影响力的论文执行,或针对特定观点执行
get_citationssnippet_search
Pattern 2 — Seed-Paper Expansion
模式2 — 种子论文扩展
- → verify seed
get_paper(DOI|arXiv|...) - → forward expansion
get_citations(paperId) - Optionally with seed title terms for sideways discovery
search_papers_by_relevance - Deduplicate by paperId before presenting
- → 验证种子论文
get_paper(DOI|arXiv|...) - → 正向扩展(查找引用该论文的文献)
get_citations(paperId) - 可选:使用种子论文标题中的关键词执行进行横向探索
search_papers_by_relevance - 展示前先按paperId去重
Pattern 3 — Author Deep-Dive
模式3 — 作者深度挖掘
- → pick correct profile (disambiguate by affiliation)
search_authors_by_name(name) - → full publication list
get_author_papers(authorId) - Filter client-side by topic keywords or date
- → 选择正确的作者资料(通过所属机构区分)
search_authors_by_name(name) - → 获取完整的出版物列表
get_author_papers(authorId) - 客户端侧按主题关键词或日期筛选结果
Pattern 4 — Evidence Retrieval
模式4 — 证据检索
- → find passages making/supporting a claim
snippet_search(claim_query) - For each hit, optionally for full metadata
get_paper(id)
- → 查找提出/支持某观点的段落
snippet_search(claim_query) - 对于每个结果,可选择执行获取完整元数据
get_paper(id)
Output & Interaction Rules
输出与交互规则
- Always report total count and which tool was used.
- Present top 10 as a table (title, year, venue, citations), then details for the most relevant.
- If the user writes in Chinese, present summaries in Chinese; keep titles in original language.
- After results, offer: Details / Refine / Citations / Snippet / Export / Done.
- 始终报告结果总数和所使用的工具。
- 将前10条结果以表格形式展示(标题、年份、会议/期刊、引用量),然后展示最相关结果的详细信息。
- 如果用户使用中文提问,需用中文展示摘要;论文标题保留原语言。
- 展示结果后,提供选项:详情/筛选/引用/片段/导出/完成。
Critical Rules
关键规则
- Prefer batched intent over ping-pong. If the user's question needs two independent lookups, issue them as parallel MCP tool calls in one turn, not sequentially.
- Never guess IDs. If a user gives a fuzzy title, use before
search_paper_by_title.get_paper - Respect rate limits. An API key buys higher limits but not unlimited — stop expanding citation graphs beyond what the user asked for.
- Do not fabricate fields. If Asta returns null or
abstract, say so rather than inventing.venue
- 优先批量处理,避免频繁交互。如果用户的问题需要两次独立查询,请在一轮中发起并行MCP工具调用,而非依次调用。
- 切勿猜测ID。如果用户提供模糊的标题,请先使用,再调用
search_paper_by_title。get_paper - 遵守速率限制。API密钥可提升速率限制,但并非无限制——不要超出用户需求扩展引用图谱。
- 切勿编造字段。如果Asta返回的或
abstract为null,请如实告知,不要编造内容。venue
Handling Asta responses
处理Asta响应
| Situation | What to do |
|---|---|
Empty | Not all corpus papers have full text — use |
| Author disambiguation uncertain | Inspect affiliations in |
| Back off; batch with |
| Need DOI / PubMed ID / arXiv ID | Add |
| 场景 | 处理方式 |
|---|---|
| 并非所有语料库中的论文都有完整文本——可使用 |
| 作者身份区分存疑 | 在调用 |
返回 | 暂停请求;改用 |
| 需要DOI / PubMed ID / arXiv ID | 在 |