hugging-face-paper-pages
Compare original and translation side by side
🇺🇸
Original
English🇨🇳
Translation
ChineseHugging Face Paper Pages
Hugging Face论文页面
Hugging Face Paper pages (hf.co/papers) is a platform built on top of arXiv (arxiv.org), specifically for research papers in the field of artificial intelligence (AI) and computer science. Hugging Face users can submit their paper at hf.co/papers/submit, which features it on the Daily Papers feed (hf.co/papers). Each day, users can upvote papers and comment on papers. Each paper page allows authors to:
- claim their paper (by clicking their name on the field). This makes the paper page appear on their Hugging Face profile.
authors - link the associated model checkpoints, datasets and Spaces by including the HF paper or arXiv URL in the model card, dataset card or README of the Space
- link the Github repository and/or project page URLs
- link the HF organization. This also makes the paper page appear on the Hugging Face organization page.
Whenever someone mentions a HF paper or arXiv abstract/PDF URL in a model card, dataset card or README of a Space repository, the paper will be automatically indexed. Note that not all papers indexed on Hugging Face are also submitted to daily papers. The latter is more a manner of promoting a research paper. Papers can only be submitted to daily papers up until 14 days after their publication date on arXiv.
The Hugging Face team has built an easy-to-use API to interact with paper pages. Content of the papers can be fetched as markdown, or structured metadata can be returned such as author names, linked models/datasets/spaces, linked Github repo and project page.
Hugging Face论文页面(hf.co/papers)是基于arXiv(arxiv.org)搭建的平台,专门用于人工智能(AI)和计算机科学领域的研究论文。Hugging Face用户可通过hf.co/papers/submit提交论文,提交的论文会展示在每日论文推送(hf.co/papers)中。用户每天可以为论文点赞、发表评论。每个论文页面支持作者进行以下操作:
- 认领自己的论文(点击字段中的自己姓名即可)。认领后,该论文页面会显示在你的Hugging Face个人主页上。
authors - 通过在模型卡片、数据集卡片或Space的README中添加HF论文或arXiv URL,关联对应的模型 checkpoint、数据集和Spaces
- 关联GitHub仓库和/或项目页面URL
- 关联HF组织。关联后,该论文页面会显示在Hugging Face组织主页上。
每当有人在模型卡片、数据集卡片或Space仓库的README中提及HF论文或arXiv摘要/PDF URL时,该论文会被自动索引。请注意,并非所有在Hugging Face上被索引的论文都会被提交到每日论文推送中,后者更偏向于推广研究论文。论文在arXiv上发布后的14天内,才可提交到每日论文推送。
Hugging Face团队打造了易于使用的API来与论文页面交互。你可以以Markdown格式获取论文内容,或是获取结构化元数据,比如作者姓名、关联的模型/数据集/Spaces、关联的GitHub仓库和项目页面等。
When to Use
使用场景
- User shares a Hugging Face paper page URL (e.g. )
https://huggingface.co/papers/2602.08025 - User shares a Hugging Face markdown paper page URL (e.g. )
https://huggingface.co/papers/2602.08025.md - User shares an arXiv URL (e.g. or
https://arxiv.org/abs/2602.08025)https://arxiv.org/pdf/2602.08025 - User mentions a arXiv ID (e.g. )
2602.08025 - User asks you to summarize, explain, or analyze an AI research paper
- 用户分享Hugging Face论文页面URL(例如 )
https://huggingface.co/papers/2602.08025 - 用户分享Hugging Face Markdown论文页面URL(例如 )
https://huggingface.co/papers/2602.08025.md - 用户分享arXiv URL(例如 或
https://arxiv.org/abs/2602.08025)https://arxiv.org/pdf/2602.08025 - 用户提及arXiv ID(例如 )
2602.08025 - 用户要求你总结、解释或分析一篇AI研究论文
Parsing the paper ID
解析论文ID
It's recommended to parse the paper ID (arXiv ID) from whatever the user provides:
| Input | Paper ID |
|---|---|
| |
| |
| |
| |
| |
| |
This allows you to provide the paper ID into any of the hub API endpoints mentioned below.
建议从用户提供的内容中解析出论文ID(arXiv ID):
| 输入内容 | 论文ID |
|---|---|
| |
| |
| |
| |
| |
| |
解析出论文ID后,你就可以将其用于以下任一Hub API端点。
Fetch the paper page as markdown
获取Markdown格式的论文页面
The content of a paper can be fetched as markdown like so:
bash
curl -s "https://huggingface.co/papers/{PAPER_ID}.md"This should return the Hugging Face paper page as markdown. This relies on the HTML version of the paper at https://arxiv.org/html/{PAPER_ID}.
There are 2 exceptions:
- Not all arXiv papers have an HTML version. If the HTML version of the paper does not exist, then the content falls back to the HTML of the Hugging Face paper page.
- If it results in a 404, it means the paper is not yet indexed on hf.co/papers. See Error handling for info.
Alternatively, you can request markdown from the normal paper page URL, like so:
bash
curl -s -H "Accept: text/markdown" "https://huggingface.co/papers/{PAPER_ID}"你可以通过以下方式获取Markdown格式的论文内容:
bash
curl -s "https://huggingface.co/papers/{PAPER_ID}.md"该命令会返回Hugging Face论文页面的Markdown内容,它依赖于arXiv上的论文HTML版本(https://arxiv.org/html/{PAPER_ID})。
存在两种例外情况:
- 并非所有arXiv论文都有HTML版本。如果该论文没有HTML版本,内容会 fallback 到Hugging Face论文页面的HTML版本。
- 如果返回404错误,说明该论文尚未在hf.co/papers上被索引。详情请查看错误处理部分。
另外,你也可以通过普通论文页面URL请求Markdown内容,命令如下:
bash
curl -s -H "Accept: text/markdown" "https://huggingface.co/papers/{PAPER_ID}"Paper Pages API Endpoints
论文页面API端点
All endpoints use the base URL .
https://huggingface.co所有端点都使用基础URL 。
https://huggingface.coGet structured metadata
获取结构化元数据
Fetch the paper metadata as JSON using the Hugging Face REST API:
bash
curl -s "https://huggingface.co/api/papers/{PAPER_ID}"This returns structured metadata that can include:
- authors (names and Hugging Face usernames, in case they have claimed the paper)
- media URLs (uploaded when submitting the paper to Daily Papers)
- summary (abstract) and AI-generated summary
- project page and GitHub repository
- organization and engagement metadata (number of upvotes)
To find models linked to the paper, use:
bash
curl https://huggingface.co/api/models?filter=arxiv:{PAPER_ID}To find datasets linked to the paper, use:
bash
curl https://huggingface.co/api/datasets?filter=arxiv:{PAPER_ID}To find spaces linked to the paper, use:
bash
curl https://huggingface.co/api/spaces?filter=arxiv:{PAPER_ID}使用Hugging Face REST API以JSON格式获取论文元数据:
bash
curl -s "https://huggingface.co/api/papers/{PAPER_ID}"该接口返回的结构化元数据可能包括:
- 作者(姓名和Hugging Face用户名,若作者已认领该论文)
- 媒体URL(提交论文到每日论文推送时上传的内容)
- 摘要(原文摘要)和AI生成的摘要
- 项目页面和GitHub仓库
- 组织和互动元数据(点赞数)
要查找与该论文关联的模型,可使用:
bash
curl https://huggingface.co/api/models?filter=arxiv:{PAPER_ID}要查找与该论文关联的数据集,可使用:
bash
curl https://huggingface.co/api/datasets?filter=arxiv:{PAPER_ID}要查找与该论文关联的Spaces,可使用:
bash
curl https://huggingface.co/api/spaces?filter=arxiv:{PAPER_ID}Claim paper authorship
认领论文作者身份
Claim authorship of a paper for a Hugging Face user:
bash
curl "https://huggingface.co/api/settings/papers/claim" \
--request POST \
--header "Content-Type: application/json" \
--header "Authorization: Bearer $HF_TOKEN" \
--data '{
"paperId": "{PAPER_ID}",
"claimAuthorId": "{AUTHOR_ENTRY_ID}",
"targetUserId": "{USER_ID}"
}'- Endpoint:
POST /api/settings/papers/claim - Body:
- (string, required): arXiv paper identifier being claimed
paperId - (string): author entry on the paper being claimed, 24-char hex ID
claimAuthorId - (string): HF user who should receive the claim, 24-char hex ID
targetUserId
- Response: paper authorship claim result, including the claimed paper ID
为Hugging Face用户认领论文作者身份:
bash
curl "https://huggingface.co/api/settings/papers/claim" \
--request POST \
--header "Content-Type: application/json" \
--header "Authorization: Bearer $HF_TOKEN" \
--data '{
"paperId": "{PAPER_ID}",
"claimAuthorId": "{AUTHOR_ENTRY_ID}",
"targetUserId": "{USER_ID}"
}'- 端点:
POST /api/settings/papers/claim - 请求体:
- (字符串,必填):要认领的arXiv论文标识符
paperId - (字符串):论文中的作者条目ID,为24位十六进制ID
claimAuthorId - (字符串):要认领论文的HF用户ID,为24位十六进制ID
targetUserId
- 响应:论文作者身份认领结果,包含已认领的论文ID
Get daily papers
获取每日论文推送
Fetch the Daily Papers feed:
bash
curl -s -H "Authorization: Bearer $HF_TOKEN" \
"https://huggingface.co/api/daily_papers?p=0&limit=20&date=2017-07-21&sort=publishedAt"- Endpoint:
GET /api/daily_papers - Query parameters:
- (integer): page number
p - (integer): number of results, between 1 and 100
limit - (string): RFC 3339 full-date, for example
date2017-07-21 - (string): ISO week, for example
week2024-W03 - (string): month value, for example
month2024-01 - (string): filter by submitter
submitter - (enum):
sortorpublishedAttrending
- Response: list of daily papers
获取每日论文推送内容:
bash
curl -s -H "Authorization: Bearer $HF_TOKEN" \
"https://huggingface.co/api/daily_papers?p=0&limit=20&date=2017-07-21&sort=publishedAt"- 端点:
GET /api/daily_papers - 查询参数:
- (整数):页码
p - (整数):结果数量,范围1到100
limit - (字符串):RFC 3339格式的完整日期,例如
date2017-07-21 - (字符串):ISO周格式,例如
week2024-W03 - (字符串):月份格式,例如
month2024-01 - (字符串):按提交者过滤
submitter - (枚举值):
sort或publishedAttrending
- 响应:每日论文列表
List papers
列出论文
List arXiv papers sorted by published date:
bash
curl -s -H "Authorization: Bearer $HF_TOKEN" \
"https://huggingface.co/api/papers?cursor={CURSOR}&limit=20"- Endpoint:
GET /api/papers - Query parameters:
- (string): pagination cursor
cursor - (integer): number of results, between 1 and 100
limit
- Response: list of papers
按发布日期排序列出arXiv论文:
bash
curl -s -H "Authorization: Bearer $HF_TOKEN" \
"https://huggingface.co/api/papers?cursor={CURSOR}&limit=20"- 端点:
GET /api/papers - 查询参数:
- (字符串):分页游标
cursor - (整数):结果数量,范围1到100
limit
- 响应:论文列表
Search papers
搜索论文
Perform hybrid semantic and full-text search on papers:
bash
curl -s -H "Authorization: Bearer $HF_TOKEN" \
"https://huggingface.co/api/papers/search?q=vision+language&limit=20"This searches over the paper title, authors, and content.
- Endpoint:
GET /api/papers/search - Query parameters:
- (string): search query, max length 250
q - (integer): number of results, between 1 and 120
limit
- Response: matching papers
对论文执行混合语义与全文搜索:
bash
curl -s -H "Authorization: Bearer $HF_TOKEN" \
"https://huggingface.co/api/papers/search?q=vision+language&limit=20"该搜索会覆盖论文标题、作者和内容。
- 端点:
GET /api/papers/search - 查询参数:
- (字符串):搜索查询,最大长度250
q - (整数):结果数量,范围1到120
limit
- 响应:匹配的论文列表
Index a paper
索引论文
Insert a paper from arXiv by ID. If the paper is already indexed, only its authors can re-index it:
bash
curl "https://huggingface.co/api/papers/index" \
--request POST \
--header "Content-Type: application/json" \
--header "Authorization: Bearer $HF_TOKEN" \
--data '{
"arxivId": "{ARXIV_ID}"
}'- Endpoint:
POST /api/papers/index - Body:
- (string, required): arXiv ID to index, for example
arxivId2301.00001
- Pattern:
^\d{4}\.\d{4,5}$ - Response: empty JSON object on success
通过ID从arXiv插入一篇论文。如果该论文已被索引,只有其作者可以重新索引:
bash
curl "https://huggingface.co/api/papers/index" \
--request POST \
--header "Content-Type: application/json" \
--header "Authorization: Bearer $HF_TOKEN" \
--data '{
"arxivId": "{ARXIV_ID}"
}'- 端点:
POST /api/papers/index - 请求体:
- (字符串,必填):要索引的arXiv ID,例如
arxivId2301.00001
- 格式要求:
^\d{4}\.\d{4,5}$ - 响应:成功时返回空JSON对象
Update paper links
更新论文关联链接
Update the project page, GitHub repository, or submitting organization for a paper. The requester must be the paper author, the Daily Papers submitter, or a papers admin:
bash
curl "https://huggingface.co/api/papers/{PAPER_OBJECT_ID}/links" \
--request POST \
--header "Content-Type: application/json" \
--header "Authorization: Bearer $HF_TOKEN" \
--data '{
"projectPage": "https://example.com",
"githubRepo": "https://github.com/org/repo",
"organizationId": "{ORGANIZATION_ID}"
}'- Endpoint:
POST /api/papers/{paperId}/links - Path parameters:
- (string, required): Hugging Face paper object ID
paperId
- Body:
- (string, nullable): GitHub repository URL
githubRepo - (string, nullable): organization ID, 24-char hex ID
organizationId - (string, nullable): project page URL
projectPage
- Response: empty JSON object on success
更新论文的项目页面、GitHub仓库或提交组织。请求者必须是论文作者、每日论文推送的提交者,或是论文管理员:
bash
curl "https://huggingface.co/api/papers/{PAPER_OBJECT_ID}/links" \
--request POST \
--header "Content-Type: application/json" \
--header "Authorization: Bearer $HF_TOKEN" \
--data '{
"projectPage": "https://example.com",
"githubRepo": "https://github.com/org/repo",
"organizationId": "{ORGANIZATION_ID}"
}'- 端点:
POST /api/papers/{paperId}/links - 路径参数:
- (字符串,必填):Hugging Face论文对象ID
paperId
- 请求体:
- (字符串,可为空):GitHub仓库URL
githubRepo - (字符串,可为空):组织ID,24位十六进制ID
organizationId - (字符串,可为空):项目页面URL
projectPage
- 响应:成功时返回空JSON对象
Error Handling
错误处理
- 404 on or
https://huggingface.co/papers/{PAPER_ID}endpoint: the paper is not indexed on Hugging Face paper pages yet.md - 404 on : the paper may not be indexed on Hugging Face paper pages yet.
/api/papers/{PAPER_ID} - Paper ID not found: verify the extracted arXiv ID, including any version suffix
- 或
https://huggingface.co/papers/{PAPER_ID}端点返回404:该论文尚未在Hugging Face论文页面上被索引。md - 返回404:该论文可能尚未在Hugging Face论文页面上被索引。
/api/papers/{PAPER_ID} - 论文ID未找到:请验证提取的arXiv ID,包括可能的版本后缀
Fallbacks
备选方案
If the Hugging Face paper page does not contain enough detail for the user's question:
- Check the regular paper page at
https://huggingface.co/papers/{PAPER_ID} - Fall back to the arXiv page or PDF for the original source:
https://arxiv.org/abs/{PAPER_ID}https://arxiv.org/pdf/{PAPER_ID}
如果Hugging Face论文页面的内容不足以回答用户的问题:
- 查看常规论文页面:
https://huggingface.co/papers/{PAPER_ID} - fallback 到arXiv页面或PDF原文:
https://arxiv.org/abs/{PAPER_ID}https://arxiv.org/pdf/{PAPER_ID}
Notes
注意事项
- No authentication is required for public paper pages.
- Write endpoints such as claim authorship, index paper, and update paper links require .
Authorization: Bearer $HF_TOKEN - Prefer the endpoint for reliable machine-readable output.
.md - Prefer when you need structured JSON fields instead of page markdown.
/api/papers/{PAPER_ID}
- 公开论文页面无需身份验证。
- 写入类端点(如认领作者身份、索引论文、更新论文关联链接)需要携带。
Authorization: Bearer $HF_TOKEN - 优先使用端点以获取可靠的机器可读输出。
.md - 当你需要结构化JSON字段而非页面Markdown时,优先使用接口。
/api/papers/{PAPER_ID}