hugging-face-paper-pages

Compare original and translation side by side

🇺🇸

Original

English

🇨🇳

Translation

Chinese

Hugging Face Paper Pages

Hugging Face论文页面

Hugging Face Paper pages (hf.co/papers) is a platform built on top of arXiv (arxiv.org), specifically for research papers in the field of artificial intelligence (AI) and computer science. Hugging Face users can submit their paper at hf.co/papers/submit, which features it on the Daily Papers feed (hf.co/papers). Each day, users can upvote papers and comment on papers. Each paper page allows authors to:

claim their paper (by clicking their name on the
```
authors
```
field). This makes the paper page appear on their Hugging Face profile.
link the associated model checkpoints, datasets and Spaces by including the HF paper or arXiv URL in the model card, dataset card or README of the Space
link the Github repository and/or project page URLs
link the HF organization. This also makes the paper page appear on the Hugging Face organization page.

Whenever someone mentions a HF paper or arXiv abstract/PDF URL in a model card, dataset card or README of a Space repository, the paper will be automatically indexed. Note that not all papers indexed on Hugging Face are also submitted to daily papers. The latter is more a manner of promoting a research paper. Papers can only be submitted to daily papers up until 14 days after their publication date on arXiv.

The Hugging Face team has built an easy-to-use API to interact with paper pages. Content of the papers can be fetched as markdown, or structured metadata can be returned such as author names, linked models/datasets/spaces, linked Github repo and project page.

Hugging Face论文页面（hf.co/papers）是基于arXiv（arxiv.org）搭建的平台，专门用于人工智能（AI）和计算机科学领域的研究论文。Hugging Face用户可通过hf.co/papers/submit提交论文，提交的论文会展示在每日论文推送（hf.co/papers）中。用户每天可以为论文点赞、发表评论。每个论文页面支持作者进行以下操作：

认领自己的论文（点击
```
authors
```
字段中的自己姓名即可）。认领后，该论文页面会显示在你的Hugging Face个人主页上。
通过在模型卡片、数据集卡片或Space的README中添加HF论文或arXiv URL，关联对应的模型 checkpoint、数据集和Spaces
关联GitHub仓库和/或项目页面URL
关联HF组织。关联后，该论文页面会显示在Hugging Face组织主页上。

每当有人在模型卡片、数据集卡片或Space仓库的README中提及HF论文或arXiv摘要/PDF URL时，该论文会被自动索引。请注意，并非所有在Hugging Face上被索引的论文都会被提交到每日论文推送中，后者更偏向于推广研究论文。论文在arXiv上发布后的14天内，才可提交到每日论文推送。

Hugging Face团队打造了易于使用的API来与论文页面交互。你可以以Markdown格式获取论文内容，或是获取结构化元数据，比如作者姓名、关联的模型/数据集/Spaces、关联的GitHub仓库和项目页面等。

When to Use

使用场景

User shares a Hugging Face paper page URL (e.g.
```
https://huggingface.co/papers/2602.08025
```
)
User shares a Hugging Face markdown paper page URL (e.g.
```
https://huggingface.co/papers/2602.08025.md
```
)

User shares an arXiv URL (e.g.

https://arxiv.org/abs/2602.08025

https://arxiv.org/pdf/2602.08025

)

User mentions a arXiv ID (e.g.
```
2602.08025
```
)
User asks you to summarize, explain, or analyze an AI research paper

用户分享Hugging Face论文页面URL（例如
```
https://huggingface.co/papers/2602.08025
```
）
用户分享Hugging Face Markdown论文页面URL（例如
```
https://huggingface.co/papers/2602.08025.md
```
）

用户分享arXiv URL（例如

https://arxiv.org/abs/2602.08025

或

https://arxiv.org/pdf/2602.08025

）

用户提及arXiv ID（例如
```
2602.08025
```
）
用户要求你总结、解释或分析一篇AI研究论文

Parsing the paper ID

解析论文ID

It's recommended to parse the paper ID (arXiv ID) from whatever the user provides:

Input	Paper ID
`https://huggingface.co/papers/2602.08025`	`2602.08025`
`https://huggingface.co/papers/2602.08025.md`	`2602.08025`
`https://arxiv.org/abs/2602.08025`	`2602.08025`
`https://arxiv.org/pdf/2602.08025`	`2602.08025`
`2602.08025v1`	`2602.08025v1`
`2602.08025`	`2602.08025`

This allows you to provide the paper ID into any of the hub API endpoints mentioned below.

建议从用户提供的内容中解析出论文ID（arXiv ID）：

输入内容	论文ID
`https://huggingface.co/papers/2602.08025`	`2602.08025`
`https://huggingface.co/papers/2602.08025.md`	`2602.08025`
`https://arxiv.org/abs/2602.08025`	`2602.08025`
`https://arxiv.org/pdf/2602.08025`	`2602.08025`
`2602.08025v1`	`2602.08025v1`
`2602.08025`	`2602.08025`

解析出论文ID后，你就可以将其用于以下任一Hub API端点。

Fetch the paper page as markdown

获取Markdown格式的论文页面

The content of a paper can be fetched as markdown like so:

bash

curl -s "https://huggingface.co/papers/{PAPER_ID}.md"

This should return the Hugging Face paper page as markdown. This relies on the HTML version of the paper at https://arxiv.org/html/{PAPER_ID}.

There are 2 exceptions:

Not all arXiv papers have an HTML version. If the HTML version of the paper does not exist, then the content falls back to the HTML of the Hugging Face paper page.
If it results in a 404, it means the paper is not yet indexed on hf.co/papers. See Error handling for info.

Alternatively, you can request markdown from the normal paper page URL, like so:

bash

curl -s -H "Accept: text/markdown" "https://huggingface.co/papers/{PAPER_ID}"

你可以通过以下方式获取Markdown格式的论文内容：

bash

curl -s "https://huggingface.co/papers/{PAPER_ID}.md"

该命令会返回Hugging Face论文页面的Markdown内容，它依赖于arXiv上的论文HTML版本（https://arxiv.org/html/{PAPER_ID}）。

存在两种例外情况：

并非所有arXiv论文都有HTML版本。如果该论文没有HTML版本，内容会 fallback 到Hugging Face论文页面的HTML版本。
如果返回404错误，说明该论文尚未在hf.co/papers上被索引。详情请查看错误处理部分。

另外，你也可以通过普通论文页面URL请求Markdown内容，命令如下：

bash

curl -s -H "Accept: text/markdown" "https://huggingface.co/papers/{PAPER_ID}"

Paper Pages API Endpoints

论文页面API端点

All endpoints use the base URL

https://huggingface.co

所有端点都使用基础URL

https://huggingface.co

。

Get structured metadata

获取结构化元数据

Fetch the paper metadata as JSON using the Hugging Face REST API:

bash

curl -s "https://huggingface.co/api/papers/{PAPER_ID}"

This returns structured metadata that can include:

authors (names and Hugging Face usernames, in case they have claimed the paper)
media URLs (uploaded when submitting the paper to Daily Papers)
summary (abstract) and AI-generated summary
project page and GitHub repository
organization and engagement metadata (number of upvotes)

To find models linked to the paper, use:

bash

curl https://huggingface.co/api/models?filter=arxiv:{PAPER_ID}

To find datasets linked to the paper, use:

bash

curl https://huggingface.co/api/datasets?filter=arxiv:{PAPER_ID}

To find spaces linked to the paper, use:

bash

curl https://huggingface.co/api/spaces?filter=arxiv:{PAPER_ID}

使用Hugging Face REST API以JSON格式获取论文元数据：

bash

curl -s "https://huggingface.co/api/papers/{PAPER_ID}"

该接口返回的结构化元数据可能包括：

作者（姓名和Hugging Face用户名，若作者已认领该论文）
媒体URL（提交论文到每日论文推送时上传的内容）
摘要（原文摘要）和AI生成的摘要
项目页面和GitHub仓库
组织和互动元数据（点赞数）

要查找与该论文关联的模型，可使用：

bash

curl https://huggingface.co/api/models?filter=arxiv:{PAPER_ID}

要查找与该论文关联的数据集，可使用：

bash

curl https://huggingface.co/api/datasets?filter=arxiv:{PAPER_ID}

要查找与该论文关联的Spaces，可使用：

bash

curl https://huggingface.co/api/spaces?filter=arxiv:{PAPER_ID}

Claim paper authorship

认领论文作者身份

Claim authorship of a paper for a Hugging Face user:

bash

curl "https://huggingface.co/api/settings/papers/claim" \
  --request POST \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $HF_TOKEN" \
  --data '{
    "paperId": "{PAPER_ID}",
    "claimAuthorId": "{AUTHOR_ENTRY_ID}",
    "targetUserId": "{USER_ID}"
  }'

Endpoint:
```
POST /api/settings/papers/claim
```
Body:
- ```
paperId
```
  (string, required): arXiv paper identifier being claimed
- ```
claimAuthorId
```
  (string): author entry on the paper being claimed, 24-char hex ID
- ```
targetUserId
```
  (string): HF user who should receive the claim, 24-char hex ID
Response: paper authorship claim result, including the claimed paper ID

为Hugging Face用户认领论文作者身份：

bash

curl "https://huggingface.co/api/settings/papers/claim" \
  --request POST \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $HF_TOKEN" \
  --data '{
    "paperId": "{PAPER_ID}",
    "claimAuthorId": "{AUTHOR_ENTRY_ID}",
    "targetUserId": "{USER_ID}"
  }'

端点：
```
POST /api/settings/papers/claim
```
请求体：
- ```
paperId
```
  （字符串，必填）：要认领的arXiv论文标识符
- ```
claimAuthorId
```
  （字符串）：论文中的作者条目ID，为24位十六进制ID
- ```
targetUserId
```
  （字符串）：要认领论文的HF用户ID，为24位十六进制ID
响应：论文作者身份认领结果，包含已认领的论文ID

Get daily papers

获取每日论文推送

Fetch the Daily Papers feed:

bash

curl -s -H "Authorization: Bearer $HF_TOKEN" \
  "https://huggingface.co/api/daily_papers?p=0&limit=20&date=2017-07-21&sort=publishedAt"

Endpoint:
```
GET /api/daily_papers
```
Query parameters:
- ```
p
```
  (integer): page number
- ```
limit
```
  (integer): number of results, between 1 and 100
- ```
date
```
  (string): RFC 3339 full-date, for example
```
2017-07-21
```
- ```
week
```
  (string): ISO week, for example
```
2024-W03
```
- ```
month
```
  (string): month value, for example
```
2024-01
```
- ```
submitter
```
  (string): filter by submitter
- ```
sort
```
  (enum):
```
publishedAt
```
  or
```
trending
```
Response: list of daily papers

获取每日论文推送内容：

bash

curl -s -H "Authorization: Bearer $HF_TOKEN" \
  "https://huggingface.co/api/daily_papers?p=0&limit=20&date=2017-07-21&sort=publishedAt"

端点：
```
GET /api/daily_papers
```
查询参数：
- ```
p
```
  （整数）：页码
- ```
limit
```
  （整数）：结果数量，范围1到100
- ```
date
```
  （字符串）：RFC 3339格式的完整日期，例如
```
2017-07-21
```
- ```
week
```
  （字符串）：ISO周格式，例如
```
2024-W03
```
- ```
month
```
  （字符串）：月份格式，例如
```
2024-01
```
- ```
submitter
```
  （字符串）：按提交者过滤
- ```
sort
```
  （枚举值）：
```
publishedAt
```
  或
```
trending
```
响应：每日论文列表

List papers

列出论文

List arXiv papers sorted by published date:

bash

curl -s -H "Authorization: Bearer $HF_TOKEN" \
  "https://huggingface.co/api/papers?cursor={CURSOR}&limit=20"

Endpoint:
```
GET /api/papers
```
Query parameters:
- ```
cursor
```
  (string): pagination cursor
- ```
limit
```
  (integer): number of results, between 1 and 100
Response: list of papers

按发布日期排序列出arXiv论文：

bash

curl -s -H "Authorization: Bearer $HF_TOKEN" \
  "https://huggingface.co/api/papers?cursor={CURSOR}&limit=20"

端点：
```
GET /api/papers
```
查询参数：
- ```
cursor
```
  （字符串）：分页游标
- ```
limit
```
  （整数）：结果数量，范围1到100
响应：论文列表

Search papers

搜索论文

Perform hybrid semantic and full-text search on papers:

bash

curl -s -H "Authorization: Bearer $HF_TOKEN" \
  "https://huggingface.co/api/papers/search?q=vision+language&limit=20"

This searches over the paper title, authors, and content.

Endpoint:
```
GET /api/papers/search
```
Query parameters:
- ```
q
```
  (string): search query, max length 250
- ```
limit
```
  (integer): number of results, between 1 and 120
Response: matching papers

对论文执行混合语义与全文搜索：

bash

curl -s -H "Authorization: Bearer $HF_TOKEN" \
  "https://huggingface.co/api/papers/search?q=vision+language&limit=20"

该搜索会覆盖论文标题、作者和内容。

端点：
```
GET /api/papers/search
```
查询参数：
- ```
q
```
  （字符串）：搜索查询，最大长度250
- ```
limit
```
  （整数）：结果数量，范围1到120
响应：匹配的论文列表

Index a paper

索引论文

Insert a paper from arXiv by ID. If the paper is already indexed, only its authors can re-index it:

bash

curl "https://huggingface.co/api/papers/index" \
  --request POST \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $HF_TOKEN" \
  --data '{
    "arxivId": "{ARXIV_ID}"
  }'

Endpoint:
```
POST /api/papers/index
```
Body:
- ```
arxivId
```
  (string, required): arXiv ID to index, for example
```
2301.00001
```
Pattern:
```
^\d{4}\.\d{4,5}$
```
Response: empty JSON object on success

通过ID从arXiv插入一篇论文。如果该论文已被索引，只有其作者可以重新索引：

bash

curl "https://huggingface.co/api/papers/index" \
  --request POST \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $HF_TOKEN" \
  --data '{
    "arxivId": "{ARXIV_ID}"
  }'

端点：
```
POST /api/papers/index
```
请求体：
- ```
arxivId
```
  （字符串，必填）：要索引的arXiv ID，例如
```
2301.00001
```
格式要求：
```
^\d{4}\.\d{4,5}$
```
响应：成功时返回空JSON对象

Update paper links

更新论文关联链接

Update the project page, GitHub repository, or submitting organization for a paper. The requester must be the paper author, the Daily Papers submitter, or a papers admin:

bash

curl "https://huggingface.co/api/papers/{PAPER_OBJECT_ID}/links" \
  --request POST \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $HF_TOKEN" \
  --data '{
    "projectPage": "https://example.com",
    "githubRepo": "https://github.com/org/repo",
    "organizationId": "{ORGANIZATION_ID}"
  }'

Endpoint:
```
POST /api/papers/{paperId}/links
```
Path parameters:
- ```
paperId
```
  (string, required): Hugging Face paper object ID
Body:
- ```
githubRepo
```
  (string, nullable): GitHub repository URL
- ```
organizationId
```
  (string, nullable): organization ID, 24-char hex ID
- ```
projectPage
```
  (string, nullable): project page URL
Response: empty JSON object on success

更新论文的项目页面、GitHub仓库或提交组织。请求者必须是论文作者、每日论文推送的提交者，或是论文管理员：

bash

curl "https://huggingface.co/api/papers/{PAPER_OBJECT_ID}/links" \
  --request POST \
  --header "Content-Type: application/json" \
  --header "Authorization: Bearer $HF_TOKEN" \
  --data '{
    "projectPage": "https://example.com",
    "githubRepo": "https://github.com/org/repo",
    "organizationId": "{ORGANIZATION_ID}"
  }'

端点：
```
POST /api/papers/{paperId}/links
```
路径参数：
- ```
paperId
```
  （字符串，必填）：Hugging Face论文对象ID
请求体：
- ```
githubRepo
```
  （字符串，可为空）：GitHub仓库URL
- ```
organizationId
```
  （字符串，可为空）：组织ID，24位十六进制ID
- ```
projectPage
```
  （字符串，可为空）：项目页面URL
响应：成功时返回空JSON对象

Error Handling

错误处理

404 on
https://huggingface.co/papers/{PAPER_ID}
or
md
endpoint: the paper is not indexed on Hugging Face paper pages yet.
404 on
/api/papers/{PAPER_ID}
: the paper may not be indexed on Hugging Face paper pages yet.
Paper ID not found: verify the extracted arXiv ID, including any version suffix

https://huggingface.co/papers/{PAPER_ID}
或
md
端点返回404：该论文尚未在Hugging Face论文页面上被索引。
/api/papers/{PAPER_ID}
返回404：该论文可能尚未在Hugging Face论文页面上被索引。
论文ID未找到：请验证提取的arXiv ID，包括可能的版本后缀

Fallbacks

备选方案

If the Hugging Face paper page does not contain enough detail for the user's question:

Check the regular paper page at

https://huggingface.co/papers/{PAPER_ID}

Fall back to the arXiv page or PDF for the original source:

```
https://arxiv.org/abs/{PAPER_ID}
```
```
https://arxiv.org/pdf/{PAPER_ID}
```

如果Hugging Face论文页面的内容不足以回答用户的问题：

查看常规论文页面：

https://huggingface.co/papers/{PAPER_ID}

fallback 到arXiv页面或PDF原文：

```
https://arxiv.org/abs/{PAPER_ID}
```
```
https://arxiv.org/pdf/{PAPER_ID}
```

Notes

注意事项

No authentication is required for public paper pages.
Write endpoints such as claim authorship, index paper, and update paper links require
```
Authorization: Bearer $HF_TOKEN
```
.
Prefer the
```
.md
```
endpoint for reliable machine-readable output.
Prefer
```
/api/papers/{PAPER_ID}
```
when you need structured JSON fields instead of page markdown.

公开论文页面无需身份验证。
写入类端点（如认领作者身份、索引论文、更新论文关联链接）需要携带
```
Authorization: Bearer $HF_TOKEN
```
。
优先使用
```
.md
```
端点以获取可靠的机器可读输出。
当你需要结构化JSON字段而非页面Markdown时，优先使用
```
/api/papers/{PAPER_ID}
```
接口。