article-extractor

Compare original and translation side by side

🇺🇸

Original

English
🇨🇳

Translation

Chinese

MQL5 Article Extractor

MQL5文章提取工具

Extract technical trading articles from mql5.com for training data collection. Scope limited to mql5.com domain only.
从mql5.com提取技术交易类文章,用于训练数据收集。仅限定mql5.com域名范围内使用。

When to Use This Skill

何时使用该Skill

Use this skill when:
  • Extracting articles from mql5.com for reference or training data
  • Downloading MQL5 documentation and tutorials
  • Collecting trading articles from specific MQL5 users
  • Building a corpus of MQL5 programming examples
在以下场景使用本Skill:
  • 从mql5.com提取文章用于参考或训练数据
  • 下载MQL5文档与教程
  • 收集特定MQL5用户的交易类文章
  • 构建MQL5编程示例语料库

Scope Boundaries

范围边界

VALID requests:
  • "Extract this mql5.com article: https://www.mql5.com/en/articles/19625"
  • "Get all articles from MQL5 user 29210372"
  • "Download trading articles from mql5.com"
  • "Extract 5 MQL5 articles for testing"
OUT OF SCOPE:
  • "Extract from yahoo.com" - NOT SUPPORTED (mql5.com only)
  • "Scrape news from reuters" - NOT SUPPORTED (mql5.com only)
  • "Get stock data from Bloomberg" - NOT SUPPORTED (mql5.com only)
If user requests non-mql5.com extraction, respond: "This skill extracts articles from mql5.com ONLY. For other sites, use different tools."
有效请求:
超出范围:
  • "从yahoo.com提取内容" - 不支持(仅支持mql5.com)
  • "爬取路透社新闻" - 不支持(仅支持mql5.com)
  • "从彭博社获取股票数据" - 不支持(仅支持mql5.com)
若用户请求提取非mql5.com的内容,请回复:"本Skill仅从mql5.com提取文章。如需处理其他网站,请使用其他工具。"

Repository Location

仓库位置

Working directory:
$HOME/eon/mql5
(adjust path for your environment)
Always execute commands from this directory:
bash
cd "$HOME/eon/mql5"
工作目录:
$HOME/eon/mql5
(可根据你的环境调整路径)
请始终从该目录执行命令:
bash
cd "$HOME/eon/mql5"

Valid Input Types

有效输入类型

1. Article URL (Most Specific)

1. 文章URL(最具体)

Format:
https://www.mql5.com/en/articles/[ID]
Example:
https://www.mql5.com/en/articles/19625
Action: Extract single article
格式:
https://www.mql5.com/en/articles/[ID]
示例:
https://www.mql5.com/en/articles/19625
操作: 提取单篇文章

2. User ID (Numeric or Username)

2. 用户ID(数字或用户名)

Format: Numeric (e.g.,
29210372
) or username (e.g.,
jslopes
) Source: From mql5.com profile URL Action: Auto-discover and extract all user's articles
格式: 数字(例如:
29210372
)或用户名(例如:
jslopes
来源: 取自mql5.com个人主页URL 操作: 自动发现并提取该用户的所有文章

3. URL List File

3. URL列表文件

Format: Text file with one URL per line Action: Batch process multiple articles
格式: 每行一个URL的文本文件 操作: 批量处理多篇文章

4. Vague Request

4. 模糊请求

If user says "extract mql5 articles" without specifics, prompt for:
  1. Article URL OR User ID
  2. Quantity limit (for testing)
  3. Output location preference

若用户仅说“提取MQL5文章”而未提供具体信息,请提示以下内容:
  1. 文章URL或用户ID
  2. 数量限制(用于测试)
  3. 输出位置偏好

Reference Documentation

参考文档

For detailed information, see:
  • Extraction Modes - Single, batch, auto-discovery, official docs modes
  • Data Sources - User collections and official documentation
  • Troubleshooting - Common issues and solutions
  • Examples - Usage examples and patterns

如需详细信息,请查看:
  • 提取模式 - 单篇、批量、自动发现、官方文档模式
  • 数据源 - 用户收藏内容与官方文档
  • 故障排查 - 常见问题与解决方案
  • 示例 - 使用示例与模式

Troubleshooting

故障排查

IssueCauseSolution
Non-mql5.com URLSkill only supports mql5.comUse other tools for non-mql5.com sites
Article not foundInvalid article ID or removedVerify URL exists by visiting in browser
User ID not recognizedWrong user ID formatUse numeric ID from profile URL or exact username
Empty extractionRate limiting or site changeWait and retry, check for site structure changes
Permission deniedWorking directory mismatchRun from $HOME/eon/mql5 directory
Batch too largeToo many articles requestedLimit batch size, use URL list file
Missing dependenciesRequired tools not installedInstall curl, jq for extraction
Output encoding issuesUnicode in article contentEnsure UTF-8 output handling
问题原因解决方案
非mql5.com的URL本Skill仅支持mql5.com处理非mql5.com网站内容请使用其他工具
文章未找到文章ID无效或已被移除通过浏览器访问验证URL是否存在
用户ID无法识别用户ID格式错误使用个人主页URL中的数字ID或准确用户名
提取结果为空速率限制或网站结构变更等待后重试,检查网站结构是否发生变化
权限被拒绝工作目录不匹配从$HOME/eon/mql5目录执行命令
批量请求过大请求的文章数量过多限制批量大小,使用URL列表文件
依赖缺失未安装必要工具安装curl、jq以支持提取操作
输出编码问题文章内容包含Unicode字符确保输出采用UTF-8编码处理