Loading...
Loading...
Compare original and translation side by side
legal-text-format用户请求 → wechat-article-fetch 抓取 → [判断内容类型]
↓
┌────────────────────────┴────────────────────────┐
↓ ↓
检测到法律内容 普通文章
↓ ↓
自动调用 legal-text-format 保存原始内容到项目根目录
↓
输出到 archive/ 目录legal-text-formatUser Request → wechat-article-fetch scraping → [Judge Content Type]
↓
┌────────────────────────┴────────────────────────┐
↓ ↓
Legal Content Detected Regular Articles
↓ ↓
Auto-call legal-text-format Save original content to project root directory
↓
Output to archive/ directoryundefinedundefinedundefinedundefined// 抓取文章(仅返回结果)
const result = await fetchWechatArticle("https://mp.weixin.qq.com/s/xxxxx");
// 抓取文章并自动保存为 Markdown 文件
const result = await fetchWechatArticle(
"https://mp.weixin.qq.com/s/xxxxx",
3, // 重试次数(可选)
"./output.md" // 保存路径(可选)
);
// 返回格式
{
title: "文章标题",
content: "文章正文...",
url: "文章URL"
}// Scrape article (return results only)
const result = await fetchWechatArticle("https://mp.weixin.qq.com/s/xxxxx");
// Scrape article and auto-save as Markdown file
const result = await fetchWechatArticle(
"https://mp.weixin.qq.com/s/xxxxx",
3, // Retry count (optional)
"./output.md" // Save path (optional)
);
// Return format
{
title: "Article Title",
content: "Article main text...",
url: "Article URL"
}undefinedundefinedundefinedundefined标题: 文章标题
文章正文第一段...
文章正文第二段...Title: Article Title
First paragraph of article main text...
Second paragraph of article main text...undefinedundefinedOriginal URL: https://mp.weixin.qq.com/s/xxxxx Scraped Time: 2026-01-21 20:30:00

undefined输出目录/
├── 文章标题.md # Markdown 文件
└── 文章标题_assets/ # 图片资源文件夹
├── image_xxx_0.jpg
├── image_xxx_1.jpg
└── ...Output Directory/
├── Article_Title.md # Markdown file
└── Article_Title_Assets/ # Image resource folder
├── image_xxx_0.jpg
├── image_xxx_1.jpg
└── ...scripts/fetch.jsconst IMAGE_FILTER_CONFIG = {
minFileSize: 15 * 1024, // 最小文件大小(字节)
enabled: true // 是否启用筛选
};scripts/fetch.jsconst IMAGE_FILTER_CONFIG = {
minFileSize: 15 * 1024, // Minimum file size (bytes)
enabled: true // Whether to enable filtering
};npx playwright install chromiumnpx playwright install chromium#js_content.rich_media_content#js_content.rich_media_contentcmd.execmd.execonst urls = [
"https://mp.weixin.qq.com/s/xxxx1",
"https://mp.weixin.qq.com/s/xxxx2",
"https://mp.weixin.qq.com/s/xxxx3"
];
for (const url of urls) {
const result = await fetchWechatArticle(url, 3, "./articles/");
console.log(`已保存: ${result.title}`);
}const urls = [
"https://mp.weixin.qq.com/s/xxxx1",
"https://mp.weixin.qq.com/s/xxxx2",
"https://mp.weixin.qq.com/s/xxxx3"
];
for (const url of urls) {
const result = await fetchWechatArticle(url, 3, "./articles/");
console.log(`Saved: ${result.title}`);
}请帮我抓取这个微信公众号文章并保存为 Markdown 文件:
https://mp.weixin.qq.com/s/xxxxxPlease help me scrape this WeChat Official Account article and save it as a Markdown file:
https://mp.weixin.qq.com/s/xxxxx