Loading...
Loading...
Scrape blog posts via RSS feeds (free, no API key) with Apify fallback for JS-heavy sites. Use when you need to monitor competitor blogs, track industry content, or aggregate blog posts by keyword.
npx skill4agent add athina-ai/goose-skills blog-scraperpip install requests# Scrape a blog's RSS feed
python3 skills/blog-scraper/scripts/scrape_blogs.py \
--urls "https://example.com/blog" --days 30
# Multiple blogs with keyword filter
python3 skills/blog-scraper/scripts/scrape_blogs.py \
--urls "https://blog1.com,https://blog2.com" --keywords "AI,marketing" --output summary
# Force Apify for JS-heavy sites
python3 skills/blog-scraper/scripts/scrape_blogs.py \
--urls "https://example.com" --mode apify<link rel="alternate">/feed/rss/atom.xml/feed.xml/rss.xml/blog/feed/index.xmljupri/rss-xml-scraper| Flag | Default | Description |
|---|---|---|
| required | Blog URL(s), comma-separated |
| none | Keywords to filter (comma-separated, OR logic) |
| 30 | Only include posts from last N days |
| 50 | Max posts to return |
| auto | |
| json | Output format: |
| env var | Apify token (only needed for Apify mode/fallback) |
| 300 | Max seconds for Apify run |
jupri/rss-xml-scraper