Loading...
Loading...
Compare original and translation side by side
references/polish-market.mdreferences/polish-market.md| Tool | Purpose | Cost |
|---|---|---|
| Search for shops, prices, offers | Free |
| Verify prices on shop pages (primary) | Free |
| Fallback for sites that block WebFetch | Free |
| 工具 | 用途 | 成本 |
|---|---|---|
| 搜索商家、价格、优惠信息 | 免费 |
| 验证商家页面上的价格(主要方式) | 免费 |
| 针对拦截WebFetch的网站的备用方案 | 免费 |
scraplinguv tool install 'scrapling[all]'uvuv run --with openpyxl python3 -c "..."scraplinguv tool install 'scrapling[all]'uvuv run --with openpyxl python3 -c "..."| Step | Action | Purpose |
|---|---|---|
| 1 | Parse query | PRODUCT, BUYER_TYPE, CATEGORY |
| 2 | Round 1: Discovery | Ceneo, Allegro, Amazon.pl, general and specialist shops |
| 3 | Gap analysis | Missing shops, unverified prices, missing data |
| 4 | Round 2: WebFetch shops | Confirm prices, stock levels, shipping |
| 5 | Round 3: Warranty and shipping | Verify warranty type, delivery costs |
| 6 | Round 4: B2B and distributors | B2B portals, official distributors, statutory warranty |
| 7 | Round 5: Final verification | Re-check cheapest, stock, coupons |
| 8 | Synthesis | Comparison table, recommendation, summary |
| 9 | Export | TXT, XLSX, HTML on request |
| 10 | Expert mode | Answers from cache, no new searches |
| 步骤 | 操作 | 目的 |
|---|---|---|
| 1 | 解析查询 | 提取产品类型、买家类型、品类 |
| 2 | 第一轮:发现 | 搜索Ceneo、Allegro、Amazon.pl、综合及垂直商家 |
| 3 | 差距分析 | 识别缺失商家、未验证价格、缺失数据 |
| 4 | 第二轮:WebFetch商家页面 | 确认价格、库存水平、运费 |
| 5 | 第三轮:保修与运费 | 验证保修类型、配送成本 |
| 6 | 第四轮:B2B与分销商 | 调研B2B平台、官方分销商、法定保修 |
| 7 | 第五轮:最终验证 | 重新核查最低价、库存、优惠券 |
| 8 | 综合分析 | 生成对比表格、购买建议、总结报告 |
| 9 | 导出 | 根据需求导出TXT、XLSX、HTML格式 |
| 10 | 专家模式 | 基于缓存数据答疑,不进行新搜索 |
WebFetch(url, prompt)scrapling extract get "URL" /tmp/scrapling-fallback.mdscrapling extract fetch "URL" /tmp/scrapling-fallback.md --network-idle --disable-resourcesscrapling extract stealthy-fetch "URL" /tmp/scrapling-fallback.md --solve-cloudflareWebFetch(url, prompt)scrapling extract get "URL" /tmp/scrapling-fallback.mdscrapling extract fetch "URL" /tmp/scrapling-fallback.md --network-idle --disable-resourcesscrapling extract stealthy-fetch "URL" /tmp/scrapling-fallback.md --solve-cloudflare| Type | Detection | Consequences |
|---|---|---|
| B2C | "for home", "personal", no indication | Statutory warranty 24 mo. + voluntary warranty |
| B2B | "for company", "VAT invoice", "business", "B2B" | Statutory warranty may be excluded, manufacturer warranty is critical |
| 类型 | 检测关键词 | 影响 |
|---|---|---|
| B2C | "家用"、"个人"、无明确标识 | 法定保修24个月 + 自愿保修 |
| B2B | "企业用"、"增值税发票"、"商务"、"B2B" | 可能不包含法定保修,厂商保修至关重要 |
references/polish-market.md| Category | Detection Patterns | Examples |
|---|---|---|
| VoIP/Telephony | Yealink, Grandstream, Fanvil, SIP, DECT, VoIP | Yealink W76P |
| IT/Networking | MikroTik, Ubiquiti, switch, router, AP, firewall | MikroTik hAP ax3 |
| Electronics | monitor, laptop, computer, printer, tablet | Dell U2723QE |
| Office | chair, desk, shredder, projector | Ergohuman Elite |
| General | (no match) | Nespresso Vertuo |
references/polish-market.md| 品类 | 检测模式 | 示例 |
|---|---|---|
| VoIP/电话系统 | Yealink、Grandstream、Fanvil、SIP、DECT、VoIP | Yealink W76P |
| IT/网络设备 | MikroTik、Ubiquiti、交换机、路由器、AP、防火墙 | MikroTik hAP ax3 |
| 电子产品 | 显示器、笔记本电脑、计算机、打印机、平板 | Dell U2723QE |
| 办公设备 | 椅子、办公桌、碎纸机、投影仪 | Ergohuman Elite |
| 综合类 | 无匹配项 | Nespresso Vertuo |
site:ceneo.pl "{PRODUCT}"site:allegro.pl "{PRODUCT}"site:amazon.pl "{PRODUCT}""{PRODUCT}" cena kupić"{PRODUCT}" sklep"{PRODUCT}" porównanie cen"{PRODUCT}" opinie recenzje"{PRODUCT}" dystrybutor hurtowniasite:ceneo.pl "{PRODUCT}"site:allegro.pl "{PRODUCT}"site:amazon.pl "{PRODUCT}""{PRODUCT}" cena kupić"{PRODUCT}" sklep"{PRODUCT}" porównanie cen"{PRODUCT}" opinie recenzje"{PRODUCT}" dystrybutor hurtowniaSHOPS_FOUND: [list of shops with prices]
SHOPS_MISSING: [from references/polish-market.md, not found]
PRICES_TO_VERIFY: [prices from snippets — need WebFetch]
WARRANTY_TO_CHECK: [shops without warranty type info]
LEADS: [URLs worth checking via WebFetch in Round 2]SHOPS_FOUND: [包含价格的商家列表]
SHOPS_MISSING: [来自polish-market.md的缺失商家]
PRICES_TO_VERIFY: [来自搜索片段的价格——需WebFetch验证]
WARRANTY_TO_CHECK: [无保修类型信息的商家]
LEADS: [第二轮中需通过WebFetch核查的URL]| Gap | Check | Action |
|---|---|---|
| No comparator data | Do we have Ceneo data? | If not → WebFetch ceneo.pl |
| No specialist shops | How many shops from category in polish-market.md? | Search for missing ones |
| Prices from snippets only | Any price confirmed via WebFetch? | Plan WebFetch for top 10 |
| No warranty data | How many shops have warranty type info? | Plan verification |
| No shipping data | How many shops have delivery costs? | Plan WebFetch of shipping pages |
| No marketplace data | Were Allegro / Amazon checked? | Additional queries |
| 差距 | 核查内容 | 操作 |
|---|---|---|
| 无对比工具数据 | 是否有Ceneo数据? | 若无 → WebFetch ceneo.pl |
| 无垂直商家数据 | polish-market.md中对应品类的商家已覆盖多少? | 搜索缺失的商家 |
| 仅搜索片段提供价格 | 是否有通过WebFetch确认的价格? | 计划对前10个结果进行WebFetch |
| 无保修数据 | 多少商家提供了保修类型信息? | 计划验证 |
| 无运费数据 | 多少商家提供了配送成本? | 计划WebFetch配送页面 |
| 无平台数据 | 是否核查了Allegro/Amazon? | 补充查询 |
WebFetch(url, "Podaj: 1) dokładną cenę brutto, 2) dostępność/stan magazynowy,
3) koszty wysyłki, 4) informacje o gwarancji (producenta czy sprzedawcy)")WebFetch(url, "请提供:1) 精确含税价格,2) 库存状态,3) 运费,4) 保修信息(厂商保修还是卖家保修)")| Source | Price Confidence |
|---|---|
| WebSearch snippet | LOW — price may be outdated |
| Ceneo listing | MEDIUM — aggregator, but delays |
| WebFetch of shop page | HIGH — directly confirmed |
| Scrapling fallback of shop page | HIGH — directly confirmed (same as WebFetch) |
| 来源 | 价格可信度 |
|---|---|
| WebSearch片段 | 低——价格可能已过期 |
| Ceneo列表 | 中——聚合平台,但存在延迟 |
| WebFetch商家页面 | 高——直接确认 |
| Scrapling备用方案获取的商家页面 | 高——直接确认(与WebFetch相同) |
site:{shop} gwarancjasite:{shop} regulaminsite:{shop} gwarancjasite:{shop} regulamin| Indicator | Type |
|---|---|
| "gwarancja producenta", "producent: [brand]" | MANUFACTURER |
| "gwarancja dystrybutora", distributor name as guarantor | DISTRIBUTOR |
| "gwarantem jest [shop name]", "gwarancja [shop]" | SELLER |
| No information | UNKNOWN |
references/warranty-guide.md| 标识 | 类型 |
|---|---|
| "gwarancja producenta"、"producent: [品牌]" | 厂商保修 |
| "gwarancja dystrybutora"、分销商名称作为担保人 | 分销商保修 |
| "gwarantem jest [商家名称]"、"gwarancja [商家]" | 卖家保修 |
| 无信息 | 未知 |
references/warranty-guide.mdWebFetch(shipping_url, "Podaj wszystkie opcje dostawy z cenami:
kurier, Paczkomat, Poczta Polska, odbiór osobisty, darmowa wysyłka (od jakiej kwoty)")WebFetch(shipping_url, "请提供所有配送选项及对应价格:快递、Paczkomat、波兰邮政、自提、免运费门槛(满额)")WebSearch: "{BRAND} dystrybutor Polska"
WebSearch: "{BRAND} importer Polska"
WebSearch: "{BRAND} autoryzowany sprzedawca"
WebSearch: "{PRODUCT}" site:ab.pl OR site:action.pl OR site:also.plWebSearch: "{BRAND} dystrybutor Polska"
WebSearch: "{BRAND} importer Polska"
WebSearch: "{BRAND} autoryzowany sprzedawca"
WebSearch: "{PRODUCT}" site:ab.pl OR site:action.pl OR site:also.plWebFetch(terms_url, "Czy regulamin wyłącza rękojmię dla przedsiębiorców?
Szukaj: Art. 558 KC, wyłączenie rękojmi, przedsiębiorca, firma")WebFetch(terms_url, "服务条款是否排除企业用户的法定保修?请查找:《波兰民法典》第558条、法定保修排除、企业、公司")"{PRODUCT}" kupon zniżka promocja"{PRODUCT}" kupon zniżka promocjaundefinedundefinedundefinedundefinedYES = confirmed manufacturer warranty (verification source)
NO = seller's own warranty (not manufacturer)
? = not verified at source
n/a = no data是 = 已确认厂商保修(验证来源)
否 = 卖家自行提供的保修(非厂商保修)
? = 未从源头验证
n/a = 无数据| Confidence | When |
|---|---|
| [HIGH] | Confirmed via WebFetch or scrapling from shop page |
| [MEDIUM] | From Ceneo/comparator or single source |
| [LOW] | From WebSearch snippets only |
| 可信度 | 适用场景 |
|---|---|
| [高] | 通过WebFetch或scrapling从商家页面确认 |
| [中] | 来自Ceneo/对比工具或单一来源 |
| [低] | 仅来自WebSearch片段 |
references/export-formats.mdreferences/export-formats.mdreferences/export-formats.mdreferences/export-formats.md| Parameter | Value |
|---|---|
| Rounds | 5 |
| Shops | 25-40 |
| WebFetch | 20-30 |
| 参数 | 数值 |
|---|---|
| 轮次 | 5 |
| 商家数量 | 25-40 |
| WebFetch次数 | 20-30 |
references/polish-market.mdreferences/polish-market.md| Error | Resolution |
|---|---|
| WebFetch 403/CAPTCHA/empty | Follow auto-escalation protocol from cli-web-scrape: HTTP tier → validate content → Dynamic tier → Stealthy tier. If scrapling unavailable or all tiers fail, skip shop and label "blocked" |
| Ceneo returns no results | Search directly in shops from polish-market.md |
| Allegro blocks scraping | Use WebSearch |
| No price on page | Label "n/a", skip in ranking |
| Conflicting prices (snippet vs WebFetch) | Always trust WebFetch (or scrapling if WebFetch was blocked) |
| 错误 | 解决方案 |
|---|---|
| WebFetch 403/验证码/空响应 | 遵循cli-web-scrape的自动升级协议:HTTP层 → 内容验证 → 动态层 → 隐身层。若未安装scrapling或所有层级均失败,跳过该商家并标记为"blocked" |
| Ceneo无结果 | 直接搜索polish-market.md中的商家 |
| Allegro拦截爬取 | 使用WebSearch |
| 页面无价格 | 标记为"n/a",不纳入排名 |
| 价格冲突(片段vs WebFetch) | 始终信任WebFetch(若WebFetch被拦截则信任scrapling) |
references/polish-market.mdreferences/warranty-guide.mdreferences/export-formats.mdreferences/polish-market.mdreferences/warranty-guide.mdreferences/export-formats.md