How do I validate product feed URLs before ingesting them?
The problem
A marketplace ingests product feeds from thousands of sellers. Each product listing contains URLs: product pages, seller storefronts, image sources, and documentation links. Dead product URLs create broken listings that frustrate buyers and erode trust in your platform.
Parked seller domains mean the seller has likely closed, but their products are still showing up in search results. Brand impersonation in seller sites signals potential counterfeit products. Without validation, your marketplace becomes a graveyard of broken links and a vector for fraud.
How Unphurl solves it
Validate product feed URLs as they arrive, before they enter your catalogue. The product feed scoring profile weights signals that indicate dead listings, defunct sellers, and potential counterfeiting. AI agents processing feeds can automatically quarantine flagged products for human review instead of publishing them directly.
Signals that matter for this use case
- – Parked means the seller's domain is a placeholder. The seller has likely closed or abandoned their storefront.
- – Chain incomplete means the product URL doesn't resolve. Dead listing.
- – Redirects 5+ flags product URLs that bounce through excessive redirects, often indicating moved or discontinued products.
- – Brand impersonation detects seller domains that mimic well-known brand names. Potential counterfeit signal.
- – Expiring soon flags seller domains about to lapse. Products from these sellers may become orphaned.
- – No MX record means the seller can't receive email. Harder to resolve disputes or communicate about orders.
- – Domain status bad (pendingDelete, serverHold) confirms the seller's domain is effectively dead.
- – SSL invalid means the seller's site has a broken certificate. Trust concern for buyers.
- – HTTP only signals a seller site without basic encryption. Not a secure shopping experience.
Suggested scoring profile
{
"name": "product-feed",
"weights": {
"parked": 25,
"chain_incomplete": 25,
"redirects_5": 20,
"brand_impersonation": 20,
"expiring_soon": 15,
"no_mx_record": 10,
"domain_status_bad": 15,
"ssl_invalid": 10,
"http_only": 10
}
} What a result looks like
Your platform ingests 5,000 product URLs from 200 sellers. Unphurl validates each one during ingestion:
Total product URLs checked: 5,000
Sellers represented: 200
Clean: 4,600
Flagged: 400
Breakdown of flagged URLs:
150 dead product links
120 parked seller domains
80 redirect chains (products moved)
50 brand impersonation concerns
The 4,600 clean URLs publish immediately. The 400 flagged URLs are quarantined for review. Your marketplace stays clean, buyers see working product pages, and counterfeit concerns get caught before they reach customers.
Cost
Daily feed ingestion is the typical pattern. Most major e-commerce domains are cached after the first validation pass, so repeat ingestion costs drop significantly. The Pro package handles ongoing validation for marketplaces processing thousands of product URLs per day.
Get started
# Validate a product feed URL
curl -X POST https://api.unphurl.com/v1/analyse \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"url": "https://seller-store.com/product/widget-123",
"profile": "product-feed"
}'