Free Sitemap URL Extractor
Paste a sitemap.xml URL or raw XML to extract every URL instantly. Supports sitemap index files with full recursive extraction, metadata columns, filter, sort, and CSV/TXT export.
ℹ️
Fetches via CORS proxy to bypass browser restrictions. If the fetch fails (server blocks proxies), use the Paste XML tab instead — download the XML from your browser and paste it directly.
Fetching sitemap…
⚠️ Extraction failed
0
Total URLs
—
Sitemaps Parsed
—
With Metadata
—
Newest Lastmod
| # URL ↕ | Last Modified ↕ | Change Freq | Priority ↕ |
|---|
This tool uses a CORS proxy to fetch public sitemaps. It cannot access password-protected sitemaps or private servers. Extracted URLs are processed entirely in your browser — nothing is stored or logged. For sitemaps that block proxies, use the Paste XML tab.
Sitemap Extractor — FAQ
What is a sitemap.xml file?
▼
A sitemap.xml is an XML file that lists all the important URLs on a website. It helps search engines like Google and Bing discover and crawl pages efficiently. Sitemaps can also include metadata like when each page was last modified (
lastmod), how often it changes (changefreq), and its relative importance (priority from 0.0 to 1.0).
What is a sitemap index file?
▼
A sitemap index file is a special sitemap that links to multiple individual sitemaps instead of listing URLs directly. Large websites use this to split URLs across multiple files (each capped at 50,000 URLs). This tool automatically detects sitemap index files and recursively fetches every child sitemap, combining all URLs into one result.
Why does fetching a sitemap fail?
▼
Browsers block cross-origin requests (CORS policy). This tool routes fetches through a CORS proxy. If the proxy also fails — because the server blocks bots, requires authentication, or returns an error — switch to the Paste XML tab. Open the sitemap URL directly in your browser, use Ctrl+S (Save as) or copy the page source, then paste the XML here.
What do lastmod, changefreq, and priority mean?
▼
lastmod — date the page was last modified (ISO 8601 format, e.g. 2024-03-15).
changefreq — hint at how often the page changes: always, hourly, daily, weekly, monthly, yearly, never.
priority — a float from 0.0 to 1.0 indicating the URL's importance relative to other URLs on the site (default 0.5). Note: Google largely ignores changefreq and priority in modern crawling.
changefreq — hint at how often the page changes: always, hourly, daily, weekly, monthly, yearly, never.
priority — a float from 0.0 to 1.0 indicating the URL's importance relative to other URLs on the site (default 0.5). Note: Google largely ignores changefreq and priority in modern crawling.
How do I download all URLs as a CSV?
▼
After extracting URLs, click Download .csv. The file includes four columns: URL, Last Modified, Change Frequency, and Priority. Perfect for importing into Google Sheets, Excel, SEO auditing tools, or processing with scripts.
How do I find my website's sitemap?
▼
Try these common locations:
https://yoursite.com/sitemap.xml, https://yoursite.com/sitemap_index.xml, or https://yoursite.com/robots.txt (look for a Sitemap: directive). WordPress sites typically use /wp-sitemap.xml. Google Search Console also shows your registered sitemaps under Coverage → Sitemaps.
Related Tools
Marketing
UTM Builder
Build UTM tracking links for campaigns in seconds.
SEO
URL Slug Generator
Convert titles to clean, SEO-friendly URL slugs.
URL Tools
URL Encoder/Decoder
Encode or decode percent-encoded URLs instantly.
Developer
Regex Tester
Test and debug regular expressions with live highlighting.
Extraction
Data Extractor
Extract emails, URLs, and patterns from any text.
SEO
XML Sitemap Generator
Generate a sitemap.xml from a list of URLs.