Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Find archived sitemap of a website that no longer exists
-
I am trying to figure out the site structure of a website and the urls of all the pages. Normally this would be easy but a couple of months ago the website went down and I don't think it will ever come back. Any help would be appreciated.
-
Use the internet archive (wayback machine) which effectdigital mentioned above, to find the /robots.txt file from the desired date. In that file you should find the referenced sitemap file (assuming the site properly included its sitemap reference in its robot.txt file). Then you can use the same process to request the sitemap file which was referenced in the robots.txt file.
-
Hi Effect,
Does your second link automatically provider the sitemaps available, or does the user still need to "know" or be able to guess where they might be e.g /sitemap.xml?
Nick
-
You can use this site to see legacy site-maps for some websites (though they may be partial or incomplete):
For example, check these sitemap results:
For smaller sites, the results are much easier to look at.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Search Intent and Relevance
Hello SEO gurus 🤓 I’m looking for the most efficient ways to analyze the search intent and relevance of competitors who are ranking for the keywords we’re targeting. While I know Google excels at assessing search intent and relevance, I’m interested in learning how we can evaluate these factors as metrics for our competitors. The goal is to understand their strategy better and find ways to outrank them. Do you have any tools or methodologies that you recommend for assessing competitor content to determine its alignment with search intent and user needs/relevance? I’d love to hear your thoughts and suggestions on this!
Competitive Research | | Cricket931 -
How to set up a competitor URL with a language slug for a campaign
Hello, I am trying to set up a competitor with language slug for my (subfolder) website with a language slug. Let's say my website is something like: websiteholding.com/de/website
Competitive Research | | Siir
My competitor is: competitor.com/de When I go to Campaign Settings > Comptetitor Sites > type in competitor.com/de > Hit Save Competitor > Then it shows the saved competitor without the language slug as competitor.com I am not sure if this is a correct method of tracking since for my DE website I would like to track the DE page of the competitor, not their global page. Please correct me if I am wrong and help me out on possible solutions? I am quite new to SEO & Moz , so any help on the topic would be appreciated.0 -
Is it allowed to buy an old website with external links en redirect it to your own site??
Hello, We noticed at MOZ that one of our competitor's websites has increased the number of external backlinks from 5k to 25k within a month. Upon investigating the links, we found that the competitor had acquired an old, unknown website with completely different content and external links in that month. The competitor then redirected that purchased website to their own website, instantly gaining 20k external links. This seems to be against Google's guidelines as it is an extremely unnatural way of link building and should be penalized by search engines. However, the website's Domain Authority (DA) has increased by 10 points in that month, and its rankings have greatly improved on Google. So it appears that acquiring an old website with many external links unrelated to your own website is highly profitable. We try to obtain links honestly, but we cannot reach 25k links naturally. Is there any way to report these practices to Google? Does anyone know how to do this?
Competitive Research | | Femamedia0 -
Interal linking
Re: I'm stuck by internal linking. What structure should a football website follow? Silo or Topic Cluster?
Competitive Research | | thanhdung0906
I need advice on my website. My website: https://tipbongda247.net/
I hope there are answers!
Thanks0 -
How do you select which keywords to push in SEO?
Hi Guys Selecting the right keywords that a website can realistically rank for is a key to gain top rankings relatively quick. I am just curious to hear how you guys do it (the methodology) when selecting which keywords to push? I mean you need to check the competition for each keyword as well so how to check this quickly to see what we realistically can rank for? Cheers John
Competitive Research | | igniterman751 -
How can I track where visitors go after exiting my website?
I don't want to track external links. I just want to know where they go when they leave. Is that possible? Can I do this with a cookie?
Competitive Research | | Vacatia_SEO0 -
I am looking to find the top pages based on traffic volume on my competitors websites, does anyone know of any good resources?
I want to know how which pages on my competitors websites are the most popular based on the traffic volume. I do not care how many links or directed to that page or any other metric. Only thing I am looking for is the traffic volume. It would also be nice to know the length of time spent on that page.
Competitive Research | | kanteenboy0 -
How to Find Another Site's robots.txt File?
An SEO report, not by SEOmoz, says my top two competitors have robots.txt files that disallows spidering. I suspect that their robots.txt file doesn't disallow all spidering. How do I find out what is in their robots.txt files?
Competitive Research | | lbohen0