MOZ Crawl help
-
Our MOZ report says it crawled 1800 pages so it reports a lot of errors based on those pages. We don't have that many pages on our site. What is MOZ crawling? I updated the profile to make sure it crawls the filtered page section of Google Analytics.
-
Hey Jessica,
I took a look at your crawl diagnostics and there isn't anything that looks too out of the ordinary. You have 415 duplicate pages, which is over inflating your page count.
Additionally, when I exported your crawl diagnostics CSV I saw a mixture of static URLs and Dynamic URLs. This may mean that you have two coppies of each page being crawled, which could be leading to over inflated page totals. You might want to look at your CMS and make sure that it is not making 2 copies of each page.
Cheers,
Ryan Watson
Business Development Associate | Moz -
Hi Jessica,
Well that's a pretty good mix
Having a bunch of different kinds of errors like that could be due to a combination of issues, but I would think that first of all you have a duplicate content issue either because your cms is creating it through different urls for things like multiple categories, archive pages and that kind of thing or because you have a server setup issue that is allowing access to the same pages through multiple urls (like www vs non www urls). It might well be a combination of these things accounting for different errors so you are going to need to narrow down the data to find out exactly what is going on.
Download your error report in csv, open it in excel and select all and then do ctrl-windows-L combination. This will put your data into a filterable table. You can then go to the headers and filter (for example) only 404 pages or only duplicate title pages. Once you have filtered for a specific error have a look at the left hand column for which page or pages the problem is appearing on and then the far right hand column to see which page is linking to this page or pages. You will then have a better idea where the problem is and how it is being reached by the crawler and that will guide you on where to look to fix it. If you want to give a couple of example urls a few obvious things might stand out.
Hope that helps!
-
By filtered, I meant that we have a profile that filters out employees hitting the site. Sorry, shouldn't have mentioned it because it isn't relevant.
-
Hi Lynn,
Thank you for getting back to me. Here is an image of our diagnostics with the errors.
-
Hi Jessica,
What are the errors telling you? If it is a lot of duplicate content type errors it could be an indication that your site is linking to the same pages in multiple ways. So even though you think your site has xx number of pages, to the crawler (and search engines) it actually has more! Not sure what you mean in regards updating the profile to crawl filtered section of google analytics? If you care to give us an example of the errors and your site we might be able to give more details.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Solved Site Crawl Won't Complete
How can I start/restart a new site crawl? I requested one 2 days ago on one of my sites, and it won't complete. It's only 150 pages -
Product Support | | PaulBarrs0 -
Moz keeps logging me out
Why moz keeps logging me out? It's annoying that I have to keep putting my credentials after checking the "remember me" checkbox.
Product Support | | sekamoving0 -
Crawl test
I used to use the crawl test tool to crawl websites and it presented the information in a really useful hierarchy of pages. The new on-demand crawl test doesn't seem to do this. Is there another tool I should be using to get the data?
Product Support | | Karen_Dauncey0 -
Crawl still in process for 3 days. Not sure why the site isn't being crawled
I added a new site to the crawl, but it seems to be stalled. It was supposed to crawl Feb 19, but it is still in process Feb 22. It tried to crawl the site and there was a robots.txt issue, but that issue was resolved way before the 19th. Not sure what is going on. this is for the clear lake campaign.
Product Support | | dpsoftware0 -
Cannot create campaign because Moz doesn't recognize my URL
I have a new url, and I'm trying to create a new campaign for it. But in first step when i enter the domain, an error message pops up saying the url is invalid. could you help?
Product Support | | ALLee0 -
Maintenance Alert: Moz Analytics + Pro unavailable for Maintenance tonight from 7-7:30pm PST
Hello! We've posted this within the apps, as well as on our social channels, but I wanted to share it here as well. We are doing required database maintenance tonight starting at 7pm, Pacific time. This update will help make things work a bit quicker. Woot! The rest of the tools will still be available at http://moz.com/researchtools. Thank you for your patience!
Product Support | | jennita2 -
Duplicate Content Report: Duplicate URLs being crawled with "++" at the end
Hi, In our Moz report over the past few weeks I've noticed some duplicate URLs appearing like the following: Original (valid) URL: http://www.paperstone.co.uk/cat_553-616_Office-Pins-Clips-and-Bands.aspx?filter_colour=Green Duplicate URL: http://www.paperstone.co.uk/cat_553-616_Office-Pins-Clips-and-Bands.aspx?filter_colour=Green**++** These aren't appearing in Webmaster Tools, or in a Screaming Frog crawl of our site so I'm wondering if this is a bug with the Moz crawler? I realise that it could be resolved using a canonical reference, or performing a 301 from the duplicate to the canonical URL but I'd like to find out what's causing it and whether anyone else was experiencing the same problem. Thanks, George
Product Support | | webmethod0