Moz Crawler suddenly reporting 1000s of duplicates (BE.net)
-
In the last 3-4 days we've had several thousand 'duplicate content' warnings appear in our crawl report, 99% of them related to our on-site blog. The blog is BlogEngine.Net, but the pages simply don't exist. The majority seem to be Roger trying quasi-random URLs like:
/?page=410/?page=151
Etc. etc. The blog will present content for these requests, but it is of course the same empty page since there's only unique content for up to /?Page=10 or so.
Two questions:
1. Did something change recently? These blogs have been up for months, and this problem has only come up this week. Did Roger change to become more aggressive lately?
2. Suggested remediation? On one of the blogs I've put no-index no-follow for any page that has a /?page querystring, and we'll see what effect that has come next crawl next week. However, I'm not sure this will work as per:
http://moz.com/community/q/functionality-of-seomoz-crawl-page-reports
Anyone else had dynamic blogs suddenly blossom into thousands of duplicate content warnings? Google (rightly) ignores these pages completely.
-
Hate to bump my own question, but it appears I spoke too soon about no-index,no-follow solving this. The duplicate errors went away for about 5 days, but then yesterday spiked with the same problem. I've confirmed that no-index, no-follow are present on the pages being detected as bad.
As per the best practices document:
http://moz.com/learn/seo/robotstxt
Using meta robots no index no follow is the recommended option:
Block with Meta NoIndex
This tells engines they can visit, but are not allowed to display the URL in results. This is the recommended method
But it apparently isn't working, as evidenced by the new surge of duplicate errors. Is there anything else I can do? I don't want to explicitly block Roger in robots.txt as that seems rather backward. Should Roger be included the Bad Robots List?
-
Peter -
Thanks for the clarification. I understand the philosophy at hand, and I kind of even understood it before I had asked the question. I'm handling these with a mix of canonical and no-index/no-robot.
Related to that, update:
By marking the superfluous pages no-index/no-follow the error count for the site has diminished by about 10,000 and the warning count by about 28,000 so that seems to be the way to go. The pages that had content are 'low value' in this context, since that content was readily available elsewhere.
-
Hi there!
Thanks for writing in with a great question.
We definitely count those dynamic URLs as duplicate content. While we are pretty sure that search engines can figure this stuff out and know which URL to index, it's still considered best practices to canonicalize or otherwise direct crawlers to the original URL (as far as I know. I'm not a professional SEO so you might be better off asking the Pro Q&A community at www.moz.com/community/q - they are all SEOs like you).
Since some dynamic URL generators can cause problems for crawlers, we do try to be overly-inclusive of these issues rather than overly-exclusive. We want people to know about potential issues with sites, even if they're not really issues in the scheme of the site owner's specific SEO implementation plan.
In sum, we'd rather leave those judgments up to you and at the same time, provide you with the data you need to make these decisions. I hope this helps explain our thinking here! However, if you think that our crawler might be having issues, and you do not want to post your site urls here you could always send us a support ticket at help@moz.com. That way can can examine it a bit further and provide some insights into why our crawler thinks this way!
Hope this helps!
Peter
Moz Help Team.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google is reporting a server error, but there's no server error.
Google is erroneously reporting a server error and I just can't figure out the source of the issue. My links work, and GoDaddy ensures me there is no server error. This issue arose when I moved from HTTP to HTTPS and CPanel hosting, but I've got no idea how to fix it. I thought maybe I have duplicate content, but it does not appear that way. Any suggestions? I'm at a loss. www.thedishmaster.com
Reporting & Analytics | | TheDishmaster0 -
I am confuse with google analytic custom and segment report
Hi All, In google analytic when I create custom report for my ecommerce site then figures go mad. I really not able to judge peformance of device, browser and it's version, conversion, ecommerce conversion rate etc. same way if I add secondary dimension in report then also figures are not accurate. Again when I create different segment like desktop, mobile, tablet then in tablet segment mobile devices comes and in mobile device tablet appear why segment also not accurate? Is it because I am using free version? Also do we have alternative of google analytic which give same report like google analytic either device, browser, os, segment, enhance ecommerce etc? Thanks!
Reporting & Analytics | | dhisman0 -
Google Analytics Automated Reporting
HI all, I tend to do a big reporting powerpoint deck using screenshots from google analytics and tables I create year end and mid year. It's like an 80 page report for the 10 webisite swe have and then I go ahead and make annotations as I see from the data. That being said this can take a lot of time, up to a 40 hours of time to pull it all together or more which is challenging when you have daily meetings. Anyhow, I've looked into automating and tried a couple things: 1. Tableau- but it keeps crashing and seems tedious 2. Dashlane and supergrabber- seem a bit tedious to set up too. Anyone have ideas on how to better shar ereporting in the organization in this type of format for a website (websites)? Organic, paid, traffic, etc. Laura
Reporting & Analytics | | lauramrobinson322 -
Why would page views per visitor suddenly increase?
My website traffic is growing by about 1% a week. It has a fairly stable page views/visitor of about 1.69. There's normally very little variability in this As we sell an industrial product. Today page views jumped by 50% and so did page views/visitor but visitor numbers stayed the same. I dont have a useful hypothesis to explain this. Analytics shows me that the traffic source, country of origin and pages viewed are pretty much the same as normal. There's been no substantive change to the site (today we changed the text in a widget to link to a new page - and no one visited it). It doesn't look like 1 person has gone through the whole site as that would skew the distribution of page views by country So why would user behavour suddenly change? I'll look at it for the rest of the week but in 7 years of looking after this website I haven't seen anything like this before.
Reporting & Analytics | | Zippy-Bungle0 -
Google webmaster links vs Moz Reporte do follow links
A bit confused about my seo reports for a site I am tracking in Moz. Google webmaster reports i have 1836 links to my domain.
Reporting & Analytics | | KenW
Moz reports 273 external followed links.
Website Auditor reports 449 dofollow and 338 no follow.> total 787
What is important factor that I should be reporting to my client that really matters?0 -
Google Custom Report Filters are a pain in the arse
Greetings from high weed pollen count Wetherby UK 🙂 Ok ive been having extreme custom reporting filter frustration with Google analytics. My objective was to set up a "custom" google analytics report that detailed: 1. Referral traffic from external sights in one reporting tab
Reporting & Analytics | | Nightwing
2. Visitor generic traffic report including direct, search & referral traffic e.g: http://i216.photobucket.com/albums/cc53/zymurgy_bucket/reporting-tabscopy.jpg But Ive hit the provebial buffers 😞 Initailly i added an advanced filter as shown here: http://i216.photobucket.com/albums/cc53/zymurgy_bucket/custom-reporting-with-filtercopy.jpg but the advanced filters do not stick / save. As soon as you exit the custom report the advanced filter is cleared as shown here: http://i216.photobucket.com/albums/cc53/zymurgy_bucket/custom-reporting-filter-removedcopy.jpg I then added a filter under a specific custom tab but the filter cascades into all the reporting tabs see below: http://i216.photobucket.com/albums/cc53/zymurgy_bucket/custom-reporting-cascading-filterscopy.jpg So my question is please... "How do can you set up filters that effect the data in one reporting tab and does not cascade into the others" Any insights welcome 🙂0 -
Duplicate Page Title
I'm new to SEO and have just signed up to SEOMOZ to see what I can learn. I got the report back on my site and it indicates various errors, one of them being Duplicate Page Title - I have a blog on my site and a lot of pages identified as with duplicates are like this: http://www.martinspencephotography.co.uk/blog?page=2 Is it important I rectify this? Do I need to rectify it?
Reporting & Analytics | | MartinSpence460 -
On what report do I get to know where do the external links come from?
I need to get a list of the external links. I don't find that report, where do I get it?
Reporting & Analytics | | carloscontinua0