Is SeoMOZ Crawl Diagnostics wrong here?
-
We've been getting a ton of critical errors (about 80,000) in SeoMoz' Crawl Diagnostics saying we have duplicate content in our client's E-commerce site. Some of the errors are correct, but a lot of the pages are variations like:
www.example.com/productlist?page=1
www.example.com/productlist?page=2
However, in our source code we have used rel="prev" and rel="next" so in my opinion we should be alright.
Would love to hear from you if we have made a mistake or if it is an error in SeoMoz.
Here's a full paste of the script:
-
Just a minor clarification - you can use both rel=prev/next and rel=canonical, IF you have something like search filters. Then, the canonical would point to the unfiltered current page and the rel=prev/next would point to the filtered paginated pages. Yeah, I know, that made a lot of sense. Let's say your page is:
http://example.com/stuff?page=2&sort=price
...then you might have
It's more than a little confusing.
Definitely check out that JavaScript issue, though - it might be that bots aren't seeing what people are seeing, and that could be very dangerous.
-
Hi,
In regards the rel=next you are absolutely right, I must have overlooked it or just searched for the prev tag. So yes as far as proper implementation of the prev/next in that respect it is correct and please ignore that last part of my first post!
Turning of javascript is instructive to see all those tags on their individual page and helps clarify what exactly is being outputted and when without the dynamic loading, providing you don't miss a rel=next tag that is really there
-
Hi Lynn,
Thank you very much for your answer / analysis! As you said "It is a bit confusing" and I will just read your answer a couple of times...
I will grant your answer "Good answer" for you thorough analysis! I think it is spot on with the double "next/prev" and "rel=can" tags. I do have one remark. You said: When I turn off javescript, I get this:
In my opinion this is alright, because it shouldn't have a "prev" as this is the initial page.
-
Hi,
I had a look at what I assume is the site and I think you have a combination of things going on that is likely causing confusion (to you, to the moz bot, probably to google too!)
Firstly, it is not recommended to use rel prev/next and rel canonical on the same page. With that what you are effectively doing is only indexing the first page of the results since all the other pages rel canonical back to the first one. If you have a 'view all' type page then you could rel canonical all of the paginated pages back to this one and you would not need to use the prev/next tags at all. It is also possible that your use of relative canonical links in combination with the above is also causing confusion, usually best to use absolute urls if possible.
Beyond that, the site dynamically loads more products as you scroll down the page which also results in the url changing to hoeretelefon/? for ALL the pages. If that is a problem or not depends on how it is coded and how the google and seomoz bots are deciding to parse the page, but it certainly adds another potential area of complexity to the issue.
Lastly, if you browse the site with javascript turned off you can see something odd in that the initial page /elektronik/baerbar-lyd/hoeretelefon has no prev/next OR canonical tag but has a link to /elektronik/baerbar-lyd/hoeretelefon?page=1 on which you find prev/next and canonicals back to the non paginated version. So you are basically skipping the pagination setup that goes from the original to the page=1 (but also giving a canonical back to the original page).
Phew! It is a bit confusing. I would recommend deciding on if you want to go with prev/next or canonical in the first place and take it from there. I would think that if you have the ability to canonical to a 'see all products page' then this might be the best way to go since it should theoretically take care of any issues the dynamic loading is causing also.
Hope that helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
SEO on-demand crawl
what happened to the on-demand crawl you could do in PRO when they switched to the new MOZ site?
Moz Pro | | Vertz-Marketing0 -
Does SEOmoz have a Keyword Research tool similar to, say, the Google AdWords tool or the WebCEO Keyword Research Tool? And where might that be? (Sorry, I'm very new to SEOmoz Pro.)
I'm looking for an SEOmoz version of the classic WebCEO Keyword Research that would give you effective suggestions based on a keyword inquiry. I've made the switch from WebCEO, but I'm trying to find something similar to that Keyword Research tool. Am I going to just need to use the Google AdWords tool for this function or does SEOmoz have it's own version?
Moz Pro | | SmokewagonKen0 -
Using Seomoz for Site Evaluation am I up to par ?
Just wanted to see how people using the seomoz bar would rate a four month old site with Domain-Homepage Authority of 27 Mozrank of 5.08 and Moztrust of 5.65 . I've read up on all the factors but just wanted to know if Im up to par on building a great site thats search engine friendly. Inner pagers are on a PA of 20 and around the same mozrank and moztrust levels of +- 5.
Moz Pro | | NikolasNikolaou0 -
SEOMoz Software
I want to start off with stating that i am truly an advocate of SEOMoz and the great stuff they have done for the inbound community that we all know and love. I've been an active member since July 2010 and a paying pro member since December 2010. The software has always been monumental in helping my clients achieve their goals. However, in the past few months i have received nothing short of buggy unreliable software. The keyword difficulty tool never returns difficulty results. The Adwords data has been gone since i can remember. The rank tracker tool is successfull close to 1 out of 5 times. OSE is updated terribly slow compared to competitors. Plus, I have had to write emails to get my campaigns to be manually refreshed to see new ranking data. I have simply missed deadlines because my data is always delayed or missing from the software. Am i an anomaly here? does anyone have these problems? I have been researching some new tools as a replacement but i have yet to find anything as robust as the old SEOMoz. I'd love some feedback. Cheers - Kyle
Moz Pro | | kchandler0 -
Crawl Diagnostics Report
I'm a bit concerned about the results I'm getting from the Crawl Diagnostics Report. I've updated the site with canonical urls to remove duplicate content and when I check the site - it all displays the right values, but the report, which has just finished crawling is still showing a lot of pages as duplicate content. Simple example: http://www.domain.com http://www.domain.com/ Both of them are in the duplicate content section although both have canonical url set as: Does each crawl check the entire site from the beginning or just the pages it didn't have a chance to crawl the last time? This is just one of 333 duplicate content pages, which have canonical url pointing to the right page. Can someone please explain?
Moz Pro | | coremediadesign0 -
An error in the SeoMoz On page note?
Hello folks, Whenever I go the OnPage link in SeoMoz some of my links show a F ranking note. And when I click in one of them to see the detail of the page rank, it shows me as an A ranking note. Do you have seen the same problem? Which note shall I rely on? Thanks!!
Moz Pro | | jgomes0 -
Is there any way to view crawl errors historically?
One of the website's we monitor have been getting high duplicate page titles, as we work through the pages, we see changes and the number of duplicate page titles are decreasing. However, lately, it went up again and the duplicate page titles have increased. I wanted to ask if there's any way to view the new errors and the old errors separately or sorted in a way that can help me identify why we are getting new page crawl errors. Any advice would be great. Thanks!
Moz Pro | | TheNorthernOffice790 -
Initial Crawl Questions
Hello. I just joined and used the Crawl tool. I have many questions and hoping the community can offer some guidance. 1. I received an Excel file with 3k+ records. Is there a friendly online viewer for the Crawl report? Or is the Excel file the only output? 2. Assuming the Excel file is the only output, the Time Crawled is a number (i.e. 1305798581). I have tried changing the field to a date/time format but that did not work. How can I view the field as a normal date/time such as May 15, 2011 14:02? 3. I use the ™ symbol in my Title. This symbol appears in the output as a few ascii characters. Is that a concern? Should I remove the trademark symbol from my Title? 4. I am using XenForo forum software. All forum threads automatically receive a Title Tag and Meta Description as part of a template. The Crawl Test report shows my Title Tag and Meta Description as blank for many threads. I have looked at the source code of several pages and they all have clean Title tags and I don't understand why the Crawl Report doesn't show them. Any ideas? 5. In some cases the HTTP Status Code field shows a result of "3". Why does that mean? 6. For every URL in the Crawl Report there is an entry in the Referrer field. What exactly is the relationship between these fields? I thought the Crawl Tool would inspect every page on the site. If a page doesn't have a referring page is it missed? What if a page has multiple referring pages? How is that information displayed? 7. Under Google Webmaster Tools > Site Configurations > Settings > Parameter Handling I have the options set as either "Ignore" or "Let Google Decide" for various URL parameters. These are "pages" of my site which should mostly be ignored. For example a forum may have 7 headers, each on of which can be sorted in ascending or descending order. The only page that matters is the initial page. All the rest should be ignored by Google and the Crawl. Presently there are 11 records for many pages which really should only have one record due to these various sort parameters. Can I configure the crawl so it ignores parameter pages? I am anxious to get started on my site. I dove into the crawl results and it's just too messy in it's present state for me to pull out any actionable data. Any guidance would be appreciated.
Moz Pro | | RyanKent0