Unexplained Crawl Diagnostic Errors & Opencart
-
Hi,
I've been looking at the crawl diagnostics for my site and trying to fix the errors that are showing up but Seomoz is producing some strange results.
It's saying pages are duplicated upto 16 times but those pages dont exist. It's adding "page=3", "page=4" to the end of the product URL but I don't see how it's finding those pages, nothing on the site(as far as I can tell) is linking to them. There is no "page=3", just the one product page.
Again on the duplicate content it's saying under the "other URLs" there's URLs like "http:///product-a" but again I don't see where it's finding these URLs and obviously those URL's dont work. Those three slashes aren't a typo either.
So far I've reduced the amount of errors from 2,005 to 543 but the rest of them I can't make sense of.
Also, what does one do when you have two products, eg: "product-a-white" and "product-a-black" to prevent Seomoz from seeing duplicates? Canonical links wont work because there's no parent item, just those two. Google Webmaster tools doesn't seem to have a problem though.
Using Opencart 1.5, if it helps.
Cheers,
-
Ah, so it may well be opencart doing something funky then. It's carrying the page url over into the product listing by the looks of it. I'll have to look into that then, thanks for pointing that out!
Do you have any idea how it could be finding the "http://maggie" style links?
Cheers for the help,
-
Ok, here is a example
http://www.lustrelingerie.com/Gracya-Lingerie/safari-wild-bra-push-up?page=5linked from
http://www.lustrelingerie.com/Gracya-Lingerie?page=5
Seems like if the pages= is on the catalog page, it is on the product links
-
Hi Alan, thanks for the response.
Yea, sure there's additional pages for the categories, I'm talking about the individual products.
Take http://www.lustrelingerie.com/Bassaya-Lingerie/camila-red for example. Seomoz's Diagnostics is saying there's a http://www.lustrelingerie.com/Bassaya-Lingerie/camila-red?page=2. The latter works if you go there, I don't understand that and that's likely down to opencart, but what I don't get is how Seomoz is finding the link to it.
And it's the same with links such as "http://maggie" (real error), I don't see where Seomoz is finding the links to those. I've checked any stray canonical links but they seem fine to me.
Thanks,
-
Yes they do exist
this page http://www.lustrelingerie.com/Everyday-Luxury-Underwear-Lingerie?page=1
is linked from this page
http://www.lustrelingerie.com/Everyday-Luxury-Underwear-Lingerie
There are many examples
-
The URL is http://www.lustrelingerie.com/
-
If you can give us a url i will tell you for sure
-
Hi Ben, thanks for the response.
The thing is I don't think it's a CMS issue, it seems to me that seomoz is getting confused somewhere. my product pages are along the lines of "www.domain.com/range/product-a/". They have a canonical link pointing to "www.domain.com/product-a/" And all only have a single page to them. Which is why I can't figure out where Seomoz is picking up these duplicates.
With regards to your latter paragraph, yea I was thinking that. I thought it might confuse customers though, or I was hoping there would be a more elegant solution. Going back in and editing 500+ products isn't something I was looking forward to hehe.
Cheers,
-
I'll speak to the duplicates issue since the other appears to be a CMS issue and how it is displaying the products. Whenever I see the "page=1" in the URL I can usually fink a pagination script that isn't helping my SEO efforts. But I don't know for sure in your situation, especially since you said you don't see any links on the product page.
As far as the "duplicates" issue. Try to get them as distinct as possible. With our product pages (starting with the most sold items) I have begun changing up the product name. We have the difference of only the height on many of our products so I'm having to get a little creative and add some other aspect to the URL that stays within the products title. I only want one page from my site competing for that exact match product SERP anyway. It's not a good idea to have two pages on your site competing for the same SERP. It seems to always be treated with less authority by Google when that happened in the past.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Analytics Question - Impressions & Queries Up, Sessions Down
I'm working with a client who, according to the Google Query report, impressions and sessions are up since we've started work with them about 6 months ago, but Google sessions are down. In moz, we're seeing a gradual, but steady increase in search visibility specifically with Google. Note: this is all organic. From when we started tracking queries, the first month we were tracking there were 43,581 impressions and 690 click throughs for the month. This past month there were 98,293 queries and 1015 clicks throughs for the month (granted not year over year data) - of these 1,015 clicks, 995 of them were from web. However, for those same time periods, sessions from Google are down over 30% - 1,750 vs. 1,189. I'm not sure how to interpret this. I realize that clicks and sessions are not a straightforward comparison, but I would think that if clicks were up according to the query report that sessions would also be up. Is it that some of these clicks are bouncing and therefore not being tracked as a session? Is there a potential issue with how data is being tracked?
Reporting & Analytics | | Corporate_Communications0 -
Pro's & Con's of Wordpress Categorys & Tags
Good Afternoon! I touched on this question a while back in another post specifically regarding a plethora of duplicate pages that I was finding due to inappropriate tagging in wordpress. As I am going through our website, I am starting to notice it happening again with categories as well. I am including some pictures where you can see the URL structures and titles etc of how everything is laid out. I would like to clarify that I was not the one who did any of this Is it wrong/bad to cross categorize? What I mean by that is put something in more than one category? Would there be any drawback to converting any of these into subcategories? Would that even do anything? Does having two pages that are named the same thing, hurt you? It would seem to me that Google wouldn't like that. I have recently come into the field of thought that Google is getting more and more human, and If it makes a human uncomfortable/confused it will make Google confused. In my pictures you can see we clearly have numerous hard copies of the same thing, not just duplicate elements created by wordpress, that is a separate issue. I personally want to change all of the titles and make everything as different and individual as possible, but i also could be very wrong in my desire to do that. Any thoughts are appreciated! eY4iX2N N3AVqss JZpU7Rq
Reporting & Analytics | | HashtagHustler0 -
Tag Manager & Universal Analytics Code - Do you need both?
Hi Mozzers I've created a container for a domain in Google Tag Manager. Within that container I've created a tag for universal analytics with track type "Page view" and the firing rule "all pages". Can I then replace the Universal Analytics code with the tag manager code? Would it still track all the normal data in Google Analytics? There are no events setup up yet so that's not a concern but there are goals setup tracking which are triggered by a page view. Would they be affected? Thanks Anthony
Reporting & Analytics | | Tone_Agency1 -
Wordpress site with increase number of Crawl(400 response Code) errors in Others section of GWT
I have a wordpress site http://muslim-academy.com/I check in Google Webmasters tool today and I see the increase number of errors in Others area of Google webmaster Tool.The error code is 400http://muslim-academy.com/%D8%B3%D9%8A%D8%B1%D8%A9-%D8%AA%D8%A7%D8%B1%D9%8A%D8%AE%D9%8A%D8%A9-%D9%84%D9%84%D8%B1%D8%A6%D9%8A%D8%B3-%D8%AC%D9%85%D8%A7%D9%84-%D8%B9%D8%A8%D8%AF-%D8%A7%D9%84%D9%86%D8%A7%D8%B5%D8%B1-2/%D8%B3%D9%....%3Cbr%20/%3E________________%3Cbr%20/%3E___________%3Ca%20href=?lang=zhOne of the example link of this error.Can you guide me why the number of errors are increasing and how to fix the existing errors.
Reporting & Analytics | | csfarnsworth0 -
AXIS plugin & SSL Search providing "not provided" results
Hi, Does anyone know if the Axis plugin by Yahoo will generate the infamous "not provided" results in reporting? Or is it dependent on what browser the plugin is used on? Thank you Bonnie
Reporting & Analytics | | DeluxeCorp0 -
Why are Seemingly Randomly Generated URLs Appearing as Errors in Google Webmaster Tools?
I've been confused by some URLs that are showing up as errors in our GWT account. They seem to just be randomly generated alphanumeric strings that Google is reporting as 404 errors. The pages do 404 because nothing ever existed there or was linked to. Here are some examples that are just off of our root domain: /JEzjLs2wBR0D6wILPy0RCkM/WFRnUK9JrDyRoVCnR8= /MevaBpcKoXnbHJpoTI5P42QPmQpjEPBlYffwY8Mc5I= /YAKM15iU846X/ymikGEPsdq 26PUoIYSwfb8 FBh34= I haven't been able to track down these character strings in any internet index or anywhere in our source code so I have no idea why Google is reporting them. We've been pretty vigilant lately about duplicate content and thin content issues and my concern is that there are an unspecified number of urls like this that Google thinks exist but don't really. Has anyone else seen GWT reporting errors like this for their site? Does anyone have any clue why Google would report them as errors?
Reporting & Analytics | | kimwetter0 -
Site crawler hasn't crawled my site in 6 days!
On 4.23 i requested a site crawl. My site only has about 550 pages. So how can we get faster crawls?
Reporting & Analytics | | joemas990 -
Google Analytics - paid & unpaid visits messed up
I guess Google Analytics messes up my paid and unpaid visits. In the list of top 10 kw's sending non-paid traffic it shows 5 very short kw's that we don't rank for at all (checked with RankTracker - we are not in first 50 search results). But these are the kw's we advertise for... One more proof: Webmaster Tools 'Search queries' shows 10 times less 'Clicks' from organic search than Google Analytics. Is there anyone who is experiencing this kind of problems with GA? Is there anything you can do with it?
Reporting & Analytics | | Alexey_mindvalley0