Unexplained Crawl Diagnostic Errors & Opencart
-
Hi,
I've been looking at the crawl diagnostics for my site and trying to fix the errors that are showing up but Seomoz is producing some strange results.
It's saying pages are duplicated upto 16 times but those pages dont exist. It's adding "page=3", "page=4" to the end of the product URL but I don't see how it's finding those pages, nothing on the site(as far as I can tell) is linking to them. There is no "page=3", just the one product page.
Again on the duplicate content it's saying under the "other URLs" there's URLs like "http:///product-a" but again I don't see where it's finding these URLs and obviously those URL's dont work. Those three slashes aren't a typo either.
So far I've reduced the amount of errors from 2,005 to 543 but the rest of them I can't make sense of.
Also, what does one do when you have two products, eg: "product-a-white" and "product-a-black" to prevent Seomoz from seeing duplicates? Canonical links wont work because there's no parent item, just those two. Google Webmaster tools doesn't seem to have a problem though.
Using Opencart 1.5, if it helps.
Cheers,
-
Ah, so it may well be opencart doing something funky then. It's carrying the page url over into the product listing by the looks of it. I'll have to look into that then, thanks for pointing that out!
Do you have any idea how it could be finding the "http://maggie" style links?
Cheers for the help,
-
Ok, here is a example
http://www.lustrelingerie.com/Gracya-Lingerie/safari-wild-bra-push-up?page=5linked from
http://www.lustrelingerie.com/Gracya-Lingerie?page=5
Seems like if the pages= is on the catalog page, it is on the product links
-
Hi Alan, thanks for the response.
Yea, sure there's additional pages for the categories, I'm talking about the individual products.
Take http://www.lustrelingerie.com/Bassaya-Lingerie/camila-red for example. Seomoz's Diagnostics is saying there's a http://www.lustrelingerie.com/Bassaya-Lingerie/camila-red?page=2. The latter works if you go there, I don't understand that and that's likely down to opencart, but what I don't get is how Seomoz is finding the link to it.
And it's the same with links such as "http://maggie" (real error), I don't see where Seomoz is finding the links to those. I've checked any stray canonical links but they seem fine to me.
Thanks,
-
Yes they do exist
this page http://www.lustrelingerie.com/Everyday-Luxury-Underwear-Lingerie?page=1
is linked from this page
http://www.lustrelingerie.com/Everyday-Luxury-Underwear-Lingerie
There are many examples
-
The URL is http://www.lustrelingerie.com/
-
If you can give us a url i will tell you for sure
-
Hi Ben, thanks for the response.
The thing is I don't think it's a CMS issue, it seems to me that seomoz is getting confused somewhere. my product pages are along the lines of "www.domain.com/range/product-a/". They have a canonical link pointing to "www.domain.com/product-a/" And all only have a single page to them. Which is why I can't figure out where Seomoz is picking up these duplicates.
With regards to your latter paragraph, yea I was thinking that. I thought it might confuse customers though, or I was hoping there would be a more elegant solution. Going back in and editing 500+ products isn't something I was looking forward to hehe.
Cheers,
-
I'll speak to the duplicates issue since the other appears to be a CMS issue and how it is displaying the products. Whenever I see the "page=1" in the URL I can usually fink a pagination script that isn't helping my SEO efforts. But I don't know for sure in your situation, especially since you said you don't see any links on the product page.
As far as the "duplicates" issue. Try to get them as distinct as possible. With our product pages (starting with the most sold items) I have begun changing up the product name. We have the difference of only the height on many of our products so I'm having to get a little creative and add some other aspect to the URL that stays within the products title. I only want one page from my site competing for that exact match product SERP anyway. It's not a good idea to have two pages on your site competing for the same SERP. It seems to always be treated with less authority by Google when that happened in the past.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How to Diagnose "Crawled - Currently Not Indexed" in Google Search Console
The new Google Search Console gives a ton of information about which pages were excluded and why, but one that I'm struggling with is "crawled - currently not indexed". I have some clients that have fallen into this pit and I've identified one reason why it's occurring on some of them - they have multiple websites covering the same information (local businesses) - but others I'm completely flummoxed. Does anyone have any experience figuring this one out?
Reporting & Analytics | | brettmandoes2 -
Google Analytics - Organic Search Traffic & Queries -What caused the huge difference?
Our website traffic dropped a little bit during the last month, but it's getting better now, almost the same with previous period. But our conversion rate dropped by 50% for the last three weeks. What could cause this huge drop in conversion rate? In Google Analytics, I compared the Organic Search Traffic with previous period, the result is similar. But the Search Engine Optimization ->Queries shows that the clicks for last month is almost zero. What could be the cause of this huge differnce? e9sJNwD.png k4M8Fa5.png
Reporting & Analytics | | joony0 -
Is real google bot like "fetch"or more like "fetch & render"?
In GWT we have two options to mimic googlebot visits, "fetch" and "fetch and render", but when the real googlebot visit a page, is he behaving like the former or the latter? I can see fetch does fetch only the html, while fetch and render does fetch .js and .css as well. But what does the real googlebot does? I have checked the web server logs, and I can see the real googlebot sometimes request the .js files too, but not every time it visit a page, sometimes it does, sometimes it does not. Has anyone figured out when googlebot actually request javascript files?
Reporting & Analytics | | max.favilli0 -
Suspect Links from Yeusaigon.net Causing Server Errors
Good morning, Webmaster Tools is reporting an increase in server errors on our site due to some very suspect links from Yeusaigon.net. After taking a quick look, it appears they are some form of search engine attempting to link to our images by using incomplete URLs. For example: http://yeusaigon.net/search/images.php?q=htc%20one%20max%20phone%20cases&page=1044 Is linking to: http://www.mobilemadhouse.co.uk/caseflex-htc-one-max-real-leather-flip... As this URL is incomplete, it's throwing up a server error. There are currently 139 instances of there errors from the same domain, and is increasing by around 5-10 per day. The domain, however, is linking to some of our pages/images correctly, but I fear Google may look at these as spammy links - they certainly look that way! So, what can we do? I can't find any contact details on Yeusaigon website so I have disavowed the entire domain. Is this the right thing to do? How do I stop the ever-increasing number of sever errors due to incorrect URLs? Cheers, Lewis
Reporting & Analytics | | PeaSoupDigital0 -
Universal Analytics & Google Tag Manager - Track URLs that include hashes
Does anyone have any experience tracking URLs that include hashes (#) using Universal Analytics and Google Tag Manager? Can it be done using GTM's container for UA, using the "more settings" options? Or building another tag to work with the GTM UA container? The fallback I'm considering is implementing the UA code in GTM for every page as Custom HTML with the "ga('send', 'pageview', location.pathname + location.search + location.hash);" solution, rather than GTM's specialized UA tag. I'm not yet sure what problems may arise from that, if any. Thanks in advance.
Reporting & Analytics | | 352inc0 -
404 errors more than 1.8 lacs, Duplicate Content, Duplicate title, missing meta description increasing as site is based on regular ticket selling (CRM), kindly help
Sites error increasing i.e. 404 errors more than 1.8 lacs, Duplicate Content, Duplicate title, missing meta description increasing day by day as site is based on regular ticket selling (CRM), We have checked with webmasters for 404's, but it is not easy to delete 1.8 lac entries. How to resolve this issue for future. kindly help and suggest the solution.
Reporting & Analytics | | 1akal0 -
Webmaster tools crawl errors
Hi there, iv been tracking my webmaster tools crawl errors for a while now(6 months) and im noticing some pages that are far gone 404 are still poping out on the crawl errors. - that pages have no data for xml linking, and remote linking are from pages that are far gone 404 also. that pages have 404 error page + redirect to homepage, and google still notice them with old cache content. does someone have a clue why is this happening?
Reporting & Analytics | | Or.Shvartz0 -
Moved Up in SERPS & Traffic, Need Help Converting
Hello, After listening to the advice of many of you on this forum, I have managed to move my site up in the SERPS, close enough to where I want/need to be. My traffic has increased heavily, yet I am still not seeing a large increase in orders being placed. I am positive that I have the lowest prices on these items, and the most information available about them, yet I still can't seem to convert a lot of this traffic into sales. Can you guys please take a look at my site and provide some guidance on what I can/should do to help convert these visitors to customers? my site is : http://goo.gl/JgK1e Thanks
Reporting & Analytics | | Prime850