Feedback needed on possible solutions to resolve indexing on ecommerce site
-
I’ve included the scenario and two proposed fixes I’m considering. I’d appreciate any feedback on which fixes people feel are better and why, and/or any potential issues that could be caused by these fixes. Thank you!
Scenario of Problem I’m working on an ecommerce website (built on Magneto) that is having a problem getting product pages indexed by Google (and other search engines). Certain pages, like the ones I’ve included below, aren’t being indexed. I believe this is because of the way the site is configured in terms of internal linking. The site structure forces certain pages to be linked very deeply, therefore the only way for Googlebot to get to these pages is through a pagination page (such as www.acme.com/page?p=3). In addition, the link on the pagination page is really deep; generally there are more than 125 links on the page ahead of this link.
One of the Pages that Google isn’t indexing: http://www.getpaper.com/find-paper/engineering-paper/bond-20-lb/430-20-lb-laser-bond-22-x-650-1-roll.html
This page is linked from http://www.getpaper.com/find-paper/engineering-paper/bond-20-lb?p=5, and it is the 147<sup>th</sup> link in the source code.
Potential Fixes Fix One: Add navigation tags to the template so that search engines will spend less time crawling them and will get to the deeper pages, such as the one mentioned above. Note: the navigation tags are for HTML-5; however, the Magento site in which this is built does not use HTML 5.
Fix Two: Revised the Templates and CSS so that the main navigation and the sidebar navigation is on the bottom of the page rather than the top. This would put the links to the product pages in the source code ahead of the navigation links.
-
Thanks Matthew, while I am aware of duplicate content on this site, I wasn't aware it it specific to some of the pages that aren't being indexed. I will do more research on this!
-
Hey,
It looks like you might have a duplicate content problem contributing here. For instance, you linked to: http://www.getpaper.com/find-paper/engineering-paper/bond-20-lb/430-20-lb-laser-bond-22-x-650-1-roll.html
And there is this duplicated page, that doesn't have the category directory structure for the URL.
http://www.getpaper.com/430-20-lb-laser-bond-22-x-650-1-roll.htmlThat duplicated page is indexed by Google. It also looks like the duplicated page is what is listed in your XML sitemap, not the page you have linked to from the paginated pages.
In spot checking some of the other product pages, it looks like there is a similar issue going on. I'd recommend altering your XML sitemap to reference the URL you want indexed. Or, since it looks like Google has already indexed the pages on your XML sitemap (some of them, at least), you may want to use the URLs that have been indexed (the ones without the category structure) instead of the URLs with the category structure.
In terms of your possible fixes, I think fix one makes more sense. The more direct links you can add to deeper pages of your site, the better. On fix two, moving the sidebar and header to the bottom of the code and controlling the design with CSS can present some problems in various browsers...in my experience, it usually is more pain than gain.
I hope that helps. Thanks!
Matthew
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Regional sites built on different platforms - will this solution for international targeting work?
We are working with our dev team on a few upcoming user stories to improve store.hp.com. We came across a question which isn’t clear in the international targeting documentation. Within http://store.hp.com, we have a number of regional stores, but those are often built on separate platforms. Therefore a story developed on the US infrastructure doesn’t carry over to Canada and so forth. The Canada Store is managed by a different team, so that story needs to get scoped, prioritized, etc. independently. In regards to helping Google understand page equivalence, will Google accept the page relationship if we include hreflang tags exclusively in the sitemap for the US site and exclusively as page-level markup for Canada site? For example: http://store.hp.com/CanadaStore (hreflang notation at page-level): http://store.hp.com/us/en" /> http://store.hp.com/CanadaStore" /> http://store.hp.com/us/en" /> http://store.hp.com/us/en (hreflang notation within sitemap file): <loc>http://store.hp.com/us/en</loc> rel="alternate" hreflang="en-ca" href=" http://store.hp.com/CanadaStore" /> rel="alternate" hreflang="en-us" href="http://store.hp.com/us/en" /> Appreciate the help anyone can give! Zach
Technical SEO | | ZachKline0 -
Ecommerce site product reviews, canonicals – which option to choose?
Recently, I discovered that only the first 4 reviews on our product pages are crawled and indexed. Example: http://www.improvementscatalog.com/eucalyptus-deep-seat-furniture-group/253432 I'm assuming it's due to the canonical that's on the product page http://www.improvementscatalog.com/eucalyptus-deep-seat-furniture-group/253432" />. When you click on page 2 of the reviews, the url does not change, but the next batch of reviews appears on the product page. Same with page 3, etc… The problem is the additional pages are not being crawled and indexed. We have to have the canonical on the product page because our platform creates multiple urls for each product page by including each category where the product resides, related link parameters, etc in the product url (example: http://www.improvementscatalog.com/eucalyptus-deep-seat-furniture-group/patio-furniture/outdoor-furniture/253432) – trust me, it gets ugly! I've researched other Moz answers and I've found that there appears to be a couple of ways to fix the issue. Any ideas/help/guidance/examples on the below options is greatly appreciated!!!! Show only 4 reviews on the first page and place the remaining reviews on a new page by themselves (similar to how Amazon does it). However, I would rather keep all of the reviews on the product page if possible. Add page 2, page 3, etc parameters to the url to display the remaining reviews and adding rel=prev/next. If we chose option 2, would each product page have a different canonical? If so, would it create a duplicate content issue since the above-the-fold content, title tag and meta descriptions would all be the same? Also, would you include each additional page in the sitemap? We had a similar issue with our category pages and we implemented the "viewall" in the canonical. Would that work for our reviews? Thanks in advance for your help!
Technical SEO | | Improvements0 -
Why are only a few of our pages being indexed
Recently rebuilt a site for an auctioneers, however it has a problem in that none of the lots and auctions are being indexed by Google on the new site, only the pages like About, FAQ, home, contact. Checking WMT shows that Google has crawled all the pages, and I've done a "Fetch as Google" on them and it loads up fine, so there's no crawling issues that is standing out. I've set the "URL Parameters" to no effect too. Also built a sitemap with all the lots in, pushed to Google which then crawled them all (massive spike in Crawl rate for a couple days), and still just indexing a handful of pages. Any clues to look into would be greatly appreciated. https://www.wilkinsons-auctioneers.co.uk/auctions/
Technical SEO | | Blue-shark0 -
Post Site Migration - thousands of indexed pages, 4 months after
Hi all, Believe me. I think I've already tried and googled for every possible question that I have. This one is very frustrating – I have the following old domain – fancydiamonds dot net. We built a new site – Leibish dot com and done everything by the book: Individual 301 redirects for all the pages. Change of address via the GWT. Trying to maintain and improve the old optimization and hierarchy. 4 months after the site migration – we still have to gain back more than 50% of our original organic traffic (17,000 vs. 35,500-50,000 The thing that strikes me the most that you can still find 2400 indexed pages on Google (they all have 301 redirects). And more than this – if you'll search for the old domain name on Google – fancydiamonds dot net you'll find the old domain! Something is not right here, but I have no explanation why these pages still exist. Any help will be highly appreciated. Thanks!
Technical SEO | | skifr0 -
Any need to worry about spammy links in Webmaster Tools from sites that no longer exist?
I own an ecommerce website that had some spammy stuff done on it by an SEO firm through SEOLinkVine a few years ago. I'm working on removing all those links, but some of the sites no longer exist. I'm assuming I don't have to worry about disavowing those in Webmaster Tools? Thanks!
Technical SEO | | CobraJones950 -
How to remove all sandbox test site link indexed by google?
When develop site, I have a test domain is sandbox.abc.com, this site contents are same as abc.com. But, now I search site:sandbox.abc.com and aware of content duplicate with main site abc.com My question is how to remove all this link from goolge. p/s: I have just add robots.txt to sandbox and disallow all pages. Thanks,
Technical SEO | | JohnHuynh0 -
Does this content get indexed?
A lot of content on this site is displayed in pop up pages. Eg. Visit the Title page http://www.landgate.wa.gov.au/corporate.nsf/web/Certificate+of+Title To access the sample report or fee details, the info is shown in a pop up page with a strange url. Example: http://www.landgate.wa.gov.au/corporate.nsf/web/Certificate+of+Title+-+Fee+Details I can't see any of these pages being indexed in Google or other search engines when I do a site search: http://www.landgate.wa.gov.au/corporate.nsf/web/Certificate+of+Title+-+Fee+Details Is there a way to get this content indexed besides telling the client to restructure this content?
Technical SEO | | Bigheadigital0 -
Ecommerce site with currency selectors giving dupe content?
Hi everyone,
Technical SEO | | BeachDude
One of my ecommerce sites uses BigCommerce. They have a feature where you can add different currency buttons to change the currency that the customer can shop as. This is great because if people from the UK visit our site, they can change the currency to their own rather than US. It just ads a variable on the end of the URL string to change the currency. However, in my webmaster tools I noticed that I think i am getting a bunch of duplicate content. For example, it thinks i have duplicate title tags for the following: domainname/pages/my-cool-widget.html
domainname/pages/my-cool-widget.html?setCurrencyId=1
.domainname/pages/my-cool-widget.html?setCurrencyId=2
domainname/pages/my-cool-widget.html?setCurrencyId=3
domainname/pages/my-cool-widget.html?setCurrencyId=4 I thought about adding "rel=no-follow" but unfortunately I don't have access to this file to edit the code. Any suggestions?0