Sitemap include all site links or just ones we want indexed?
-
Got a quick sitemap question. We have a clients site built in opencart and are getting ready to submit the sitmap. The default sitemap setting generates urls right off of the root. For example site.com/product. These urls are also accessible through the site itself. We prefer to give the site some depth and have structured the products so the urls are site.com/category/product. All of the product pages have canonicals including the category so we should not have to worry about duplicate content on the /product page vs the /category/product page. My question is both types of product pages are included in the sitemap at the moment. Since we don't want google to index the /product urls should we leave them off of the sitemap even though they are readily accessible from the frontend(though not linked)? Or just leave them and let the canonical tag be used in directing google as to which urls to index. Thanks in advance.
-
Hi again JS,
I think it's great that you continue to evaluate your platform from all perspectives and evaluate its strengths/weaknesses. Many times, a platform can do a lot of the basics well, but fall short on the details that differentiate us from our competition. For example, opencart may do the basic SEO requirements well, but not include ecommerce microdata (schema.org) which have a high impact on our search listings.
You can do a lot of harm/good with the robots.txt file - like deindex entire website (probably not a good thing) or block certain directories (your /product issue). I would gain some deeper knowledge about what you can do with the robots.txt file and how you need it to perform for your business.
-
Hey Raymond,
Thanks for the response, feel like I'm over thinking this a bit, as usually we just leave our opencart setups as is, other then a few minor tweaks. Lately I've really been scrutinizing opencart's SEO setup and how to improve it, since it seems there are a lot of gaps in he way it handles this.
I thought the robots.txt would have been a good way to block the pages, but the issue is I would need to block every single product page as opencart automatically creates a page for every product that is site.com/product and since we are adding lots of products there should be a better way to handle this. After I posted I came across this tidbit from a 6 year old google webmaster central blog post. Basically it states that 'While we can't guarantee that our algorithms will display that particular URL in search results, it's still helpful for you to indicate your preference by including that URL in your Sitemap. '. I think going this route along with the canonical should do the trick.
-
Hi JStrong,
Great question to be asking and an important topic to be doing your due diligence on, especially when dealing with an eCommerce related website.
Google uses a sitemap as a guideline for crawling your site. So, just because you put a URL in your sitemap, doesn't mean that they URL will actually be indexed. You can see those stats in your Google Webmaster Tools account, under the Sitemap area. It will display how many URLs are in the sitemap and how many out of those URLs are indexed.
If you do not want certain pages to be indexed by Google, then you would need to adjust your robots.txt file to give Google those instructions.
As long as you have the correct Canonical configurations, you should avoid any duplicate content issues from the URLs you've described above.
Good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Question on site structure
My client is a nationwide company. They provide building maintenance services in 7 different cities. In each city they provide a different range of services. They currently have a single service page for each service and no mention on that page of the cities they offer the service. The service pages are getting no SERP visibility. We are running Paid Search and recommending SEO. I'm wondering whether it would be beneficial to build out specific service pages for each city so the content is more relevant to both users and search engines. What is best practice in this situation? Client wants to dominate SERPs in each market for the services they offer.
On-Page Optimization | | SEOinSunnyNelson0 -
How do I reduce the amount of internal links on my site?
Hi, Can someone help me with reducing the amount of internal links on our site please? https://www.thepresentfinder.co.uk Thanks Charlie
On-Page Optimization | | The-Present-Finder0 -
Meta Titles On Site Different From Google Index Page
This is very embarrassing but I hope someone can help. The Meta Titles on my site are not being shown correctly on the Google site index. For example, when I got directly to the page the appropriate page title is shown obviously. However when I go view that page on Google, the title in completely different. Page title: <a class="attribute-value">Web Design, SEO, PPC, Mobile Development In Philadelphia, Bucks| Infinity Digital Agency</a> Google Shows: Our Services - Infinity Digital Agency Here is the result page. I am currently running WordPress and Yoast. Any thoughts would be greatly appreciated. https://www.google.com/search?q=site%3Awww.infinitydigitalagency.com%2Fservices%2F&ie=utf-8&oe=utf-8
On-Page Optimization | | infinitydigitalagency0 -
On hover my links are with additional Parameters while links that are indexed are without additional parameters
On hover my links are with additional Parameters while links that are indexed are without additional parameters does it impact in a negative way. For ex: i have a site http://www.yoursite.com and Its internal pages that are linked to the site are in pattern of http://www.yoursite.com/jobs-in-india?xz=3_0_5 and these are the pages which are interlinked through out the site. When any user click the link they will land to the similar pages with additional parameter even on mouse hover any one can see the same link. while we have used Canonical, so pages that are getting indexed are http://www.yoursite.com/jobs-in-india. But my concern is: - To showing two different link as when Google crawler follow the site they will get the links with additional parameter while in its index its a URL without additional parameter so is there problem that we can encounter or is there any negative impact on ranking?
On-Page Optimization | | vivekrathore0 -
One Company, Two Brands with Two Blogs, but One WP Panel for Blog?
I work with a company that has 2 brands. Both brands have separate sites (currently on a WP multisite install). We want each brand to have its own blog, but for ease of content creation have ONE wp install to create the blog content and depending on what category is clicked (Brand 1 and/or Brand 2), it will publish to that sites blog. 2 questions: 1. Is one WP install for blog syndication for 2 separate sites advisable (as client is requesting)? Or should we just bite the bullet and have each site have it's separate posting through it's own WP install? 2. Sometimes one blog post will be published to BOTH blogs (i.e. category Brand 1 and Brand 2 clicked OR if we use two separate wp installs for each site, publish to both blogs). Is using a rel=canonical for the original post (we need to decide which brand takes precedence) sufficient to overcome duplicate content problem? Thanks in advance! Stephan
On-Page Optimization | | stephanwb0 -
Site Duplicated despte redirect
Buon pormeriggio from I can smell Whaler Chips Through the window Wetherby,
On-Page Optimization | | Nightwing
When you Google Thakray Medical Museum 2 sites appear in the SERPS, yikes! Now the .org site is no longer hosted & point to the .co.uk site when clicked on but in a nutshell I wantto get rid of the .org site
as illustrated here: http://s216.photobucket.com/user/zymurgy_bucket/media/two-versions-same-website-yikes_zps182e6e12.jpg.html Actions taken so far:
1: Wembaster tools re index request for the .co.uk site
2: Redirect configured to point .org site to the .co.uk What else is left apart from updating the xml site but ultimating i do not want to see the the .org site but it doesnt exist (well id did a few month back but is no longer hosted so i am told) Any insights welcome,
GRazie tanto,
David0 -
One product two audiences, two pages or one
We have a product on the site that is used by two different groups of people, who refer to it with different terms. One group refers to it as "Lace yarn" plus around another 15 similar terms and the other group refers to it as "Crewel wool" with also 15 similar terms. I am having difficultly deciding how to approach this. At the moment it is on one page (http://www.renaissancedyeing.com/en/category/threads-yarns/crewel-wool/). would it be a good idea to split this into two pages?
On-Page Optimization | | SimonLuijk0 -
Internal link to the home page
When building menus and other internal links, should the link to the home page be http://www.domain.com/ or http://www.domain.com/index.html or does it matter? Best,
On-Page Optimization | | ChristopherGlaeser
Christopher0