Sitemap include all site links or just ones we want indexed?
-
Got a quick sitemap question. We have a clients site built in opencart and are getting ready to submit the sitmap. The default sitemap setting generates urls right off of the root. For example site.com/product. These urls are also accessible through the site itself. We prefer to give the site some depth and have structured the products so the urls are site.com/category/product. All of the product pages have canonicals including the category so we should not have to worry about duplicate content on the /product page vs the /category/product page. My question is both types of product pages are included in the sitemap at the moment. Since we don't want google to index the /product urls should we leave them off of the sitemap even though they are readily accessible from the frontend(though not linked)? Or just leave them and let the canonical tag be used in directing google as to which urls to index. Thanks in advance.
-
Hi again JS,
I think it's great that you continue to evaluate your platform from all perspectives and evaluate its strengths/weaknesses. Many times, a platform can do a lot of the basics well, but fall short on the details that differentiate us from our competition. For example, opencart may do the basic SEO requirements well, but not include ecommerce microdata (schema.org) which have a high impact on our search listings.
You can do a lot of harm/good with the robots.txt file - like deindex entire website (probably not a good thing) or block certain directories (your /product issue). I would gain some deeper knowledge about what you can do with the robots.txt file and how you need it to perform for your business.
-
Hey Raymond,
Thanks for the response, feel like I'm over thinking this a bit, as usually we just leave our opencart setups as is, other then a few minor tweaks. Lately I've really been scrutinizing opencart's SEO setup and how to improve it, since it seems there are a lot of gaps in he way it handles this.
I thought the robots.txt would have been a good way to block the pages, but the issue is I would need to block every single product page as opencart automatically creates a page for every product that is site.com/product and since we are adding lots of products there should be a better way to handle this. After I posted I came across this tidbit from a 6 year old google webmaster central blog post. Basically it states that 'While we can't guarantee that our algorithms will display that particular URL in search results, it's still helpful for you to indicate your preference by including that URL in your Sitemap. '. I think going this route along with the canonical should do the trick.
-
Hi JStrong,
Great question to be asking and an important topic to be doing your due diligence on, especially when dealing with an eCommerce related website.
Google uses a sitemap as a guideline for crawling your site. So, just because you put a URL in your sitemap, doesn't mean that they URL will actually be indexed. You can see those stats in your Google Webmaster Tools account, under the Sitemap area. It will display how many URLs are in the sitemap and how many out of those URLs are indexed.
If you do not want certain pages to be indexed by Google, then you would need to adjust your robots.txt file to give Google those instructions.
As long as you have the correct Canonical configurations, you should avoid any duplicate content issues from the URLs you've described above.
Good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Outbound links
Hi everyone, Just a quick question about using info/statistics from other sources in my articles. If I use a quote/piece of info from another online article do I just say where it's from and link to it? Is this acceptable or do you have to get permission? I find the whole permissions thing quite confusing! I know that outbound links are good for SEO so just wanted to check this. Thanks!
On-Page Optimization | | ClareO0 -
One Main Brand Domain - One EMD which Ranks Better
Hello SEO MOZers! We have one one main ecommerce site and I have just discovered we have an EMD sitting on a .co.uk, for one of our premium product ranges, and it actually outranks our main website. It appears on page one. My question is how do I handle the EMD? I don't want to start building authority to this site, as we are currently developing the main brand site, however I can't ignore that it is ranking. There is relatively little content. There are links on the page (it's only one page) that go through to the product section on the main site. But should I 301 redirect it? What would be your advise? I want to capitalise on any traffic we could benefit from, and ultimately I want the main site to rank better for this product's keywords. Any help would much appreciated.
On-Page Optimization | | CarlaAction1 -
Issue with old sitemap
Hi Quite new to SeoMoz so some learning curve but we have one issue on a site perhaps someone can advise on? We have many duplicate page titles / content, we think this was due to a corrupt sitemap.xml file, all of the URL's look like this with an odd page at the end http://www.totalpetsupplies.co.uk/problem.aspx?aspxerrorpath=/animals/Cat_Accessories_Pet_Collars/privacy.aspx So that XML map was removed well over a week ago, the new sitemap is an apsx generated one to get all pages correctly show as www.totalpetsuppies.co.uk/sitemap.aspx. Goolge and Bing have that. SeoMoz re did our site and the errors went up a bit!! Not sure how that process works, anyone any suggestions? Cheers Shaun
On-Page Optimization | | ShaunSizen0 -
Cannibalizing link
this is what the reports states :Cannibalizing link"Red & Color Roses", "Red roses", "Black Baccara Roses\rBlack And Red Roses\rStarting at: $ 0.72 per Rose", "Black Magic Rose\rDark Red Black Rose\rStarting at: $ 0.96 per Rose", "Forever Young Roses\rRed Rose Flowers\rVelvety Deep Red\rStarting at: $ 1.16 per Rose", "Freedom Roses\rRed Rose\rStarting at: $ 0.66 ¢ per Rose", "Madame Delbard\rRed Wedding Rose\rStarting at: $ 0.66 per Rose", "Night Fever Roses\rTraditional Roses\rRomantic Red Roses\rStarting at: $ 0.66 per Rose", and "Red Paris Roses\rDark Red Rose\rStarting at: $ 0.72 per Rose"ExplanationIt's a best practice in SEO to target each keyword with a single page on your site (sometimes two if you've already achieved high rankings and are seeking a second, indented listing). To prevent engines from potentially seeing a signal that this page is not the intended ranking target and creating additional competition for your page, we suggest staying away from linking internally to another page with the target keyword(s) as the exact anchor text. Note that using modified versions is sometimes fine (for example, if this page targeted the word 'elephants', using 'baby elephants' in anchor text would be just fine).RecommendationUnless there is intent to rank multiple pages for the target keyword, it may be wise to modify the anchor text of this link so it is not an exact match. the questions is then why having those links on that page hurts, when we need the links to take the costumer to the color of red roses they want.
On-Page Optimization | | globalrose.com0 -
Impact of nofollow links
Does anyone know what the impact of a nofollowed link is on the ranking value any given page has to distribute? For example, if I have 2 links on a page, both followed, I know those links each distribute nearly 50% of the total ranking value the current page has to offer. However, if one of those links is nofollowed, does that automatically mean the other link gets the ranking value cast off by the nofollowed link? In other words, the single followed link now distributes nearly 100% of the ranking value the page has to offer? It seems to me I remember hearing this was not the case and that the ranking value a nofollowed link would have if it were followed just evaporates. This would mean the single followed link still only passes on around 50%...not 100%. Is the effect different if the links are internal vs. external? If any citations are available to justify knowledge here, that would be great. I know a lot of people have opinions about this subject, but I'm not sure anyone knows Google's position. Thanks!
On-Page Optimization | | RyanOD0 -
How Can I Get Yahoo to Index My Site?
How Can I Get Yahoo to Index My Site? I have installed Bing webmaster tool two months ago -- is Yahoo that slow. My site has been out since May 2010 and for a year and a half, I only have 40 pages index. HELP!!!!
On-Page Optimization | | AppleCapitalGroup0 -
If i only want to rank for one specific keyword and use it in all my page titles, will it negatively affect my rankings?
If i want to rank highest for one specific keyword (virtualization management, for example) and use that keyword in all the titles on my website, will that negatively affect my search rankings? SEOmoz is telling me that i should use unique titles for my different pages to ensure that they describe each page uniquely and don't compete with each other for keyword relevance.
On-Page Optimization | | foonista0 -
Tags to index or to no index that is the Question
Sorry for the silly Shakespearean introduction but what are peoples opinions on leaving tag clouds to be indexed or no indexable by the search engines. I tend to leave them non indexable at the moment. I use wordpress for blogging as do most others.
On-Page Optimization | | onlinemediadirect0