Sitemap include all site links or just ones we want indexed?
-
Got a quick sitemap question. We have a clients site built in opencart and are getting ready to submit the sitmap. The default sitemap setting generates urls right off of the root. For example site.com/product. These urls are also accessible through the site itself. We prefer to give the site some depth and have structured the products so the urls are site.com/category/product. All of the product pages have canonicals including the category so we should not have to worry about duplicate content on the /product page vs the /category/product page. My question is both types of product pages are included in the sitemap at the moment. Since we don't want google to index the /product urls should we leave them off of the sitemap even though they are readily accessible from the frontend(though not linked)? Or just leave them and let the canonical tag be used in directing google as to which urls to index. Thanks in advance.
-
Hi again JS,
I think it's great that you continue to evaluate your platform from all perspectives and evaluate its strengths/weaknesses. Many times, a platform can do a lot of the basics well, but fall short on the details that differentiate us from our competition. For example, opencart may do the basic SEO requirements well, but not include ecommerce microdata (schema.org) which have a high impact on our search listings.
You can do a lot of harm/good with the robots.txt file - like deindex entire website (probably not a good thing) or block certain directories (your /product issue). I would gain some deeper knowledge about what you can do with the robots.txt file and how you need it to perform for your business.
-
Hey Raymond,
Thanks for the response, feel like I'm over thinking this a bit, as usually we just leave our opencart setups as is, other then a few minor tweaks. Lately I've really been scrutinizing opencart's SEO setup and how to improve it, since it seems there are a lot of gaps in he way it handles this.
I thought the robots.txt would have been a good way to block the pages, but the issue is I would need to block every single product page as opencart automatically creates a page for every product that is site.com/product and since we are adding lots of products there should be a better way to handle this. After I posted I came across this tidbit from a 6 year old google webmaster central blog post. Basically it states that 'While we can't guarantee that our algorithms will display that particular URL in search results, it's still helpful for you to indicate your preference by including that URL in your Sitemap. '. I think going this route along with the canonical should do the trick.
-
Hi JStrong,
Great question to be asking and an important topic to be doing your due diligence on, especially when dealing with an eCommerce related website.
Google uses a sitemap as a guideline for crawling your site. So, just because you put a URL in your sitemap, doesn't mean that they URL will actually be indexed. You can see those stats in your Google Webmaster Tools account, under the Sitemap area. It will display how many URLs are in the sitemap and how many out of those URLs are indexed.
If you do not want certain pages to be indexed by Google, then you would need to adjust your robots.txt file to give Google those instructions.
As long as you have the correct Canonical configurations, you should avoid any duplicate content issues from the URLs you've described above.
Good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
ECommerce Website Internal Links
We run an ecommerce website... approx 8K products. When using the page grader, MOZ tools consistently tell me that I have too many Internal Links on the page.
On-Page Optimization | | Ampweb
These are caused from our fairly large menu system, and probably from the sub-category links on the category landing pages as well. I was reading an article that mentioned a no-follow on these Internal links would not really solve the "Too many internal links issue", so wanted to check if anyone has ideas or should I just dis-regard this MOZ suggestion that there are too many in this type of environment?0 -
Pre-launch site or not
We are going to set up a new site in four months. Historically we always set up a simple Wordpress "Pre-launch-site" with relevant texts to start ranking in the SERP. Anyone with experience of doing/not doing this and what is had led to? A site with relevant texts also should have incoming links, which needs more work.
On-Page Optimization | | fredrikahlen0 -
Robots file include sitemap
Hello, I see that google, facebook and moz... have robots.txt include sitemap at the footer.
On-Page Optimization | | JohnHuynh
Eg: http://www.google.com.vn/robots.txt Sitemap: http://www.google.com/sitemaps_webmasters.xml
Sitemap: http://www.google.com/ventures/sitemap_ventures.xml Should I include my sitemap file (sitemap.xml) at the footer of robots.txt and why should do this? Thanks,0 -
How can I reduce Too Many On-Page Links? I am looking for best method through which I can reduce by on page link.
Hello, As I have the Pro Account in SEOMOZ . I have created the campaign for my website and I have seen the warring for on page analysis for Too Many On-Page Links. As per my knowledge in past it's matter that you can put maximum 100 links per page but now is it still matter or harm if pages has Too Many On-Page Links? And if yest then please let me know the best method to reduce my On-Page Links with out doing any major changes in website
On-Page Optimization | | jemindesai0 -
Too many outbound links on a page?
We have a "Clients" page on our site with approximately 125 of our clients listed. We have a link to each client's website, so that's 125 links. I am rethinking this approach. Is there any value to having these outbound links? The SEOmoz PRO analysis tells me I have too many links on this page. I have read that more than 100 links on a page is too many, but that seemed to be referring to internal links. Any thoughts? Thanks!
On-Page Optimization | | nyc-seo0 -
Impact of nofollow links
Does anyone know what the impact of a nofollowed link is on the ranking value any given page has to distribute? For example, if I have 2 links on a page, both followed, I know those links each distribute nearly 50% of the total ranking value the current page has to offer. However, if one of those links is nofollowed, does that automatically mean the other link gets the ranking value cast off by the nofollowed link? In other words, the single followed link now distributes nearly 100% of the ranking value the page has to offer? It seems to me I remember hearing this was not the case and that the ranking value a nofollowed link would have if it were followed just evaporates. This would mean the single followed link still only passes on around 50%...not 100%. Is the effect different if the links are internal vs. external? If any citations are available to justify knowledge here, that would be great. I know a lot of people have opinions about this subject, but I'm not sure anyone knows Google's position. Thanks!
On-Page Optimization | | RyanOD0 -
Internal Followed Links and Total Internal Links as 1
It is showing Internal Followed Links and Total Internal Links as 1 in OpenSiteExplorer Tool http://www.expresscasket.com/ http://www.opensiteexplorer.org/comparisons?site=www.expresscasket.com Not able to understand and identify the problem and fix it. But when i check in google webmasters tool, it is showing lots of internal links. Does it differ those internal links and your trace of internal links
On-Page Optimization | | expresscasket0 -
Link Product Thumb & Product Name with same anchor link?
We have an issue on one of our sites we're monitoring a campaign for that seems to have TOO many links on each page. I think the biggest reason is that each product listing on each category page has two separate anchor links into that page. One for the thumb and one for the name. So even though there should only be 60-70 links on each category page, that amount is being inflated because each product listing technically is being split into two separate links. Question is, should I place the thumbnail and name within the same anchor link? We do this on a lot of other sites we operate, but I'm not sure what's a better strategy. It would seem to me that it would be better to have a single anchor link that shares the thumb and product name.
On-Page Optimization | | AarcMediaGroup0