Sitemap include all site links or just ones we want indexed?
-
Got a quick sitemap question. We have a clients site built in opencart and are getting ready to submit the sitmap. The default sitemap setting generates urls right off of the root. For example site.com/product. These urls are also accessible through the site itself. We prefer to give the site some depth and have structured the products so the urls are site.com/category/product. All of the product pages have canonicals including the category so we should not have to worry about duplicate content on the /product page vs the /category/product page. My question is both types of product pages are included in the sitemap at the moment. Since we don't want google to index the /product urls should we leave them off of the sitemap even though they are readily accessible from the frontend(though not linked)? Or just leave them and let the canonical tag be used in directing google as to which urls to index. Thanks in advance.
-
Hi again JS,
I think it's great that you continue to evaluate your platform from all perspectives and evaluate its strengths/weaknesses. Many times, a platform can do a lot of the basics well, but fall short on the details that differentiate us from our competition. For example, opencart may do the basic SEO requirements well, but not include ecommerce microdata (schema.org) which have a high impact on our search listings.
You can do a lot of harm/good with the robots.txt file - like deindex entire website (probably not a good thing) or block certain directories (your /product issue). I would gain some deeper knowledge about what you can do with the robots.txt file and how you need it to perform for your business.
-
Hey Raymond,
Thanks for the response, feel like I'm over thinking this a bit, as usually we just leave our opencart setups as is, other then a few minor tweaks. Lately I've really been scrutinizing opencart's SEO setup and how to improve it, since it seems there are a lot of gaps in he way it handles this.
I thought the robots.txt would have been a good way to block the pages, but the issue is I would need to block every single product page as opencart automatically creates a page for every product that is site.com/product and since we are adding lots of products there should be a better way to handle this. After I posted I came across this tidbit from a 6 year old google webmaster central blog post. Basically it states that 'While we can't guarantee that our algorithms will display that particular URL in search results, it's still helpful for you to indicate your preference by including that URL in your Sitemap. '. I think going this route along with the canonical should do the trick.
-
Hi JStrong,
Great question to be asking and an important topic to be doing your due diligence on, especially when dealing with an eCommerce related website.
Google uses a sitemap as a guideline for crawling your site. So, just because you put a URL in your sitemap, doesn't mean that they URL will actually be indexed. You can see those stats in your Google Webmaster Tools account, under the Sitemap area. It will display how many URLs are in the sitemap and how many out of those URLs are indexed.
If you do not want certain pages to be indexed by Google, then you would need to adjust your robots.txt file to give Google those instructions.
As long as you have the correct Canonical configurations, you should avoid any duplicate content issues from the URLs you've described above.
Good luck!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Hreflang - Is it needed even if the site is only one language
This topic came up in a discussion I had with a fellow SEO colleague, I don't believe it makes sense to have Hreflang if you don't have a second language but according to my friend they mentioned that it is great if your only targeting one country. Any opinions out in the Moz community? It seems like overkill to me
On-Page Optimization | | JonAlonsoCNC0 -
Indexed Link Removal request in GWT, good idea?
Hello, I used a plugin to no-index a lot of pages on my website, and its been couple months and they never disappeared from serps, so i used the google webmaster tool "remove urls" to ask google to remove them. Is that a good idea? Or does it look bad in googles eyes? any thoughts would help a lot. Thanks
On-Page Optimization | | Rank-and-Grow0 -
What on-site issue could be causing Moz to not detect internal links?
Hey guys, We've done a crawl and none of our internal links are showing up. Are there any on-page factors that would prevent Moz from being able to detect our internal links? Thanks!
On-Page Optimization | | ATMOSMarketing560 -
Site restructure question
Our site was deigned years ago to target customers in specific cities, now we've grown beyond this and I believe it is time to change the site structure.
On-Page Optimization | | PM_Academy
Ignore the 302 from the root page. Current structure: (assuming you've never been to our site before) projectmanagementacademy.net 302->/select-location.php /select-location.php -> /city-name/pmp-training.php This page was meant to be a "homepage" for each city, pointless page really /city-name/pmp-training.php -> /ciy-name/product-name.php These pages are for each individual product My suggested site structure: /city-name/pmp-training.php becomes projectmanagementacademy.net no more redirect /city-name/pmp-training.php gets removed and 301 to root page. /product-name.php each product's page and you would select a location when necessary (some products are online only) would 301 each /city-name/product-name to corresponding product page /product-name/city-name.php could add these pages if we still wanted the city name in url for city specific products My thoughts here are /product-name.php would receive a higher % of link juice because there are fewer page between 2 vs 4 if you came to the root page. and 2 vs 3 if you came from the select-location page. Also instead of being split between over 50 locations, all these would be together on one page. Your thoughts? Would this change improve our SERP for those product pages? Would we see a drop off in traffic if we did this? How long, if done correctly, would it take to see the recovery of rankings and traffic? Could we 301 /select-location.php to the root page? Thanks in advance for your insights to this. Any answer is a good answer. Trenton0 -
When should I dofollow a link?
I know I should have all my affiliate links as nofollow but when should I dofollow a link? When will it HELP me to have a dofollow link? Right now - all my links are nofollow because I haven't figured out how to have nofollow and dofollow in the same blog post. Any info appreciated!
On-Page Optimization | | dealblogger0 -
The correct way to go from PHP site to HTML site?
I have a website fully coded in PHP and I am doing a re-design over to an HTML site. I searched through the Q&A and there were some conflicting answers. Some said you will need to 301 all the pages. Others said to use the .htaccess to parse all the files as html. What is the correct way I should go about this? Thanks in advance!
On-Page Optimization | | reliabox0 -
Troubleshooting Drop in Rank - could it be one of these?:
At nlpca(dot)com, we've had a drop in rank of like 13 slots for our main term "NLP". Could this be caused by?: Adding keywords to title and meta tags that are not completely relevant to page. For example, "degree programs" on this page http://www.nlpca.com/DCweb/nlp-salt-lake-city-and-bay-area The word "degree" and "programs" do not show up on that page, but "program" does. Our courses are 'certification programs' but maybe 'degree programs' is not close enough. Adding keywords to title that do not show up in page. The same keyword "degree programs" is a good example. Another example is the keyword "learn about NLP" on this page: http://www.nlpca.com/DCweb/learn-about-nlp-techniques Targeting 2 different locations (California and Utah) Having our title 2 characters too big on the home page (we used to) And lastly, Do you think our ranking drop is reversible if it's on-site SEO? Thanks!
On-Page Optimization | | BobGW0 -
Links from my homepage
I am redesigning my homepage completely. What criteria should I use to decide which pages to link to from the homepage?
On-Page Optimization | | mascotmike0