Prevent Indexing of URLs Based on Tags
-
I started my website as a blog over at Posterous, but decided to turn it into a full scale business website with a self-hosted WordPress theme. Shortly after transitioning from Posterous to WordPress, I noticed that Google was indexing not only my old blog posts, but the URLs of my blog posts based on the tags they have. Is there any reason why this is a problem? I'm sure it shouldn't qualify as duplicate content, but for some reason it just feels a bit sloppy to me to have all of these pages indexed...Is this a non-issue? Should I just be more discriminating with my use of 'tags' if it bothers me?
-
Thanks guys. Very helpful.
-
I did a similar process a couple of years ago. I had a blog on Google's blogspot, and I set up a new, self-hosted wordpress blog and transferred all my blog posts over. Both sites were getting crawled, but the new site got no rankings until I blocked Google from crawling my old blogspot site. Google saw it as duplicate content and was only giving the old site ranking value.
-
Dan Shure wrote an excellent SEOMoz blog post about optimising wordpress earlier in the year. It really goes into all the detail you could wish for, so rather than repeat it, I'd rather just post it!
Having just quickly reminded myself of the post, it provides you with the answer I think you're looking for.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical tags in the body?
Hi there, Does anyone know if placing canonical tags in the body instead of the header of a page will still "take"? The system we are on means that making an editable header is no easy business and I was just wondering how big of a difference it makes to have it in a different area. Thank you in advance.
On-Page Optimization | | Whittie0 -
Paginated URLs are getting Indexed
Hi, For ex: - My site is www.abc.com and Its paginated URLs for www.abc.com/jobs-in-delhi are in the format of : www.abc.com/jobs-in-delhi-1, www.abc.com/jobs-in-delhi-2 and vice versa also i have used pagination tags rel=next and rel=prev. My concern is all the paginated URLs are getting indexed so is their any disadvantage if these URLs are getting indexed as somewhere i have read that link juice may get distributed in case of pagination. isn't it good to use Noindex, Follow so that we can make the Google to understand that paginated page are not so much important and that should not be ranked.
On-Page Optimization | | vivekrathore0 -
Canonical tag?
I have an e-commerce website and the query strings of the URL's are causing duplicate content/titles. I'm thinking of adding a site-wide canonical tag which should fix them all. Any other ideas of making it neater or better?
On-Page Optimization | | KarlBantleman0 -
Title tags for deep pages
Just pondering what is current best practice for Title tags of pages buried deep within my website? Say I have a page about 'Cheese's of the world' and from that page there is a page about 'Cheshire Cheese' how would you suggest to structure title tags Would for example this be ok - Cheshire Cheese | Cheese's of the World | Brand name Or is this better - Cheshire Cheese | Brand name Just wondering as I'm redesigning my site currently and looking at everything! Ted PS - I like cheese 🙂
On-Page Optimization | | Jon-C0 -
Google Indexing
Hi, We recently launched a new version of our site on the Magento platform. I submitted a new sitemap and on the first crawl only 7 pages out of 132 were indexed...a few days later and we now have 107 indexed (phew). My question is this....how on earth do i find out which pages are indexed and more importantly not indexed? For all i know they might be really important ones so I need to be able to identify the missing pages so i can work on getting them indexed. Nic
On-Page Optimization | | nicc19760 -
ON SITE SEARCH INDEXED BY GOOGLE - no follow or no index
Google indexes alll our internetal searches: search box is brand - clothes types - size type - and for each page it creates a page that which creates duplicate page title and unnecessary content. Should I do a nofollow on the advance search or a no index. Many thanks for the info. Sonja
On-Page Optimization | | reallyitsme0 -
Meta refresh - nojavascript url
seomox is telling me that I am getting a page that is not being indexed or crawled and since the crawl status code is 200 and there are no robots the meta-refresh url must be the problem. the meta refresh url is different than the on page report card url as it's the nojavascript url which my developer says should be ok. see his comments below. The is redirecting to http://mastermindtoys.com/store/nojavascript.html only in case if the JavaScript is disabled in the client browser. This is the right way to do it, I don’t understand why this might be a problem, otherwise MM has to implement Noscript pages that have a real content. I didn’t get what’s wrong about accessibility. The code 200 means it is accessible, and yes there is nothing to access if JavaScript is disabled on browser. I think there are no modern retail sites that would do any sensible business with the scripting disabled in browsers.The H1 is really present 2 times and second occurrence can be removed, though I highly doubt about importance of this change.Regarding duplicates – what URLs are considered duplicates? Can you please send me examples?I am not aware of canonical URL problem for MM site unless we consider old .asp links as duplicate links of the canonical product pages. I would appreciate if SEOMoz gave us an example what they mean.I suspect that the page is not getting indexed as a result of this or I'm just not getting a good score. Which is it?
On-Page Optimization | | mastermindtoys0