Noindex, nofollow on a blog since 2009
-
Just reviewed a WordPress blog that was launched in 2009 but somehow the privacy setting was to not index it, so all this time there's been a noindex, nofollow meta tag in the header. The client couldn't figure out why masses of content wasn't showing up in search results.
I've fixed the setting and assume Google will spider in short order; the blog is a subdirectory of their main site. My question is whether there is anything else I can or should do. Can Google recognize the age of the content, or that it once had a noindex meta tag? Will it "date" the blog as of today? Has the client lost out on untold benefits from the long history of content creation? I imagine that link juice from any backlinks to the blog will now flow back to the main site; think that's true?
Just curious what others might think of this scenario and whether any other action is warranted.
-
Thanks Dan. One thing I found interesting is that Google Webmaster Tools doesn't offer any alerts about pages that aren't indexed because of meta tags, only about those included in the robots.txt file.
-
Hi
Great responses Matt and Ben, thanks!! Only things I could add are;
Webmaster Tools
- Check google webmaster tools every few days for the first 2-3 weeks.
- You may turn up some 404s or other types of errors that should be corrected.
- And keep your eyes out for any other warnings
Analytics
- You're going to spike your traffic (potentially, hopefully) in analytics big time, or at least skew the data
- Use filters and advanced segments to separate blog traffic so you can still analyze things even after a potential spike in blog search traffic.
- At minimum make an annotation of the date you made it indexable.
Dates
- Regarding the dates, I did come across this recently - I have not tested, so please take it with a grain of salt - removing dates from the SERPs - I would only recommend trying it if the content was not "time sensitive" (like a cooking recipe).
Hope all this helps!
-Dan
-
Thanks for the clarification Ben. I think I'll leave older posts as is. They've been actively posting several times a week, so there should be enough fresh content. My hope is that Google recognizes the age of the blog because it's my understanding that age factors in the ranking algorithm.
-
Ahh yeah my bad, ignore that bit. I think you'd still want to make a subtle change to each post so WordPress can set the date updated flag on the sitemap to today, that way Google will put a higher priority on the content when indexing your site.
-
Thanks, the site maps are a good idea. Ben, I'm not sure what you mean about making the content different to what Google has in its index. Because of the meta tag, it doesn't have any content in its index, right?
-
You've done the most important step (removing the noindex/nofollow) tags. The only additional thing I would do is submit (or resubmit) the XML sitemap to Google. Make sure that XML sitemap is perfect and error free so that you don't create any additional errors.
Google should be smart enough to recognize the dates. I've never had a situation where it was years between publish and index. I have however had situations where it was days or weeks in between publish and index and in those situations Google has recognize the date. I'd imagine the same is true here (assuming of course, you have the date in a recognizable format and don't change the date to today).
I'd be curious to find out what happens. Definitely update this Q&A when you find out what happens!
-
I would probably re-arrange some of the paragraphs (or add some more content) to the old posts and update them in WordPress, this then makes the content different to what Google has in its index.
I would then use the Yoast WordPress SEO plugin to regenerate your sitemap. Since you've updated and added new content to the posts their last updated date would have changed so Google will probably see this as revised content. I would submit to all major search engines as your first port of call.
In terms of the "link juice", I would say that Google will still count links to the article as a ranking factor, but because you have noindex the content wont appear in search results. So the content will have a fairly good page rank (possibly) but its being held back by the exclusion of the search engine index.
Now that the setting has been changed and the sitemap / content has been updated you should start to see the results in the search results in due time.
You could also add a few new articles of content to the blog and publicise that over social media to help get back in the game a bit quicker.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Question on Pagination - /blog/ vs /blog/?page=1
Question on Pagination Because we could have /blog/ or /blog/?page=1 as page one would this be the correct way to markup the difference between these two URL? The first page of a sequence could start with either one of these URLs. Clarity around what to do on this first page would be helpful. Example… Would this be the correct way to do this as these two URLs would have the exact content? Internal links would likely link to /blog/ so signal could be muddy. URL: https://www.somedomain.com/blog/
Technical SEO | | jorgensoncompanies
<link rel="canonical" href="https://www.somedomain.com/blog/?page=1"> URL: https://www.somedomain.com/blog/?page=1
<link rel="canonical" href="https://www.somedomain.com/blog/?page=1"> Google is now saying to just use the canonical to the correct paginated URL with page number. You can read that here:
https://developers.google.com/search/docs/advanced/ecommerce/pagination-and-incremental-page-loading But they do not clarify what to do on /blog/?page=1 vs /blog/ as they are the exact same thing. Thanks for your help.0 -
Duplicate Page Titles For Paginated Topics In Blog
Hello, I've just run a site audit and it has come up with a duplicate title tag issue for the topics section of our blog. For example it is flagging that the following have the same page title. https://blog.companyname.com/topic/topic-name https://blog.companyname.com/topic/topic-name/page/2 How significant is this as an SEO issue and what are the ways we can go about fixing this? I look forward to any suggestions and guidance that can be provided. Thanks, John
Technical SEO | | SEOCT1 -
Schema for blogs
When I run a wordpress blog through the structured data testing tool I see that there is @type hentry. Is this enough for blogs etc? Is this a result of Wordpress adding in this markup? Do you recommend adding @blogposting type and if so why? What benefit to add a specific type of schema? How does it help in blogging? Thanks
Technical SEO | | AL123al4 -
Robots.txt vs. meta noindex, follow
Hi guys, I wander what your opinion is concerning exclution via the robots.txt file.
Technical SEO | | AdenaSEO
Do you advise to keep using this? For example: User-agent: *
Disallow: /sale/*
Disallow: /cart/*
Disallow: /search/
Disallow: /account/
Disallow: /wishlist/* Or do you prefer using the meta tag 'noindex, follow' instead?
I keep hearing different suggestions.
I'm just curious what your opinion / suggestion is. Regards,
Tom Vledder0 -
Job/Blog Pages and rel=canonical
Hi, I know there are several questions and articles concerning the rel=canonical on SEOmoz, but I didn't find the answer I was looking for... We have some job pages, URLs are: /jobs and then jobs/2, jobs/3 etc.. Our blog pages follow the same: /blog, /blog2, /blog/3... Our CMS is self-produced, and every job/blog-page has the same title tag. According to SEOmoz (and the Webmaster Tools), we have a lots of duplicate title tags because of this problem. If we put the rel=canonical on each page's source code, the title tag problem will be solved for google, right? Because they will just display the /job and /blog main page. That would be great because we dont want 40 blog pages in the index. My concern (a stupid question, but I am not sure): if we put the rel=canonical on the pages, does google crawl them and index our job links? We want to keep our rankings for our job offers on pages 2-xxx. More simple: will we find our job offers on jobs/2, jobs/3... in google, if these pages have the rel=canonical on them? AND ONE MORE: does the SEOmoz bot also follow the rel=canonical and then reduce the number of duplicate title-tags in the campaigns??? Thanx........
Technical SEO | | accessKellyOCG0 -
I am using SEOmoz pro software and my blog tags are bringing up 404 errors.
After checking they do bring back a 404 page, so i am wondering what to do. Do i remove all the blog tags? We use a Drupal cms system.
Technical SEO | | AITLtd0 -
Best Practice to Remove a Blog
Note: Re-posting since I accidentally marked as answered Hi, I have a blog that has thousands of URL, the blog is a part of my site. I would like to obsolete the blog, I think the best choices are 1. 404 Them: Problem is a large number of 404's. I know this is Ok, but makes me hesitant. 2. meta tag no follow no index. This would be great, but the question is they are already indexed. Thoughts? Thanks PS A 301 redirect to the main page would be flagged as a soft 404
Technical SEO | | Bucky0 -
Link juice distributed to too many pages. Will noindex,follow fix this?
We have an e-commerce store with around 4000 product pages. Although our domain authority is not very high (we launched our site in February and now have around 30 RD's) we did rank on lots of long tail terms, and generated around 8000 organic visits / month. Two weeks ago we added another 2000 products to our existing catalogue of 2000 products, and since then our organic traffic dropped significantly (more than 50%). My guess is that link juice has been distributed to too many pages, causing rankings to drop on overall. I'm thinking about noindexing 50% of the product pages (the ones not receiving any organic traffic). However, I am not sure if this will lead to more link juice for the remaining 50% of the product pages, or not. So my question is: if I noindex,follow page A, will 100% of the linkjuice go to page B INSTEAD of page A, or will just a part of the link juice flow to page B (after flowing through page A first)? Hope my question is clear 🙂 P.s. We have a Dutch store, so the traffic drop is not a Panda issue 🙂
Technical SEO | | DeptAgency0