Unnecessary pages getting indexed in Google for my blog

rahulchowdhury

I have a blog dapazze.com and I am suffering from a problem for a long time. I found out that Google have indexed hundreds of replytocom links and images attachment pages for my blog.

I had to remove these pages manually using the URL removal tool. I had used "Disallow: ?replytocom" in my robots.txt, but Google disobeyed it. After that, I removed the parameter from my blog completely using the SEO by Yoast plugin.

But now I see that Google has again started indexing these links even after they are not present in my blog (I use #comment). Google have also indexed many of my admin and plugin pages, whereas they are disallowed in my robots.txt file.

Have a look at my robots.txt file here: http://dapazze.com/robots.txt

Please help me out to solve this problem permanently?

Esaky

Me too have the same issue ! but not indexed in the Google ! but URL parameters in Google Webmasters shows there are 5K errors !

Should i use the URL Parameters settings or which one ?

Also make sure replytocom links are not blocked using Robots.txt, as it will stop Google bots from crawling and this your links won’t get deindexed. This is one mistake which I did, and later after removing replytocom parameter from robots.txt file, I was able to get most of my replytocom links deindexed. These are warning by the blogger ! http://www.shoutmeloud.com/how-to-fix-replytocom-links-issue-in-wordpress.html - he showed how to do that ! but my problem is different - It's Good that it's not indexed but i don't want to take any risk ! how to avooid them for future !

Someone else told me here that some plugins are doing/helping for you ! and not seen in your Robot.txt !

Confused confused ! so much confused ! Please help me !

rahulchowdhury

Actually previously I had removed the links manually. But I am seeing them come up again even after removing the parameter completely.

Can you please point our the problem for me?

SoftzSolutions

Please check that the comment pages are blocked by robots.txt file -

https://www.google.co.in/webhp?sourceid=chrome-instant&ion=1&ie=UTF-8#q=inurl:replytocom+site:http://dapazze.com/&hl=en&tbo=d&filter=0&bav=on.2,or.r_gc.r_pw.r_cp.r_qf.&bvm=bv.1355534169,d.bmk&fp=8d5ddd2cfb254bfd&bpcl=40096503&ion=1&biw=1366&bih=677

However, the blocked pages are now getting redirected to the main landing page of the blog posts.

Seems like it will take a while for Google to recrawl these pages and sort the issue.

In the mean time, could you please show some pages that are getting indexed by Google.

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Unnecessary pages getting indexed in Google for my blog

Got a burning SEO question?

Explore more categories

Products

Moz Solutions

Free SEO Tools

Resources

About Moz

Why Moz

Get Involved