Can I use a "no index, follow" command in a robot.txt file for a certain parameter on a domain?

PapaRelevance

I have a site that produces thousands of pages via file uploads. These pages are then linked to by users for others to download what they have uploaded.

Naturally, the client has blocked the parameter which precedes these pages in an attempt to keep them from being indexed. What they did not consider, was they these pages are attracting hundreds of thousands of links that are not passing any authority to the main domain because they're being blocked in robots.txt

Can I allow google to follow, but NOT index these pages via a robots.txt file --- or would this have to be done on a page by page basis?

NakulGoyal

Since you have those pages blocked via robots.txt, the bots would never even crawl these pages in theory...which means the Noindex,follow is not helping.

Also, if you do a report on the domain on opensiteexplorer and dig, you should be able to find tons of those links already showing up. So if my site is linking to a page on that site, that page may not be cached/indexed because of the robots.txt exclusion, but that as long as my site is follow, your domain is still getting the credit for the link.

Does that make sense ?

PapaRelevance

Answered my own question.

watch?v=ZjRGkc__FwQ

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

Can I use a "no index, follow" command in a robot.txt file for a certain parameter on a domain?

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Our subdomain hosts content that can not be optimized (static videos) - should I de-index it?

Do I miss traffic (thus, page value) by using the GWMT Parameter Handling Tool?

Should I set up no index no follow on low quality pages?

How do i migrate from Volusion to Magento with the same domain using 301 redirect?

When you add 10.000 pages that have no real intention to rank in the SERP, should you: "follow,noindex" or disallow the whole directory through robots? What is your opinion?

Robots.txt 404 problem

Using 2 wildcards in the robots.txt file

How Can I know that a link placed is not lableld "No Follow"?