Rogerbot directives in robots.txt
-
I feel like I spend a lot of time setting false positives in my reports to ignore.
Can I prevent Rogerbot from crawling pages I don't care about with robots.txt directives? For example., I have some page types with meta noindex and it reports these to me. Theoretically, I can block Rogerbot from these with a robots,txt directive and not have to deal with false positives.
-
Yes, you can definitely use the robots.txt file to prevent Rogerbot from crawling pages that you don’t want to include in your reports. This approach can help you manage and minimize false positives effectively.
To block specific pages or directories from being crawled, you would add directives to your robots.txt file. For example, if you have certain page types that you’ve already set with meta noindex, you can specify rules like this:
User-agent: Rogerbot Disallow: /path-to-unwanted-page/ Disallow: /another-unwanted-directory/
This tells Rogerbot not to crawl the specified paths, which should reduce the number of irrelevant entries in your reports.
However, keep in mind that while robots.txt directives can prevent crawling, they do not guarantee that these pages won't show up in search results if they are linked from other sites or indexed by different bots.
Additionally, using meta noindex tags is still a good practice for pages that may occasionally be crawled but shouldn’t appear in search results. Combining both methods—robots.txt for crawling and noindex for indexing—provides a robust solution to manage your web presence more effectively.
-
Never mind, I found this. https://moz.com/help/moz-procedures/crawlers/rogerbot
-
@awilliams_kingston
Yes, you can use robots.txt directives to prevent Rogerbot from crawling certain pages or sections of your site, which can help reduce the number of false positives in your reports. By doing so, you can focus Rogerbot’s attention on the parts of your site that matter more to you and avoid reporting issues on pages you don't care about.Here’s a basic outline of how you can use robots.txt to block Rogerbot:
Locate or Create Your robots.txt File: This file should be placed in the root directory of your website (e.g., https://www.yourwebsite.com/robots.txt).
Add Directives to Block Rogerbot: You’ll need to specify the user-agent for Rogerbot and define which pages or directories to block. The User-agent directive specifies which web crawlers the rules apply to, and Disallow directives specify the URLs or directories to block.
Here’s an example of what your robots.txt file might look like if you want to block Rogerbot from crawling certain pages:
javascript
Disallow: /path-to-block/
Disallow: /another-path/
If you want to block Rogerbot from accessing pages with certain parameters or patterns, you can use wildcards:javascript
Disallow: /path-to-block/*
Disallow: /another-path/?parameter=
Verify the Changes: After updating the robots.txt file, you can use tools like Google Search Console or other site analysis tools to check if the directives are being applied as expected.Monitor and Adjust: Keep an eye on your reports and site performance to ensure that blocking these pages is achieving the desired effect without inadvertently blocking important pages.
By doing this, you should be able to reduce the number of irrelevant or false positive issues reported by Rogerbot and make your reporting more focused and useful.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved The Moz.com bot is overloading my server
0 -
Zero '0' Total Visits
Hi. One of the properties in our account has been reporting zero '0' total visits for the past few weeks. The other properties aren't affected. Is there a reason for this or is this an issue on the Moz side of things. Thanks!Moz Zero Visits.PNG
Reporting & Analytics | | rh-digi0 -
Client Dashboard Options
My company is an agency, and we manage many SEO Campaigns. I love the reports, but I'd really like to add an online dashboard that my clients can login to and see the same up to date stats as I do in Moz Pro. I can't do it with Seats because you can't limit a seat to a specific campaign (as far as I know). Has anyone found a solution for this?
Reporting & Analytics | | bizmarquee3 -
Abnormally High Direct Traffic Volume
We have abnormally high amounts of direct traffic to our site. It's comprising over half of all web traffic while organic is second with considerably less. From there the volume decreases amongst other channels. I've never seen such a huge proportion of traffic being attributed the Direct. Does anyone know how to test this or see if there is an error in Google Analytics reporting?
Reporting & Analytics | | graceflack 01 -
Direct traffic coming to URLs with /rss_feedIP#
I'm doing a site audit for an organization that has a bunch of really messy old Drupal sites. In looking at their traffic, I see that a majority of it is coming to landing pages that look like this: http://clientsdomain.com/rss_feed173.8.208.97 plus other IP addresses. The bounce rate is 100% and time on site is less than a second. It looks like something that an RSS feed tool might use, but I've never seen something like it before. It creates its own landing page, hits the site, then appears to bounce. This is making their Analytics data look a whole lot worse than the site is actually doing, since the bounce rate is 100% on all that fake traffic. I have some experience with Drupal, but I've never seen anything like this in Drupal or any other CMS. Has anyone out there ever experienced something like this, where direct traffic comes to an rss feed landing page and bounces immediately?
Reporting & Analytics | | newwhy0 -
How to check search engine and direct traffic of Mobile?
Hi Guys, I want to check search engine traffic or direct traffic of mobile, how to check in google analytic?
Reporting & Analytics | | varo0 -
Universal Analytics: Why does Google Organic appear as Direct traffic?
Hi there, When I enter the site via Google Search and follow myself via Real-Time Analytics I appear an organic visitor (which is good). When I browse and visit the site I still am an organic visitor. However, as soon as I fill in the contact form (gravity forms) and land on the "thank you page" I appear as a direct visitor with Google as the source. Since I have the thank you page set-up as a goal, Analytics incorrectly attributes these conversions to the direct medium instead of the organic medium. The tracking code has been installed on all the pages and all conversions are being recorded. What is going on?
Reporting & Analytics | | Robbern0 -
Huge Spike in Direct Traffic from IE7
Our site is seeing a huge spike in direct (none) traffic from IE 7 from July 8, 2014 - on. June 25 - July 7 showed 21 direct visits from IE 7; July 8 - July 20 is showing 5,889 (an increase of 27,943%). All traffic from the spike is going to our homepage. Other Google Analytics' stats for this direct (none) IE 7 traffic: Bounce Rate: 99.52%
Reporting & Analytics | | SJVC_Susie
Avg. Session Duration: 0:02
Pages/session: 1.01
Mostly all new users What's strange is that the traffic is from a variety of cities and networks. What could be causing this? Has anyone experienced this before?0