Robots.txt Help
-
I need help to create robots.txt file.
Please let me know what to add in the file. any real example or working example.?
-
Michael, from what i can tell, your website is built using WordPress. We typically recommend installing the Yoast SEO plugin and using that--which will help with your robots.txt file. If you need more information, take a look here: https://yoast.com/wordpress-robots-txt-example/
Generally, most of your site won't need to be disallowed in the robots.txt file, unless you're using tags and categories on your site. Yoast typically helps disallow the proper directories that you need to disallow.
One thing that you need to be aware of is the fact that you don't want to disallow your .CSS or .JS files on your site, many of the themes nowadays will put those files in your wp-admin folder--which by default typically gets disallowed.
-
This is the site I used to really get a good understanding of how to create a robots.txt file: http://www.robotstxt.org/
-
A very basic robots.txt file would look something like the below
User-agent: *
Sitemap: http://www.yourwebsite.com/sitemap.xml
Disallow: http://www.yourwebsite.com/url-you-dont-want-indexed
Disallow: http://www.yourwebsite.com/another-url-you-dont-want-indexedHope that helps
-
Include sitemaps. Disallow: Pages that you don't want indexed: search pages, login pages, core admin files.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Urgent help needed for site move with major ranking loss
URGENT HELP/ADVICE NEEDED I am so stressed and worried about my website domain change. I desperately need advice as soon as possible. I will try my best to keep this as brief as possible. I have owned and operated my punk clothing business online at the URL toofastonline.com for 15 years now. And for a long long time we ranked #1 for punk clothing on Google & life was good. However, thanks to the arrival of several cheap marketplaces and other unanticipated changes our ranking dropped considerably. The last few years have been extremely hard on us, to say the least, we came really close to losing the business altogether. But finally after lots of hard work & long hours, things started to improve. Ranking went back up, and we were busy again. I had been toying with the idea of buying the domain TooFast.com for about 10 years, but I never had the money to do it until this now, so I made the leap and as of Jan 9, toofastonline.com became toofast.com. Unfortunately, I now know that I set up the domain change hastily, without doing any of the pre-work Google suggests to do. I didn’t know it then but I did it wrong. And our site which wasranking #7 for punk clothing on Jan. 8th is now number 51 and today is only Jan 24th! I AM PANICKING. I have looked for help, posting jobs on Shopify Experts site several times now, opening accounts with MOZ and SEM Rush, spending countless hours on the phone with GoDaddy, Shopify and even long chats with Google. I have spent all day everyday for the past two weeks trying fix everything to no avail. No one can start on my site issues fast enough. And I have been given so much wrong information that I feel like I have done irreparable damage. I was (am) not qualified to make this kind of a site change alone. Too much was done too fast and without any real working knowledge Google SEO. My brother was the SEO guy and since he left the business I have just been struggling along with it, just trying to keep my head above water. So now for the big question: Should I temporarily change my Shopify stores domain back to toofastonline.com? This way I couldstart at the beginning, fix all the 404 redirects, fix the 301 redirects, clean up code, get the site in top working condition, and then, as Google suggests in theirGoogle Search Console Change of Address Toolstart to do the change of address in small sections, I can not afford to make any more reckless decisions. I have started and stopped, updated, fixed, changed and tried to fix again too many times now. I dont want Google to think I am trying something shady.. I’m not, I just don’t know what I’m doing, and I need help. Here is as much info as I can think of, I am more than willing to pay for help or do the work myself, as long as what I am doing is the right thing. Any and all help/advice/offers are welcome! Maureen CONTACT DETAILS: NAME: Maureen Keough, Owner EM:<a style="-webkit-text-size-adjust: 100%;">Maureen@TooFast.com</a> PH: 856-599-1675 (W) DETAILS OF OUR SET-UP THE APPS & SERVICES WE USE: Google Admin / G-Suite User Gmail for emails Godaddy holds our domains Shopify hosts our storefront. My Shopify store was located at TooFastOnline.com for about 5 years Our Domain Changed From toofastonline.com to toofast.com on Jan 9 In Godaddy both toofastonline.com is being forwarded to toofast.com In Shopify I added toofast.com, made it my primary domain, but left toofastonline.com in there but it is just redirecting to toofast.com. STEPS TAKEN TO CHANGE | ADD | VERIFY THE NEW DOMAIN GoDaddy DNS Records Both Sites - Updated Pointing to Shopify’s IP Address GoDaddy Subdomains For TooFastOnline.com - Redirected But Causing SSL/HTTPS/Privacy errors GoDaddy Subdomains For TooFast.com - Added But Causing SSL/HTTPS/Privacy errors Google Admin - Updated Gmail MX Records TooFast - Added and Updated Gmail MX Records TooFastOnline - Unchanged Google Merchant Center - Updated TooFastOnline is now TooFast Google Merchant Product Feed- Updated TooFastOnline is now TooFast Google Ads - Finally got the New Feed Approved and It is Working Google Search Console - Updated I Think Sitemaps - Added and Asked To Crawl Google Analytics Added TooFast As A Property Seems To Be Working Google Analytics Tag Updated in Shopify Admin Google Search Console - Requested to Move TooFastOnline.com to TooFast.com, still not done. No Redirects were made prior to the “Move” All Social Media Channels Links were Updated By Us Mailerlite MX Records For Bulk Emails - Updated/Verified
Intermediate & Advanced SEO | | TooFast130 -
Should I switch my website builder/host? Please help.
My website: www.joeborders.com is hosted with a service called jigsy: www.jigsy.com. I'm losing my mind trying to figure out if I should stay or not. Lol. I am positive I have done waaaayyy more work on my seo than many people ranking above me. I used to be on the first page, but over the last year I've slowly dropped in rankings. I've checked everything! I need to do some work on my blog, but I'm really thinking now that it might have something to do with my host. Some concerns I've identified: 1) I can't give pages individual h1 tags. The same one is blanketed across the site. 2) I'm told there are a lot of .css and JavaScript. 3) i cant redirect blog posts.....so moz is tagging me with 250 critical issues because my posts are on both www and http versions of my site .But that's all I know. I've talked with squarespace and WordPress and they have no way of transferring my site. It would probably take me a good 30 hours to set everything up....should i move? Please help 😞
Intermediate & Advanced SEO | | joebordersmft0 -
Using "nofollow" internally can help with crawl budget?
Hello everyone. I was reading this article on semrush.com, published the last year, and I'd like to know your thoughts about it: https://www.semrush.com/blog/does-google-crawl-relnofollow-at-all/ Is that really the case? I thought that Google crawls and "follows" nofollowed tagged links even though doesn't pass any PR to the destination link. If instead Google really doesn't crawl internal links tagged as "nofollow", can that really help with crawl budget?
Intermediate & Advanced SEO | | fablau0 -
HELP! How do I stop scraper sites - is there any recourse?
Our site has lots of unique content and photos and it is constantly being scraped and posted on other websites. Most of these are no-name sites that pop up and exist for adwords revenue. Aside from the fact that we don't want our content being copied, this is an SEO nightmare because they often link back to us from pages that are stuffed with keywords and have very low domain authority (it's a form of negative SEO). My question is: Does anyone have experience with fighting this phenonmenon? What have you done that is effective? Does anyone have experience with a service such as http://www.dmca.com/ProtectionPro.aspx ? Does it work/is it worth it? Any input is appreciated!
Intermediate & Advanced SEO | | YairSpolter0 -
Robots.txt Blocked Most Site URLs Because of Canonical
Had a bit of a "Gotcha" in Magento. We had Yoast Canonical Links extension which worked well , but then we installed Mageworx SEO Suite.. which broke Canonical Links. Unfortunately it started putting www.mysite.com/catalog/product/view/id/516/ as the Canonical Link - and all URLs with /catalog/productview/* is blocked in Robots.txt So unfortunately We told Google that the correct page is also a blocked page. they haven't been removed as far as I can see but traffic has certainly dropped. We have also , at the same time had some Site changes grouping some pages & having 301 redirects. Resubmitted site map & did a fetch as google. Any other ideas? And Idea how long it will take to become unblocked?
Intermediate & Advanced SEO | | s_EOgi_Bear0 -
Help - Lost Ranking - What did I screw up?!
Hi, We're working with a local service provider with a location specific keyword (not a real example: "orlando plumbing contractors"). Background:
Intermediate & Advanced SEO | | AaronHenry
In recent history the client updated a new site design and upgraded from Joomla 1.5 to Joomla 2.5. Of course there were duplicate content issues which have been resolved with the help of AceSEF. Duplicate content, title tags, and other content issues are handled as soon as they appear in GWT or MOZ. Additionally, a high number of backlinks were lost when the latest Google update hit. Many of these sites were of sites that no longer existed or were spammy and flushed out. Some were lost due to the previous SEO firm literally removing backlinks and switching them to their new client (seo firm was putting all of the work the client paid for under their name to control everything). Current Situation: The backlink loss seems to have been stopped (hopefully) because we are using a new strategy that relies solely on the quality of the links, surrounding text, varied anchor text, relevancy, etc.) However, we tried an experiment on just one of the clients keywords. That experiment seems to have blown up in our faces evidently. The landing page for the location specific keyword has dropped from the index completely (it seems), but only when searching broad. When using exact match with quotes like the example quoted above ("orlando plumbing contractors") the landing page appears, but several ranks lower. We were ranked yesterday 6/23/13, but as of today 6/24/13 are no longer ranked. On broad matches, non-relevant sites and even a site that shows only a broken server configuration is outranking the client (they appear for the broad search, but the client does not). What Was Done
We recently created a press release for and posted it on a press release site. We then created a link back to the landing page (exact match anchor text). We posted the PR article to several social sites (Google plus, folkd, delicious, twitter, stumble upon, diigo). We also created a blog article (on-site) on site for that, creating links back to the landing page (the links all had exact anchor text). We posted that blog article to social news sites (facebook, stumble, delicious) and included a ping. The PR article was manually rewritten and posted to the PR site (we had to make 2 versions of the PR; one for the blog and one for the PR site). The Result
The client ending up dropping off the broad search rankings, but only slipped a few for "exact match" (with quotes). The PR article that was created is now ranked on page 3 for the board keyword and is still beat by non working sites. We suspect that the exact anchor text could be causing this problem. Anyone else have an idea (we're scratching our heads and trying not to freak out at the same time).0 -
Need help creating sitemap
Hello, The details of my question is sitemap related. Below is the background info: we are ecommerce site with around 4000 pages, and 20000 images. we dont have a sitemap implemented on our site yet. i have checked alot of sitemap tools out there, like g-sitecrawler, xml sitemap, a1 sitemap builder etc, and i tried to create sitemaps via them, but all them give different results. the major links are all there, but the results start to vary for level 2, level 3 links and so on. plus no matter how much i read up on sitemaps, the more i am getting confused. i read lots of seomoz articles on sitemaps, and due to my limited seo and technical knowledge, the extra information on these articles gets more confusing. i also just read an article on seomoz that instead of having one sitemap, having multiple smaller sitemaps is very good idea, specially if we are adding lots of new products (which we are). Now my question: My question is having understood the immense value of sitemap (and by having it very poorly implemented before), how can i make sure that i get a very good sitemap (both xml and html sitemap). i do not want to do something again and just repeat old mistakes by having a poorly implemented sitemap for our site. I am hoping that one of the professionals out there, can help me also make and implement the sitemap. If you can please point me to the right direction.
Intermediate & Advanced SEO | | kannu10 -
Infinite Redirect Loop without trailing slash, please help
I've been searching for an answer all day, I can't seem to figure this out. When I Fetch my blog as Google(http://www.mysite.com/blog) WITHOUT a trailing slash at the end, I get this error: The page seems to redirect to itself. This may result in an infinite redirect loop **HTTP/1.1 301 Moved Permanently** When I Fetch my blog as Google WITH the trailing slash at the end(http://www.mysite.com/blog/), it is fine without errors. When I pull it up in a browser comes up fine both with and without the trailing slash. My .htaccess file in the root directory contains this: RewriteEngine On
Intermediate & Advanced SEO | | debc
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index.htm\ HTTP/
RewriteRule ^index.htm$ http://www.mysite.com/ [R=301,L]
RewriteCond %{HTTP_HOST} ^mysite.com$
RewriteRule ^(.*)$ http://www.mysite.com/$1 [R=301,L] My .htaccess file in the blog directory contains this: BEGIN WordPress <ifmodule mod_rewrite.c="">RewriteEngine On
RewriteBase /blog/
RewriteCond %{REQUEST_URI} ^./index.php/. [NC]
RewriteRule ^index.php/(.*)$ http://www.mysite.com/blog/$1 [R=301,L]
RewriteCond %{REQUEST_FILENAME} !-f
RewriteCond %{REQUEST_FILENAME} !-d
RewriteRule . /blog/index.php [L]</ifmodule> END WordPress Do I have something incorrectly coded in these .htaccess files that could be causing this? Or is there something else I should look at? Thank you for any help!!0