Still Cant Crawl My Site
-
I've removed all blocks but two from our htaccess. They are for amazonaws.com to block amazon from crawling us.
I did a fetch as google in our WM tools on our robots txt with success.
SEOMoz crawler here hit's our site and gets a 403. I've looks in our blocked request logs and amazon is the only one in there.
What is going on here?
-
Hey Joel,
Happy Friday!
Sha
-
Hi Dana,
No problem. Glad you have sorted the problem now.
Have an awesome weekend
Sha
-
Hey Dana,
We've been corresponding in email, but I just wanted to update your thread here as well.
We don't use Amazon's bot, we use Amazon Web Service to host our crawler. If you are no longer blocking AWS you should be able to crawl OK moving forward.
Thanks!
Joel. -
Wish someone would've pointed that out days ago.
Thank you soooooo much for your great answer.
I don't understand though how or why seomoz is using amazons bot...
What if I don't want amazon accessing our site ( i dont). That means we can't use seomoz then??
-
we'll see how this goes. I've removed the blocks for amazonaws...
Thanks .
-
Hi Dana,
I believe SEOmoz utilizes Amazonaws services for crawling, (or at least they did a few months ago) so that may well be your problem.
The best (and quickest) way to confirm this is to go to the SEOmoz Help Hub and click the button at the top of the page to contact the Help Team directly.
Hope that helps,
Sha
-
Whats the web address?
Issa
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved How many sites can I track with one subscription?
Hello, We are currently a MozPro medium member and we are tracking amlrightsource.com but we have other sites we'd like to track as well. Wondering if we can track more sites with this subscription?
Moz Pro | | KassandraSharr0 -
Shopify crawl issues
Hi Moz'ers, I am a total newcomer to this level of seo. Recently I transitioned to Shopify and I'm puzzled by why I'm getting 803 errors - incomplete crawl attempts due to server timing out. Wouldn't this have to do with Shopify? How would I go about fixing it? I'm also getting 804 - SSL issues, but I assume that will go away. Any advice? Thanks! Sharon
Moz Pro | | Sharon2016
www.ZeldasSong.com0 -
Abnormal crawl issues appearing in my Moz results
I have been asked to look at a site for a friend and was more than surprised to see 16,9k crawl issues appear in the dashboard... of this 6,238 are duplicate page content and 5878 are duplicated page titles. What on earth is going on? I have spoken to the web developer as it appears there is a dev site somewhere and this is his response [Can I stress that Google determines which site was in the index first and then removes other sites it sees as having duplicate content. Our dev sites appearing in the search index would not affect your ranking due to duplicate content as Google would see your site as the first site with the content] As I cannot make contact with him, I am scratching my head, surely a dev site should be no-indexed, it sounds as though he is saying that its ok because Google will take the main site as the first site with the content... Very confused! Help need MOZ community. Manythanks, Sarah
Moz Pro | | Mutatio_Digital0 -
Open Site Explorer - Image Anchor Text
Hey there. I'm researching internal links and In my report I get alot of image links with the following text in the Link Anchor Text column: (img alt)(img)[No Anchor Text]. Is it because my images don't have alt-text or something else... How to optimize correctly? See attached screenshot. Best regards K87xLN8
Moz Pro | | nosuchagency0 -
Crawl Errors from URL Parameter
Hello, I am having this issue within SEOmoz's Crawl Diagnosis report. There are a lot of crawl errors happening with pages associated with /login. I will see site.com/login?r=http://.... and have several duplicate content issues associated with those urls. Seeing this, I checked WMT to see if the Google crawler was showing this error as well. It wasn't. So what I ended doing was going to the robots.txt and disallowing rogerbot. It looks like this: User-agent: rogerbot Disallow:/login However, SEOmoz has crawled again and it still picking up on those URLs. Any ideas on how to fix? Thanks!
Moz Pro | | WrightIMC0 -
Is Rank Tracker still down?
I have been trying to use Rank Tracker but it still appears to be down. The following message is displayed which dates from 5th September and suggests it would take around a week to get it working. "Due to a server failure, we are experiencing a delay in Rank Tracker results this week. Unfortunately, it may take up to a week to get it working properly again. Thanks for your patience and understanding; our engineers are working around the clock to get this issue fixed. - Updated September 5th." It is now 17th September and I am still unable to use it. Just wondering whether this is the same situation for all or whether SEOMoz have an update on when we can expect this to be up and running. Thanks.
Moz Pro | | simon_realbuzz0 -
Open Site Explorer Update
What is taking OSE so long to update? The update schedule said the next update was going to be on Dec 28th.
Moz Pro | | Robbie8299
If you open OSE it says "Last Index Update: November 28th, 2011" Today in January 1st. Any thoughts as to why the delay?0 -
Can Open Site Explorer Do This?
Is there any way to set up Open Site Explorer to show these things for competitor external backlinks: Google Page Rank of the page the backlink is on Google Page Rank of the domain the backlink is on Whether the backlink is a follow or no follow Is this possible in OSE? If not, are there any other SEOMOZ Tools that will do this? Thanks.
Moz Pro | | N5c0