SEOMOZ Crawler unicode bug
-
for the last couple of weeks the SEOMOZ crawls my homepage only and gets 4xx error for most of the URL's.
the crawler have no issues with English url's only with the unicode(Hebrew) ones.
this is what is see in the csv export for the crawl (one sample) :
http://www.funstuff.co.il/׳ž׳¡׳׳‘׳×-׳¨׳•׳•׳§׳•׳× 404 text/html; charset=utf-8
you can see that the URL is Gibberish
please help.
-
Hey Asaf,
Thanks for writing in.
We have a known issue where Hebrew isn't parsed right by our crawler so it has caused issues in the past. The issues have been intermittent but they can affect the data you see. Sorry about that. Our engineers have been working to get a fix out there for the Hebrew character set, so stay tuned.
Best,
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Couple of Moz's bugs
There are still some bugs in a new Moz: I can still find a lot of mentions of "SEOMoz", for example: in a footer, in a Q/A form (SEOMoz Resources), in a new question form (SEOmoz PRO Application, SEOmoz Tools, etc.) On a main form (http://moz.com/pro/home) sometimes my full name is not visible at all, sometimes my MozPoints are hidden (on a left top corner); There is not direct link from http://devblog.moz.com/ to a main website; Regards.
Moz Pro | | ditoroin0 -
Crawlers crawl weird long urls
I did a crawl start for the first time and i get many errors, but the weird fact is that the crawler tracks duplicate long, not existing urls. For example (to be clear): there is a page: www.website.com/dogs/dog.html but then it is continuing crawling:
Moz Pro | | r.nijkamp
www.website.com/dogs/dog.html
www.website.com/dogs/dogs/dog.html
www.website.com/dogs/dogs/dogs/dog.html
www.website.com/dogs/dogs/dogs/dogs/dog.html
www.website.com/dogs/dogs/dogs/dogs/dogs/dog.html what can I do about this? Screaming Frog gave me the same issue, so I know it's something with my website0 -
How do i get the crawler going again?
The initial crawl only hit one page. Set up another campaign for another site and it crawled 260 pages. How can I get the crawler started up again or do I really have to wait a week ?
Moz Pro | | martJ0 -
Bug with seomoz level display?
Is anyone else seeing people's moz levels shown incorrectly? Spotted this yesterday, but I am getting the same today. Lots of people showing as Aspirant who shouldn't be.
Moz Pro | | matbennett1 -
How accurate is SEOMoz's keyword analysis tool?
For the most part, SEOMoz's keyword analysis tool has been in line with other tools like Adwords keyword tool with regards to competitive level. I have just encountered a keyword though that a client may choose to compete on that seems to be far off. keyword phrase: online math games Adwords competitive level: Low SEOMoz competitive level: 80 This seems like a sizeable difference (I know the two compare all results vs first page authority's, but typically they are in line with each other). With other related keywords for the industry in question, SEOMoz and Adwords seem to be in line. This one just got me thinking. I know the SEOMoz score is a sign of the strength of the top results and that the "low" score from Adwords may be a sign of much weaker results on the following pages (with a higher number of weaker pages vs fewer high authority outliers). **Question: ** How accurate is SEOMoz keyword analysis tool and what other keyword analysis tools are you guys/gals using that you like? I have tried others but many provide duplicate insights.
Moz Pro | | mattylac0 -
Does SeoMoz realize about duplicated url blocked in robot.txt?
Hi there: Just a newby question... I found some duplicated url in the "SEOmoz Crawl diagnostic reports" that should not be there. They are intended to be blocked by the web robot.txt file. Here is an example url (joomla + virtuemart structure): http://www.domain.com/component/users/?view=registration and the here is the blocking content in the robots.txt file User-agent: * _ Disallow: /components/_ Question is: Will this kind of duplicated url errors be removed from the error list automatically in the future? Should I remember what errors should not really be in the error list? What is the best way to handle this kind of errors? Thanks and best regards Franky
Moz Pro | | Viada0 -
SEOMoz toolbar - Anyone else have problems with Search Profiles?
(Using Firefox 7.0.1) I just downloaded the toolbar and the Custom Search Profiles do not work--clicking on any of them adds "%" and numbers to the search query. I've created a couple of specific locations and I'd really like to get this figured out. Does this function work correctly for anyone? Am I doing something wrong?
Moz Pro | | Court_LOQUA0