SEOMOZ Crawler unicode bug

AsafY

for the last couple of weeks the SEOMOZ crawls my homepage only and gets 4xx error for most of the URL's.

the crawler have no issues with English url's only with the unicode(Hebrew) ones.

this is what is see in the csv export for the crawl (one sample) :

http://www.funstuff.co.il/׳ž׳¡׳׳‘׳×-׳¨׳•׳•׳§׳•׳× 404 text/html; charset=utf-8

you can see that the URL is Gibberish

please help.

Nick_Sayers

Hey Asaf,

Thanks for writing in.

We have a known issue where Hebrew isn't parsed right by our crawler so it has caused issues in the past. The issues have been intermittent but they can affect the data you see. Sorry about that. Our engineers have been working to get a fix out there for the Hebrew character set, so stay tuned.

Best,

Welcome to the Q&A Forum

Browse the forum for helpful insights and fresh discussions about all things SEO.

SEOMOZ Crawler unicode bug

Got a burning SEO question?

Browse Questions

Explore more categories

Related Questions

Did Profiles on SEOmoz get de-indexed?

Are the SEOMoz queries banned by Google?

Does SEOmoz give a way to know what link on what page produces the 404 errors that SEOmoz is telling me I have??

Is the SEOmoz RankTracker working?

How can you set SEOmoz to work with your dev site behind an htpasswd?

How do I add a second email to SEOmoz?

301 redirect in SEOMoz campaigns tool

What's your favorite part of SEOMoz PRO?