SEOMOZ Crawler unicode bug
-
for the last couple of weeks the SEOMOZ crawls my homepage only and gets 4xx error for most of the URL's.
the crawler have no issues with English url's only with the unicode(Hebrew) ones.
this is what is see in the csv export for the crawl (one sample) :
http://www.funstuff.co.il/׳ž׳¡׳׳‘׳×-׳¨׳•׳•׳§׳•׳× 404 text/html; charset=utf-8
you can see that the URL is Gibberish
please help.
-
Hey Asaf,
Thanks for writing in.
We have a known issue where Hebrew isn't parsed right by our crawler so it has caused issues in the past. The issues have been intermittent but they can affect the data you see. Sorry about that. Our engineers have been working to get a fix out there for the Hebrew character set, so stay tuned.
Best,
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Getting my top keywords separated out in SEOmoz reports
I am using the standard functionality to produce weekly Moz reports. There does not seem to be a setting to show rankings of my most important keywords. It would be nice to have those high-level keywords on the first page. For example, I have 200 keywords in an account. I want the report to show on a page my 10 most important keywords. Is there a way to set up a Label to my keywords in order to product a report page just for those keywords?
Moz Pro | | clicktoshop0 -
How to exclude a specific subdomain from SEOMoz Campaign?
Forgive me if I've overlooked something obvious, but how can I exclude a subdomain from a campaign? I want to crawl/analyze mywebsite.com, but not subdomain.mywebsite.com Thanks in advance...
Moz Pro | | ahockley1 -
SEOmoz Keyword Difficulty Tool been down for a few days?
Hi All, I notice the SEO moz keyword difficulty tool has been down for a few days!!! I know from support that they say it is going to be a "while" till it fixed, but some type of estimation on how long it will be will be good. Also in regards to the types of accounts, why do the top accounts have the same limitations as the 79/month tool in regards to the keyword tool reports (50 max and 5 per scan)? I mean this is probably a wider question for the SEOmoz team need to answer. Kind Regards.
Moz Pro | | ColumbusAustralia2 -
SEOmoz report vs. Google's Algo
Hello, I got an SEOmoz Report for one of our clients and the report is showing these pages and many more others as duplicate page content. Thing is the page content is not duplicate however there is very little data differentiating the contents. My question is does Google see the following pages contents as duplicate? because seomoz does. http://dallastxlofts.com/blog/2012/06/using-a-loft-for-commercial-or-office-space.html/img_9632/ http://dallastxlofts.com/blog/2012/08/newly-renovated-in-victory-park.html/3-2/ http://dallastxlofts.com/blog/2012/08/pedestrian-friendly-uptown-west-village.html/attachment/18/ http://dallastxlofts.com/blog/2012/06/using-a-loft-for-commercial-or-office-space.html/img_4322/ http://dallastxlofts.com/blog/2012/08/historic-deep-ellum-lofts.html/2012-08-18-11-24-30/ http://dallastxlofts.com/blog/2012/08/pedestrian-friendly-uptown-west-village.html/attachment/13/ http://dallastxlofts.com/blog/2012/06/using-a-loft-for-commercial-or-office-space.html/842-4/
Moz Pro | | Bryan_Loconto0 -
Does anybody really think that SEOMoz provides much value?
Crawl results lag so far behind as to be of questionable value for corrective purposes, and in the fast-paced world of SEO, the service seems right out of the horse-and-buggy era.. I corrected crawl errors two weeks ago, and yet SEOMoz' crawls are still not reflecting this. Furthermore, SEOMoz' idea of where my keywords rank has little to no bearing on reality. I am really disenchanted, and thinking of cancelling my subscription.
Moz Pro | | amadomon0 -
Do you think Seomoz is worth the monthly fee if you're not a professional SEO ?
I just want to ask the people who subscribe to Seomoz on a regular basis, I just paid for my first month subscription but to be perfectly honest I'm trying to work out whether
Moz Pro | | whitbycottages
somebody who is not rolling in cash and trying to make a living can afford to
pay the fee each month. I'm not a professional I just have two business websites and I'm learning the subject and finding it interesting. The tools do seem very good but I just wondered how people see this service on which aspect is the most important of them. I like to continue, I have been impressed the quality of the forum topics and discussions I just wonder whether I can afford to justify the fee.1 -
SEOmoz API - Links and Anchor Text Calls
Hi, I'm testing out the SEOmoz API - however I'm stuggling to understand the use of the Cols parameter within the "anchor-text" method. I've looped through increasing numbers of "Cols" for a standard query and there just seems to be no logical pattern.
Moz Pro | | AlexThomas
** - Could someone please enlighten me as to how this works?** E.g. of results for query: http://lsapi.seomoz.com/linkscape/anchor-text/www.seomoz.org/?Scope=term_to_page&Sort=domains_linking_page&Cols=1 1Array ( [0] => Array ( [aturid] => 86128451138 ) [1] => Array ( [aturid] => 86128451144 ) [2] => Array ( [aturid] => 86128451131 ) ) 2Array ( [0] => Array ( [atut] => seomoz ) [1] => Array ( [atut] => seomoz.org ) [2] => Array ( [atut] => seo ) ) 3Array ( [0] => Array ( [aturid] => 86128451138 [atut] => seomoz ) [1] => Array ( [aturid] => 86128451144 [atut] => seomoz.org ) [2] => Array ( [aturid] => 86128451131 [atut] => seo ) ) 4Array ( [0] => Array ( [atui] => 38845159274 ) [1] => Array ( [atui] => 38845159274 ) [2] => Array ( [atui] => 38845159274 ) ) 5Array ( [0] => Array ( [atui] => 38845159274 [aturid] => 86128451138 ) [1] => Array ( [atui] => 38845159274 [aturid] => 86128451144 ) [2] => Array ( [atui] => 38845159274 [aturid] => 86128451131 ) ) 6Array ( [0] => Array ( [atui] => 38845159274 [atut] => seomoz ) [1] => Array ( [atui] => 38845159274 [atut] => seomoz.org ) [2] => Array ( [atui] => 38845159274 [atut] => seo ) ) 7Array ( [0] => Array ( [atui] => 38845159274 [aturid] => 86128451138 [atut] => seomoz ) [1] => Array ( [atui] => 38845159274 [aturid] => 86128451144 [atut] => seomoz.org ) [2] => Array ( [atui] => 38845159274 [aturid] => 86128451131 [atut] => seo ) ) 8Array ( [0] => Array ( [atuiu] => 1 ) [1] => Array ( [atuiu] => 1 ) [2] => Array ( [atuiu] => 0 ) ) 9Array ( [0] => Array ( [atuiu] => 1 [aturid] => 86128451138 ) [1] => Array ( [atuiu] => 1 [aturid] => 86128451144 ) [2] => Array ( [atuiu] => 0 [aturid] => 86128451131 ) ) 10Array ( [0] => Array ( [atuiu] => 1 [atut] => seomoz ) [1] => Array ( [atuiu] => 1 [atut] => seomoz.org ) [2] => Array ( [atuiu] => 0 [atut] => seo ) ) Links API: Similar confusion here for:
"TargetCols"
"SourceCols"
"LinkCols" The description here http://apiwiki.seomoz.org/w/page/13991141/Links API - is a bit vague It appears that the links API spits out everything anyway - that one's less of an issue. So... could anyone explain how the Anchor-text API parameter Cols works?? Cheers!0 -
Seomoz Spider/Bot Details
Hi All Our website identifies a list of search engine spiders so that it does not show them the session ID's when they come to crawl, preventing the search engines thinking there is duplicate content all over the place. The Seomoz has bought a over 20k crawl errors on the dashboard due to session ID's. Could someone please give the details for the Seomoz bot so that we can add it to the list on the website so when it does come to crawl it won't show it session ID's and give all these crawl errors. Thanks
Moz Pro | | blagger1