Not getting foreign characters in crawl diagnostics .csv
-
The crawl diagnostics .csv file is showing high-ascii characters instead of the correct language (foreign language website) e.g. Vietnamese, Chinese (both kinds), etc. Is there a way to get this right?
-
Glad it helped! I think the issue might be with excel more than Moz, its handling of utf8 csv's has been terrible since day 1! I think there is a way you can use the excel import data function to get the same result but I never had much luck with it and the open office trick seemed less painful.
-
Open Office did the trick! Thank you. Would be nice if the Moz app could do UTF-8 natively.
-
Hi Ash,
I had this problem too and here is how I solved it (there might be better ways).
If the characters are in the page titles, meta tags etc you can open the csv file in open office and then choose save as xls and it will save an excel file which you can then open in excel and the utf8 characters will read ok. This method works great for titles etc but does not decode foreign characters in the urls themselves.
If the characters are in the url then a way I have found is to download this pretty awesome excel addon (site is in german, I used google translate to figure out what was going on). Then you have some new functions in excel where you can create a 2nd column next to the url column, apply the url decode function to the first column and get readable urls in the second. This addon saved me sooo much time and trouble! It works for greek which I need it for, I assume it will work for chinese also. Let me know if you need more detailed instructions, it took a bit of trial and error to figure out the exact moves needed to get the results you want.
Hope that helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Links on my website do not get highlighted by Moz bar in Chrome
I've used the Moz bar for many years to quickly figure out if a link is followed or no-followed. I recently used a Wordpress plugin to build lists of subjects and excerpts of pages with appropriate links. An example is at https://www.chicagotraveler.com/chicago-parks/ This is the main Parks section page. Below the map is a set of links and descriptions of each park in this section of the site. The problem is that when I use the Moz bar to look at this page, the links are not highlighted no matter which settings I click on. Followed, No-followed, External or Internal. I've looked at the code and while there is a bit of css nearby the links, they look fairly normal. Does this mean that moz thinks there is something wrong. Do you think google will also ignore these links? Should I scrap the plugin and build and maintain these lists manually?
Moz Bar | | EdKim0 -
Moz is only crawling 2 pages
Hi, I found a similar thread, but it did not provide a clear-cut answer. We have had this campaign running for over a year, and we are always adding content to the website, but Moz is only ever able to crawl 2 pages, Screaming Frog only picks up 12, but I know there is a lot more than that. None of our pages are set to no-index, so I do not know what is causing this. Welcoming any ideas/solutions. Thanks
Moz Bar | | GavinAdv0 -
I'm getting, "you're not using the rel="canonical" META attribute" in my crawl diagnotic
I'm running a campaign crawler through Moz on this particular page: http://www.henley.ac.uk/executive-education/leadership-and-management-programmes/ but I'm getting a notifcaiton from Moz saying, "you're not using the rel="canonical" META attribute" I don't understand what this means!! Has anyone else had this problem, or can they help me understand what this means and how to fix it? Oh, and Happy Thanksgiving from the UK! Virginia
Moz Bar | | blackboxideas0 -
Who and how does one get in Fresh Alerts?
Who and how does one get in Fresh Alerts? This is such a great tool! Thank, Moz! I would like to use this more often and to a better advantage. Can someone help me understand what criteria the tool uses to choose who it and what it picks up? Why would someone's personal family gathering turn up in my Moz Fresh Alerts("Minneapolis home buyers"? http://mydesultoryblog.com/2014/07/having-a-great-time-with-katelyn-and-drew-in-wayzata-mn/ My Desultory Blog Desultory thoughts on a variety of subjects … Having a great time with Katelyn and Drew in Wayzata, MN It seems completely random when and which of my blog posts show up in Moz Fresh Alerts. For example one that did ("Minneapolis real estate sellers"): "5 Critical Shifts in the Twin Cities Housing Market" http://www.homedestination.com/real-estate-blog/4-critical-shifts-in-the-twin-cities-housing-market Jeannie
Moz Bar | | jessential0 -
Site crawl errors - download list of all urls
Hi Ive provided my clients developers with the pdf reports of crawl errors but these seem to miss some urls I see there are lots of csv file download/email options Will the email csv button send a report of everything listing all urls that are missing from the pdfs ? if not will the more specific csv reports Would be good if i can press 1 button and get all issues listed with all urls It does look like this happens but i just want confirmed best way asap since need to provide reports urgently, any guidance much appreciated ? All Best Dan
Moz Bar | | Dan-Lawrence0 -
Moz crawl suddenly shows much less pages from what I really have
Hi! Moz crawl suddenly shows much less pages from what I really have and from what they used to show after completing the crawl. Should I be worried? What could that be? Regards, Yossey
Moz Bar | | Joseph-Green-SEO1 -
Moz "Crawl Diagnostics" doesn't respect robots.txt
Hello, I've just had a new website crawled by the Moz bot. It's come back with thousands of errors saying things like: Duplicate content Overly dynamic URLs Duplicate Page Titles The duplicate content & URLs it's found are all blocked in the robots.txt so why am I seeing these errors?
Moz Bar | | Vitalized
Here's an example of some of the robots.txt that blocks things like dynamic URLs and directories (which Moz bot ignored): Disallow: /?mode=
Disallow: /?limit=
Disallow: /?dir=
Disallow: /?p=*&
Disallow: /?SID=
Disallow: /reviews/
Disallow: /home/ Many thanks for any info on this issue.0 -
Since the revised website was launched, I can't find the "Crawl Test" function showing Titles and Descriptions of other websites. Anyone know where that link is located?
MOZ can "crawl" any website and show information like Title, Description, etc.....Can't find that link.
Moz Bar | | bpedrazas0