URLs contains other language than English
-
I am in need of your advice in regards to urls of my new sites. I have got one site from gulf region site is in English and Arabic language. The issue is we are getting url from both. Some are Arabic, do you guys think it will effect the ranking result?
url example is : www.mydomain.com/بيع-بي-سيارة
-
Hey there
To Quote Google on this, with the issue of ASCII and UTF encoded characters, like Arabic:
"Yes, we can generally keep up with UTF-8 encoded URLs and we’ll generally show them to users in our search results (but link to your server with the URLs properly escaped). I would recommend that you also use escaped URLs in your links, to make sure that your site is compatible with older browsers that don’t understand straight UTF-8 URLs"
So their recommendation would be to have both URLs available (the English and the Arabic) in order to support all users. So the fact you already do this is a good thing.
The next step would be to make sure you are handling duplicate content correctly. If the Arabic and non Arabic URLs are linking to a page with the same content - Google _should_be able to recognise this as the same page and not penalise you for duplications. So if the Arabic URL and the "escaped URL" (ASCII/English equivalent) both go to the same page, you should be fine. I've experienced this quite a few times with Turkish websites, for example, that also have UTF encoded characters.
However, you can eliminate the risk further by adding a canonical tag to each page. As far as I am aware, the canonical tag will support Arabic characters and so, on each page of the site, add a canonical tag that points to that page. For example, with the URL above, you would want to place a canonical tag like:
You can read more on canonical tags here: http://moz.com/learn/seo/canonicalization
Do be aware that for XML sitemaps, the URLs in the sitemap need to be URL-escaped - that is to say, UTF encoded URLs need to be made into their ASCII equivalent. You can read more about that in this Google guide to using non-alphanumeric characters in Sitemap URLs.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is it possible (or advisable) to try to rank for a keyword that is 'split' across subfolders in your url?
For example, say your keyword was 'funny hats' - ideally you'd make your url 'website.com/funny-hats/' But what if 'hats' is already a larger category in your site that you want to rank for as its own keyword? Could you then try to rank for 'funny hats' using the url 'website.com/hats/funny/' ? Basically what I'm asking is, would it be harmful to the chances of ranking for your primary keyword if it's split across the url like this, and not necessarily in the correct order?
Algorithm Updates | | rwat0 -
Flat Structure URL vs Structured Sub-directory URL
We are finally taking our classifieds site forward and moving into a much improved URL structure, however, there is some disagreement over whether to go with a Flat URL structure or a structured sub-directory. I've browsed all of the posts and Q&A's for this going back to 2011, and still don't feel like I have a real answer. Has anyone tested this yet, or is there any consensus over ranking? I am in a disagreement with another SEO manager about this for our proposed URL structure redesign who is for it because it is what our competitors are doing. Our classifieds are geographically based, and we group by state, county, and city. Most of our traffic comes from state and county based searches. We also would like to integrate categories into the URL for some of the major search terms we see. The disagreement arises around how to structure the site. I prefer the logical sub-directory style: [sitename]/[category]/[state]/[county]/
Algorithm Updates | | newspore
mysite.com/for-sale/california/kern-county/
or
[sitename]/[category]/[county]-county-[stateabb]/
mysite.com/for-sale/kern-county-ca/ I don't mind the second, except for when you look at it in the context of the whole site: Geo Landing Pages:
mysite.com/california/
mysite.com/los-angeles-ca-90210/ Actual Search Pages:
mysite.com/for-sale/orange-ca/[filters] Detail Pages:
mysite.com/widget-type/cool-product-name/productid I want to make sure this flat structure performs better before sacrificing my analytics sanity (and ordered logic). Any case studies, tests or real data around this would be most helpful, someone at Moz must've tackled this by now!0 -
How Additional Characters and Numbers in URL affect SEO
Hi fellow SEOmozers, I noticed that a lot of websites have additional characters and words at the end of the URL in addition keyword optimized URL. Mostly for E-Commerce sites For example: www.yoursite.com/category/keyword?id=12345&Keyword--Category--cm_jdkfls_dklj or wwww.yoursite.com/category/keyword#83939=-37292 My question is how does the additional characters or parameters(not necessarily tracking parameters) affect SEO? Does it matter if i have additional keywords in the additional stuff in the URL (1st url example)? If you can provide more information, that would be helpful. Thank you!
Algorithm Updates | | TommyTan0 -
Vanity URL's and http codes
We have a vanity URL that as recommended is using 301 http code, however it has been discovered the destination URL needs to be updated which creates a problem since most browsers and search engines cache 301 redirects. Is there a good way to figure out when a vanity should be a 301 vs 302/307? If all vanity URL's should use 301, what is the proper way of updating the destination URL? Is it a good rule of thumb that if the vanity URL is only going to be temporary and down the road could have a new destination URL to use 302, and all others 301? Cheers,
Algorithm Updates | | Shawn_Huber0 -
Why does Google say they have more URLs indexed for my site than they really do?
When I do a site search with Google (i.e. site:www.mysite.com), Google reports "About 7,500 results" -- but when I click through to the end of the results and choose to include omitted results, Google really has only 210 results for my site. I had an issue months back with a large # of URLs being indexed because of query strings and some other non-optimized technicalities - at that time I could see that Google really had indexed all of those URLs - but I've since implemented canonical URLs and fixed most (if not all) of my technical issues in order to get our index count down. At first I thought it would just be a matter of time for them to reconcile this, perhaps they were looking at cached data or something, but it's been months and the "About 7,500 results" just won't change even though the actual pages indexed keeps dropping! Does anyone know why Google would be still reporting a high index count, which doesn't actually reflect what is currently indexed? Thanks!
Algorithm Updates | | CassisGroup0 -
Is it OK to 301 redirect the index page to a search engine friendly url
Is it OK to 301 redirect the index page to a search engine friendly url.
Algorithm Updates | | WinningInch0 -
The related: query for one of my urls makes no sense
I'm trying to compete regarding keyword X. Currently, I'm on first page, 7-8th position. If, for each one of the urls listed in first page for such keyword, I search for related:[url], I get similar results for all of them, but mine. Mine shows inconsistent results, none of which related to the same topic as the other 9 in the top 10. Looking at them, the only hypothesis I am able to formulate is that, somehow, google is linking the url to its paid banners in big media. However, such banners go through an adserver and/or are declared as nofollow. Is there any obvious reason that could be causing this? I wonder if we are on page 1 even though we're considered pretty-much 'off-topic' regarding the keyword.
Algorithm Updates | | jleanv240 -
Is URL appearance defined by crawling or by XML sitemap
I am having a problem developing a sitemap because I have long URLs that are made by zend. They go like this: http://myagingfolks.com/professionals/20661/social-workers/pennsylvania-civi-stanger Because these URL's are long and are fed by Zend when I try to call them all up, to put on the sitemap, the system runs out of memory and crashes. Do you know what part of a search result, in google, say, comes from the URL? Would it be fine for me to submit to google only www.myagingfolks.com/professionals/20661. Does the crawler find that the URL is indeed http://myagingfolks.com/professionals/20661/social-workers/pennsylvania-civi-stanger or does it go with just what the sitemap tells it?
Algorithm Updates | | Jordanrg0