Novice Question - Can Browsers realistically distinguish words within concatenated strings e.g. text55fun or should one use text-55-fun? What about foreign languages especially more obscure ones like Finnish which Google Translate often miss-translates?
-
I am attempting to understand what is realistically possible within Google, Yahoo and Bing as they search websites for KeyWords. Technically my understanding is that they should be able to distinguish common words within concatenated strings, although there can be confusion between word boundaries when ambiguity is involved. So in the simple example of text55fun, do search engines actually distinguish text, 55 and fun separately? There are practical processing, databased and algorithm limitations that might turn a technically possible solution into a unrealistic one at a commercial scale.
What about more ambiguous strings like stringsstrummingstrongly would that be parsed as string s strummings trongly or strings strummings trongly or strings strumming strongly? Does one need to use dashes or underscores to make it unambiguous to the search engine? My guess is that the engine would recognize the dash or space and better understand the word boundaries yet ignore the dash or underscore from an overall concatenated string perspective.
Thanks in advance to whoever can provide any insight to an old coder who is new to this field.
-
Omid,
Thanks very much for the fast response.
Totally agree about about Google AI - my objective is completely white hat, original content-rich website which their AI encompasses - Google is attempting to weed out folks who are avoiding the hard work associated with building these types of websites.
Not everything in business has a brand-building goal - only so many brands can rise to the top. I'm looking from a very different perspective in this case and would be very happy to find my 1,000 to 5,000 specific people or so. From that point, I would then have my work cut out from a business standpoint, nothing to do with SEO or anything related to this field. Just simple blocking and tackling - sales,customer service, marketing, delivery etc.
I'm an old-school guy (sell high-value, high-margin products and services to customers who value you and want to stay with you over long periods of time because of great customer service) and this is an experiment for me using a new-school sales tactic. In my experience it does not take a lot of those types of customers to build a very nice small business. The really hard part is actually creating the organization which genuinely delivers on those commitments consistently over long periods of time and retains those customers.
All the best
Newell
-
Paul,
Thanks very much for the prompt response - I love your part of Canada by the way - and have driven through your town on the way to Jasper a number of time. I think that It's the most majestic part of the Rockies one can see without a plane.
Your response is at the heart of my question - the difference between what is possible and practical, particularly at the speed in which response occur. I wasn't aware of the space issue, very good point!
As a variation, were the concatenated string to become part of a URL for an original content-rich website, it sounds as though both the dashed and un-dashed URL would be required to be safe (because people tend not to type dashes or forget). In that event, would it matter to the search engines which URL is 301 redirected?
Again, thanks very much
Newell
-
take this, for random names and brands, they may not even recognize it proper and find the next best guess to it...
ever try googling a brand new online site / brand that is an abstract name? you get corrections and suggestions?
NOT ADVISED to make text sticktogetherlikethis in ANY language. it's just a best practice. across the board, content and url.
As Paul said, I do not like the whole "goofing" around situation with machine learning and Google's current artificial intel. its not nearly perfect technology and you can be its statistical miss...
-
The problem here,. as I see it, NY60, is that there's no way of knowing for sure. The engines are "supposed" to be able to parse the text strings, but there's no way they're infallible and in fact there's no way of knowing if they're even acceptably good at it under the conditions that are important to you.
For that reason, I always opt to make it as hard as possible for the engines to goof. In this case, that means dividing at word boundaries with a hyphen in strings like URLs and other code. Spaces are problematic because they often get encoded into html entity %20 which can cause yet more havoc - though in straight content like meta-titles, meta-descriptions, alt text and page content they are fine.
There's my $0.02. Whattaya think?
Paul
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Does google credit links from iFrames or created by Javascript, if so, is one more powerful than the other?
Consider this example, because I want to be clear about what I mean. You have two websites. Lets all them www.a.com and www.b.com. On www.a.com/some/page, there is an iframe something like this:
Intermediate & Advanced SEO | | adriandg
<iframe src="www.b.com/some/special/path"></iframe>
Then content of this iframe is a bunch of pictures, text and numbers, as well as a group of links, linking each picture to www.b.com for example the links might be:
www.b.com/content/1
www.b.com/content/2
www.b.com/content/3 Questions: When google crawls **www.a.com/some/page, **does it pass link juice to www.b.com/content/*? Does google instead consider these to be internal links within b.com itself. because links to www.b.com/content/ ** are actually from b.com itself, since the domain of the iframe is actually: www.b.com/some/special/path 3) Is there any amount of link juice passed from www.a.com/some/page to* www.b.com/some/special/path **because this is the src= element of an iframe that a.com is hosting? Consider an alternative setup. Where instead of using an iframe the contents of the above described iFrame is actually added the the page dynamically using javascript, and a call to an API endpoint at b.com. Resulting in these links being added directly to the body of a.com without being wrapped in an iframe element. Questions:
4) Do these links that were created after page load still get crawled and credited by google? (i have heard in the past that google was going to start crawling javascript, i just don't know if this is known for a fact yet).
5) Do links created on the client side hold the same weight as a link that was served directly via the backend html generation? If both the links within the iframe and the links within the javascript embed method pass link juice. Is one preferred over the other? is one known to be more effective than the other? Thanks!0 -
Using on two pages a keyword in alternative language in the title
Hello SEO wizards, The main language on my website is english, and I am wondering if I can add a keyword in russian to couple of pages to the title and image alt tag and maybe header , with the hope that it would rank in google with that russian keyword.. But I am not sure how google would react to that, I tried to search information on that, but could not find a clear answer.... Many thanks for anybody who takes time to respond
Intermediate & Advanced SEO | | bidilover0 -
How can I remove my old sites URL from showing up in Google?
Hi everyone. We have had a new site up for over a year now. When I search site:sqlsentry.net the old url still shows up and while those pages are redirected to .com I'd like to get the .net URL's out of google forever. What is the best way I can go about that?
Intermediate & Advanced SEO | | Sika220 -
What Happens If a Hreflang Sitemap Doesn't Include Every Language for Missing Translated Pages?
As we are building a hreflang sitemap for a client, we are correctly implementing the tag across 5 different languages including English. However, the News and Events section was never translated into any of the other four languages. There are also a few pages that were translated into some but not all of the 4 languages. Is it good practice to still list out the individual non-translated pages like on a regular sitemap without a hreflang tag? Should the hreflang sitemap include the hreflang tag with pages that are missing a few language translations (when one or two language translations may be missing)? We are uncertain if this inconsistency would create a problem and we would like some feedback before pushing the hreflang sitemap live.
Intermediate & Advanced SEO | | kchandler0 -
Is Google indexing Mp3 audio and MIDI music files? Can that cause any duplicate problems?
Hello, I own virtualsheetmusic.com website and we have several thousands of media files (Mp3 and MIDI files) that potentially Google can index. If that's the case, I am wondering if that could cause any "duplicate" issues of some sort since many of such media files have exact file names or same meta information inside. Any thoughts about this issue are very welcome! Thank you in advance to anyone.
Intermediate & Advanced SEO | | fablau0 -
What is next from Google Panda and Google Penguin?
Does anyone know what we can expect next from Google Panda/Penguin? We did prepare for this latest update and so far so good.
Intermediate & Advanced SEO | | jjgonza0 -
How can I block unwanted urls being indexed on google?
Hi, I have to block unwanted urls (not that page) from being indexed on google. I have to block urls like example.com/entertainment not the exact page example.com/entertainment.aspx . Is there any other ways other than robot.txt? If i add this to robot.txt will that block my other url too? Or should I make a 301 redirection from example.com/entertainment to example.com/entertainment.aspx. Because some of the unwanted urls are linked from other sites. thanks in advance.
Intermediate & Advanced SEO | | VipinLouka780 -
Does Google Use Security Seals As A Trust/Ranking Signal
There are quite a few secuirty seals/site safety tools by some big antivirus/trust companies Mcaffe site secuirty, verisign etc. Does Google, or any other big search engines use these as a trust/ranking signal?
Intermediate & Advanced SEO | | rhysmaster0