Novice Question - Can Browsers realistically distinguish words within concatenated strings e.g. text55fun or should one use text-55-fun? What about foreign languages especially more obscure ones like Finnish which Google Translate often miss-translates?
-
I am attempting to understand what is realistically possible within Google, Yahoo and Bing as they search websites for KeyWords. Technically my understanding is that they should be able to distinguish common words within concatenated strings, although there can be confusion between word boundaries when ambiguity is involved. So in the simple example of text55fun, do search engines actually distinguish text, 55 and fun separately? There are practical processing, databased and algorithm limitations that might turn a technically possible solution into a unrealistic one at a commercial scale.
What about more ambiguous strings like stringsstrummingstrongly would that be parsed as string s strummings trongly or strings strummings trongly or strings strumming strongly? Does one need to use dashes or underscores to make it unambiguous to the search engine? My guess is that the engine would recognize the dash or space and better understand the word boundaries yet ignore the dash or underscore from an overall concatenated string perspective.
Thanks in advance to whoever can provide any insight to an old coder who is new to this field.
-
Omid,
Thanks very much for the fast response.
Totally agree about about Google AI - my objective is completely white hat, original content-rich website which their AI encompasses - Google is attempting to weed out folks who are avoiding the hard work associated with building these types of websites.
Not everything in business has a brand-building goal - only so many brands can rise to the top. I'm looking from a very different perspective in this case and would be very happy to find my 1,000 to 5,000 specific people or so. From that point, I would then have my work cut out from a business standpoint, nothing to do with SEO or anything related to this field. Just simple blocking and tackling - sales,customer service, marketing, delivery etc.
I'm an old-school guy (sell high-value, high-margin products and services to customers who value you and want to stay with you over long periods of time because of great customer service) and this is an experiment for me using a new-school sales tactic. In my experience it does not take a lot of those types of customers to build a very nice small business. The really hard part is actually creating the organization which genuinely delivers on those commitments consistently over long periods of time and retains those customers.
All the best
Newell
-
Paul,
Thanks very much for the prompt response - I love your part of Canada by the way - and have driven through your town on the way to Jasper a number of time. I think that It's the most majestic part of the Rockies one can see without a plane.
Your response is at the heart of my question - the difference between what is possible and practical, particularly at the speed in which response occur. I wasn't aware of the space issue, very good point!
As a variation, were the concatenated string to become part of a URL for an original content-rich website, it sounds as though both the dashed and un-dashed URL would be required to be safe (because people tend not to type dashes or forget). In that event, would it matter to the search engines which URL is 301 redirected?
Again, thanks very much
Newell
-
take this, for random names and brands, they may not even recognize it proper and find the next best guess to it...
ever try googling a brand new online site / brand that is an abstract name? you get corrections and suggestions?
NOT ADVISED to make text sticktogetherlikethis in ANY language. it's just a best practice. across the board, content and url.
As Paul said, I do not like the whole "goofing" around situation with machine learning and Google's current artificial intel. its not nearly perfect technology and you can be its statistical miss...
-
The problem here,. as I see it, NY60, is that there's no way of knowing for sure. The engines are "supposed" to be able to parse the text strings, but there's no way they're infallible and in fact there's no way of knowing if they're even acceptably good at it under the conditions that are important to you.
For that reason, I always opt to make it as hard as possible for the engines to goof. In this case, that means dividing at word boundaries with a hyphen in strings like URLs and other code. Spaces are problematic because they often get encoded into html entity %20 which can cause yet more havoc - though in straight content like meta-titles, meta-descriptions, alt text and page content they are fine.
There's my $0.02. Whattaya think?
Paul
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved I have a "click rate juice" question would like to know.
Hello I have a "click rate juice" question would like to know. For example. I created a noindex site for a few days event purposes. Using a random domain like this: event.example.com. Expecting 5000+ clicks per day. Is it possible to gain some traffic juice from this event website domain "example.com" to my other main site "main.com" but without exposing its URL. Thought about using 301 redirecting "example.com" to "main.com". But it will reveal the example-b.com to the general public if someone visits the domain "example.com". Also thought about using a canonical URL, but it would not be working because the event site is noindex. or it would not matter at all 🤔 Wondering if there is a thing like this to gain some traffic juice for another domain? Thanks
Intermediate & Advanced SEO | | Blueli0 -
Will two navigation components (one removed by Javascript) impact Google rankings?
We are trying to eliminate tedium when developing complexly designed responsive navigations for mobile, desktop and tablet. The changes between breakpoints in our designs are too complex to be handled with css, so we are literally grabbing individual elements with javascript and moving them around. What we'd like to do instead is have two different navigations on the page, and toggle which one is on the DOM based on breakpoint. These navigations will have the same links but different markup. Will having two navigation components on the page at page load negatively impact our Google SEO rankings or potential to rank, even if we are removing one or the other from the DOM with JavaScript?
Intermediate & Advanced SEO | | CaddisInteractive0 -
Different language with direct translation: duplicate content, meta?
For a site that does NOT want a separate subdomain, or directory, or TLD for a country/language would the directly translated page (static) content/meta be duplicate? (NOT considering a translation of the term/acronym which could exist in another language) i.e. /SEO-city-state in English vs. /SEO-city-state Spanish -In this example a term/acronym that is the same in any language. Outside of duplicate content, are their other conflict potentials in rankings you can think of?
Intermediate & Advanced SEO | | bozzie3110 -
Our web site lost ranking on google a couple of years ago. We have done lots of work on it but still can not improve our search ranking. Can anyone give us some advise
A couple of years ago the ranking on our site dropped over night. I believe someone working here at the time purchased links about that time. We have been doing lots of work on the site since then to improve it. We can not get our rankings back up on google searches. Can anyone give us some advise about what to do or where to go for some help that we can trust.
Intermediate & Advanced SEO | | CostumeD0 -
A few questions regarding listings in Google Places
For an SAB (Service Area Business) with a hidden address - Can you have more then one listing? Can you use a free Google Voice number? Can you forward the number to a main number? Can the listing be in an office building? Such as a rented space... For a non SAB listing with the address visible - Can you use free Google voice numbers for each listing and forward them to one main number?
Intermediate & Advanced SEO | | Bryan_Loconto0 -
Another E-commerce Canonical Question
Hi guys, Quick question: one of our clients has an e-commerce site with a very poor canonical tag setup and thousands of pages of duplicate content. Let's use this as an example: BRAND > Category > Type > Color
Intermediate & Advanced SEO | | elcrazyhorse
Four separate pages/URLs. The BRAND page lists all products.
The Category page lists all BRAND products for that category.
The Type page lists all BRAND products of a specific type in that category.
The Color page lists all BRAND products of a specific type in that category of a specific color. Anyway, these generate four separate URLs: /BRAND
/BRAND/Category
/BRAND/Category-Type
/BRAND/Category-Type-Color Avoiding duplicate content and product listings, I would appreciate your proposed canonicalization strategy/feedback.0 -
How are pages ranked when using Google's "site:" operator?
Hi, If you perform a Google search like site:seomoz.org, how are the pages displayed sorted/ranked? Thanks!
Intermediate & Advanced SEO | | anthematic0 -
How can I block unwanted urls being indexed on google?
Hi, I have to block unwanted urls (not that page) from being indexed on google. I have to block urls like example.com/entertainment not the exact page example.com/entertainment.aspx . Is there any other ways other than robot.txt? If i add this to robot.txt will that block my other url too? Or should I make a 301 redirection from example.com/entertainment to example.com/entertainment.aspx. Because some of the unwanted urls are linked from other sites. thanks in advance.
Intermediate & Advanced SEO | | VipinLouka780