Novice Question - Can Browsers realistically distinguish words within concatenated strings e.g. text55fun or should one use text-55-fun? What about foreign languages especially more obscure ones like Finnish which Google Translate often miss-translates?
-
I am attempting to understand what is realistically possible within Google, Yahoo and Bing as they search websites for KeyWords. Technically my understanding is that they should be able to distinguish common words within concatenated strings, although there can be confusion between word boundaries when ambiguity is involved. So in the simple example of text55fun, do search engines actually distinguish text, 55 and fun separately? There are practical processing, databased and algorithm limitations that might turn a technically possible solution into a unrealistic one at a commercial scale.
What about more ambiguous strings like stringsstrummingstrongly would that be parsed as string s strummings trongly or strings strummings trongly or strings strumming strongly? Does one need to use dashes or underscores to make it unambiguous to the search engine? My guess is that the engine would recognize the dash or space and better understand the word boundaries yet ignore the dash or underscore from an overall concatenated string perspective.
Thanks in advance to whoever can provide any insight to an old coder who is new to this field.
-
Omid,
Thanks very much for the fast response.
Totally agree about about Google AI - my objective is completely white hat, original content-rich website which their AI encompasses - Google is attempting to weed out folks who are avoiding the hard work associated with building these types of websites.
Not everything in business has a brand-building goal - only so many brands can rise to the top. I'm looking from a very different perspective in this case and would be very happy to find my 1,000 to 5,000 specific people or so. From that point, I would then have my work cut out from a business standpoint, nothing to do with SEO or anything related to this field. Just simple blocking and tackling - sales,customer service, marketing, delivery etc.
I'm an old-school guy (sell high-value, high-margin products and services to customers who value you and want to stay with you over long periods of time because of great customer service) and this is an experiment for me using a new-school sales tactic. In my experience it does not take a lot of those types of customers to build a very nice small business. The really hard part is actually creating the organization which genuinely delivers on those commitments consistently over long periods of time and retains those customers.
All the best
Newell
-
Paul,
Thanks very much for the prompt response - I love your part of Canada by the way - and have driven through your town on the way to Jasper a number of time. I think that It's the most majestic part of the Rockies one can see without a plane.
Your response is at the heart of my question - the difference between what is possible and practical, particularly at the speed in which response occur. I wasn't aware of the space issue, very good point!
As a variation, were the concatenated string to become part of a URL for an original content-rich website, it sounds as though both the dashed and un-dashed URL would be required to be safe (because people tend not to type dashes or forget). In that event, would it matter to the search engines which URL is 301 redirected?
Again, thanks very much
Newell
-
take this, for random names and brands, they may not even recognize it proper and find the next best guess to it...
ever try googling a brand new online site / brand that is an abstract name? you get corrections and suggestions?
NOT ADVISED to make text sticktogetherlikethis in ANY language. it's just a best practice. across the board, content and url.
As Paul said, I do not like the whole "goofing" around situation with machine learning and Google's current artificial intel. its not nearly perfect technology and you can be its statistical miss...
-
The problem here,. as I see it, NY60, is that there's no way of knowing for sure. The engines are "supposed" to be able to parse the text strings, but there's no way they're infallible and in fact there's no way of knowing if they're even acceptably good at it under the conditions that are important to you.
For that reason, I always opt to make it as hard as possible for the engines to goof. In this case, that means dividing at word boundaries with a hyphen in strings like URLs and other code. Spaces are problematic because they often get encoded into html entity %20 which can cause yet more havoc - though in straight content like meta-titles, meta-descriptions, alt text and page content they are fine.
There's my $0.02. Whattaya think?
Paul
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
For FAQ Schema markup, do we need to include every FAQ that is on the page in the markup, or can we use only selected FAQs?
The website FAQ page we are working on has more than 50 FAQs. FAQ Schema guidelines say the markup must be an exact match with the content. Does that mean all 50+ FAQs must be in the mark-up? Or does that mean the few FAQs we decided to put in the markup are an exact match?
Intermediate & Advanced SEO | | PKI_Niles0 -
How often is google pushing data ?
Hello, I know that google index quickly but how often is the data pushed into their search results ? Thank you,
Intermediate & Advanced SEO | | seoanalytics1 -
Can Google Bot View Links on a Wix Page?
Hi, The way Wix is configured you can't see any of the on-page links within the source code. Does anyone know if Google Bots still count the links on this page? Here is the page in question: https://www.ncresourcecenter.org/business-directory If you do think Google counts these links, can you please send me URL fetcher to prove that the links are crawlable? Thank you SO much for your help.
Intermediate & Advanced SEO | | Fiyyazp0 -
Does Google frown on using 3 different page titles with same content to secure the top 3 results in SERPs?
Is it frowned upon by Google to create 3 different pages with the sames content yet different titles to secure the top three results in SERPs? For example: Luxury Care Homes in Liverpool Care Homes in Liverpool Private Care Homes in Liverpool The page titles are different with slightly different meta data but the user content is exactly the same, would this be considered a cheeky win or negative to rankings?
Intermediate & Advanced SEO | | TrustedCare.co.uk1 -
My Website Has a Google Penalty, But I Can't Disavow Links
I have a client who has definitely been penalized, rankings dropped for all keywords and hundreds of malicious backlinks when checked with WebMeUp....However, when I run the backlink portfolio on Moz, or any other tool, they don't appear anyone, and all the links are dead when I click on the actual URL. That being said, I can't disavow links that don't exist, and they don't show up in Webmaster Tools, but I KNOW this site has been penalized. Also- I noticed this today (attached). Any suggestions? I've never come across this issue before. xT6JNJC.png
Intermediate & Advanced SEO | | 01023450 -
Does Google index more than three levels down if the XML sitemap is submitted via Google webmaster Tools?
We are building a very big ecommerce site. The site has 1000 products and has many categories/levels. The site is still in construccion so you cannot see it online. My objective is to get Google to rank the products (level 5) Here is an example level 1 - Homepage - http://vulcano.moldear.com.ar/ Level 2 - http://vulcano.moldear.com.ar/piscinas/ Level 3 - http://vulcano.moldear.com.ar/piscinas/electrobombas-para-piscinas/ Level 4 - http://vulcano.moldear.com.ar/piscinas/electrobombas-para-piscinas/autocebantes.html/ Level 5 - Product is on this level - http://vulcano.moldear.com.ar/piscinas/electrobombas-para-piscinas/autocebantes/autocebante-recomendada-para-filtros-vc-10.html Thanks
Intermediate & Advanced SEO | | Carla_Dawson0 -
Canonical links apparently not used by google
hi, I do have an ecommerce website (www.soundcreation.ro) which in the last 3 months had a drop in the SERP. Started to look around in GWT what is happening. Google is reporting a lot of duplicate meta-tags (and meta-titles problem). But 99% of them had already canonical links setted. I tried to optimize my product listings with the new "prev", "next" tags and introduced also the "view-all" canonical link to help Google identify the appropiate product listing pages. SeoMoz is not reporting thos duplicate meta issues. Here is an example of the same page with different links, but with the same common canonical and reported by GWT "duplicate title tag": http://www.soundcreation.ro/chitare-chitari-electroacustice-cid10-pageall/http://www.soundcreation.ro/chitare-chitari-electroacustice-cid10/http://www.soundcreation.ro/chitare-chitari-electroacustice-cid10_999/http://www.soundcreation.ro/chitare-electro-acustice-cid10_1510/What could be the issue?- only that gwt is not refreshing as should be, keeping old errors?- if so, then there is an other serious issue because of why our PR is dropping on several pages?- do we have other problem with the site, which ends up with google penalizing us? Thank you for your ideas!
Intermediate & Advanced SEO | | bjutas0 -
Is there any delay between crawling a page by google and displaying of the ratings in rich snippet of the results in google?
Is there any delay between crawling a page by google and displaying of the ratings in rich snippet of the results in google?
Intermediate & Advanced SEO | | NEWCRAFT0