Can anyone help me understand why google is "Not Selecting" a large number of my webpages to include when crawling my site.
-
When looking through my google webmaster tools, I clicked into the advanced settings under index status and was surprised to see that google has marked around 90% of my pages on my site as "Not Selected" when crawling. Please take a look and offer any suggestions.
-
Thank you. Thank you. Thank you. That makes so much sense. This is also the issue I am having with my communities and cities pages, pointing at my http://luxuryhomehunt.com/homes-for-sale page.
Does that make sense?
-
Thanks for the response. The pop up is running in java, and from what I have been told search engines can crawl pages so long as the opt in is running in java. Typically a visitor would hit one of our landing pages such as http://luxuryhomehunt.com/homes-for-sale/Longwood/alaqua-lakes.html where they can find information about the specific community they are searching for then if they click on a listing they would be prompted to opt in.
Do you think there may be any correlation to me using or not using canonical tags? Another thing I was wondering is if it has anything to do with, my handling of pages 2,3,4,5 etc of a city or community with more than ten listings.
I am not sure as to why your connection would have been refused, I am currently running a xml sitemap generator and maybe that had something to do with it. Either way, I am super grateful for your help and for you looking at this. I am very new to SEO and trying to learn my way through as much as possible.
-
Hmm, I just tried to click on a listing in Google but I was served a popup which required that I enter in my contact information before I could access the site http://luxuryhomehunt.com/view-property/40096215. Did you just add this pop up? Since there is no way for users to opt out of entering in contact information to view a listing, then it may be possible that the search engines are being blocked as well.
I also tried crawling the site with Screaming Frog SEO Spider and Xenu, but my connection was refused... not sure if my IP was blocked or if the site is blocking crawlers, but my guess is the search engines may be having some trouble accessing all of the pages on your site.
At the very least, I'd recommend removing that popup since it's bad for user experience and may be causing problems with the search engines.
EDIT - I did some more digging and looked the Google cache for one of your listings - http://webcache.googleusercontent.com/search?q=cache:L6LzTqj9gQUJ:luxuryhomehunt.com/view-property/40445850+&cd=6&hl=en&ct=clnk&gl=us&client=firefox-a. On this page, you have the rel="canonical" tag set to http://luxuryhomehunt.com/view-property so that tells the search engines that all of your property listing pages should use that canonical URL, which explains why most of your pages are "Not Selected" per Google -
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=2642366
http://support.google.com/webmasters/bin/answer.py?hl=en&answer=139066
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Where did the "Location" go, on Google SERP?
In order to emulate different locations, I've always done a Google query, then used the "Location" button under "Search Tools" at the top of the SERP to define my preferred location. It seems to have disappeared in the past few days? Anyone know where it went, or if it's gone forever? Thanks!
Technical SEO | | measurableROI0 -
SEO question: Need help on rel="alternate" hreflang="x"
Hi all, we have webcontent in 3 languages (official belgian yellow pages), we use a separate domain per language, these are also our brands.
Technical SEO | | TruvoDirectories
ex. for the restaurant Wagamamahttp://www.goudengids.be/wagamama-antwerpen-2018/ corresponds to nl-be
http://www.pagesdor.be/wagamama-antwerpen-2018/ corresponds to fr-be
http://www.pagesdor.be/wagamama-antwerpen-2018/ corresponds to en-be The trouble is that sometimes I see the incorrect urls appearing when doing a search in google, ex. when searching on google.be (dutch=nederlands=nl-be) I see the www.pagesdor.be version appearing (french) I was trying to find a fix for this within https://support.google.com/webmasters/answer/189077?hl=nl , but this only seems to apply to websites which use SUBdomains for language purposes. I'm not sure if can work for DOMAINS. Can anyone help me out? Kind regards0 -
How Google can interpret all "hreflag" links into HTML code
I've found the solution. The problem was that did not put any closing tag into the HTML code....
Technical SEO | | Red_educativa0 -
Does using data-href="" work more effectively than href="" rel="nofollow"?
I've been looking at some bigger enterprise sites and noticed some of them used HTML like this: <a <="" span="">data-href="http://www.otherodmain.com/" class="nofollow" rel="nofollow" target="_blank"></a> <a <="" span="">Instead of a regular href="" Does using data-href and some javascript help with shaping internal links, rather than just using a strict nofollow?</a>
Technical SEO | | JDatSB0 -
Google ignores Meta name="Robots"
Ciao from 24 degrees C wetherby UK, On this page http://www.perspex.co.uk/products/palopaque-cladding/ this line was added to block indexing: But it has not worked, when you google "Palopaque PVC Wall Cladding" the page appears in the SERPS. I'm going to upload a robots txt file in a second attempt to block indexing but my question is please:
Technical SEO | | Nightwing
Why is it being indexed? Grazie,
David0 -
CDN Being Crawled and Indexed by Google
I'm doing a SEO site audit, and I've discovered that the site uses a Content Delivery Network (CDN) that's being crawled and indexed by Google. There are two sub-domains from the CDN that are being crawled and indexed. A small number of organic search visitors have come through these two sub domains. So the CDN based content is out-ranking the root domain, in a small number of cases. It's a huge duplicate content issue (tens of thousands of URLs being crawled) - what's the best way to prevent the crawling and indexing of a CDN like this? Exclude via robots.txt? Additionally, the use of relative canonical tags (instead of absolute) appear to be contributing to this problem as well. As I understand it, these canonical tags are telling the SEs that each sub domain is the "home" of the content/URL. Thanks! Scott
Technical SEO | | Scott-Thomas0 -
Remove Site from Google
How can I get my website out of google? I want all pages completely gone. Thanks!
Technical SEO | | tylerfraser0 -
Will training videos available on the "members only" section of a site contribute to the sites ranking?
Hello, I got asked a question recently as to whether training videos on the deeper pages of a website (that you can only access if you are a member and log in) will help with the sites ranking. On the SEOMoz software these deeper pages have been crawled as far as I can tell with errors reported on pages from the "members only" section of the site, leading me to believe the members only pages and their content will contribute to the sites overall ranking profile. I have suggested uploading the informational videos on the main pages of the site for now, making them accessible to all visitors and putting them in a more obvious place to encourage more sharing and views, however I've also said I would check it out with some experts so any information will be greatly appreciated! Many thanks 🙂 Charlotte
Technical SEO | | CharlotteWaller0