Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Does a KML file have to be indexed by Google?
-
I'm currently using the Yoast Local SEO plugin for WordPress to generate my KML file which is linked to from the GeoSitemap. Check it out http://www.holycitycatering.com/sitemap_index.xml.
A competitor of mine just told me that this isn't correct and that the link to the KML should be a downloadable file that's indexed in Google. This is the opposite of what Yoast is saying... "He's wrong.
And the KML isn't a file, it's being rendered. You wouldn't want it to be indexed anyway, you just want Google to find the information in there.
What is the best way to create a KML? Should it be indexed?
-
There isn't really a good way that I know of currently to verify Google has indexed it...
-
Thanks for getting back! I wanted to show you a screenshot of my GWT. The geo_sitemap.xml is crawled with no errors but the locations.kml that it's linking to is never seen. That being said, how it the KML being seen by Google? Is there some way that I can verify?
-
Yeah we might as well ditch that
but yeah it's crawled as a normal XML file as it doesn't give any errors at all in GWT.
-
Thanks for chiming in on this, Joost.
I wasn't 100% certain that geo_sitemap.xml was a problem, but the xmlns reference to http://www.google.com/geo/schemas/sitemap/1.0 in line 2 I thought might be throwing Google off - I take it they'll just ignore this and crawl the doc as any other XML file?
Thanks again.
-
I'm sorry to say Mike above is wrong. He's been deceived by the file name and didn't actually look to see what it did I guess. Our geo_sitemap.xml file is a normal XML sitemap, linking to the KML file, it's not actually a geo sitemap, it's just named that way for historic reasons.
See the first question on this thread and Susan Moskwa's answer: https://plus.google.com/+SusanMoskwa/posts/CmZejMkLN4r
-
Hi Anthony,
Sorry for the delay on this. In migrating over to the new Moz.com platform, Q&A messaging for admins has been a bit spotty.
You are right - geositemap.xml is using the "geo sitemap" protocol that Google no longer supports. This may cause Google not to follow the reference to locations.kml contained therein.
Unfortunately I don't have an alternative recommendation to Yoast's SEO plugin for this. Manually creating your XML may be your best option, or using software like GSiteCrawler to speed up the process, then manually add your KML file.
If this output from Yoast's plugin can't be manually configured, and the KML file is important enough to your goals that you consider it a top priority to have it crawled, it seems a clear choice to me to move away from this plugin and find a better solution. Unfortunately, I haven't dealt with KML files for WordPress in the past. I'd probably recommend site crawling software to speed up the process, then switching to manual to add this in.
Best,
Mike -
Hi Mike,
I think I'm starting to understand where you are going with this. It sounds like I need to index the KML using a link from the footer of the site instead of from the geositemap that Yoast creates since Google won't crawl it or past it.
I read on Google Sitemap page:
"We recommmend that you tell Google about geographically-based URLs by including them in a regular Web Sitemap."
If the KML is referrenced in the sitemap_index.xml, then it's being seen by Google but if the geositemap.xml is between the sitemap_index.xml and the locations.kml, then it is hidden from Google.
All of this is being controlled by the an SEO plugin for WordPress from Yoast. I am wondering if I need to create the KML manually and upload to the sitemap or if should I let Yoast continue to render it. Mike, do you use a specific tool/plugin for KML creation for Wordpress websites?
-
Hi Anthony,
"Indexed in Google" is irrelevant here. Sitemap protocol and the searchable web index have little to do with each other directly (sitemap files are not searchable in the web index).
If you're following the instructions on this page, you're good. Geo sitemap tags are no longer supported by Google.
Note: When I click on the link to http://www.holycitycatering.com/geo_sitemap.xml your server returns a "page not found" error, so I'm not sure where your geo URLs are located...
-Mike
-
If google webmaster tools doesn't return an error on when you test the sitemap then it should be indexing it fine.
-
How do you know know if Google can see the KML? It's not been listed in any of the search results for our sites using this plugin and this competitor is telling my client I'm wrong because you can't see the file in Google Webmasters.
I guess the main question is if Google isn't indexing the KML and Webmaster Tools doesn't index it, how do we know it sees the file?
-
There's one rule in SEO, Yoast is always right
(not only because he's Dutch). But in this case he's right. By mentioning the KML file to Google it knows where it could be found. So it will trigger a visit to the file which get generated on the fly + by doing this it prevents you from being indexed.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Cache
So, when I gain a link I always check to see if the page that is linking is in the Google cache. I've noticed recently that more and more pages are actually not showing up in Google's cache, yet still appear in search results. I did read an article from someone whoo works at Google a few weeks back that there is sometimes an error with the cache and occasionally the cache will not display. This week, my own website isn't showing up in the cache yet I'm still ranking in SERP's. I'm not worried about it, mostly whitehat, but has there been any indication that Google are phasing out the ability to check cache's of websites?
Algorithm Updates | | ThorUK0 -
Very strange, inconsistent and unpredictable Google ranking
I have been searching through these forums and haven't come across someone that faces the same issue I am. The folks on the Google forums are certain this is an algorithm issue, but I just can't see the logic in that because this appears to be an issue fairly unique to me. I'll take you through what I've gone through. Sorry for it being long. Website URL: https://fenixbazaar.com 1. In early February, I made the switch to https with some small hiccups. Overall however the move was smooth, had redirects all in place, sitemap, indexing was all fine. 2. One night, my organic traffic dropped by almost 100%. All of my top-ranking articles completely disappeared from rank. Top keyword searches were no longer yielding my best performing articles on the front page of results, nor on the last page of results. My pages were still being indexed, but keyword searches weren't delivering my pages in results. I went from 70-100 active users to 0. 3. The next morning, everything was fine. Traffic back up. Top keywords yielding results for my site on the front page. All was back to normal. Traffic shot up. Only problem was the same issue happened that night, and again for the next three nights. Up and down. 4. I had a developer and SEO guy look into my backend to make sure everything was okay. He said there were some redirection issues but nothing that would cause such a significant drop. No errors in Search Console. No warnings. 5. Eventually, the issue stopped and my traffic improved back to where it was. Then everything went great: the site was accepted into Google News, I installed AMP pages perfectly and my traffic boomed for almost 2 weeks. 6. At this point numerous issues with my host provider, price increases, and incredibly outdated cpanel forced me to change hosts. I did without any issues, although I lost a number of articles albeit low-traffic ones in the move. These now deliver 404s and are no longer indexed in the sitemap. 7. After the move there were a number of AMP errors, which I resolved and now I sit at 0 errors. Perfect...or so it seems. 8. Last week I applied for hsts preload and am awaiting submission. My site was in working order and appeared set to get submitted. I applied after I changed hosts. 9. The past 5 days or so has seen good traffic, fantastic traffic to my AMP pages, great Google News tracking, linking from high-authority sites. Good performance all round. 10. I wake up this morning to find 0 active people on my site. I do a Google search and notice my site isn't even the first result whenever I do an actual search for my name. The site doesn't even rank for its own name! My site is still indexed but search results do not yield results for my actual sites. Check Search Console and realised the sitemap had been "processed" yesterday with most pages indexed, which is weird because it was submitted and processed about a week earlier. I resubmitted the sitemap and it appears to have been processed and approved immediately. No changes to search results. 11. All top-ranking content that previously placed in carousal or "Top Stories" in Google News have gone. Top-ranking keywords no longer bring back results with my site: I went through the top 10 ranking keywords for my site, my pages don't appear anywhere in the results, going as far back as page 20 (last page). The pages are still indexed when I check, but simply don't appear in search results. It's happening all over again! Is this an issue any of you have heard of before? Where a site is still being indexed, but has been completely removed from search results, only to return within a few hours? Up and down? I suspect it may be a technical issue, first with the move to https, and now with changing hosts. The fact the sitemap says processed yesterday, suggests maybe it updated and removed the 404s (there were maybe 10), and now Google is attempting to reindexed? Could this be viable? The reason I am skeptical of it being an algorithm issue is because within a matter of hours my articles are ranking again for certain keywords. And this issue has only happened after a change to the site has been applied. Any feedback would be greatly appreciated 🙂
Algorithm Updates | | fenixbazaar0 -
How long for google to de-index old pages on my site?
I launched my redesigned website 4 days ago. I submitted a new site map, as well as submitted it to index in search console (google webmasters). I see that when I google my site, My new open graph settings are coming up correct. Still, a lot of my old site pages are definitely still indexed within google. How long will it take for google to drop off or "de-index" my old pages? Due to the way I restructured my website, a lot of the items are no longer available on my site. This is on purpose. I'm a graphic designer, and with the new change, I removed many old portfolio items, as well as any references to web design since I will no longer offering that service. My site is the following:
Algorithm Updates | | rubennunez
http://studio35design.com0 -
Where can I find a breakdown of google search volume by specific industry/vertical? For example, what % of people searching in google are looking for housing? Cars? Restaurants?
I"m looking for specific breakdowns of search volume in google by: #1 Vertical (Shopping/restaurants/Services etc). For example, how many people are searching in google for information pertaining to restaurants per month? Search volume for all of 2012, 2013, 2014? #2 More granular categories within verticals, people searching for: books,apartment rentals,cellphones) Is there a breakdown of google search somewhere online that gives this type of information? Thank you MOZ community, really appreciate it!
Algorithm Updates | | AppleSauceRules0 -
Homepage Index vs Home vs Default?
Should your home page be www.yoursite.com/index.htm or home.htm or default.htm on an apache server? Someone asked me this, and I have no idea. On our wordpress site, I have never even seen this come up, but according to my friend, every homepage HAS to be one of those three. So my question is which one is best for an apache server site AND does it actually have to be one of those three? Thanks, Ruben
Algorithm Updates | | KempRugeLawGroup0 -
Google is forcing a 301 by truncating our URLs
Just recently we noticed that google has indexed truncated urls for many of our pages that get 301'd to the correct page. For example, we have:
Algorithm Updates | | mmac
http://www.eventective.com/USA/Massachusetts/Bedford/107/Doubletree-Hotel-Boston-Bedford-Glen.html as the url linked everywhere and that's the only version of that page that we use. Google somehow figured out that it would still go to the right place via 301 if they removed the html filename from the end, so they indexed just: http://www.eventective.com/USA/Massachusetts/Bedford/107/ The 301 is not new. It used to 404, but (probably 5 years ago) we saw a few links come in with the html file missing on similar urls so we decided to 301 them instead thinking it would be helpful. We've preferred the longer version because it has the name in it and users that pay attention to the url can feel more confident they are going to the right place. We've always used the full (longer) url and google used to index them all that way, but just recently we noticed about 1/2 of our urls have been converted to the shorter version in the SERPs. These shortened urls take the user to the right page via 301, so it isn't a case of the user landing in the wrong place, but over 100,000 301s may not be so good. You can look at: site:www.eventective.com/usa/massachusetts/bedford/ and you'll noticed all of the urls to businesses at the top of the listings go to the truncated version, but toward the bottom they have the full url. Can you explain to me why google would index a page that is 301'd to the right page and has been for years? I have a lot of thoughts on why they would do this and even more ideas on how we could build our urls better, but I'd really like to hear from some people that aren't quite as close to it as I am. One small detail that shouldn't affect this, but I'll mention it anyway, is that we have a mobile site with the same url pattern. http://m.eventective.com/USA/Massachusetts/Bedford/107/Doubletree-Hotel-Boston-Bedford-Glen.html We did not have the proper 301 in place on the m. site until the end of last week. I'm pretty sure it will be asked, so I'll also mention we have the rel=alternate/canonical set up between the www and m sites. I'm also interested in any thoughts on how this may affect rankings since we seem to have been hit by something toward the end of last week. Don't hesitate to mention anything else you see that may have triggered whatever may have hit us. Thank you,
Michael0 -
Stop google indexing CDN pages
Just when I thought I'd seen it all, google hits me with another nasty surprise! I have a CDN to deliver images, js and css to visitors around the world. I have no links to static HTML pages on the site, as far as I can tell, but someone else may have - perhaps a scraper site? Google has decided the static pages they were able to access through the CDN have more value than my real pages, and they seem to be slowly replacing my pages in the index with the static pages. Anyone got an idea on how to stop that? Obviously, I have no access to the static area, because it is in the CDN, so there is no way I know of that I can have a robots file there. It could be that I have to trash the CDN and change it to only allow the image directory, and maybe set up a separate CDN subdomain for content that only contains the JS and CSS? Have you seen this problem and beat it? (Of course the next thing is Roger might look at google results and start crawling them too, LOL) P.S. The reason I am not asking this question in the google forums is that others have asked this question many times and nobody at google has bothered to answer, over the past 5 months, and nobody who did try, gave an answer that was remotely useful. So I'm not really hopeful of anyone here having a solution either, but I expect this is my best bet because you guys are always willing to try.
Algorithm Updates | | loopyal0 -
Rankings changing every couple of MINUTES in Google?
We've been experiencing some unusual behaviour in the Google.co.uk SERPs recently... Basically, the ranking of some of our websites for certain keywords appears to be changing by the minute. For example, doing a search for "our keyword" might show us at #20. Then a few minutes later, doing the same search shows us at #14, and then the same search a few minutes later shows us at #26, and then sometimes we're not ranked at all, etc etc. I know the algorithm changes a lot, but does it really change every couple of minutes? Has anyone else experienced this kind of behaviour in the SERPs? What could be causing it to happen?
Algorithm Updates | | d4online0