Trouble Indexing one of our sitemaps
-
Hi everyone thanks for your help. Any feedback is appreciated. We have three separate sitemaps:
blog/sitemap.xml
events.xml
sitemap.xml
Unfortunately we keep trying to get our events sitemap to pickup and it just isn't happening for us. Any input on what could be going on?
-
There also seem to be url's which are duplicated:
/new-york-city-tickets/elektra-theatre-tickets/50-shades-the-musical-mar-21-2015-1283412.html
/new-york-city-tickets/elektra-theatre-tickets/50-shades-the-musical-mar-25-2015-1283241.html
/new-york-city-tickets/elektra-theatre-tickets/50-shades-the-musical-mar-27-2015-1283246.html=> 3 different url's - but the content seems to be identical on these pages.
You could try to do a full crawl with Screamingfrog - and check the semi-duplicates on your site (identical H1, metadescription,... and so on)
-
If I do a site:yoursite.com/minneapolis-tickets in Google I get results - so these pages seem to be in the index, even if this is not shown on the sitemap level in WMT.
I notice you use noindex on a substantial number of pages (for expired events) - maybe it would be better to use the unavailable after meta tag. See also: http://searchenginewatch.com/sew/news/2334932/ecommerce-seo-tips-for-unavailable-products-from-googles-matt-cutts
-
Update - if your site is identical to your username - the cause is almost certain related to the lack of indexable content on these pages. The event pages, while very userfriendly & valuable for end users, are too light for Google in terms of content. Apart from the title, most of this pages are quite identical (the maps, dates & prices are different) if you look at the source code.
-
Hi Dirk,
Thanks for your response. We have used fetch as google to test out a couple of the URL's and it worked on 1 out of 3. All the pages do have light content and I checked on the pages that we fetched that weren't indexed and we don't have any noindex, nofollow tags on the page. It is frustrating as we can see our competitors event pages indexing with no content. So any help is appreciated.
-
There could be many reasons why this sitemap is not indexed.
Are there any duplicates between the different sitemaps (if there are duplicates, they are not listed as indexed in the 2nd sitemap)
It could also be that the pages are too light in terms of content to get indexed - example - if you only list the event name, date, and place, without additional content it will probably not get indexed.
Are you sure that all the url's in these sitemap can be indexed (not blocked by robots.txt or noindex tag)- you could try a few url's of the sitemap in Fetch like google and see if they are fetched properly.
rgds
Dirk
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Sitemap - What are the recommendations on the number of links
Hi, I have a sitemap(s) which is very large(.i.e. 60000) links, is it recommended to have so many links and how come when I do a site search(site:mydomain) the number of links are less than on my site map?
Intermediate & Advanced SEO | | FreddyKgapza0 -
Best Practice Approaches to Canonicals vs. Indexing in Google Sitemap vs. No Follow Tags
Hi There, I am working on the following website: https://wave.com.au/ I have become aware that there are different pages that are competing for the same keywords. For example, I just started to update a core, category page - Anaesthetics (https://wave.com.au/job-specialties/anaesthetics/) to focus mainly around the keywords ‘Anaesthetist Jobs’. But I have recognized that there are ongoing landing pages that contain pretty similar content: https://wave.com.au/anaesthetists/ https://wave.com.au/asa/ We want to direct organic traffic to our core pages e.g. (https://wave.com.au/job-specialties/anaesthetics/). This then leads me to have to deal with the duplicate pages with either a canonical link (content manageable) or maybe alternatively adding a no-follow tag or updating the robots.txt. Our resident developer also suggested that it might be good to use Google Index in the sitemap to tell Google that these are of less value? What is the best approach? Should I add a canonical link to the landing pages pointing it to the category page? Or alternatively, should I use the Google Index? Or even another approach? Any advice would be greatly appreciated. Thanks!
Intermediate & Advanced SEO | | Wavelength_International0 -
Old sitemaps after site migration.
Hi, I was wondering if it's safe to remove all the sitemaps from the old site in search console? It's been 3 months since site migration from http://sitea.com (301 redirected) to http://siteb.com. Therefore, can I delete the old sitemap from the http://sitea.com from search console? Thanks.
Intermediate & Advanced SEO | | ggpaul5620 -
Old pages STILL indexed...
Our new website has been live for around 3 months and the URL structure has completely changed. We weren't able to dynamically create 301 redirects for over 5,000 of our products because of how different the URL's were so we've been redirecting them as and when. 3 months on and we're still getting hundreds of 404 errors daily in our Webmaster Tools account. I've checked the server logs and it looks like Bing Bot still seems to want to crawl our old /product/ URL's. Also, if I perform a "site:example.co.uk/product" on Google or Bing - lots of results are still returned, indicating the both still haven't dropped them from their index. Should I ignore the 404 errors and continue to wait for them to drop off or should I just block /product/ in my robots.txt? After 3 months I'd have thought they'd have naturally dropped off by now! I'm half-debating this: User-agent: *
Intermediate & Advanced SEO | | LiamMcArthur
Disallow: /some-directory-for-all/* User-agent: Bingbot
User-agent: MSNBot
Disallow: /product/ Sitemap: http://www.example.co.uk/sitemap.xml0 -
Pages are Indexed but not Cached by Google. Why?
Here's an example: I get a 404 error for this: http://webcache.googleusercontent.com/search?q=cache:http://www.qjamba.com/restaurants-coupons/ferguson/mo/all But a search for qjamba restaurant coupons gives a clear result as does this: site:http://www.qjamba.com/restaurants-coupons/ferguson/mo/all What is going on? How can this page be indexed but not in the Google cache? I should make clear that the page is not showing up with any kind of error in webmaster tools, and Google has been crawling pages just fine. This particular page was fetched by Google yesterday with no problems, and even crawled again twice today by Google Yet, no cache.
Intermediate & Advanced SEO | | friendoffood2 -
Sitemap Folders on Search Results
Hello! We are managing SEO campaign of a video website. We have an issue about sitemap folders. I have sitemaps like ** /xml/sitemap-name.xml .** But Google is indexing my /xml/ folder and also sitemaps and they appear in search results. If i will add Disallow: /xml/ to my robots.txt and remove /xml/ folder from webmaster tools, Google could see my sitemaps? or it ignores them? Will my site effect negatively after remove /xml/ folder completely from search results? What should i do?
Intermediate & Advanced SEO | | roipublic0 -
Sudden Index drop, but traffic increased?
Here are the numbers- Pages submitted on sitemap- About 18k Total Pages indexed on 12/30- About 250k Total Pages indexed on 1/6- About 81k We made no site changes in that week, why the sudden drop? Also why is total pages indexed so much higher than sitemap?
Intermediate & Advanced SEO | | EcommerceSite0 -
Google +one button - help needed
Can someone from this wonderful community answer the question. The link is http://www.seomoz.org/q/google-one-button-help-needed-2
Intermediate & Advanced SEO | | seoug_20050