Sitemap Folders on Search Results
-
Hello!
We are managing SEO campaign of a video website. We have an issue about sitemap folders.
I have sitemaps like ** /xml/sitemap-name.xml .** But Google is indexing my /xml/ folder and also sitemaps and they appear in search results.
If i will add Disallow: /xml/ to my robots.txt and remove /xml/ folder from webmaster tools, Google could see my sitemaps? or it ignores them?
Will my site effect negatively after remove /xml/ folder completely from search results?
What should i do?
-
Hi Ahmet
I don't think the sitemap indexed is an issue. Also I wouldn't block it with robots.txt - cause they won't be able to crawl it!
-Dan
-
Hi Ahmet,
You could give it at try by add the lines to your robots.txt for a second and testing your sitemap from within Google Webmaster Tools to see if it's working. I can't give you a definitive answer on this one. In the cases I handled I just accepted that these sitemaps were ranking for some long long long tail queries.
-
Hi Martijn,
I wonder if i block my sitemaps from robots.txt, could google bot reach my sitemaps?
-
Hi Ahmet,
I've seen this a dozen times, somehow Google thinks site maps are in some way great content as they are putting them within the search results. But by using the Disallow in your robots.txt Google won't be able to find your sitemap, or at least they will provide you with an error because you excluded them.
Hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Getting Google to index our sitemap
Hi, We have a sitemap on AWS that is retrievable via a url that looks like ours http://sitemap.shipindex.org/sitemap.xml. We have notified Google it exists and it found our 700k urls (we are a database of ship citations with unique urls). However, it will not index them. It has been weeks and nothing. The weird part is that it did do some of them before, it said so, about 26k. Then it said 0. Now that I have redone the sitemap, I can't get google to look at it and I have no idea why. This is really important to us, as we want not just general keywords to find our front page, but we also want specific ship names to show links to us in results. Does anyone have any clues as to how to get Google's attention and index our sitemap? Or even just crawl more of our site? It has done 35k pages crawling, but stopped.
Intermediate & Advanced SEO | | shipindex0 -
Keyword not provided now in search console
Hello, Is the not provided now available in google search console ? It seems that it is or is it a totally different thing in the search console ? Thank you,
Intermediate & Advanced SEO | | seoanalytics0 -
Crawled page count in Search console
Hi Guys, I'm working on a project (premium-hookahs.nl) where I stumble upon a situation I can’t address. Attached is a screenshot of the crawled pages in Search Console. History: Doing to technical difficulties this webshop didn’t always no index filterpages resulting in thousands of duplicated pages. In reality this webshops has less than 1000 individual pages. At this point we took the following steps to result this: Noindex filterpages. Exclude those filterspages in Search Console and robots.txt. Canonical the filterpages to the relevant categoriepages. This however didn’t result in Google crawling less pages. Although the implementation wasn’t always sound (technical problems during updates) I’m sure this setup has been the same for the last two weeks. Personally I expected a drop of crawled pages but they are still sky high. Can’t imagine Google visits this site 40 times a day. To complicate the situation: We’re running an experiment to gain positions on around 250 long term searches. A few filters will be indexed (size, color, number of hoses and flavors) and three of them can be combined. This results in around 250 extra pages. Meta titles, descriptions, h1 and texts are unique as well. Questions: - Excluding in robots.txt should result in Google not crawling those pages right? - Is this number of crawled pages normal for a website with around 1000 unique pages? - What am I missing? BxlESTT
Intermediate & Advanced SEO | | Bob_van_Biezen0 -
Google Search Results...
I'm trying to download every google search results for my company site:company.com. The limit I can get is 100. I tried using seoquake but I can only get to 100. The reason for this? I would like to see what are the pages indexed. www pages, and subdomain pages should only make up 7,000 but search results are 23,000. I would like to see what the others are in the 23,000. Any advice how to go about this? I can individually check subdomains site:www.company.com and site:static.company.com, but I don't know all the subdomains. Anyone cracked this? I tried using a scrapper tool but it was only able to retrieve 200.
Intermediate & Advanced SEO | | Bio-RadAbs0 -
XML Sitemaps - Multi-lingual website
Hi Mozzers, I am working with a large website that has some of its content translated across multiple languages. I am planning on using The Media Flow to create an HREFLANG Sitemap for content on various languages. Please see the attached image for the questions below. Thanks! Section Highlighted Yellow: When there is a URL that does not have a translated version, should it not be included on the same HREFLANG sitemap? Alternately, could I just remove the languages that are not being targeted, so this would just reflect English language targeting? fqO9Dvk
Intermediate & Advanced SEO | | J-Banz0 -
Block search bots on staging server
I want to block bots from all of our client sites on our staging server. Since robots.txt files can easily be copied over when moving a site to production, how can i block bots/crawlers from our staging server (at the server level), but still allow our clients to see/preview their site before launch?
Intermediate & Advanced SEO | | BlueView13010 -
How would I be able to make sure Google lists search results as a combined listing opposed to a single listing/
I am slightly confused about how to let Google know to index our site as a complex listing opposed to individual page listing. Our site is well established and has over 3500 indexes. Does this have to do with the Sitemap or is it something else? Is there a way to expedite Google to list our site like the example below. Thank you for your help! Whole Foods Market <cite>www.wholefoodsmarket.com/</cite>Owns and operates chain of natural foods supermarkets which sell meat and poultry free of growth hormones and antibiotics, unprocessed grains and cereals, ... | ### Stores Hours: Open 8am to 10pm Seven Days a Week. Note Holiday ... | ### Online Ordering welcome find a store healthy eating about our products ... |
Intermediate & Advanced SEO | | olive13
| ### Coupons Here they are: printable coupons from the latest issue of our in ... | ### Recipes This boldly flavored casserole is an excellent way to use leftover ... |
| ### Careers Hiring Process - Job Fairs and Events - Career Paths - ... | ### Lamar Located just blocks from where Whole Foods Market began as ... |
| More results from wholefoodsmarket.com » |0 -
Organic Search Problems?
Hey guys, I am in need of a little help! I am currently an aspiring SEO (trying to absorb as much information as I can and implement changes to help my site organically)... Most of my experience revolves around SEM. That being said, I have a problem. My site is doing well through paid search... great quality scores, etc. However, the content on my site (and even my site as a whole) does not "appear" to rank well in Organic. To explain further... My site is federalautoloan.com... and when I type in exact article names (or even federal auto loan) into Google, nothing shows up. And yes, my content is all original/unique content. I've even recently added a unique Calculator to my site. site:federalautoloan.com in the search bar shows results for all of my pages... but it just seems as though Google does not like my site for some reason. At least in Organic. The odd thing is, none of my other sites have this problem. Do you guys have any advice? The only thing I can think of is that somehow my 301 redirect was performed improperly. Yes, I had a permanent redirect performed on my site about 4 months back. The URL we were using prior just wasn't performing as well in Paid Search. But seeing as how that is the preferred method by Google... I'm really at a loss... Again, my site is FederalAutoLoan.com. Any help would be GREATLY appreciated. Even generic SEO advice would be appreciated. Edit: Two other things to note... I have plugged my site into the SEOmoz Pro tool... the tool is not showing any issues for my site. I am also making use of Google Webmaster Tools and the only error that shows up for my site is a Soft 404 for one of my pmcs... Not sure why it is even pulling one of my pmcs... but as far as I can tell, there really shouldn't be any problems. Note on the 404 for anyone who might give a response on that issue... http://www.seoconsultants.com/tools/headers returns a 200 OK response. Edit2: Question presented below.
Intermediate & Advanced SEO | | WPColt0