Sitemaps - Format Issue
-
Hi,
I have a little issue with a client site whose programmer seems kind of unwilling to change things that he has been doing a long time.
So, he has had this dynamic site set up for a few years and active in google webmaster tools and others, but is not happy with the traffic it is getting.
When I looked at webmaster tools I see that he has a sitemap registered, but it is /sitemap.php
When I said that we should be offering the SE's /sitemap.xml his response is that sitemap.php checks the site every day and generates /sitemap.xml, but there is no /sitemap.xml registered in webmaster tools.
My gut is telling me that he should just register /sitemap.xml in webmaster tools, but it is a hard sell
Anyone have any definitive experience of people doing this before and whether it is an issue?
My feeling is that it doesn't need to be rocket science...
Any input appreciated,
Sha
-
I have a sitemap.php on my sites. The file contains the php code which generates my xml sitemap. It is perfectly standard and common practice.
The question for your programmer is, where is the output xml file located? A sitemap program will output the file to the same location each time it is updated. He should be able to provide you a link to the file.
I would advise the URL to be placed somewhere like mydomain.com/sitemap directory. If a deeper directory is preferred, then add the location to robots.txt. Either way it cannot hurt to update the sitemap in Google WMT. With that said, it is not necessary to do so as long as you can confirm Google is getting the information.
-
I haven't seen a sitemap.php in a long time, Sha. Certainly Google could read it if they want, but whether they will or not is the question. I would be inclined to doubt it.
If he says that it's generating a sitemap.xml, but none is present on WMT, then I would respond that one of two things is happening:
1. It isn't generating the sitemap in an xml format at all, but only in php, or
2. For some reason, the xml version is either not transmitted, or not received.
The only other possibility that comes to mind is that perhaps the conversion from php to xml is not tagged in a fashion to be recognized as an xml file, and WMT is detecting it as php and assigning it that status accordingly. I suppose that could happen, particularly if he is using an outdated plugin or if of his own coding, the conversion is faulty.
I'd be interested in hearing what you ultimately learn on this.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Are there any SEO issues we should be aware of on Gutenberg?
We are launching a new website and switching to WP 5.0 Gutenberg. Are there any issues we should be aware of related to SEO with the new platform?
Technical SEO | | AegisLiving0 -
Sitelinks Issue - Different Languages
Hey folks, We run different ccTLD's for revolveclothing.com (revolveclothing.es, revolveclothing.com.br, etc. etc.) and they all have their own WMT/Google Console with their own href lang tags etc. The problem is this. https://www.google.fr/#q=revolve+clothing When you look at the sitelinks, you'll see that one of them (sales page) happens to be in Portuguese on the French site. Can anyone investigate and see why?
Technical SEO | | ggpaul5620 -
Development Website Duplicate Content Issue
Hi, We launched a client's website around 7th January 2013 (http://rollerbannerscheap.co.uk), we originally constructed the website on a development domain (http://dev.rollerbannerscheap.co.uk) which was active for around 6-8 months (the dev site was unblocked from search engines for the first 3-4 months, but then blocked again) before we migrated dev --> live. In late Jan 2013 changed the robots.txt file to allow search engines to index the website. A week later I accidentally logged into the DEV website and also changed the robots.txt file to allow the search engines to index it. This obviously caused a duplicate content issue as both sites were identical. I realised what I had done a couple of days later and blocked the dev site from the search engines with the robots.txt file. Most of the pages from the dev site had been de-indexed from Google apart from 3, the home page (dev.rollerbannerscheap.co.uk, and two blog pages). The live site has 184 pages indexed in Google. So I thought the last 3 dev pages would disappear after a few weeks. I checked back late February and the 3 dev site pages were still indexed in Google. I decided to 301 redirect the dev site to the live site to tell Google to rank the live site and to ignore the dev site content. I also checked the robots.txt file on the dev site and this was blocking search engines too. But still the dev site is being found in Google wherever the live site should be found. When I do find the dev site in Google it displays this; Roller Banners Cheap » admin dev.rollerbannerscheap.co.uk/ A description for this result is not available because of this site's robots.txt – learn more. This is really affecting our clients SEO plan and we can't seem to remove the dev site or rank the live site in Google. In GWT I have tried to remove the sub domain. When I visit remove URLs, I enter dev.rollerbannerscheap.co.uk but then it displays the URL as http://www.rollerbannerscheap.co.uk/dev.rollerbannerscheap.co.uk. I want to remove a sub domain not a page. Can anyone help please?
Technical SEO | | SO_UK0 -
Subdomains Issue
Hi , We have created sub domains of our site to target various Geo´s. For example, geo, uk.site.com, de.site,com and all these sub domains have the same content as main domain. Will it affect our SEO Rankings? How can we solve this if it affects our rankings?
Technical SEO | | mikerbrt240 -
Http & https canonicalization issues
Howdyho I'm SEOing a daily deals site that mostly runs on https Versions. (only the home page is on http). I'm wondering what to do for canonicalization. IMO it would be easiest to run all pages on https. But the scarce resources I find are not so clear. For instance, this Youmoz blog post claims that https is only for humans, not for bots! That doesn't really apply anymore, right?
Technical SEO | | zeepartner0 -
What's the issue?
Hi, We have a client who dropped in the rankings (initially from bottom of the first page to page to page 3, and now page 5) for a single keyword (their most important one - targeted on their homepage) back in the middle of March. So far, we've found that the issue isn't the following: Keyword stuffing on the page External anchor text pointing to the page Internal anchor text pointing to the page In addition to the above, the drop didn't coincide with panda or penguin. Any other ideas as to what could cause such a drop for a single keyword (other related rankings haven't moved). We're starting to think that this may just have been another small change in the algorithm but it seems like too big of a drop in a short space of time for that to be the case. Any thoughts would be much appreciated! Thanks.
Technical SEO | | jasarrow0 -
Drupal 1.5 Issue: Taxonomy
Hi there I have a domain which is built in Drupal 1.5 . We managed to redirect all nodes to the actial SEF URL. The one issue we have no is redirecting the taxonomy urls to the SEF url. The obviuos answr is to do a manual 301 redirect n the htaccess file but this will a long process as there are over 500 urls affected. Is there a better way to do this automatically within Drupal? Your thoughts and ideas are welcome.
Technical SEO | | stefanok0 -
What is with WordPress Dupe issues?
Hi, Just wondering if anyone can explain for me why it seems every tag that is entered in WP blog posts on a site creates a duplicate page (identified by ROGER and friends in SEOmoz crawl)? Obviously if you can offer a solution (apart from the extremely obvious "don't use tags") I would be immensely grateful. Thanks so much,
Technical SEO | | ShaMenz0