Replacing a site map
-
We are in the process of changing our folder/url structure. Currently we have about 5 sitemaps submitted to Google.
How is it best to deal with these site maps in terms of either (a) replacing the old URLs with the new ones in the site map and (b) what affect should we have if we removed the site map submission from the Google Webmaster Tools console.
Basically we have in the region of 20,000 urls to redirect to the new format, and to update in the site map.
-
Another thought might be to place a noindex on the new pages to start with and as we migrate and 301 redirect the old to the new remove the noindex on the new pages ?
That can work but it's not an approach i would use. It seems like a lot of extra work, you run the risk of forgetting to remove the noindex tag on some pages, and also you may wind up not having pages properly indexed for a month.
If you publish a page today, Google may crawl the new page and see the noindex tag. You can then remove the noindex tag but Google may not recrawl the page for some time leaving your site without an indexed page.
As part of the process of publishing the page, I would 301 the old URL to the new URL immediately.
-
Another thought might be to place a noindex on the new pages to start with and as we migrate and 301 redirect the old to the new remove the noindex on the new pages ?
Thoughts ??
-
since the site has over 10,000 pages we need to make sure all redirects etc are set-up before we go live with the new URLs ?
Whether your site has 10 pages or a million pages you should ensure all internal links work without the need for redirection. Any old external links should be redirected to the correct page on your site if one exists. Otherwise you can allow the URL to 404 if there is not a current equivalent page.
Set up your site's 404 page so users are offered a basic "page not found" message along with your site's navigation and a search function. You should set up a log to track which URLs are generating 404 errors.
Prior to launching the site run a crawl diagnostic to help ensure nothing has been missed.
-
Perfect thanks. Just one final question, since the site has over 10,000 pages we need to make sure all redirects etc are set-up before we go live with the new URLs ?
What is the best way to go forward with regards launching the site ?
Should we launch the new pages and then go through the URLs redirecting them ?
Thoughts please ??
-
We are changing our site structure for two main reasons
-
Ease of functionality, and having the ability to target friendly URLs suitable for SEO
-
Plus we've a new CMS, that allows this custom written URLs
The current structure has too many folders that are too deep, and is becoming too un-manageable. The new CMS gives us totally control from one control panel.
I understand that we will loose some PR, but believe it will be for the better of the site and user experience.
-
-
We are changing our site structure for two main reasons
-
Ease of functionality, and having the ability to target friendly URLs suitable for SEO
-
Plus we've a new CMS, that allows this custom written URLs
The current structure has too many folders that are too deep, and is becoming too un-manageable. The new CMS gives us totally control from one control panel.
I understand that we will loose some PR, but below it will be for the better of the site and user experience.
-
-
Big question: are you changing folder/url structure for aesthetics or functionality? Often times it's not worth making such a large change in hopes of getting some SEO-friendly URL's, as the weight on SEO-friendly URL's isn't what it once was. And the headache involved, as well as the inevitable loss in traffic, is quite often not worth it at all.
With that said, refresh your entire sitemap with the new URL's once they are made. Remove all old urls.
IMPORTANT: setup 301 redirects, either using .htaccess or PHP (or whatever language your site uses), to redirect all old urls to the respective new urls. You will lose a fair chunk of PR during this change, but if you feel your site will benefit greatly from a structure change, then you will be willing to take the hit.
Don't leave any redirect un-turned. Then, you'll just have to wait it out while Google re-indexes your entire site trying to figure out your new url structure. Could take a week, could take months. All depends on what Google has valued your site as. For example, if CNN changed their entire URL structure, they probably would miss a beat. Smaller websites tend to take much larger hits in the SERP's.
So, just be sure it's a necessary action, trust me. And don't ever remove those 301's from your .htaccess as you never know what Google still has in their index for your site.
-
A sitemap should be a link representation of your site. It should contain a link to every page you wish to be included in Google's index.
How is it best to deal with these site maps in terms of either (a) replacing the old URLs with the new ones in the site map
Just make the switch. If a page no longer exists on your site, remove the link. If you create a new page on your site, add the link.
what affect should we have if we removed the site map submission from the Google Webmaster Tools console
For the most part, none. During the next crawl Google would look for your sitemap at your root address: www.mydomain.com/sitemap.xml. Google will also check your robots.txt file for a path to your sitemap file. If a sitemap is not located, it will crawl your site normally.
The primary purpose of a sitemap is to allow Google to become aware about new pages on your site it otherwise might not find. If your site offers solid navigation, a site map is not necessary at all.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved URL dynamic structure issue for new global site where I will redirect multiple well-working sites.
Dear all, We are working on a new platform called [https://www.piktalent.com](link url), were basically we aim to redirect many smaller sites we have with quite a lot of SEO traffic related to internships. Our previous sites are some like www.spain-internship.com, www.europe-internship.com and other similars we have (around 9). Our idea is to smoothly redirect a bit by a bit many of the sites to this new platform which is a custom made site in python and node, much more scalable and willing to develop app, etc etc etc...to become a bigger platform. For the new site, we decided to create 3 areas for the main content: piktalent.com/opportunities (all the vacancies) , piktalent.com/internships and piktalent.com/jobs so we can categorize the different types of pages and things we have and under opportunities we have all the vacancies. The problem comes with the site when we generate the diferent static landings and dynamic searches. We have static landing pages generated like www.piktalent.com/internships/madrid but dynamically it also generates www.piktalent.com/opportunities?search=madrid. Also, most of the searches will generate that type of urls, not following the structure of Domain name / type of vacancy/ city / name of the vacancy following the dynamic search structure. I have been thinking 2 potential solutions for this, either applying canonicals, or adding the suffix in webmasters as non index.... but... What do you think is the right approach for this? I am worried about potential duplicate content and conflicts between static content dynamic one. My CTO insists that the dynamic has to be like that but.... I am not 100% sure. Someone can provide input on this? Is there a way to block the dynamic urls generated? Someone with a similar experience? Regards,
Technical SEO | | Jose_jimenez0 -
2 sets of stats for same site
Somehow on OSE I managed to get two different sets of results appear for my page. The column on the left (PA 34) is for mysite.com/ and the second column is for www.mysite.com/ .Note that these are the same site. Why do i have two different sets of results ?(note some things are the same such as google +1 & FB likes)Im concerned ive done something wrong and could have a bigger beast with both sets of results merged together. Any help much appreciated. Chris QFNeGh7
Technical SEO | | cjkimber0 -
Site removed from Google Index
Hi mozers, Two months ago we published http://aquacion.com We registered it in the Google Webmaster tools and after a few day the website was in the index no problem. But now the webmaster tools tell us the URLs were manually removed. I've look everywhere in the webmaster tools in search for more clues but haven't found anything that would help me. I sent the acces to the client, who might have been stupid enough to remove his own site from the Google index, but now, even though I delete and add the sitemap again, the website won't show in Google SERPs. What's weird is that Google Webmaster Tools tells us all the page are indexed. I'm totally clueless here... Ps. : Added screenshots from Google Webmaster Tools. Update Turns out it was my mistake after all. When my client developped his website a few months ago, he published it, and I removed the website from the Google Index. When the website was finished I submited the sitemap, thinking it would void the removal request, but it don't. How to solve In webmaster tools, in the [Google Index => Remove URLs] page, you can reinclude pages there. tGib0
Technical SEO | | RichardPicard0 -
Redirecting the .com of our site
Hey guys, A company I consult for has a different site for its users depending on the geography. Example: When a visitor goes to www.company.com if the user is from the EU, it gets redirected to http://eu.company.com If the user is from the US, it goes to http://us.company.com And so on. I have two questions: Does having a redirect on the .com will influence rankings on each specific sub-site? I suspect it will affect the .com since it will simply not get indexed but not sure if affects the sub domains. The content on this sub-sites are not different (I´m still trying to figure out why they are using the sub-domains). Will they get penalized for duplicate content? Thanks!
Technical SEO | | FDSConsulting0 -
Should this site start again on a new domain
Hi We have not done SEO on this site they have used another company who looks like they outsourced and the links have been built by a third party all blog networks and this company have said they cannot get the links removed. Google flagged artificial links on this web site in February and in April it lost over 10000 visitors in a month and its just free falled ever since. The categories have been recreated and no redirects created due to the amount of backlinks from the blog sites to the original category pages but the site is not recovering its down to 1500 visitors a month and used to get 14000 a month. So should my customer ditch the domain and move this site to fresh domain? http://www.kids-beds-online.com Any answers would really be appreciated. thanks Tracy
Technical SEO | | dashesndots0 -
Site being indexed by Google before it has launched
We are currently coming towards the end of migrating one of our retail sites over to magento. To our horror, we find out today that some pages are already being indexed by Google, and we have started receiving orders through new site. Do you have any suggestions for what may have caused this? Or similarly, what the best solution would be to de-index ourselves? We most recently excluded anything with a certain parameter from robots.txt - could this being implemented incorrectly have caused this issue? Thanks
Technical SEO | | Sayers0 -
Brand New Site Penalized?
I recently launched 2 completely separate and unrelated websites at the same time. Both are new domains and hosting accounts. neither have any links. One is ranking for a branded search and the other is not. The interesting thing is that I tested both sites on the back end of my server before launch. The site that is not ranking for branded search IS ranking still on the back end of my site for the branded search. I have removed all content and 301 redirected the testing urls back to my portfolio page. Could this be do to Google indexing one but not the other. Does it have anything to do with testing on my server first and my DA being higher than current new sites? Or is it something completely different I'm missing completely. Is this a Penalty?
Technical SEO | | CDUBP0 -
.CA site same as .com site - are both necessary?
Dear Friend, We representa a major national brand in the auto care industry, and they have locations in both US and Canada. There is a primary content site at .com that we have duplicated at .ca. We are hosting the .ca site on a separate IP on a server in Canada - but by in large it is the same site. (there are some minor changes we made to change US English to Canadian English - though minor. When we search Google.ca we generally see strong search results for the .com site, but rarely, if ever any evidence of rankings for the .ca site. The .com site was launched several years ago about 18 months before the .ca site. Why doesn't Google.ca show the .ca site? Is this an issue of duplicate content, and Google.ca simply shows the .com version which it knew about first? Are we wasting our time, money and efforts having both? Thanks, Tim ps. this isn't about location. We use a separate site to locate local shops, and have coordinated that well with Google Places, and when looking for local auto care - we do well in both US and Canada. The sites described above are largetl content sites.
Technical SEO | | lunavista-comm0