Removing hundreds of old product pages - Best process
-
Hi guys,
I've got a site about discounts/specials etc. A few months ago we decided it might be useful to have shop specials in PDF documents "pulled" and put on the site individually so that people could find the specials easily. This resulted in over 2000 new pages being added to the site over a few weeks (there are lots of specials).
However, 2 things have happened:1 - we have decided to go in another direction with the site and are no longer doing this
2 - the specials that were uploaded have now ended but the pages are still liveGoogle has indexed these pages already. What would be the best way to "deal" with these pages? Do I just delete them, do I 301 them to the home page? PS the site is build on wordpress.
Any ideas as I am at a complete loss.
Thanks,
Marc -
I am not aware of any benefit to removing the pages slowly over time as opposed to all at once.
-
Hi Ryan,
Thanks, I'll go through now and start deleting them.
Will not deleting almost 2000 pages from the site, make Google a little nervous about the "structure" or "trustworthiness" of my site?
Should I do it a few each week, or just all at once? -
In that case the general advise would be to delete the pages and allow them to 404.
The wordpress information is offered in case you happen to have a WP site. If your site is not built in WP, then a similar process would ideally be used.
-
Hi Ryan,
Thanks.
Yes there would be no other pages that would help the user.
So do I just go through wordpress and mass delete all of those posts? -
When deleting a web page ask yourself the following question:
If I was a user who was looking for this web page and it was not available, is there another page on my site which is very closely related which is likely to satisfy the user's query?
For example, if you have a page on May 2012 Deals of the Month, then I would forward that page to the current "Deals of the Month" page.
If you do not have a closely related page, then I would suggest allowing the URL to go to your 404 page. You should have a solid 404 process in place. Specifically:
-
your 404 page should be helpful. It should contain your site's navigation, a search option, a basic "I am sorry the page you are looking for cannot be found" message, etc.
-
you should track 404 errors so you can understand the popularity of any URLs which generate a 404. If you see a single URL generating multiple 404 errors on a daily basis, then you can take that as a strong indicator of a need to create new content related to that subject.
I recommend Yoast Google Analytics for WP plugin: http://yoast.com/wordpress/google-analytics/. It helps track and resolve 404 errors on WP sites.
-
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Should remove 404 page
Hello, I upload a new website with new web addresses and my current addresses don't work anymore. I don't want to do redirects. Should I just remove the old address from google index using their tool or let google do it on its own. Thank you,
Intermediate & Advanced SEO | | seoanalytics1 -
Product page as homepage
Hello, Is it ok that to use the homepage of website as a product page directly where you present all your products on your homepage or can it penalise you to do that ? and in that case, is it better to have a homepage that you don't rank and create a subpage for your product page. Thank you,
Intermediate & Advanced SEO | | seoanalytics1 -
Fresh page versus old page climbing up the rankings.
Hello, I have noticed that if publishe a webpage that google has never seen it ranks right away and usually in a descend position to start with (not great but descend). Usually top 30 to 50 and then over the months it slowly climbs up the rankings. However, if my page has been existing for let's say 3 years and I make changes to it, it takes much longer to climb up the rankings Has someone noticed that too ? and why is that ?
Intermediate & Advanced SEO | | seoanalytics0 -
New Site Launch - Redirecting Hundreds of Old Invalid URLs?
We have a client whose WMT shows a ton of "Not found" crawl errors after a new site launched. The URLs are largely a bunch of files that they had previously uploaded on their old site which have external links pointing at them. What is their best course of action, if they don't have files on the new site that correspond with these old files? Redirect to the home page? Leave them 404ing? Customize 404 message? (Note: They're mostly generic, non-human friendly URLs that are difficult to identify like /gallery3.php/ppage/3, g/interior.php/pid/3/sid/24, /uploads/1213196781.pdf)
Intermediate & Advanced SEO | | VTDesignWorks0 -
Merging your google places page with google plus page.
I have a map listing showing for the keyword junk cars for cash nj. I recently created a new g+ page and requested a merge between the places and the + page. now when you do a search you see the following. Junk Cars For Cash NJ LLC
Intermediate & Advanced SEO | | junkcars
junkcarforcashnj.com/
Google+ page - Google+ page the first hyperlink takes me to the about page of the G+ and the second link takes me to the posts section within g+. Is this normal? should i delete the places account where the listing was originally created? Or do i leave it as is? Thanks0 -
How to get around Google Removal tool not removing redirected and 404 pages? Or if you don't know the anchor text?
Hello! I can’t get squat for an answer in GWT forums. Should have brought this problem here first… The Google Removal Tool doesn't work when the original page you're trying to get recached redirects to another site. Google still reads the site as being okay, so there is no way for me to get the cache reset since I don't what text was previously on the page. For example: This: | http://0creditbalancetransfer.com/article375451_influencial_search_results_for_.htm | Redirects to this: http://abacusmortgageloans.com/GuaranteedPersonaLoanCKBK.htm?hop=duc01996 I don't even know what was on the first page. And when it redirects, I have no way of telling Google to recache the page. It's almost as if the site got deindexed, and they put in a redirect. Then there is crap like this: http://aniga.x90x.net/index.php?q=Recuperacion+Discos+Fujitsu+www.articulo.org/articulo/182/recuperacion_de_disco_duro_recuperar_datos_discos_duros_ii.html No links to my site are on there, yet Google's indexed links say that the page is linking to me. It isn't, but because I don't know HOW the page changed text-wise, I can't get the page recached. The tool also doesn't work when a page 404s. Google still reads the page as being active, but it isn't. What are my options? I literally have hundreds of such URLs. Thanks!
Intermediate & Advanced SEO | | SeanGodier0 -
How do I fix the error duplicate page content and duplicate page title?
On my site www.millsheating.co.uk I have the error message as per the question title. The conflict is coming from these two pages which are effectively the same page: www.millsheating.co.uk www.millsheating.co.uk/index I have added a htaccess file to the root folder as I thought (hoped) it would fix the problem but I doesn't appear to have done so. this is the content of the htaccess file: Options +FollowSymLinks RewriteEngine On RewriteCond %{HTTP_HOST} ^millsheating.co.uk RewriteRule (.*) http://www.millsheating.co.uk/$1 [R=301,L] RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /index\.html\ HTTP/ RewriteRule ^index\.html$ http://www.millsheating.co.uk/ [R=301,L] AddType x-mapp-php5 .php
Intermediate & Advanced SEO | | JasonHegarty0 -
Old pages still crawled by SE returning 404s. Better to put 301 or block with robots.txt ?
Hello guys, A client of ours has thousand of pages returning 404 visibile on googl webmaster tools. These are all old pages which don't exist anymore but Google keeps on detecting them. These pages belong to sections of the site which don't exist anymore. They are not linked externally and didn't provide much value even when they existed What do u suggest us to do: (a) do nothing (b) redirect all these URL/folders to the homepage through a 301 (c) block these pages through the robots.txt. Are we inappropriately using part of the crawling budget set by Search Engines by not doing anything ? thx
Intermediate & Advanced SEO | | H-FARM0