Removing a large number of unnecessary pages from a site
-
Hi all,
I got a big problem with my website. I have a lot of page, duplicate page made from various combinations of selects, and for all this duplicate content we've be hit by a panda update 2 years ago.
I don't want to bring new content an all of these pages, about 3.000.000, because most of them are unnecessary. Google indexed all of them (3.000.000), and I want to redirect the pages that I don't need anymore to the most important ones.
My question, is there any problem in how google will see this change, because after this it will remain only 5000-6000 relevant pages?
-
I can't see this causing you problems. I've commonly noindexed huge numbers of pages, mostly for sites with Panda issues, and in several cases we've seen great increases in traffic with a future Panda refresh.
-
No problem with doing this at all. I have also worked with a client who had a large number of pages and they came down from about 170,000 to just under 2,000 and the result was a good one. This was also to combat Panda.
However, when redirecting, take care not to just redirect every one of the 3m pages to the same places. If you can't find a good and relevant match for a 301, then just 404 it. it is much better to have a 404 than poor redirects. I would also review your 404 page to make sure it is user friendly and helps people navigate to something better.
Best of luck.
-Andy
-
Ouch, Google will definitely notice. But if you already took a big hit on traffic then this probably would be a wise choice. What i would check before is how much traffic is going to the pages that you've deleted so you know upfront what kind of traffic you could lose again by removing these pages.
With luck though the pages that will remain in place will have more authority as they get better links from across the domain.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Is site: a reliable method for getting full list of indexed pages?
The site:domain.com search seems to show less pages than it used to (Google and Bing). It doesn't relate to a specific site but all sites. For example, I will get "page 1 of about 3,000 results" but by the time I've paged through the results it will end and change to "page 24 of 201 results". In that example If I look in GSC it shows 1,932 indexed. Should I now accept the "pages" listed in site: is an unreliable metric?
Technical SEO | | bjalc20112 -
301'd site, but new site is not getting picked up in google.
Hi I'm having big issues! Any help would be greatly appreciated This is the 3rd time this happened. Every time I switch my old site greatcleanjokes.com to the new design of chokeonajoke.com traffic goes almost completely down (I even tried out the new design on greatcleanjokes [to see if it was a 301 issue] and traffic also went down.) What can possibly be wrong with this new site that google just doesn't like it ?! I was ranking high up for many big phrase like joke of the day, corny jokes, clean jokes, short jokes. Now It's all gone. I also think it's strange that when I search for site:chokeonajoke.com the post pages show up before the category pages!? Here is the old site http://web.archive.org/web/20140406214615/http://www.greatcleanjokes.com/ Here is the new one http://chokeonajoke.com/ If you can't figure out anything do you know of anyone I can hire who may be able to figure it out?
Technical SEO | | Nickys22111 -
Mobile site content and main site content
Help, pls! I have one main site and a mobile version of that site (m.domain.com). The main site has more pages, more content, different named urls. The main site has consistently done well in Google. The mobile site has not: the mobile site is buried. I am working on adding more content to the mobile site, but am concerned about duplicate content. Could someone pls tell me the best way to deal with these two versions of our site? I can't use rel=canonical because the urls do not correspond to the same names on the main site, or can I? Does this mean I need to change the url names, offer different content (abridged), etc? I really am at a loss as to how to interpret Google's rules for this. Could someone please tell me what I am doing wrong? Any help or tips would GREATLY appreciated!!!!! Thanks!
Technical SEO | | lfrazer0 -
Site dropped after I removed spammy KW-stuffed section from homepage
An SEO newbie. I took a sabbatical from my site for 10 months for personal reasons & I'm now coming back to it for the first time and facing a number of issues that I'd appreciate some advice on. 1) In the past week, my rankings for my main keywords have suddenly plummeted from ~#7 - #8 to about #25. The only thing I've done in that time is sign up for Google Webmaster Tools and clean up a spammy-looking keyword-stuffed section at the foot of my homepage. (It read: Red keyword | blue keyword | green keyword | yellow keyword ... 20 times). I've kept the links (to internal pages of my site) but removed keywords. I've no social presence whatsoever. No Twitter, Facebook, Digg, G+, Google Places, nothing -- but I can't imagine this is behind the sudden drop. 2) Last year my rankings start to fall in late April. But for the first two months or so the drop was only from #1 to #2 - #3. I put it down to Penguin as my backlink profile is not ideal. About 5 years ago some SEO got me a mass of sitewide blog footer links with exact-match anchors -- some of which still remain. (Some domains have 750-plus links pointing back to my site). I've a sitewide banner (2,500 pages) on a site with a related theme to mine. I've loads of links from article directory sites (from when I knew no better). * Question: In Google Webmaster Tools I have not (yet?) received an unnatural links warning. Should I still clean up the links with the Link Disavow tool?
Technical SEO | | Jeepster0 -
Please recommend a tool to list pages on my site.
I have taken a major hit from the latest update. Site has been online for 10 years, white hat SEO all the way but I do have some legacy pages were I would duplicate title or the description on a new page. Things are just unorganized currently and trying to find the best approach to organizing what I already have as well as track new content. I would like to have a tool that would basically extract a list of my current pages, the title tags and the description in an Excel file. Not sure how the pros organinze the SEO on a site but my biright idea is that I can have a large excel file with the pages listed so I can detect duplicate info. Site only has about 300 pages. Just regular php pages, no CMS. Thanks in advance!
Technical SEO | | Force70 -
Same Video on Multiple Pages and Sites... Duplicate Issues?
We're rolling out quite a bit of pro video and hosting on a 3-party platform/player (likely BrightCove) that also allows us to have the URL reside on our domain. Here is a scenario for a particular video asset: A. It's on a product page that the video is relevant for. B. We have an entry on our blog with the video C. We have a separate section of our site "Video Library" that provides a centralized view of all videos. It's there too. D. We eventually give the video to other sites (bloggers, industry educational sites etc) for outreach and link-building. A through C on our domain are all for user experience as every page is very relevant, but are there any duplicate video issues here? We would likely only have the transcript on the product page (though we're open to suggestions). Any related feedback would be appreciated. We want to make this scalable and done properly from the beginning (will be rolling out 1000+ videos in 2010)
Technical SEO | | SEOPA0 -
We're working on a site that is a beer company. Because it is required to have an age verification page, how should we best redirect the bots (useragents) to the actual homepage (thus skipping ahead of the age verification without allowing all browsers)?
This question is about useragents and alcohol sites that have an age verification screen upon landing on the site.
Technical SEO | | OveritMedia0 -
What are the SEOmoz-suggested best practices for limiting the number of 301 redirects for a given site?
I've read some vague warnings of potential problems with having a long list of 301 redirects within an htaccess file. If this is a problem, could you provide any guidance on how much is too much? And if there is a problem associated with this, what is that problem exactly?
Technical SEO | | roush0