For large sites, best practices for pages hidden behind internal search?
-
If a website has 1M+ pages, with most of them being hidden behind an internal search, what's the best way to get pages included in an engine's index?
Does a direct clickpath to those pages need to exist from the homepage or other major hub pages on the site?
Is submitting an XML sitemap enough?
-
Hello Vlevit,
You could do several things. I recommend giving Google your product feed, which should accomplish your goals. Another possible solution would be to make those search pages noindex,follow so they don't end up getting indexed, but Google can still use them for discovery.
Thanks for explaining the situation.
Below is more on submitting product feeds. It is for Google Product Search, but I would imagine the "link" field where you put the URL to your product detail page will help those pages get indexed in the standard results:
http://support.google.com/merchants/bin/answer.py?hl=en&answer=188494#USEverett
-
Everett, thanks for your reply. I understand the problems of showing internal search pages. I'm not looking to have internal search results being indexed, just the pages that the results link to. We're in eCommerce.
I was under the impression that there was a clever way to have the individual product pages indexed without establishing a direct click path, but best practices recommend otherwise.
Question answered. Thanks all for your help.
-
Hello Vlevit,
If you can be more specific we may be able to be of more help. Google doesn't want you to show internal search result pages, but if this is a different type of situation it there may be an exception. Are these search result pages, product pages, category pages, content pages.... is it an eCommerce site, community, content site... ?
Generally speaking, 1M+ pages with no links going into them and content that is either sparce/thin or partially/fully duplicated on other similar pages (like a search for widgets and a search for green widgets showing overlapping content) is exactly the type of thing that will get you in hot water that would affect even the rankings of your home page.
Do you feel like your question has been answered or would you like to be more specific about your site and goals?
Cheers,
Everett
-
This is what I was assuming, but was wondering if there was a clever way around creating direct click paths to those pages, while still maintaining their importance to the site. Thanks for the info.
-
Make sure they are part of the actual structure of your website, not just part of search. Meaning, you have to have links pointing at them. Also, you will also want to make sure that those pages have value.
-
Hi vlevit,
The best practice would be to exist a direct path of flow from index page. Something like: index -> category(filter) -> subcategory(filter) -> page/product. But in some cases xml sitemaps can also help you in indexing.
BUT, beware with to large XML sitemaps, try to create more then one sitemap, group them as possible.
A few very good resources can be found under the next links:
http://www.seomoz.org/ugc/solving-new-content-indexation-issues-for-large-b2b-websites
http://www.seomoz.org/qa/view/29009/sitemaps-management-for-big-sites-tens-of-millions-of-pages
I hope it helpes,
Istvan
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
What is the best practice for redirecting a lower authority TLD to a high authority TLD?
Hi there moz community! My organization is blessed with an extremely high authority TLD (91). Powers-that-be want to start using a lesser authority (though still a respectable 62) TLD in marketing materials because they think it's more memorable/less confusing for users. We currently have a 302 redirect in place from score-62 to score-91, and our situation relative to the engines is strong. However, if they ramp-up a branding campaign using the 62-score TLD, should we change the 302 to a 301? I don't want to risk infecting that 91 score with any juice relative to the score-62 TLD. There isn't a lot written for the best practice in redirecting a lower-authority TLD to a high authority TLD - almost all the literature is about preserving your score/juice when redirecting an old TLD to a new TLD. Thanks for anyone/everyone's help! Brian Alpert; Smithsonian Institution
Technical SEO | | Smithsonian1 -
Rel canonical for partner sites - product pages only or also homepage and other key pages?
Hello there Our main site is www.arenaflowers.com. We also run a number of partner sites (eg: http://flowershop.cancerresearchuk.org/). We've relcanonical'd the products on the partner site back to the main (arenaflowers.com) site. eg: http://flowershop.cancerresearchuk.org/flowers/tutti_frutti_es_2013 rel canonicals back to: http://www.arenaflowers.com/flowers/tutti_frutti_es_2013). My question: Should we also relcanonical the homepage and other key pages on partner sites back to the main arenaflowers website too? The content is similar but not identical. We don't want our partner sites to be outranking the original (as is the case on kw flower delivery for example). (NB this situation may be complicated by the fact we appear to have an unnatural link penalty on af.com (and when we did an upgrade a while back, the af.com site fell out of the index altogether due to some issues with our move to AWS.) We're getting professional SEO advice on this but wondered what the Moz community's thoughts were.. Cheers, Will
Technical SEO | | ArenaFlowers.com0 -
Should component pages be visible in the search result?
Hi everyone, My question is suppose i have a blog having 200 pages arranged in footer like seomoz blog and when i move to 2nd page and say the url is http://www.seomoz.org/blog?page=2 and when i search exact url on google should this page be visible in search result or not. Since all component pages of seomoz blog are visible, i think this should not be a problem but when i see other popular blogs like SEJ and seroundtable none of their component pages are visible in search result. By the way i am using rel=prev and next but not robots: noindex, follow
Technical SEO | | himanshu3019890 -
Onsite SEO Strategy for a large accommodation site
Hi All I have been thinking about the best strategy for keyword optimisation on a forthcoming accommodation website I am involved with. This may be a bit of a newbie type question, but most of my work has been on considerably smaller sites to date.... Lets say the site will have 1 primary landing page for "Hotels in Bristol" and then 50 pages that are each for a hotel in Bristol. The aim would be for the primary page which will be a browse/search result type page to rank well for the term 'Hotels in Bristol' and other similar terms. If each of the hotel listing pages that have a hotel in Bristol on, have the phrase 'Hotel in Bristol' contained within the title, url, page content, maybe headings/alt tags etc. will the result be that the rank for the site is 'spread too thin' across the domain? Whats the best way to drive all the relevancy and keyword usage on the 50 listing pages, to the primary page such that that is the one that ranks well? And the other pages rank more for the hotel name etc? I guess one way would be to avoid using the words hotels and Bristol in the title/URL etc.. but the natural approach for usability (not SEO) would be to use these words i.e. http://www.newtravelsite.com/hotels/bristol/stgeorgeshotel/ Or would each of the 50 listing pages simply need a followed, anchored link pointing the main landing page? I'm sure there may be a fundamental technique to do this that has alluded me so far, but any help, thoughts or guidance much appreciated! Regards Simon
Technical SEO | | SCL-SEO0 -
Decreasing the size of a site to increase SEO value of remaining pages?
My website has thousands of pages and I have so many keywords on the bottom of page 1 and on page 2 of SERPs. I am considering making the site smaller to lessen the dilution of the overall domain authority and in theory the remainder pages should get pushed up in rank. Do you feel this theory is flawed? Is it better to 301 or remove the pages if they don't have backlinks directly to the internal page? These are pages I would re-enable down the road once overall domain authority is increased. thanks, David couponcactus.com
Technical SEO | | CouponCactus0 -
Nofollowing to boost internal page rankings.
I have a site with 200 links on the homepage, how much will it boost nofollowing the other links boost the 50 pages we care most about?
Technical SEO | | adamzski0 -
Can search engines penalize my site if I block IPs from some countries?
I have spotted that some countries in South America generate lot's of traffic on my site and I don't want to sell my service there. Can I be penalized for blocking IPs from certain counties? Thanks!
Technical SEO | | Xopie0 -
Best practices for temporary articles
Hello, I would like to have expert inputs about the best way to manage temporary content? In my case, I've a page (ex : mydomain.com/agenda) where I have listing of temporary article, with a lifetime of 1 month to 6 months for some of them. My articles also have a specific url like for ex : mydomain.com/agenda/12-02-2011/thenameofmyarticle/ As you can guess, I got hundreds of 404 😞 I'm already using canonical tag, should I use a in the listing page? I'm a bit lost here..
Technical SEO | | Alexandre_0