CMS dynamicly created pages indexed?
-
Hey Moz'erz,
Looking at the indexed pages of my clients eCommerce website I noticed that dynamically created pages are being indexed.
For example this page does not "exist" but is created by a drop down filter menu that sorts by product tag:
/collections/tools/TAG
I can only conclude that this page got indexed either through a backlink or once upon a time there was an internal link pointing to this URL and got indexed (currently there is not). Are either of these cases possibilities?
In either case before considering removal or any action I would of-course reference analytics to check for conversions, traffic and any backlinks for those "pages".
I believe at the end of the day is recommend a drop down filer that doesn't create new pages as the best solution.
Thoughts, comments and experience is greatly welcomed
-
Hey Dylan
Either of those are possibilities for Google finding and indexing a page like that. There could be many ways that happened - I've seen them spider "links" in a drop down depending on how it's implemented.
One thing you can do to check how, is looked at the text-only cache of the page (type cache:www.domain.com/page-name in your browser and click text only) - and look to see if the drop down items actually appear and clickable links. You can also try crawling the site with Screaming Frog and set the user-agent to GoogleBot and see if they got picked up.
If the filter is just for example re-sorting the list of items in a category, there is probably not a need to have this crawled or indexed, because it's just the same content in a different order.
If you do want to remove them from the index, you will want to add a meta noindex tag to the HTML, wait for them to drop out of the index, and then block crawling with robots.txt or nofollow the links that might be generated.
Hope that helps!
EDIT - I'd also check to be sure they are not showing up in your XML sitemap.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Website Home page suddenly disappeared after changing Hosting
HI All, My site was ranking very well and was in 1st page of google for most of my keywords. Last week we did some update to the site and moved it to new hosting and from then onwards I dont see my site home page in Google ranking . My Website Name is : royalevents.com.au. We used to be in 1st of Google for keywords like wedding Mandaps, Indian Wedding Mandaps etc, Would be great if some one helps us to figure out whats gone wrong .. I also did Webmaster Fetch as Google but nothing happened. Thanks
Web Design | | Verve-Innovation0 -
SEO page length 4500+ words
I have read varying discussions on this... some say it is good or rather it does not really matter (as long as not stuffed with keywords) and some say more than 1000+ words is bad! I have a travel site and I want to add an historical page about the zone. It is very interesting (very organic, not written for SEO purposes as such). It adds flavor and details to a site that is really all about sales. Does anyone have an opinion whether this is detrimental to SEO or not?
Web Design | | Llanero0 -
What else should you call the Home page?
In the menu bar and footer the main page is called Home. Would it confuse people to rename it to Business Name Home or Business Name? How do you handle this?
Web Design | | CFSSEO0 -
Homepage and Category pages rank for article/post titles after HTML5 Redesign
My site's URL (web address) is: http://bit.ly/g2fhhC Timeline:
Web Design | | mcluna
At the end of March we released a site redesign in HTML5
As part of the redesign we used multiple H1s (for nested articles on the homepage) and for content sections other than articles on a page. In summary, our pages have many many, I mean lots of H1's compared to other sites notable sites that use HTML5 and only one H1 (some of these are the biggest sites on the web) - yet I don't want to say this is the culprit because the HTML5 document outline (page sections) create the equivalent of H1 - H6 tags. We have also have been having Google cache snapshot issues due to Modernzr which we are working to apply the patch. https://github.com/h5bp/html5-boilerplate/issues/1086 - Not sure if this would driving our indexing issues as below. Situation:
Since the redesign when we query our article title then Google will list the homepage, category page or tag page that the article resides on. Most of the time it ranks for the homepage for the article query.
If we link directly to the article pages from a relevant internal page it does not help Google index the correct page. If we link to an article from an external site it does not help Google index the correct page. Here are some images of some example query results for our article titles: Homepage ranks for article title aged 5 hours
http://imgur.com/yNVU2 Homepage ranks for article title aged 36 min.
http://imgur.com/5RZgB Homepage at uncategorized page listed instead of article for exact match article query
http://imgur.com/MddcE Article aged over 10 day indexing correctly. Yes it's possible for Google index our article pages but again.
http://imgur.com/mZhmd What we have done so far:
-Removed the H1 tag from the site wide domain link
-Made the article title a link. How it was on the old version so replicating
-Applying the Modernizr patch today to correct blank caching issue. We are hoping you can assess the number H1s we are using on our homepage (i think over 40) and on our article pages (i believe over 25 H1s) and let us know if this may be sending a confusing signal to Google. Or if you see something else we're missing. All HTML5 and Google documentation makes clear that Google can parse multiple H1s & understand header, sub & that multiple H1s are okay etc... but it seems possible that algorythmic weighting may not have caught up with HTML5. Look forward to your thoughts. Thanks0 -
Flag page elements to not be loaded by Instapaper and co.
Does anybody know if there is a way to mark certain elements (especially navigation menus) so that instapaper and co don't pull them? I'm looking for a quick solution (best would be if it was CSS based) nothing fancy like parsing the user-agent. That would be plan B. I've added role="navigation" id="navigation" and class="navigation" to the nav elements in hope that it would work. Seems like it does not; sometimes the elements are present in the page generated by instapaper, sometimes not. Thank you for any replies and have a great day! Jan
Web Design | | jmueller0 -
Random 302 Redirect (Wordpress CMS)
So this new project that i am working on is a redesigned CMS site using Wordpress (php based). Before i started on the project they made a few major updates that include: Changed from ID based URLS to SEO friendly URLS Added multiple languages in directories www.domain.com/en www.domain.com/fr www.domain.com/de etc... Due to the new languages they wanted to have the previous home page, www.domain.com, redirect to the proper language based on their IP address. Currently they are using a 301 redirect through a php header. So if i was visiting the site from my hometown of Cleveland, Ohio and i type www.domain.com into the browser. I would automatically redirect to www.domain.com/en and i would have the option to switch languages if needed. The issue: When i first added their site into SEOMoz the crawl returned a large amount of 302 redirects coming from their old homepage www.domain.com. So i took a look at the header calls using IE's webmaster tools and Firebug in Firefox. In both profilers it showed the same problem. Before the 301 header redirect there was a 302 redirect called first When viewing the response header it mentioned an x-pingback of some file that didn't even exist on the site: www.domain.com/xmlrpc.php This is obviously a huge issue because any link value from the old homepage will be lost due to the 302 not passing the value. I have tried search the almighty Google for help but it has gotten me no where. I have a hunch it is something to do with Wordpress but that is based on nothing but my gut. Any help is greatly appreciated. I got to get that 302 gone or changed to a 301 🙂 Regards - Kyle
Web Design | | kchandler0 -
Sudden dramatic drops in SERPs along with no snippet and no cached page?
We are a very stable, time tested domain (over 15 yrs old) with thousands of stable, time tested inbound links. We are a large catalog/e commerce business and our web team has over a decade's experience with coding, seo etc. We do not engage in link exchanges, buying links etc and adhere strictly to best white hat seo practices. Our SERPs have generally been very stable for years and years. We continually update content, leverage user generated content etc, and stay abreast of important algorithm and policy changes on Google's end. On Wednesday Jan 18th, we noticed dramatic, disturbing changes to our SERPs. Our formerly very stable positions for thousands of core keywords dropped. In addition, there is no snippet in the SERPs and no cached page for these results. Webmaster tools shows our sitemap most recently successfully downloaded by Google on Jan 14th. Over the weekend and monday the 16th, our cloud hosted site experienced some downtime here and there. I suspect that the sudden issues we are seeing are being caused by one of three possibilities: 1. Google came to crawl when the site was unavailable.
Web Design | | jamestown
However, there are no messages in the account or crawl issues otherwise noted to indicate this. 2. There is a malicious link spam or other attack on our site. 3. The last week of December 2011, we went live with Schema.org rich tagging on product level pages. The testing tool validates all but the breadcrumb, which it says is not supported by Schema. Could Google be hating our Schema.org microtagging and penalizing us? I sort of doubt bc category/subcategory pages that have no such tags are among those suffering. Whats odd is that ever since we went live with Schema.org, Google has started preferring very thin content pages like video pages and articles over our product pages. This never happened in the past. the site is: www.jamestowndistributors.com Any help or ideas are greatly, greatly appreciated. Thank You DMG0 -
What is the best SEO friendly CMS platform?
Would like to build some content sites and would appreciate some guidance on what you believe to be the most SEO friendly CMS platform. I'm also hoping to find something that doesn't require much HTML and is pretty easy to jump into.
Web Design | | EdStaton0