Noindex pages being indexed
-
Hi all
Wondering if anyone could offer a pointer on a problem i am having please. I am developing an affiliate store and to prevent problems with duplicate content I have added name="robots" content="NOINDEX,FOLLOW" /> to all the product pages to avoid google penalties.
However, Google appears to be indexing product pages. When I do a site: search I see a few hundred product pages in the engine. This is odd as the site has always had noindex on these pages. Even viewing the cache of the indexed page shows the noindex meta tag to be in place.
I'm at a loss as to why these pages are being indexed and could do with removing them asap to stop any penalties on the site.
Many thanks for any help.
-
Thanks for taking the time to look at the site.
Not sure why the code is coming in the wrong place it is using a magento seo plugin so will need to chase them up on that. Just searched a random selection of pages and the code seems to be in the header section on all of those, so it seems there are some pages not playing nicely.
I would like to index the product pages but there are over 250,000 items pulled from the merchants and no chance of writing that much unique content so I feel safest to noindex them all. The main traffic strategy will be to use content which will promote items, such as fashion advice pieces etc.
In the example you give, that seems to be a problem with the categories, will check that out, thanks for pointing it out.
-
Okay, i have found the indexing problem. The noindex meta tag should be placed within the head section of your HTML. It is now in your body content so it's not getting used.
Are you sure you want to no-index product pages though? These are very important pages on your site and there are other ways to fix duplicate content issues.
Most duplicate content issues on pages comes from the fact that most CMS systems have the same product at different URL's. On your site i see http://yochic.co.uk/dress-267864/bodycon/black-cut-out-side-bodycon-dress.html and http://yochic.co.uk/black-cut-out-side-bodycon-dress.html
Same product, different URL. This could be fixed by adding a rel="canonical" on product pages which links back to the preferred version. Easiest would be to link back to www.domain.com/product.html
This would solve most duplicate content issues without having to deindex them.
Hope i was helpful, if you have any questions left, feel free to ask me.
-
hi, yes, maybe should have added that to start with. The site is yochic. co. uk - it is still being worked on so bear that in mind with random products in the wrong places or wonky looking menus.
The category pages are set to be indexed so no problem there, the product pages are the ones which contain the most duplicate content so have been blocked, but a number are still going in the listing.
-
Perhaps you could give us a link to your website using the following notation: domainname (dot) TLD It's difficult to tell you what the problem is without being able to take a look at your site.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Why my website some pages is not index in google ?
Hi, I have submitted my pages in Google fetch for consideration tool but they are not indexed yet in the Google search. Additionally, there is also no error shown by the Google.
On-Page Optimization | | seo.kishore890 -
Category page canonical tag
I know this question has been asked a few times on here but I'm looking for very specific advice. Currently when you go to a category, say http://www.bronterose.co.uk/range.html, a canonical tag is added to the head of the page. There are plenty of "variant" pages which carry the same tag, for example: /range.html?p=2
On-Page Optimization | | crichardson9
/range.html?p=3
/range.html?dir=asc&order=price
/range.html?dir=asc&limit=all&order=price Is it wise to push the "link juice" for each of these variant pages to the top level page? Or should each variant page have its own unique canonical tag? After reading many blog posts, guides and papers I'm truly confused! Any general guidance or recommendations would be much appreciated. Chris.1 -
Deleted pages still registering as 404 pages.
I have a an all html site that I can only work on through the ftp. The previous marketing company ran a script that built thousands of location landing pages, but all they did was change the tags and headers and the keywords in the pages, other than that they are all duplicate pages. I removed them, but Google is reading them as 404 pages. How do I tell Google those pages don't exist? or do I just need to let the bots crawl it a few times and it will see that eventually?
On-Page Optimization | | SwanJob0 -
On-Page Report Card: Whats up with the TITLE of the page?
Started to fix the SEO issues on a customers website. When I run a "On-Page Report Card" It says that the title of the webpage:
On-Page Optimization | | maklarlabbet
www.visbymaklarna.se/visbymaklarna.html Is "visbymäklarna - Ditt förstahandsval på gotland." But if I check in the source code of the webbrowser the title should be:
name="title" content="Vi är mäklarna på Gotland som sätter människan i första rummet" /> (Actually this is with special encoding for the swedish characters. The title in coded text is: "Vi är mäklarna på Gotland som sätter människan i första rummet") Anyway the title of the webpage source code and the title of what SEOmoz reports is different. Why is that?0 -
Locating Duplicate Pages
Hi, Our website consists of approximately 15,000 pages however according to our Google Webmaster Tools account Google has around 26,000 pages for us in their index. I have run through half a dozen sitemap generators and they all only discover the 15,000 pages that we know about. I have also thoroughly gone through the site to attempt to find any sections where we might be inadvertently generating duplicate pages without success. It has been over six months since we did any structural changes (at which point we did 301's to the new locations) and so I'd like to think that the majority of these old pages have been removed from the Google Index. Additionally, the number of pages in the index doesn't appear to be going down by any discernable factor week on week. I'm certain it's nothing to worry about however for my own peace of mind I'd like to just confirm that the additional 11,000 pages are just old results that will eventually disappear from the index and that we're not generating any duplicate content. Unfortunately there doesn't appear to be a way to download a list of the 26,000 pages that Google has indexed so that I can compare it against our sitemap. Obviously I know about site:domain.com however this only returned the first 1,000 results which all checkout fine. I was wondering if anybody knew of any methods or tools that we could use to attempt to identify these 11,000 extra pages in the Google index so we can confirm that they're just old pages which haven’t fallen out of the index yet and that they’re not going to be causing us a problem? Thanks guys!
On-Page Optimization | | ChrisHolgate0 -
Your thoughts on page navigation
Hi again SEOmoz community. What are your thoughts on mainpage navigation. How it should be handled? Scenario 1. - links to the main sections with a mouse rollover feature where it shows subsections of the main sections Scenario 2. - links to the main sections, but the subsections are hidden and only visible on the click Scenario 3. links to the main sections and subsections allways visible I would like to hear you oppinions on this. What did you find as the best featrue, or did you try to find someting new entirely. What do you think would be the best scenario SEO wise and in the light of keeping links on page in decent numbers 🙂 Imo, Scenario 2 is the option to go with. Tnx in advance for all your replys. Sincerely, sinisa
On-Page Optimization | | TataSinke0 -
Exstinguishing Page Rank?
Hi Guys, Here is a thought. Google gives more weight to links in content, less to navigation etc.. Therefore if they say give 50% of a pages rank to a link within content and 50% to the the other elements. What happens to the total pagerank from a page if you have not utilized in page content links? (is it lost) If this is the case, and you have a site that does not use content links on every page, are you loosing value (and hard earned) pagerank. Google did mention sometime back about pagerank being exstinguished with the nofollow tag. I would be interested to hear what others think? Cheers Scott
On-Page Optimization | | Jurnii0