I have more pages in my site map being blocked by the robot file than I have being allowed to be crawled. Is Google going to hate me for this?
-
Using some rules to block all pages which start with "copy-of" on my website because people have a bad habit of duplicating new product listings to create our refurbished, surplus etc. listings for those products. To avoid Google seeing these as duplicate pages I've blocked them in the robot file, but of course they are still automatically generated in our sitemap. How bad is this?
-
When you say "people," are you saying your own web team duplicates content to make their job easier? Or am I missing something?...
If that's the case, you really should create unique URL's with unique page titles, product info, etc. That's the correct way to avoid getting hit for duplicate content - don't create it. It seems like what you're doing now is more of a band-aid solution to the problem.
I'd consider that even though creating unique content in situations like this can seem daunting and/or be more expensive, there's probably huge long-term gains to made if you do it right.
-
It is not bad, just not best practices because Google will still index the URL's if they are mentioned on other pages. Just to quote them:
"While Google won't crawl or index the content of pages blocked by robots.txt, we may still index the URLs if we find them on other pages on the web. As a result, the URL of the page and, potentially, other publicly available information..."
What I would do instead is either use rel="canonical" or 301 redirects. I hope that helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
URL, page title, item name - which is most important for google ranking
We are a bridal store and are able to use different information in the URL, Page title and item name. In item name we give the product a name for us to identify ie. Alex Lace Dress in Black/Nude, Ivory/Nude, Red/Red In Page Title we use the suppliers name and product code as well as the item name ie. Jadore j8075 Alex Lace Dress Online Australia URL = alex-lace-dress/ Are we using the correct format ? What could we do to improve them?
On-Page Optimization | | CostumeD0 -
When You Add a Robots.txt file to a website to block certain URLs, do they disappear from Google's index?
I have seen several websites recently that have have far too many webpages indexed by Google, because for each blog post they publish, Google might index the following: www.mywebsite.com/blog/title-of-post www.mywebsite.com/blog/tag/tag1 www.mywebsite.com/blog/tag/tag2 www.mywebsite.com/blog/category/categoryA etc My question is: if you add a robots.txt file that tells Google NOT to index pages in the "tag" and "category" folder, does that mean that the previously indexed pages will eventually disappear from Google's index? Or does it just mean that newly created pages won't get added to the index? Or does it mean nothing at all? thanks for any insight!
On-Page Optimization | | williammarlow0 -
One Page Website vs. Multipage Site, if you want to target one specific Keyword only.
Hello! suppose I want to start a website about, let's say spray adhesives. My aim is to rank on the first page for the keyword "spray adhesive". I don't care about my ranking on more specific keywords like "Tesa spray adhesive" or "3M spray adhesive". My ranking for more general keywords like "glue" is unimportant, too. So I thought about creating a single-page website, that writes about spray adhesives, the pros & cons of every manufacturer, and shows the best discounts for spray adhesives. Each section can be accessed through a top-navigation, that links via anchors to the individual sections. The page will be updated every day On the other hand, i could create a blog and write an article for every specific spray adhesive. So I would have a home page that lists the latest articles for every product, with titles like "3M spray adhesive CreativeMount", "3M spray adhesive SprayMount", "Tesa Spray adhesive" ... I will write one article every day What do you think would be the better strategy? Is there a risk to create competing articles for the keyword "spray adhesive" and thus rank lower if I go with the blog strategy? On the other hand, does google rate singe-page websites lower, because google thinks those websites are less valuable than websites with many pages for the same topic? Thank you ver much for you help in advance!
On-Page Optimization | | MGMT0 -
Robots.txt file
Does it serve any purpose if we omit robots.txt file ? I wonder if spider has to read all the pages, why do we insert robots.txt file ?
On-Page Optimization | | seoug_20050 -
Pages not cached
Sorry for all the questions. I have dozens of article pages that are not cached by google. How can I get them cached?
On-Page Optimization | | azguy0 -
Should I use a Page Name variable after the ? for a dynamic web page
I'm converting for static to dynamic web pages. It appears that the page name is used for page ranking in the search engines. Will adding a Page Name variable help to increase our SEO. For example aspecialgift.com/subcat.php?PageName=GiftPage&ProductID=ABCDE. Does the page name variable make a difference?
On-Page Optimization | | NCBob0 -
Can I have a strong brand category page and a strong product page?
It seems Google base and other Comparison Shopping Engines like to see the brand in the product name. But, on my category page for that brand, website optimizer tells me including the brand name with each product is cannabilizes links. For example; I have a page for jewelerABC with 20 pieces of jewelry listed as well as original content about jewelerABC. I do not currently name these products as xyz by jewelerABC. This page comes up nicely in the serps. But in Google base The top listings for jewelry by jewelerABC seem to have every product named xyz by jewelerABC or JewelerABC xyzs. What is the best way to optimize.for both? Stephen
On-Page Optimization | | stephenfishman0 -
Urgent, Duplicate page title and content at eCommerce site- how to solve
Hi, there, does anyone can help to solve 'duplicate page title, duplicate page content' problem? it is a eCommerce site, each categories has hundreds of products, so there are more than 10 pages, but the report crawl the errors, i totally have no idea, can anyone help? Thanks a lot! Anna
On-Page Optimization | | anna-2944510