Google keeps marking different pages as duplicates
-
My website has many pages like this:
mywebsite/company1/valuation
mywebsite/company2/valuation
mywebsite/company3/valuation
mywebsite/company4/valuation
...
These pages describe the valuation of each company.
These pages were never identical but initially, I included a few generic paragraphs like what is valuation, what is a valuation model, etc... in all the pages so some parts of these pages' content were identical.
Google marked many of these pages as duplicated (in Google Search Console) so I modified the content of these pages: I removed those generic paragraphs and added other information that is unique to each company. As a result, these pages are extremely different from each other now and have little similarities.
Although it has been more than 1 month since I made the modification, Google still marks the majority of these pages as duplicates, even though Google has already crawled their new modified version. I wonder whether there is anything else I can do in this situation?
Thanks
-
Google may mark different pages as duplicates if they contain very similar or identical content. This can happen due to issues such as duplicate metadata, URL parameters, or syndicated content. To address this, ensure each page has unique and valuable content, use canonical tags when appropriate, and manage URL parameters in Google Search Console.
-
Yes, there are a few other things you can do if Google is still marking your pages as duplicates after you have modified them to be unique:
-
Check your canonical tags. Canonical tags tell Google which version of a page is the preferred one to index. If you have canonical tags in place and they are pointing to the correct pages, then Google should eventually recognize that the duplicate pages are not actually duplicates.
-
Use the URL parameter tool in Google Search Console. This tool allows you to tell Google which URL parameters it should treat as unique and which ones it should ignore. This can be helpful if you have pages with similar content but different URL parameters, such as pages for different product categories or pages with different sorting options.
-
Request a recrawl of your website. You can do this in Google Search Console. Once Google has recrawled your website, it will be able to see the new, modified versions of your pages.
If you have done all of the above and Google is still marking your pages as duplicates, then you may need to contact Google Support for assistance.
-
-
If Google is marking different pages on your website as duplicates, it can negatively impact your website's search engine rankings. Here are some common reasons why Google may be doing this and steps you can take to address the issue:
Duplicate Content: Google's algorithms are designed to filter out duplicate content from search results. Ensure that your website does not have identical or near-identical content on multiple pages. Each page should offer unique and valuable content to users.
URL Parameters: If your website uses URL parameters for sorting, filtering, or tracking purposes, Google may interpret these variations as duplicate content. Use canonical tags or the URL parameter tool in Google Search Console to specify which version of the URL you want to be indexed.
Pagination: For websites with paginated content (e.g., product listings, blog archives), ensure that you implement rel="next" and rel="prev" tags to indicate the sequence of pages. This helps Google understand that the pages are part of a series and not duplicates.
www vs. non-www: Make sure you have a preferred domain (e.g., www.example.com or example.com) and set up 301 redirects to the preferred version. Google may treat www and non-www versions as separate pages with duplicate content.
HTTP vs. HTTPS: Ensure that your website uses secure HTTPS. Google may view HTTP and HTTPS versions of the same page as duplicates. Implement 301 redirects from HTTP to HTTPS to resolve this.
Mobile and Desktop Versions: If you have separate mobile and desktop versions of your site (e.g., responsive design or m.example.com), use rel="alternate" and rel="canonical" tags to specify the relationship between the two versions.
Thin or Low-Quality Content: Pages with little or low-quality content may be flagged as duplicates. Improve the content on such pages to provide unique value to users.
Canonical Tags: Implement canonical tags correctly to indicate the preferred version of a page when there are multiple versions with similar content.
XML Sitemap: Ensure that your XML sitemap is up-to-date and accurately reflects your website's structure. Submit it to Google Search Console.
Avoid Scraped Content: Ensure that your content is original and not scraped or copied from other websites. Google penalizes sites with duplicate or plagiarized content.
Check for Technical Errors: Use Google Search Console to check for crawl errors or other technical issues that might be causing duplicate content problems.
Structured Data: Ensure that your structured data (schema markup) is correctly implemented on your pages. Incorrectly structured data can confuse search engines.
Regularly monitor Google Search Console for any duplicate content issues and take prompt action to address them. It's essential to provide unique and valuable content to your website visitors while ensuring that search engines can correctly index and rank your pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Unsolved I have lost SEO Ranking while removing www from domain
I have lost search SEO ranking for 4-6 core keywords while removing www from domain switch.
On-Page Optimization | | velomate
Referring domain: https://cashforscrapcarsydney.com.au/ Earlier the domain was in the format: https://www.cashforscrapcarsydney.com.au/ But when I checked the search result, search engines had not yet crawled to the new format. Let me know if the server change or any algorithm hit might cause it. Also please share the feedback on - does removing www from the domain losses keyword ranking. Helpful replies are needed.0 -
Best SEO Structure For E-Commerce With Products Using Multiple Categories
Hi all, I am in the process of re-structuring my e-commerce website for better SEO and user experience. I have done some keyword research and would like some advice on how best to structure my site around those keywords. For example, my site (All Things Nature) sells a brand of wooden sculptures (Woodsculp) and I would like to rank for keywords related to that brand, the brand by animal, the brand by collection and the brand by release date.
Content Development | | nb2e4fg
Examples of keywords could be: Brand by Animal: Woodsculp Dogs, Woodsculp Cats, Woodsculp Elephants
Brand by Collection: Woodsculp Pets, Woodsculp Safari
Brand by Release Date: Woodsculp Christmas 2023, Woodsculp Summer 2022 I would create each of these keywords as a category so that they can be found by a search engine and by users. I would then structure as follows: All Things Nature -> Woodsculp -> Woodsculp by Animal -> Woodsculp Dogs
All Things Nature -> Woodsculp -> Woodsculp by Animal -> Woodsculp Elephants
All Things Nature -> Woodsculp -> Woodsculp by Collection -> Woodsculp Pets
All Things Nature -> Woodsculp -> Woodsculp by Collection -> Woodsculp Safari
All Things Nature -> Woodsculp -> Woodsculp by Release Date -> Woodsculp Christmas 2023
All Things Nature -> Woodsculp -> Woodsculp by Release Date -> Woodsculp Summer 2022 The only problem with this structure is it would take more than 3 clicks (4) for the user to reach a product. How critical is this for good SEO and user experience? Would I be better off getting rid of the ‘Woodsculp by Animal’, ‘Woodsculp by Collection’ and ‘Woodsculp by Release Date’ categories? Structure would look as follows: All Things Nature -> Woodsculp -> Woodsculp Dogs
All Things Nature -> Woodsculp -> Woodsculp Elephants
All Things Nature -> Woodsculp -> Woodsculp Safari
All Things Nature -> Woodsculp -> Woodsculp Christmas 2023 The only thing with this is there would be a lot of categories under the brand name which might make it more difficult for search engines and users to logically follow. Would I be better off getting rid of the brand category and replace them with the keyword categories? Structure would look as follows: All Things Nature -> Woodsculp by Animal -> Woodsculp Dogs
All Things Nature -> Woodsculp by Animal -> Woodsculp Elephants
All Things Nature -> Woodsculp by Collection -> Woodsculp Safari
All Things Nature -> Woodsculp by Release Date -> Woodsculp Christmas 2023 This would organise things more logically but I would then lose the brand category (and the potential of the brand keyword ranking?) Would I be better off choosing one main keyword to use as a category and then use tags for the other categories? Categories: All Things Nature -> Woodsculp -> Woodsculp Dogs
All Things Nature -> Woodsculp -> Woodsculp Elephants Tags: Woodsculp Safari
Woodsculp Christmas 2023 The next issue I have is that I have products which could fall under several different categories. A product called Elijah Elephant, for example could fall under Woodsculp Elephants, Woodsculp Safari and Woodsculp Summer 2022. In previous e-commerce sites I have never assigned multiple categories to one product (I instead have used tags). Is it good practice to organise products under multiple categories for an e-commerce site? Thanks in advance for any help and advice.0 -
English pages given preference over local language
We recently launched a new design of our website and for SEO purposes we decided to have our website both in English and in Dutch. However, when I look at the rankings in MOZ for many of our keywords, it seems the English pages are being preferred over the Dutch ones. That never used to be the case when we had our website in the old design. It mainly is for pages that have an English keyword attached to them, but even then the Dutch page would just rank. I'm trying to figure out why English pages are being preferred now and whether that could actually damage our rankings, as search engines would prefer copy in the local language. An example is this page: https://www.bluebillywig.com/nl/html5-video-player/ for the keywords "HTML5 player" and "HTML5 video player".
Local SEO | | Billywig0 -
Google Capital - Antitrust Conspiracy
I think we all have heard about Thumbtack breaking the rules w/ badges. Getting deindexed, then getting a 100M injection from Google capital and having the penalties removed: https://techcrunch.com/2014/08/20/service-marketplace-thumbtack-raises-100m-round-led-by-google-capital/ Our primary competitor is a different marketplace backed by Google Capital. Does anyone know of any low frequency products (reliant on SEO) backed Google Capital that has not won out within search? (i.e. is there any hope of competing against a low frequency marketplace after they have Google Capital backing?)
Search Behavior | | MarketGrowth0 -
Duplicate content due to numerous sub category level pages
We have a healthcare website which lists doctors based on their medical speciality. We have a paginated series to list hundreds of doctors. Algorithm: A search for Dentist in Newark locality of New York gives a result filled with dentists from Newark followed by list of dentists in locations near by Newark. So all localities under a city have the same set of doctors distributed jumbled an distributed across multiple pages based on nearness to locality. When we don't have any dentists in Newark we populate results for near by localities and create a page. The issue - So when the number of dentists in New York is <11 all Localities X Dentists will have jumbled up results all pointing to the same 10 doctors. The issue is even severe when we see that we have only 1-3 dentists in the city. Every locality page will be exactly the same as a city level page. We have about 2.5 Million pages with the above scenario. **City level page - **https://www.example.com/new-york/dentist - 5 dentists **Locality Level Page - **https://www.example.com/new-york/dentist/clifton, https://www.example.com/new-york/dentist/newark - Page contains the same 5 dentists as in New York city level page in jumbled up or same order. What do you think we must do in such a case? We had discussions on putting a noindex on locality level pages or to apply canonical pointing from locality level to city level. But we are still not 100% sure.
Technical SEO | | ozil0 -
Empty Google cached pages.
My little startup Voyage has a tough relationship with Google. I have been reading SEOMOZ/MOZ for years. I am no pro but I understand the basics pretty well. I would like to know why all pages on my main domain look empty in google cache. Here is one example. Other advice is welcome too. I know a lot of my metas and my markup is bad but I am working on it!
Technical SEO | | vincentgagne0 -
Issue Duplicate Page Title
I'm having some really strange issues with duplicate page titles and I can't seem to figure out what's going on. I just got a new crawl from SEOMOZ and it's showing some duplicate page titles. http://www.example.com/blog/ http://www.example.com/blog/page/2/ http://www.example.com/blog/page/3/ Repeat .............. I have no idea what's going on, how these were duplicated, or how to correct it. Does anyone have a chance to take a look and see if you can figure out what's happening and what I need to do to correct the errors? I'm using Wordpress and all in one SEO plugin. Thanks so much!
Technical SEO | | KLLC0 -
Why are Google search results different if you are log'd into Google or not?
I get different results when I'm log'd into my Google account associated with my website than if I'm not. The same country is occurring. So how can I rely on the google results I'm seeing? For instance my site is page 1 with the improvements I made based on SEOMOZ if I'm log'd in. Yet I'm not on the first 25 pages if I'm not logged in.
Technical SEO | | Romana0