Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Google keeps marking different pages as duplicates
-
My website has many pages like this:
mywebsite/company1/valuation
mywebsite/company2/valuation
mywebsite/company3/valuation
mywebsite/company4/valuation
...
These pages describe the valuation of each company.
These pages were never identical but initially, I included a few generic paragraphs like what is valuation, what is a valuation model, etc... in all the pages so some parts of these pages' content were identical.
Google marked many of these pages as duplicated (in Google Search Console) so I modified the content of these pages: I removed those generic paragraphs and added other information that is unique to each company. As a result, these pages are extremely different from each other now and have little similarities.
Although it has been more than 1 month since I made the modification, Google still marks the majority of these pages as duplicates, even though Google has already crawled their new modified version. I wonder whether there is anything else I can do in this situation?
Thanks
-
Google may mark different pages as duplicates if they contain very similar or identical content. This can happen due to issues such as duplicate metadata, URL parameters, or syndicated content. To address this, ensure each page has unique and valuable content, use canonical tags when appropriate, and manage URL parameters in Google Search Console.
-
Yes, there are a few other things you can do if Google is still marking your pages as duplicates after you have modified them to be unique:
-
Check your canonical tags. Canonical tags tell Google which version of a page is the preferred one to index. If you have canonical tags in place and they are pointing to the correct pages, then Google should eventually recognize that the duplicate pages are not actually duplicates.
-
Use the URL parameter tool in Google Search Console. This tool allows you to tell Google which URL parameters it should treat as unique and which ones it should ignore. This can be helpful if you have pages with similar content but different URL parameters, such as pages for different product categories or pages with different sorting options.
-
Request a recrawl of your website. You can do this in Google Search Console. Once Google has recrawled your website, it will be able to see the new, modified versions of your pages.
If you have done all of the above and Google is still marking your pages as duplicates, then you may need to contact Google Support for assistance.
-
-
If Google is marking different pages on your website as duplicates, it can negatively impact your website's search engine rankings. Here are some common reasons why Google may be doing this and steps you can take to address the issue:
Duplicate Content: Google's algorithms are designed to filter out duplicate content from search results. Ensure that your website does not have identical or near-identical content on multiple pages. Each page should offer unique and valuable content to users.
URL Parameters: If your website uses URL parameters for sorting, filtering, or tracking purposes, Google may interpret these variations as duplicate content. Use canonical tags or the URL parameter tool in Google Search Console to specify which version of the URL you want to be indexed.
Pagination: For websites with paginated content (e.g., product listings, blog archives), ensure that you implement rel="next" and rel="prev" tags to indicate the sequence of pages. This helps Google understand that the pages are part of a series and not duplicates.
www vs. non-www: Make sure you have a preferred domain (e.g., www.example.com or example.com) and set up 301 redirects to the preferred version. Google may treat www and non-www versions as separate pages with duplicate content.
HTTP vs. HTTPS: Ensure that your website uses secure HTTPS. Google may view HTTP and HTTPS versions of the same page as duplicates. Implement 301 redirects from HTTP to HTTPS to resolve this.
Mobile and Desktop Versions: If you have separate mobile and desktop versions of your site (e.g., responsive design or m.example.com), use rel="alternate" and rel="canonical" tags to specify the relationship between the two versions.
Thin or Low-Quality Content: Pages with little or low-quality content may be flagged as duplicates. Improve the content on such pages to provide unique value to users.
Canonical Tags: Implement canonical tags correctly to indicate the preferred version of a page when there are multiple versions with similar content.
XML Sitemap: Ensure that your XML sitemap is up-to-date and accurately reflects your website's structure. Submit it to Google Search Console.
Avoid Scraped Content: Ensure that your content is original and not scraped or copied from other websites. Google penalizes sites with duplicate or plagiarized content.
Check for Technical Errors: Use Google Search Console to check for crawl errors or other technical issues that might be causing duplicate content problems.
Structured Data: Ensure that your structured data (schema markup) is correctly implemented on your pages. Incorrectly structured data can confuse search engines.
Regularly monitor Google Search Console for any duplicate content issues and take prompt action to address them. It's essential to provide unique and valuable content to your website visitors while ensuring that search engines can correctly index and rank your pages.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
site speed
i use mid-quality pic and... but my site speed is low
On-Page Optimization | Mar 16, 2024, 1:41 PM | zlbvasgabc
any suggestion?
my site is:
https://bandolini.ir/0 -
Page disappears from Google search results
Hi, I recently encountered a very strange problem.
Technical SEO | Jun 13, 2021, 7:57 PM | JoelssonMedia
One of the pages I published in my website ranked very well for a couple of days on top 5, then after a couple of days, the page completely vanished, no matter how direct I search for it, does not appear on the results, I check GSC, everything seems to be normal, but when checking Google analytics, I find it strange that there is no data on the page since it disappeared and it also does not show up on the 'active pages' section no matter how many different computers i keep it open. I have checked to page 9, and used a couple of keyword tools and it appears nowhere! It didn't have any back links, but it was unique and high quality. I have checked on the page does still exist and it is still readable. Has this ´happened to anyone before? Any thoughts would be gratefully received.0 -
Duplicate Page Titles For Paginated Topics In Blog
Hello, I've just run a site audit and it has come up with a duplicate title tag issue for the topics section of our blog. For example it is flagging that the following have the same page title. https://blog.companyname.com/topic/topic-name https://blog.companyname.com/topic/topic-name/page/2 How significant is this as an SEO issue and what are the ways we can go about fixing this? I look forward to any suggestions and guidance that can be provided. Thanks, John
Technical SEO | Apr 24, 2020, 4:09 PM | SEOCT1 -
Duplicate Content on a Page Due to Responsive Version
What are the implications if a web designer codes the content of the site twice into the page in order to make the site responsive? I can't add the url I'm afraid but the H1 and the content appear twice in the code in order to produce both a responsive version and a desktop version. This is a Wordpress site. Is Google clever enough to distinguish between the 2 versions and treat them individually? Or will Google really think that the content has been repeated on the same page?
Technical SEO | Apr 14, 2016, 9:35 AM | Wagada0 -
Google Cache showing a different URL
Hi all, very weird things happening to us. For the 3 URLs below, Google cache is rendering content from a different URL (sister site) even though there are no redirects between the 2 & live page shows the 'right content' - see: http://webcache.googleusercontent.com/search?q=cache:http://giltedgeafrica.com/tours/ http://webcache.googleusercontent.com/search?q=cache:http://giltedgeafrica.com/about/ http://webcache.googleusercontent.com/search?q=cache:http://giltedgeafrica.com/about/team/ We also have the exact same issue with another domain we owned (but not anymore), only difference is that we 301 redirected those URLs before it changed ownership: http://webcache.googleusercontent.com/search?q=cache:http://www.preferredsafaris.com/Kenya/2 http://webcache.googleusercontent.com/search?q=cache:http://www.preferredsafaris.com/accommodation/Namibia/5 I have gone ahead into the URL removal Tool and got denied for the first case above ("") and it is still pending for the second lists. We are worried that this might be a sign of duplicate content & could be penalising us. Thanks! ps: I went through most questions & the closest one I found was this one (http://moz.com/community/q/page-disappeared-from-google-index-google-cache-shows-page-is-being-redirected) but it didn't provide a clear answer on my question above
Technical SEO | Sep 25, 2014, 11:20 AM | SouthernAfricaTravel0 -
Unnecessary pages getting indexed in Google for my blog
I have a blog dapazze.com and I am suffering from a problem for a long time. I found out that Google have indexed hundreds of replytocom links and images attachment pages for my blog. I had to remove these pages manually using the URL removal tool. I had used "Disallow: ?replytocom" in my robots.txt, but Google disobeyed it. After that, I removed the parameter from my blog completely using the SEO by Yoast plugin. But now I see that Google has again started indexing these links even after they are not present in my blog (I use #comment). Google have also indexed many of my admin and plugin pages, whereas they are disallowed in my robots.txt file. Have a look at my robots.txt file here: http://dapazze.com/robots.txt Please help me out to solve this problem permanently?
Technical SEO | Mar 13, 2013, 3:49 AM | rahulchowdhury0 -
Why are Google search results different if you are log'd into Google or not?
I get different results when I'm log'd into my Google account associated with my website than if I'm not. The same country is occurring. So how can I rely on the google results I'm seeing? For instance my site is page 1 with the improvements I made based on SEOMOZ if I'm log'd in. Yet I'm not on the first 25 pages if I'm not logged in.
Technical SEO | Oct 6, 2012, 8:02 PM | Romana0