Are there any tricks for checking duplicate content?
-
Hello all,
My MOZ weekly scan keeps coming back with indication for duplicate content on pages that don't have that much alike. I feel like I must be missing something. Is there any place I can plug in two urls so that it would tell me what the similarities are and figure out how to make it less of a duplicate content?
https://www.stage32.com/happy-writers/pitch-sessions/Pitch-Amar-Hansen-Saturday-October-29th-2016
and
https://www.stage32.com/happy-writers/pitch-sessions/Pitch-Will-Raynor-Wednesday-November-16th-2016OR
https://www.stage32.com/webinars/Film-Contracts-101-Everything-You-Need-To-Know
and
https://www.stage32.com/webinars/Breaking-Down-IP-Intellectual-Property-For-DevelopmentAny ideas? Thanks in advance.
-
Hi Tomasz,
Loading with a delay may alter the effect of what a tool reports I guess (though the tools may wait until the page is fully loaded/DOM ready or something?), but I don't think it'll solve the issue as such to be honest, as the dupe content would still be on the page.
If you don't like the idea of a link opening in a new tab, there's always the option of an iframe I guess, on the relevant tabs? Though it'd be a bit of work to make it responsive.
-
Thanks Tawny, I'm checking it out right now.
You rock!
T.
-
Thanks so much Mike. I didn't consider the remaining tabs, I suppose loading them with delay might also be an option. I appreciate your speedy response.
Cheers,
Tomasz -
Hmmm looking at the first 2 pages, I think I can see what's happening here
Not 100% on how Moz's tool figures out dupe content, but I think this is why (3rd point is strongest, I think):
- Similar URLs (first part up to the last folder, then first word, then both ending in 2016) means the URL is a close match
- The title tags are also quite similar
- Perhaps most importantly, it does look like the VAST MAJORITY of the content on the page is in fact duplicate of other pages. It's not obvious at first BUT... To see what I mean, click on the 'Guidelines' tab (looks like jQuery tabs or similar) and we get a big amount of content - the most on the page - that's duplicated between both pages.
Perhaps consider making that a link to a dedicated page that has this content, opening in a new tab or similar?
For your 2nd two examples, check the 'About Your Instructor' and 'FAQs'. I'd advise that where content is repeated over & over, move it to it's own page and then link to it from there, rather than having the same several hundred words indexed on many pages.
Hope that helps?
Ps. There may be a better solution for displaying the content, iframing it into the tab etc, but if you go down that route be careful, it's an effort to make iframes responsive!
-
Hey there! Tawny from Moz's Customer Support team here.
One tool we use pretty frequently to take a look at the similarity between two pages is this one: http://smallseotools.com/similar-page-checker/
That won't give you suggestions for how to differentiate the content on those pages, but it'll do a pretty good job of pointing out what on the page is similar, which could help highlight areas where you could bulk up the content to differentiate it a bit.
I hope this helps! If you have any other questions or if there's anything that needs clarifying, feel free to send us a note at help@moz.com!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Need help fixing a duplicate content issue for my website. The moz crawl is show OMG my website with https:// and https://www. But I have never used the url https:// so I don’t understand why moz is showing this
Moz is showing my url with two different starts. Https:// and then the one I use https://www. The problem is I don’t think I have ever used the url without the www. at the start. How do I fix this?
Moz Bar | | jdp_uk0 -
Duplicate page found with MOZ crawl test?
When I crawl my website www.radiantguard.com, the crawl test comes back with what appears to be a duplicate of my home page: http://www.radiantguard.com and http://www.radiantguard.com/ Does the crawler indeed see two different pages and therefore, are my search engine rankings potentially affected, AND is this because of how my rel canonical is set up?
Moz Bar | | rhondafranklin0 -
Site Crawl report show strange duplicate pages
Beginning in early in Feb, we got a big bump in duplicate pages. The URLs of the pages are very odd: Example URL:
Moz Bar | | Neo4j
http://firstname.lastname@website.com/dir/page.php
is duplicate with http://website.com/dir/page.php I checked though the site, nginx conf files, and referral pages, and could not find what is prefixing the pages with 'http://firstname.lastname@'. Any ideas? The person whose name is 'Firstname Lastname' is stumped as well. Thanks.0 -
What is Considered Duplicate Content by Crawlers?
I am asking this because I have a couple of site audit tools that I use to crawl a site I work on every week and they are showing duplicate content issues (which I know there is a lot on this site) but some of what is flagged as duplicate content makes no sense. For example, the following URL's were grouped together as duplicate content: | https://www.firefold.com/contact-us | https://www.firefold.com/gabe | https://www.firefold.com/sale | | | How are these pages duplicate content? I am confused on what site audit tools are considering duplicate content. Just FYI, this is data from Moz crawl diagnostics but SEMrush site auditor is giving me the same type of data. Any help would be greatly appreciated. Ryan
Moz Bar | | RyanRhodes0 -
WP 4.0 Update Causing Major Duplicate Content Errors?
According to my moz analytics, my site has went through the roof with duplicate content. There's a nice Mozzer named Abe looking into this with me, but I'm wondering if it could be due to the WP 4.0 update. Has anyone else experienced an uptick like this before? I've never had any problems with the other updates. Thanks, Ruben
Moz Bar | | KempRugeLawGroup0 -
Moz Crawler URL paramaters & duplicate content
Hi all, this is my first post on Moz Q&A 🙂 Questions: Does the Moz Crawler take into account rel="canonical" for search results pages with sorting / filtering URL parameters? How much time does it take for an issue to disappear from the issues list after it's been corrected? Does it come op in the next weekly report? I'm asking because the crawler is reporting 50k+ pages crawled, when in reality, this number should be closer to 1000. All pages with query parameters have the correct canonical tag pointing to the root URL, so I'm wondering whether I need to noindex the other pages for the crawler to report correct data?: Original (canonical URL): DOMAIN.COM/charters/search/mx/BS?search_location=cabo-san-lucas Filter active URL: DOMAIN.COM/charters/search/mx/BS?search_location=cabo-san-lucas&booking_date=&booking_days=1&booking_persons=1&priceFilter%5B%5D=0%2C500&includedPriceFilter%5B%5D=drinks-soft Also, if noindex is the only solution, will it impact the ranking of the pages involved? Note: Google and Bing are semi-successful in reporting index page count, each reporting around 2.5k result pages when using the site:DOMAIN.com query. The rel canonical tag was missing for a short period of time about 4 weeks ago, but since fixing the issue these pages still haven't been deindexed. Appreciate any suggestions regarding Moz Crawler & Google / Bing index count!
Moz Bar | | Vukan_Simic0 -
What does the green check mark mean in the KW Difficulty & SERP Analysis Report mean?
Hi, I just generated a Keyword Difficulty and SERP Analysis Report and I find that many of the metrics have a green checkmark besides them. What does that indicate? Similarly, there are many metrics which appear in green. However rest are in grey. Also help what does that indicate? Refer the attached link to the screenshot of the report. I tried looking at a couple of places for the answer but in vain.. Thanks.. DQSlssU
Moz Bar | | Shalin.TJ0