How do SEOMOZ calculate duplicate content?
-
first of all i have to much duplicate stuff on my website end cleaning it up. But if i look at GWMC the duplicate stuff is a lot less than in SEOMOZ? can someone explain to me what the difference is?
Thnx, Leonie.
-
Hi Andre, Thnx for the reply. i'll read it
-
Moz doesn't just look at the text of a page, it also looks at the template and how "similar" it appears compared to other pages.
Here's a quote from Dr. Pete:
"Our system currently uses a threshold of 95% to determine whether content is duplicated. This is based on the source code (not the text copy), so the amount of actual duplicate content may vary depending on the code/content ratio."
Here are a few articles you can read to get a deeper understanding.
http://www.seomoz.org/blog/duplicate-content-in-a-post-panda-world
http://www.seomoz.org/blog/duplicate-content-block-redirect-or-canonical
http://www.seomoz.org/blog/the-illustrated-guide-to-duplicate-content-in-the-search-engines
http://www.seomoz.org/blog/rethinking-duplicate-content
http://www.seomoz.org/blog/fat-pandas-and-thin-content
http://www.seomoz.org/blog/the-illustrated-guide-to-duplicate-content-in-the-search-engines
Greg
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Managing Removed Content
I am a Real Estate Developer. Once a home goes off market (is sold), I had been using a 404 for that page. The problem: When the home goes up on market again, google will not re-index the new page (same URL) I have also tried to manage it a different way. Instead of removing the page, I left it as-is. At some later point time, the house goes back up on the market. The page is refreshed with new content. However, google decides to use cached version. Please note in either case, the property appears on the main page for a period of indexing. I have been doing this for 10 years, the problem is increasing with time.
Web Design | | Buckey0 -
Duplicate Product Descriptions for Each Variant
Hi, I am setting up a Shopify e-commerce store and I have a questions about duplicate product descriptions. I have written unique product descriptions for all our products. Each product has at least 10 color options. I am thinking that it would look better if I created each color variant as a unique product. i.e. store.com/nice-shirt-blue, store.com/nice-shirt-red ect. Here is the kicker. Would I be penalized for using the same product descriptions for each product type?
Web Design | | Jon_B0 -
Question re. crawlable textual content
I have a client who is struggling to fit crawlable textual content on their pages. I'm wondering if we can add a "Learn More..." feature that works as a mouse over pop up. When a page visitor runs their curser over the link or button, a window bubble pops up and textual content about the page will show. Not knowing much about code, can text in this format be crawlable by search engines and count as unique and relevant content? Thanks, Dino
Web Design | | Dino640 -
Getting a lot more duplicate content warnings than I expected.
I run WordPress on many of my sites and a site crawl has found MANY duplicate content pages on the latest domain I started a campaign for. I expected to see quite a lot on the tag pages that only had one post but even tag pages with multiple posts and author and category pages with many posts are showing as duplicate content. Is this normal for a WordPress site to have so much duplicate content warnings from the taxonomy pages? I have the option to bulk noindex, follow the category and tag pages but should I do it? I get some traffic directly to the tag pages so removing the pages from search results would dent the traffic of the site a little (generally high bounce rate, low engagement traffic anyway) but could removing the apparent duplicate content actually improve the article pages themselves? Or does anyone have any WordPress specific advice for making the pages not duplicate content? I've toyed with the idea of just displaying excerpts but creating manual excerpts for the 4 years worth of posts, some of which I have no personal knowledge of the subject matter so other suggestions are welcome.
Web Design | | williampatton0 -
Duplicate H1 tag IF it holds SAME text?
Hello people, I know that majority of SEO gurus (?) claim that H1 tag should only be used once per page. In the landing page design I'm working with, we actually need to repeat our core message stated in H1 & H2 - at the bottom of the page. Now the question is: Can that in any way cause any ranking penalty from big G? In my eyes that is not attempt to over optimize page as it contains SAME info as the H1 & H2 at the top of the page. Confusing, so I'm hope that some SEO gurus here will share some light on this. Thanks in advance!
Web Design | | RetroOnline0 -
Duplicate Page Content mysite.com and mysite.com/index.html MOZ Dashboard
According to MOZ Dashboard my site shows Duplicate Page Content mysite.com and mysite.com/index.html .What i can do for that .redirect mysite.com/index.html to mysite.com .then how can i do that using .htaccess file .
Web Design | | innofidelity0 -
Why is there no sitemap.xml for SEOmoz?
I noticed that SEOmoz does not have a "root" sitemap called sitemap.xml. On the other hand, there do appear to be sitemaps for various sections of the such as http://www.seomoz.org/blog-sitemap.xml I was planning on having a root level sitemap that referenced difference sections of my site (blog, support, etc.) but I'm a little concerned that this site itself doesn't seem to be following that practice. Presumably this website is submitting the individual section maps to Google directly since they aren't linkable through sitemap.xml?
Web Design | | schof0 -
Lazy Loading Content and SEO
I'v been seeing a lot of websites use a technique to present content to website visitors when the scroll down the page called "Lazy Loading". Does this hinder SEO and indexing since the content is not actually on the page until the user acts/requests it?
Web Design | | JusinDuff0