Crawl reports urls with duplicate content but its not the case
-
Hi guys!
Some hours ago I received my crawl report.I noticed several records with urls with duplicate content so I went to open those urls one by one.
Not one of those urls were really with duplicate content but I have a concern because website is about product showcase and many articles are just images with href behind them. Many of those articles are using the same images so maybe thats why the seomoz crawler duplicate content flag is raised. I wonder if Google has problem with that too.See for yourself how it looks like:
http://by.vg/NJ97y
http://by.vg/BQypEThose two url's are flagged as duplicates...please mind the language(Greek) and try to focus on the urls and content.
ps: my example is simplified just for the purpose of my question.
<colgroup><col width="3436"></colgroup>
| URLs with Duplicate Page Content (up to 5) | -
Disclaimer: I just answered a question just like this on another thread, so I literally copied and pasted my response from there, and edited where necessary.
The SEOmoz web app uses a similarity threshold of 95% of the html code. This takes everything on the page, both hidden and visible into account.
In this case, it's counting all of the navigation and sidebar as well, which is significant. What's left of the unique content - the part that matters, makes up less than 5% of the code.Here's a tool you can use to check the similarity: http://www.duplicatecontent.net/
I ran the pages through a couple of tools which showed 98% similarity. (but only 75% text similarity, which is good, but not great)
SEOKeith is absolutely right that there's very little on those pages to help them rank. Without text, you're fighting an uphill battle.
Hope this helps! Best of luck with your SEO.
-
Yeah, thats what I m going to do in my next meeting. Either way I also feel such websites need to have more pics than anything else, maybe a blog page or separate pages with articles could link to those products one by one with related description having a side content website for the actual product pages.
-
Maybe explain to the client it's not going to rank as well without text and has less chance of getting found by searches (generally speaking...).
I get duplicate content flagging as well sometimes, I check the pages manually when it happens.
-
Thanks Keith. I ve been using seomoz for some days so I wasnt sure about this.
Client wants website with as less text as possible so I guess my only hopes are title and alt attributes.
-
Those pages are very similar so it's probably throwing the duplicate content switch in SEOmoz, you might want to ignore it in this case.
I would add some more text to those pages personally to aid with ranking, you can position the text over the images with CSS.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Duplicate Page Content, Indexing and Rel Canonical Just DOUBLED! Need Advice to Fix
Last Friday (Penguin 5/2.1) my website shot way off the grid and I noticed in my MOZ PRO Campaign dashboard that all of the following just doubled in numbers on my website: duplicate page content, Google indexing, and rel canonicals. I also noticed that some of my pages, images, tags and categories now added a /page/2/ or a -2. I just changed noindex for tags, but indexing for media, pages, posts, and categories. I'm currently using All In One SEO for a plugin. Any advice would be much appreciated as I'm stuck on the issue. relconical.png Duplicate-Page-Content.png [Duplicate Content II](Duplicate Content II) index1.png
Moz Pro | | CelebrityPersonalTrainer0 -
No follow links also been reported in SEOmoz crawl diagnostics
Hi, Why does SEOmoz reports links which has been marked as 'nofollow'. I am getting 'Overly-Dynamic URL' reports on links which I have designated as nofollow which means Google will discount them. So why does SEOmoz still report them. Thanks.
Moz Pro | | malpani0 -
Why is my crawl STILL in progress?
I'm a bit new here, but we've had a few crawls done already. They are always finished by Wednesday night. Our website is not large (by any means), but the crawl still says it's in progress now 3 days later. What's the deal here?!?
Moz Pro | | Kibin0 -
Crawl credits how to buy more?
Just wondering if there is a way of increasing, my 2 crawl credits per day limit?
Moz Pro | | aussieseoguy0 -
Crawl Diagnostics Warnings - Duplicate Content
Hi All, I am getting a lot of warnings about duplicate page content. The pages are normally 'tag' pages. I have some news stories or blog posts tagged with multiple 'tags'. Should I ask google not to index the tag pages? Does it really affect my site? Thanks
Moz Pro | | skehoe0 -
Conflict in reported link data
I have a competitor in a campaign - the campaign report shows 181 linked domains BUT - the site explorer report shows only 15 linked domains. Which is correct? And if the site explorer is correct - how do we fix the campaign report? [URL]]([URL=http://imgur.com/m9xFv][IMG]http://i.imgur.com/m9xFv.jpg[/IMG][/URL]) m9xFv open-site-explorer open-site-explorer competitive-domain-analysis-humphreys-assoc.com-seomoz-pro
Moz Pro | | robertdonnell0 -
OSE Link Report Question.
Hi, when i export a link report as a csv file via the export tool on OSE, there is a column in the exported table called "Numder of Link". Please explain me, what does this column says exactly. Its located between "Domain Authorithy" and "Number of Domains Linking to Domain". Thanx, Seb.
Moz Pro | | TheHecksler0 -
On Page Optimization Reports - Huh?
I've been working hard to use this EXCELLENT tool for optimize some of what I consider my most important pages . . . But the automatic tool that pulls pages and grades them (the "summary" of the "on page" report) . . . I don't get it. It only graded three of my pages, and I don't understand how it chose what keywords to grade it for? I'm just very confused. I don't understand how it chose the pages to grade, not the words it chose to grade it against. 😞
Moz Pro | | damon12120