Duplication Penalty through Specs?
-
I am trying to figure our how to correct a recently incurred duplication penalty on a partner site. I didn't see any posts on this yet specific to my problem. The site used to be ranked on page 1 of Google for all important keywords but now we ran into the situation that many pages were bumped to pos 100 or lower due to duplication issues. This is an aviation site, discussing airplanes and each page discusses a different model but each page also has the specs of the plane and while the data parts are different for each plane the specification terms are the same ,see here:
Primary Function:
Crew:
Engine:
Thrust:
Weight Empty:
Max. Weight:
Length:
Wingspan:
Cruise Speed:
Max.Speed:
Climb:
Ceiling:
Range:
First Flight:
Year Deployed:Is there an easy way to get Google to stop including these terms (not the data in the 2nd column) from the page anaysis to prevent this causing the duplication issues we are are seeing due to this?
Thanks in advance!
-
Dear Dan. THANK YOU for this excellent information. We are still pretty new to this so this helps a lot!
We did some work already and now are noticing also strange behaviour around capitalization, which makes no sense to us at all; any thoughts on that? Here some more details:
I am finding out some interesting things about G search results as I further explore what has happened to my site.
The results are different if you use small case or capital letters such as RC A-10 Warthog or rc a-10 warthog.
The results are different at different times of the day, sometimes within minutes. Before 7 am today most of the pages which I modified were in the top 10 results, now about 1/2 of them have slipped to page two.
There is absolutely no logic to all the variables. For example, my site returned to its normal traffic (the same way it was before the algorithm change) of over 5,000 views per day on Saturday and Sunday. Just last weekend it barely got to 3,000 views. And, I haven't changed enough pages to the new format to directly affect those results. I wonder if Google has made an adjustment and that is why many of my pages are getting back to near where they were before? I look forward to tonight's final numbers for the day to see if it was solely for the weekend or if it continues.
However, using rc a-10 warthog in small letters, the air hogs page is no where to be found, yet my page is no. 4 as of right now. In Caps, RC A-10 Warthog it is no. 11 right now.
The above example may lead you to believe that the pages on my site should all be in the higher search results when using small case. However, check out Japanese Zero vs japanese zero. Just the opposite is true. In this case my page rates higher when entering it with Capital letters.
Yet, if you look at RC A-6 Intruder vs rc a-6 intruder, they are both around no. 7 in the search results.
Check out how little content is on the page that is higher in the results than mine for the A-6:
http://www.dhgate.com/jet-airplane-a-6-intruder-high-grade-rtf/p-ff80808128ed96260128f2db154e1e9f.htmlCheck out my A-6 page:
http://www.aviationtrivia.info/Grumman-A-6-Intruder.phpI think it is because DH Gate, like Amazon, simply has thousands of pages and G likes that, even if each page has little content. Do you concur?
I think what G is doing is known as the google dance. It is reminiscent of the cypher codes used to encrypt top secret transmissions for the military in that it is constantly changing so as to discourage detection. What may be a top page one day can rate up to 10 positions lower on another.
Many of my pages have slipped only slightly when G changed their algorithm, such as from no. 1 in the search results to no. 3. (Virtually all the top rated pages are now ones which have videos.) However, that has been enough to make a big difference in traffic and sales.
Other than start taking videos of some 500 rc airplanes featured on my site, putting the videos on my site and YouTube, and hoping that the videos will be highly rated, I don't know how to get my pages as high in the search results as the YouTube pages.
The other pages which usually are higher in G's search results are forums like RC Universe and RC Groups. They have tens of thousands of pages, none of which have been changed, dating back to the late 1990's. Not much I can do about that either.
My aim is to be in G's search results right after the videos and forums. I think that someone searching for a rc airplane for sale will go right past them if they know that Aviation Trivia has ALL the rc airplanes of a particular model listed on it on a single page.
The changes I've made, ie. no longer dividing the page between the actual aircraft and rc models of it, has gotten those pages back into the top ten search results, but not for both Caps and small letters.
-
It was after. The site however has some of the best data from a people perspective. We are trying to reexplain that to the G algo
-
Tried to add as a reply to you Ralf, but it's not working. Anyone know why I cannot reply to a response?
Ralf, I glanced at some pages to see if I could find anything. Here's a couple of things that I think you could change or work on to improve your standing with Google:
1. There is pretty wide consensus that pages with a lot of ads and adsense seem to have been hit the hardest. Some of your pages have up to 5 blocks of adsense. Perhaps 1 or 2 blocks of adsense would be better. Google doesn't seem to like it much when the adsense is "hidden" in a sense, as in, there's so much adsense and it looks so much like the actual content that users cannot tell the difference. Go easy on it and see if that helps.
2. You have very outdated code on your website. It seems your whole site is built in HTML tables. Your code to content ratio is going to suffer because of it, and you are using HTML elements that are deprecated I believe. (ie: font face). Perhaps a face lift of the site and an update of that clunky code could help speed up the site and present a better image to users and search engines.
3. You haven't signaled which URL of your site is the main URL. For example, you have 4 home pages according to Google:
www.aviationtrivia.info/index.php
All go to the exact same page, so that page is showing up under 4 different URL's. That is one, duplicate content and two, not making good use of your link juice. According to open site explorer, you have over 22,000 links to your domain, but 13,000 go to the www version of your domain. So your links are split between your main domain and the www subdomain. Redirect the www version or the non www version to its counterpart with or without the www. This will consolidate your link juice much better.
4. From the numbers, 68 domains are giving you 13,000 links to your www url and 10 domains are giving you about 9,000 links to the non www url. 9,000 links from only 10 domains looks a little odd to me. The odd part is your home page(s) only account for about 1,000 of your total links. I didn't take the time to find out which page or pages are getting all those links, but it doesn't appear to be your home page. 9,000 links from 10 domains going to a page other than your home page just seems, well, odd to me. If you have any paid links or have participated in a suspicious link exchange of some kind, this could be harming you as well.
Hope that helps. Another tip would be to go into your Google Webmaster Tools account and see what it tells you. Often you can get good information from them to help you out.
-
Was this before or AFTER the Panda Update? If after, you may have been hit by G's new algo which targets sites they deem to be of low quality.
-
Thanks you for your thoughts, much appreciated! The Google change is what made use think it is duplication and the only thing we could think of was the repeating specs.
It was copyscape that let us to believe it to be the duplication issue but as we checked on copyscape before the Google change and since tested some content changes that didn't make any difference to copyscape, I am beginning to think that copyscape doesn't work properly.
Now going back to the problem itself, let me describe what we are seeing and maybe you have a better idea what could cause this and what we need to be looking at.
Virtually all of the pages on my Aviation Trivia website www.aviationtrivia.info have been downgraded by Google. The aircraft pages were mostly in the top ten search results for rc airplanes and the name of the aircraft such as F4U Corsair and the words "for sale", ie: "F4U Corsair for sale". A great number of those pages are now out of the first 50 search results.
The aircraft pages all contain original content. One such page, Sikorsky CH-53
http://www.aviationtrivia.info/Sikorsky-CH-53.php
is typical of the downgraded pages. It was in the first five page results under its name and now is around no. 60. The page even has an exclusive interview with a pilot of the aircraft.A page that was in the top results for the search words "rc Airwolf Helicopter"
http://www.aviationtrivia.info/Airwolf-Helicopter.php has now been downgraded to about no. 30. The no. 1 search results for rc Airwolf Helicopter is Century Helicopters 30 Size Airwolf Helicopter
http://www.centuryheli.com/products/helikits/cn1070airwolf/index.html?currentid=120
Their page minimally describes three helicopters they sell. My page at Aviation Trivia describes over 30 Airwolf helicopters for sale plus has information from a person who has flown one, plus additional information on where you can find specifics about the helicopter on popular websites.The most popular page on my site, World's Fastest Aircraft http://www.aviationtrivia.info/THE-100-FASTEST-AIRCRAFT.php
is still no. 1 in the Google search results, however what was the second most popular page, Largest Aircraft
http://www.aviationtrivia.info/THE-LARGEST-AIRCRAFT.php
has gone from no. 1 to about no. 50. What is really interesting is that, although I expect Wikipedia pages to always be above mine in the search results, the highest ranked page after Wikipedia is Global Aircraft - Top 50 Largest Aircraft.
http://www.globalaircraft.org/50_largest.htmLooking at my page that describes 100 aircraft in detail and provides links to pages in my site that goes into their histories and full specifications, then looking at the Global Aircraft page that simply states the name of the aircraft, wingspan, and weight, I can't understand how it can now ranked no. 1 and my page no. 50 when searching for "world's largest aircraft."
I have set up my site from the view point of a person who is interested in scale rc model aircraft and the aviation history of their actual aircraft. The information I put on the site is there to not only inform the people about the full scale aircraft, but to give them choices about the rc model airplanes that are available, and other information that may be helpful in choosing the right rc model for them.
Just to clarify, someone would search for rc F4U Corsair, or rc F-14 Tomcat and that would come up in the top search results as well as F4U Corsair for sale and F-14 Tomcat for sale, etc.
Hope that makes sense
Thanks!
Ralf -
If I understand correctly, you have a site with several pages that discuss different models, and each of those pages has this same spec list? You are not being hit with a duplicate content penalty. If you were, every website in the world that does reviews would be hit hard. I just did a quick search on treadmill reviews. Look at those results on the top page. They all use the same specs on all their pages. Treadmilldoctor.com and treadmillreviews.com seem to use a near identical system to each other even. Yet none of them have duplicate penalties.
WIthout knowing your website, there is definitely some other reason you are not ranking anymore. There was a new algorithm change that could have affected you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Quickview popup duplicate content
Hi We have an eccomerce site. We just added to the product list view a quickview tab - when you roll mouse over it a popup window with the product image and short description shows up - is this a problem of duplicate content( its the same content that's on the product pages except there we also have a long detailed description) - t is done with javascript. Thanks!
Technical SEO | | henya0 -
Why are some pages now duplicate content?
It is probably a silly question, but all of a sudden, the following pages of one of my clients are reported as Duplicate content. I cannot understand why. They weren't before... http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal
Technical SEO | | MarketingEnergy
http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal
http://www.ciaoitalia.nl/product/pizza-originale/döner-halal
http://www.ciaoitalia.nl/product/pizza-originale/vegetariana
http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate
http://www.ciaoitalia.nl/product/pizza-originale/contadina
http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni
http://www.ciaoitalia.nl/product/pizza-originale/shoarma Thanks for any help in the right direction 🙂 | |
| |
| |
| |
| |
| |
| |
| | <colgroup><col style="mso-width-source: userset; mso-width-alt: 17225; width: 353pt;" width="471"></colgroup>
| http://www.ciaoitalia.nl/product/pizza-originale/mediterranea-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/gyros-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/döner-halal |
| http://www.ciaoitalia.nl/product/pizza-originale/vegetariana |
| http://www.ciaoitalia.nl/product/pizza-originale/seizoen-pizza-estate |
| http://www.ciaoitalia.nl/product/pizza-originale/contadina |
| http://www.ciaoitalia.nl/product/pizza-originale/4-stagioni |
| http://www.ciaoitalia.nl/product/pizza-originale/shoarma |0 -
Do I have a manual penalty?
My rankings and traffic suddenly went down about 50% around the end of Feb 2013. I never received any warnings in webmaster tools (and as far as I know never did anything even vaguely black hat) but thought it might be a penalty since the drop was so steep and as far as I know there were no major algo updates at the time. I sent a reconsideration request expecting to receive an answer that I have no manual penalty. Instead, I received the following email: We received a request from a site owner to reconsider how we index the following site: http://www.sitename.com/. We've now reviewed your site. When we review a site, we check to see if it's in violation of our Webmaster Guidelines. If we don't find any problems, we'll reconsider our indexing of your site. If your site still doesn't appear in our search results, check our Help There have been no changes in my rankings. Does this reply mean that I have/had a manual penalty?
Technical SEO | | JillB20130 -
Subdomain vs Main Domain Penalties
We have a client who's main root.com domain is currently penalized by Google, but the subdomain.root.com is appearing very well. We're stumped - any ideas why?
Technical SEO | | Prospector-Plastics0 -
Newbie Duplicate Title Question
We recently update our website with DNN 6. Once the upgrade was done, I kept recieving log in links on my duplicate title and duplicate content error reports. Is anyone familiar with how to stop these links from showing up? Example of link: http://www.faisongroup.com/Login/tabid/750/Default.aspx?returnurl=%2F Any help would be greatly appreciated! Thank you!
Technical SEO | | VeronicaCFowler0 -
Hiding Duplicate Content using Javascript
We have e-commerce site selling books. Besides basic information on books, we have content for “About the book” , “Editorial Reviews”, “About the author” etc. But the content in all these section are duplicate and are available on all sites selling similar books. Our question is: 1.Should we worry about the content being duplicate?2.If yes, then will it by a good idea to hide this duplicate content using javascript or iframe?
Technical SEO | | CyrilWilson0 -
Another Penalty Question - Should I Start from Scratch?
I've seen many questions on google penalties recently. Not really sure where to go from here. I realised a year or so we would be living on borrowed time with our link building methods. We have been really successful in the past and are keen to build a site that has a bit more longevity. We have not received a warning from google but have lost pretty much all of our ranking for everything. My question is with our backlink profile as it is. Building links from various blog networks for the past 3 years. Is it just worth rebranding and starting from scratch rather than trying to get over a million links removed? We have a lot of content that I guess could be classed as spam. Should I really remove all of the content? or leave it running as we are still getting some traffic from other marketing activities. Or should I just get a new domain and transfer all the decent content?
Technical SEO | | DaveDawson2 -
Duplicate Content
Many of the pages on my site are similar in structure/content but not exactly the same. What amount of content should be unique for Google to not consider it duplicate? If it is something like 50% unique would it be preferable to choose one page as the canonical instead of keeping them both as separate pages?
Technical SEO | | theLotter0