Duplication Penalty through Specs?
-
I am trying to figure our how to correct a recently incurred duplication penalty on a partner site. I didn't see any posts on this yet specific to my problem. The site used to be ranked on page 1 of Google for all important keywords but now we ran into the situation that many pages were bumped to pos 100 or lower due to duplication issues. This is an aviation site, discussing airplanes and each page discusses a different model but each page also has the specs of the plane and while the data parts are different for each plane the specification terms are the same ,see here:
Primary Function:
Crew:
Engine:
Thrust:
Weight Empty:
Max. Weight:
Length:
Wingspan:
Cruise Speed:
Max.Speed:
Climb:
Ceiling:
Range:
First Flight:
Year Deployed:Is there an easy way to get Google to stop including these terms (not the data in the 2nd column) from the page anaysis to prevent this causing the duplication issues we are are seeing due to this?
Thanks in advance!
-
Dear Dan. THANK YOU for this excellent information. We are still pretty new to this so this helps a lot!
We did some work already and now are noticing also strange behaviour around capitalization, which makes no sense to us at all; any thoughts on that? Here some more details:
I am finding out some interesting things about G search results as I further explore what has happened to my site.
The results are different if you use small case or capital letters such as RC A-10 Warthog or rc a-10 warthog.
The results are different at different times of the day, sometimes within minutes. Before 7 am today most of the pages which I modified were in the top 10 results, now about 1/2 of them have slipped to page two.
There is absolutely no logic to all the variables. For example, my site returned to its normal traffic (the same way it was before the algorithm change) of over 5,000 views per day on Saturday and Sunday. Just last weekend it barely got to 3,000 views. And, I haven't changed enough pages to the new format to directly affect those results. I wonder if Google has made an adjustment and that is why many of my pages are getting back to near where they were before? I look forward to tonight's final numbers for the day to see if it was solely for the weekend or if it continues.
However, using rc a-10 warthog in small letters, the air hogs page is no where to be found, yet my page is no. 4 as of right now. In Caps, RC A-10 Warthog it is no. 11 right now.
The above example may lead you to believe that the pages on my site should all be in the higher search results when using small case. However, check out Japanese Zero vs japanese zero. Just the opposite is true. In this case my page rates higher when entering it with Capital letters.
Yet, if you look at RC A-6 Intruder vs rc a-6 intruder, they are both around no. 7 in the search results.
Check out how little content is on the page that is higher in the results than mine for the A-6:
http://www.dhgate.com/jet-airplane-a-6-intruder-high-grade-rtf/p-ff80808128ed96260128f2db154e1e9f.htmlCheck out my A-6 page:
http://www.aviationtrivia.info/Grumman-A-6-Intruder.phpI think it is because DH Gate, like Amazon, simply has thousands of pages and G likes that, even if each page has little content. Do you concur?
I think what G is doing is known as the google dance. It is reminiscent of the cypher codes used to encrypt top secret transmissions for the military in that it is constantly changing so as to discourage detection. What may be a top page one day can rate up to 10 positions lower on another.
Many of my pages have slipped only slightly when G changed their algorithm, such as from no. 1 in the search results to no. 3. (Virtually all the top rated pages are now ones which have videos.) However, that has been enough to make a big difference in traffic and sales.
Other than start taking videos of some 500 rc airplanes featured on my site, putting the videos on my site and YouTube, and hoping that the videos will be highly rated, I don't know how to get my pages as high in the search results as the YouTube pages.
The other pages which usually are higher in G's search results are forums like RC Universe and RC Groups. They have tens of thousands of pages, none of which have been changed, dating back to the late 1990's. Not much I can do about that either.
My aim is to be in G's search results right after the videos and forums. I think that someone searching for a rc airplane for sale will go right past them if they know that Aviation Trivia has ALL the rc airplanes of a particular model listed on it on a single page.
The changes I've made, ie. no longer dividing the page between the actual aircraft and rc models of it, has gotten those pages back into the top ten search results, but not for both Caps and small letters.
-
It was after. The site however has some of the best data from a people perspective. We are trying to reexplain that to the G algo
-
Tried to add as a reply to you Ralf, but it's not working. Anyone know why I cannot reply to a response?
Ralf, I glanced at some pages to see if I could find anything. Here's a couple of things that I think you could change or work on to improve your standing with Google:
1. There is pretty wide consensus that pages with a lot of ads and adsense seem to have been hit the hardest. Some of your pages have up to 5 blocks of adsense. Perhaps 1 or 2 blocks of adsense would be better. Google doesn't seem to like it much when the adsense is "hidden" in a sense, as in, there's so much adsense and it looks so much like the actual content that users cannot tell the difference. Go easy on it and see if that helps.
2. You have very outdated code on your website. It seems your whole site is built in HTML tables. Your code to content ratio is going to suffer because of it, and you are using HTML elements that are deprecated I believe. (ie: font face). Perhaps a face lift of the site and an update of that clunky code could help speed up the site and present a better image to users and search engines.
3. You haven't signaled which URL of your site is the main URL. For example, you have 4 home pages according to Google:
www.aviationtrivia.info/index.php
All go to the exact same page, so that page is showing up under 4 different URL's. That is one, duplicate content and two, not making good use of your link juice. According to open site explorer, you have over 22,000 links to your domain, but 13,000 go to the www version of your domain. So your links are split between your main domain and the www subdomain. Redirect the www version or the non www version to its counterpart with or without the www. This will consolidate your link juice much better.
4. From the numbers, 68 domains are giving you 13,000 links to your www url and 10 domains are giving you about 9,000 links to the non www url. 9,000 links from only 10 domains looks a little odd to me. The odd part is your home page(s) only account for about 1,000 of your total links. I didn't take the time to find out which page or pages are getting all those links, but it doesn't appear to be your home page. 9,000 links from 10 domains going to a page other than your home page just seems, well, odd to me. If you have any paid links or have participated in a suspicious link exchange of some kind, this could be harming you as well.
Hope that helps. Another tip would be to go into your Google Webmaster Tools account and see what it tells you. Often you can get good information from them to help you out.
-
Was this before or AFTER the Panda Update? If after, you may have been hit by G's new algo which targets sites they deem to be of low quality.
-
Thanks you for your thoughts, much appreciated! The Google change is what made use think it is duplication and the only thing we could think of was the repeating specs.
It was copyscape that let us to believe it to be the duplication issue but as we checked on copyscape before the Google change and since tested some content changes that didn't make any difference to copyscape, I am beginning to think that copyscape doesn't work properly.
Now going back to the problem itself, let me describe what we are seeing and maybe you have a better idea what could cause this and what we need to be looking at.
Virtually all of the pages on my Aviation Trivia website www.aviationtrivia.info have been downgraded by Google. The aircraft pages were mostly in the top ten search results for rc airplanes and the name of the aircraft such as F4U Corsair and the words "for sale", ie: "F4U Corsair for sale". A great number of those pages are now out of the first 50 search results.
The aircraft pages all contain original content. One such page, Sikorsky CH-53
http://www.aviationtrivia.info/Sikorsky-CH-53.php
is typical of the downgraded pages. It was in the first five page results under its name and now is around no. 60. The page even has an exclusive interview with a pilot of the aircraft.A page that was in the top results for the search words "rc Airwolf Helicopter"
http://www.aviationtrivia.info/Airwolf-Helicopter.php has now been downgraded to about no. 30. The no. 1 search results for rc Airwolf Helicopter is Century Helicopters 30 Size Airwolf Helicopter
http://www.centuryheli.com/products/helikits/cn1070airwolf/index.html?currentid=120
Their page minimally describes three helicopters they sell. My page at Aviation Trivia describes over 30 Airwolf helicopters for sale plus has information from a person who has flown one, plus additional information on where you can find specifics about the helicopter on popular websites.The most popular page on my site, World's Fastest Aircraft http://www.aviationtrivia.info/THE-100-FASTEST-AIRCRAFT.php
is still no. 1 in the Google search results, however what was the second most popular page, Largest Aircraft
http://www.aviationtrivia.info/THE-LARGEST-AIRCRAFT.php
has gone from no. 1 to about no. 50. What is really interesting is that, although I expect Wikipedia pages to always be above mine in the search results, the highest ranked page after Wikipedia is Global Aircraft - Top 50 Largest Aircraft.
http://www.globalaircraft.org/50_largest.htmLooking at my page that describes 100 aircraft in detail and provides links to pages in my site that goes into their histories and full specifications, then looking at the Global Aircraft page that simply states the name of the aircraft, wingspan, and weight, I can't understand how it can now ranked no. 1 and my page no. 50 when searching for "world's largest aircraft."
I have set up my site from the view point of a person who is interested in scale rc model aircraft and the aviation history of their actual aircraft. The information I put on the site is there to not only inform the people about the full scale aircraft, but to give them choices about the rc model airplanes that are available, and other information that may be helpful in choosing the right rc model for them.
Just to clarify, someone would search for rc F4U Corsair, or rc F-14 Tomcat and that would come up in the top search results as well as F4U Corsair for sale and F-14 Tomcat for sale, etc.
Hope that makes sense
Thanks!
Ralf -
If I understand correctly, you have a site with several pages that discuss different models, and each of those pages has this same spec list? You are not being hit with a duplicate content penalty. If you were, every website in the world that does reviews would be hit hard. I just did a quick search on treadmill reviews. Look at those results on the top page. They all use the same specs on all their pages. Treadmilldoctor.com and treadmillreviews.com seem to use a near identical system to each other even. Yet none of them have duplicate penalties.
WIthout knowing your website, there is definitely some other reason you are not ranking anymore. There was a new algorithm change that could have affected you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
ViewState and Duplicate Content
Our site keeps getting duplicated content flagged as an issue... however, the pages being grouped together have very little in common on-page. One area which does seem to recur across them is the ViewState. There's a minimum of 150 lines across the ones we've investigated. Could this be causing the reports?
Technical SEO | | RobLev0 -
Duplicate Titles Aren't Actually Duplicate
I am seeing duplicate title errors, but when I go to fix the problem, the titles are not actually identical. Any advice? Becky
Technical SEO | | Becky_Converge0 -
Content and url duplication?
One of the campaign tools flags one of my clients sites as having lots of duplicates. This is true in the sense the content is sort of boiler plate but with the different countries wording changed. The is same with the urls but they are different in the sense a couple of words have changed in the url`s. So its not the case of a cms or server issue as this seomoz advises. It doesnt need 301`s! Thing is in the niche, freight, transport operators, shipping, I can see many other sites doing the same thing and those sites have lots of similar pages ranking very well. In fact one site has over 300 keywords ranked on page 1-2, but it is a large site with an 12yo domain, which clearly helps. Of course having every page content unique is important, however, i suppose it is better than copy n paste from other sites. So its unique in that sense. Im hoping to convince the site owner to change the content over time for every country. A long process. My biggest problem for understanding duplication issues is that every tabloid or broadsheet media website would be canned from google as quite often they scrape Reuters or re-publish standard press releases on their sites as newsworthy content. So i have great doubt that there is a penalty for it. You only have to look and you can see media sites duplication everywhere, everyday, but they get ranked. I just think that google dont rank the worst cases of spammy duplication. They still index though I notice. So considering the business niche has very much the same content layout replicated content, which rank well, is this duplicate flag such a great worry? Many businesses sell the same service to many locations and its virtually impossible to re write the services in a dozen or so different ways.
Technical SEO | | xtopher660 -
How do I get rid of duplicate content
I have a site that is new but I managed to get it to page one. Now when I scan it on SEO Moz I see that I have duplicate content. Ex: www.mysite.com, www.mysite.com/index and www.mysite.com/ How do I fix this without jeopardizing my SERPS ranking? Any tips?
Technical SEO | | bronxpad0 -
Duplicate Homepage In Google
Hi Just found through my SEO dashboard, Google has two versions of the same homepage, the root page, plus the index.html page, causing duplicate content from both the pages. what is the best option to ensure google only have 1 version of the homepage listed?
Technical SEO | | rfksolutionsltd0 -
Home page penalty?
What does it mean when your home page has a penalty? I have a site that has good rankings for many pages, but my home page seems to be penalized by Google. I tried searching for my home page URL in Google, www.xxxxxx.com and my page doesn't show up, but sub pages do show up? What would cause this penalty and how do you correct this issue.
Technical SEO | | tadden0 -
Duplicate Content Home Page
Hello, I am getting Duplicate Content warning from SEOMoz for my home page: http://www.teacherprose.com http://www.teacherprose.com/index html I tried code below in .htaccess: redirect 301 /index.html http://www.teacherprose.com This caused error "too many re-directs" in browser Any thoughts? Thank You, Eric
Technical SEO | | monthelie10 -
Duplicate Content Penalties, International Sites
We're in the process of rolling out a new domestic (US) website design. If we copy the same theme/content to our International subsidiaries, would the duplicate content penalty still apply? All International sites would carry the Country specific domain, .co.uk, .eu, etc. This question is for English only content, I'm assuming translated content would not carry a penalty.
Technical SEO | | endlesspools0