Duplication Penalty through Specs?
-
I am trying to figure our how to correct a recently incurred duplication penalty on a partner site. I didn't see any posts on this yet specific to my problem. The site used to be ranked on page 1 of Google for all important keywords but now we ran into the situation that many pages were bumped to pos 100 or lower due to duplication issues. This is an aviation site, discussing airplanes and each page discusses a different model but each page also has the specs of the plane and while the data parts are different for each plane the specification terms are the same ,see here:
Primary Function:
Crew:
Engine:
Thrust:
Weight Empty:
Max. Weight:
Length:
Wingspan:
Cruise Speed:
Max.Speed:
Climb:
Ceiling:
Range:
First Flight:
Year Deployed:Is there an easy way to get Google to stop including these terms (not the data in the 2nd column) from the page anaysis to prevent this causing the duplication issues we are are seeing due to this?
Thanks in advance!
-
Dear Dan. THANK YOU for this excellent information. We are still pretty new to this so this helps a lot!
We did some work already and now are noticing also strange behaviour around capitalization, which makes no sense to us at all; any thoughts on that? Here some more details:
I am finding out some interesting things about G search results as I further explore what has happened to my site.
The results are different if you use small case or capital letters such as RC A-10 Warthog or rc a-10 warthog.
The results are different at different times of the day, sometimes within minutes. Before 7 am today most of the pages which I modified were in the top 10 results, now about 1/2 of them have slipped to page two.
There is absolutely no logic to all the variables. For example, my site returned to its normal traffic (the same way it was before the algorithm change) of over 5,000 views per day on Saturday and Sunday. Just last weekend it barely got to 3,000 views. And, I haven't changed enough pages to the new format to directly affect those results. I wonder if Google has made an adjustment and that is why many of my pages are getting back to near where they were before? I look forward to tonight's final numbers for the day to see if it was solely for the weekend or if it continues.
However, using rc a-10 warthog in small letters, the air hogs page is no where to be found, yet my page is no. 4 as of right now. In Caps, RC A-10 Warthog it is no. 11 right now.
The above example may lead you to believe that the pages on my site should all be in the higher search results when using small case. However, check out Japanese Zero vs japanese zero. Just the opposite is true. In this case my page rates higher when entering it with Capital letters.
Yet, if you look at RC A-6 Intruder vs rc a-6 intruder, they are both around no. 7 in the search results.
Check out how little content is on the page that is higher in the results than mine for the A-6:
http://www.dhgate.com/jet-airplane-a-6-intruder-high-grade-rtf/p-ff80808128ed96260128f2db154e1e9f.htmlCheck out my A-6 page:
http://www.aviationtrivia.info/Grumman-A-6-Intruder.phpI think it is because DH Gate, like Amazon, simply has thousands of pages and G likes that, even if each page has little content. Do you concur?
I think what G is doing is known as the google dance. It is reminiscent of the cypher codes used to encrypt top secret transmissions for the military in that it is constantly changing so as to discourage detection. What may be a top page one day can rate up to 10 positions lower on another.
Many of my pages have slipped only slightly when G changed their algorithm, such as from no. 1 in the search results to no. 3. (Virtually all the top rated pages are now ones which have videos.) However, that has been enough to make a big difference in traffic and sales.
Other than start taking videos of some 500 rc airplanes featured on my site, putting the videos on my site and YouTube, and hoping that the videos will be highly rated, I don't know how to get my pages as high in the search results as the YouTube pages.
The other pages which usually are higher in G's search results are forums like RC Universe and RC Groups. They have tens of thousands of pages, none of which have been changed, dating back to the late 1990's. Not much I can do about that either.
My aim is to be in G's search results right after the videos and forums. I think that someone searching for a rc airplane for sale will go right past them if they know that Aviation Trivia has ALL the rc airplanes of a particular model listed on it on a single page.
The changes I've made, ie. no longer dividing the page between the actual aircraft and rc models of it, has gotten those pages back into the top ten search results, but not for both Caps and small letters.
-
It was after. The site however has some of the best data from a people perspective. We are trying to reexplain that to the G algo
-
Tried to add as a reply to you Ralf, but it's not working. Anyone know why I cannot reply to a response?
Ralf, I glanced at some pages to see if I could find anything. Here's a couple of things that I think you could change or work on to improve your standing with Google:
1. There is pretty wide consensus that pages with a lot of ads and adsense seem to have been hit the hardest. Some of your pages have up to 5 blocks of adsense. Perhaps 1 or 2 blocks of adsense would be better. Google doesn't seem to like it much when the adsense is "hidden" in a sense, as in, there's so much adsense and it looks so much like the actual content that users cannot tell the difference. Go easy on it and see if that helps.
2. You have very outdated code on your website. It seems your whole site is built in HTML tables. Your code to content ratio is going to suffer because of it, and you are using HTML elements that are deprecated I believe. (ie: font face). Perhaps a face lift of the site and an update of that clunky code could help speed up the site and present a better image to users and search engines.
3. You haven't signaled which URL of your site is the main URL. For example, you have 4 home pages according to Google:
www.aviationtrivia.info/index.php
All go to the exact same page, so that page is showing up under 4 different URL's. That is one, duplicate content and two, not making good use of your link juice. According to open site explorer, you have over 22,000 links to your domain, but 13,000 go to the www version of your domain. So your links are split between your main domain and the www subdomain. Redirect the www version or the non www version to its counterpart with or without the www. This will consolidate your link juice much better.
4. From the numbers, 68 domains are giving you 13,000 links to your www url and 10 domains are giving you about 9,000 links to the non www url. 9,000 links from only 10 domains looks a little odd to me. The odd part is your home page(s) only account for about 1,000 of your total links. I didn't take the time to find out which page or pages are getting all those links, but it doesn't appear to be your home page. 9,000 links from 10 domains going to a page other than your home page just seems, well, odd to me. If you have any paid links or have participated in a suspicious link exchange of some kind, this could be harming you as well.
Hope that helps. Another tip would be to go into your Google Webmaster Tools account and see what it tells you. Often you can get good information from them to help you out.
-
Was this before or AFTER the Panda Update? If after, you may have been hit by G's new algo which targets sites they deem to be of low quality.
-
Thanks you for your thoughts, much appreciated! The Google change is what made use think it is duplication and the only thing we could think of was the repeating specs.
It was copyscape that let us to believe it to be the duplication issue but as we checked on copyscape before the Google change and since tested some content changes that didn't make any difference to copyscape, I am beginning to think that copyscape doesn't work properly.
Now going back to the problem itself, let me describe what we are seeing and maybe you have a better idea what could cause this and what we need to be looking at.
Virtually all of the pages on my Aviation Trivia website www.aviationtrivia.info have been downgraded by Google. The aircraft pages were mostly in the top ten search results for rc airplanes and the name of the aircraft such as F4U Corsair and the words "for sale", ie: "F4U Corsair for sale". A great number of those pages are now out of the first 50 search results.
The aircraft pages all contain original content. One such page, Sikorsky CH-53
http://www.aviationtrivia.info/Sikorsky-CH-53.php
is typical of the downgraded pages. It was in the first five page results under its name and now is around no. 60. The page even has an exclusive interview with a pilot of the aircraft.A page that was in the top results for the search words "rc Airwolf Helicopter"
http://www.aviationtrivia.info/Airwolf-Helicopter.php has now been downgraded to about no. 30. The no. 1 search results for rc Airwolf Helicopter is Century Helicopters 30 Size Airwolf Helicopter
http://www.centuryheli.com/products/helikits/cn1070airwolf/index.html?currentid=120
Their page minimally describes three helicopters they sell. My page at Aviation Trivia describes over 30 Airwolf helicopters for sale plus has information from a person who has flown one, plus additional information on where you can find specifics about the helicopter on popular websites.The most popular page on my site, World's Fastest Aircraft http://www.aviationtrivia.info/THE-100-FASTEST-AIRCRAFT.php
is still no. 1 in the Google search results, however what was the second most popular page, Largest Aircraft
http://www.aviationtrivia.info/THE-LARGEST-AIRCRAFT.php
has gone from no. 1 to about no. 50. What is really interesting is that, although I expect Wikipedia pages to always be above mine in the search results, the highest ranked page after Wikipedia is Global Aircraft - Top 50 Largest Aircraft.
http://www.globalaircraft.org/50_largest.htmLooking at my page that describes 100 aircraft in detail and provides links to pages in my site that goes into their histories and full specifications, then looking at the Global Aircraft page that simply states the name of the aircraft, wingspan, and weight, I can't understand how it can now ranked no. 1 and my page no. 50 when searching for "world's largest aircraft."
I have set up my site from the view point of a person who is interested in scale rc model aircraft and the aviation history of their actual aircraft. The information I put on the site is there to not only inform the people about the full scale aircraft, but to give them choices about the rc model airplanes that are available, and other information that may be helpful in choosing the right rc model for them.
Just to clarify, someone would search for rc F4U Corsair, or rc F-14 Tomcat and that would come up in the top search results as well as F4U Corsair for sale and F-14 Tomcat for sale, etc.
Hope that makes sense
Thanks!
Ralf -
If I understand correctly, you have a site with several pages that discuss different models, and each of those pages has this same spec list? You are not being hit with a duplicate content penalty. If you were, every website in the world that does reviews would be hit hard. I just did a quick search on treadmill reviews. Look at those results on the top page. They all use the same specs on all their pages. Treadmilldoctor.com and treadmillreviews.com seem to use a near identical system to each other even. Yet none of them have duplicate penalties.
WIthout knowing your website, there is definitely some other reason you are not ranking anymore. There was a new algorithm change that could have affected you.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Deleting Tags without Penalty?
Hello - We have a site with over 1,000 tags. We added too many and would like a fresh start as they are creating a lot of duplicate pages on the site. What is the best way to go about deleting all of these tags without being penalized by Google? Is there a way to tell Google direclty to stop crawling them? We would prefer to not have that many pages just sit as 404 errors on the site. Thank you.
Technical SEO | | FamiliesLoveTravel0 -
Duplicated titles and meta descriptions
Hi, Dealing with both my duplicated titles and meta descriptions i'm wondering if there's a "quick" win I could potentially implement asap. A bit of background:
Technical SEO | | GhillC
Say I've 4 pages structured that way: domain.com/us/productA.html for the US domain.com/gb/productA.html the UK domain.com/fr/productA.html for France domain.com/de/productA.html For Germany At the moment, both my page titles and meta-descriptions are duplicated all over the place for product A.
Title is reading "Product A - company name"
MD is a bit better, being translated in all 3 languages (En, Fr, DE). Therefore being the same for the US and for the UK. Ideally, I would get unique page titles and MD all over the place. However, due to time and resource constraints, I can't make it happen overnight. So my questions are pretty simple:
1. Can I create a rule for page titles to be "Product A - country - company name" or similar? Would that be enough to make the page titles unique? Is there any value doing so?
2. Can I "localize" duplicate MD by simply naming the country? I assume it is not enough in this case as all the rest would be copy/pasted. Ideally speaking, both my page titles and MD would be completely unique but I can't afford doing so in the short term. Thanks!0 -
Issues with Duplicates and AJAX-Loader
Hi, On one website, the "real" content is loaded via AJAX when the visitor clicks on a tile (I'll call a page with some such tiles a tile-page here). A parameter is added to the URL at the that point and the content of that tile is displayed. That content is available via an URL of its own ... which is actually never called. What I want to achieve is a canonicalised tile-page that gets all of the tiles' content and is indexed by google - if possible with also recognising that the single-URLs of a tile are only fallback-solutions and the "tile-page" should be displayed instead. The current tile-page leads to duplicate meta-tags, titles etc and minimal differences between what google considers a page of its own (i.e. the same page with different tiles' contents). Does anybody have an idea on what one can do here?
Technical SEO | | netzkern_AG0 -
Duplicate page issue
Hi, i have a serious duplicate page issue and not sure how it happened and i am not sure if anyone will be able to help as my site was built in joomla, it has been done through k2, i have never come across this issue before i am seem to have lots of duplicate pages under author names, example http://www.in2town.co.uk/blog/diane-walker this page is showing the full articles which is not great for seo and it is also showing that there are hundreds more articles at the bottom on the semoz tool i am using, it is showing these as duplicates although there are hundreds of them and it is causing google to see lots of duplicate pages. Diane Walker
Technical SEO | | ClaireH-184886
http://www.in2town.co.uk/blog/diane-walker/Page-2 5 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-210 1 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-297 1 1 0
Diane Walker
http://www.in2town.co.uk/blog/diane-walker/Page-3 5 1 0
Diane Walker can anyone please help me to sort this important issue out.0 -
How to remove the duplicate page title
Hi everyone, I saw many posts related to this query.But i couldnt find a solution for my error.. Here is my question I got 575 Duplicate page title & 600 duplicate page content errors. My site is related to realestate. I created a page title like same sentence differs with locality name Eg: Land for sale - kandy property Land for sale - Galle property Likewise Locality name only differs..I have created meta title & Content like this. Can anyone let me know how to solve this error ASAP ?
Technical SEO | | Rajesh.Chandran0 -
Penalty on two primary keywords
Hi Seomoz, I have been strugling to get www.texaspoker.dk out of what seems to be a keyword specific penalty (we are on page 5 on "poker" and "online poker"). First I thought it was Penguin related, but I'm not so sure any longer. I have removed all the bad links to my site possible (it's not easy to get other people to remove links, I can tell you), and I have reported all the links that I would like google to "ignore" (reconsideration request) ... all in all I have requested for reconsideration 5 times, and - despite some small changes - I got the same answer every time. We violate the quality guide lines and we should be looking for unnatural links pointing to our site. If any one - by having a look at our site - have any idea what could be wrong, please don't hold back, we would love to hear your point of view. Right now we are in the middle of making our partners take off their site wide links to us (the partners you'll find if you click the flags to the right in the top of the home page). On Texaspoker.dk we only link to the partners from the home page, but maybe we should consider to take of even these links? No one is really clicking on them anyway. Another thing, which is only under consideration, is to ask our partners from Betxpert.com (with whom we have exchanged our news feed - you will find their feed if you schroll down the home page) to set the feed to "no follow" and do the same our selves. What do you think of this thought? As far as I can see, there is nothing wrong with the on page optimization, but maybe some one can see what I don't see? Again - what ever thoughts you guys may have - shoot ... i'm ready to take all bulets 🙂 Thanks in advance! Nicolai, Texaspoker.dk
Technical SEO | | MPO0 -
How to fix duplicate page content error?
SEOmoz's Crawl Diagnostics is complaining about a duplicate page error. The example of links that has duplicate page content error are http://www.equipnet.com/misc-spare-motors-and-pumps_listid_348855 http://www.equipnet.com/misc-spare-motors-and-pumps_listid_348852 These are not duplicate pages. There are some values that are different on both pages like listing # , equipnet tag # , price. I am not sure how do highlight the different things the two page has like the "Equipment Tag # and listing #". Do they resolve if i use some style attribute to highlight such values on page? Please help me with this as i am not really sure why seo is thinking that both pages have same content. Thanks !!!
Technical SEO | | RGEQUIPNET0 -
Duplicate page content errors in SEOmoz
Hi everyone, we just launched this new site and I just ran it through SEOmoz and I got a bunch of duplicate page content errors. Here's one example -- it says these 3 are duplicate content: http://www.alicealan.com/collection/alexa-black-3inch http://www.alicealan.com/collection/alexa-camel-3inch http://www.alicealan.com/collection/alexa-gray-3inch You'll see from the pages that the titles, images and small pieces of the copy are all unique -- but there is some copy that is the same (after all, these are pretty much the same shoe, just a different color). So, why am I getting this error and is there any best way to address? Thanks so much!
Technical SEO | | ketanmv
Ketan0