Is it Panda?, how to deal with AP etc newswire articles
-
A site I have lost 30% of its traffic in June then another 10% in July, is it Panda?
The site has 10's of thousands of AP or other syndicated articles on it, they are not there for SE benefits, they are categorized and relevant to the people who read them, the site gets half of its traffic from type ins/bookmarks.
Should I nofollow the articles or rel="canonical" them? what can help......
Cheers
-
Thanks again, I guess I will have to look through keywords and see what traffic these news pages are still getting from google, then weigh up whether to tag them.
-
Panda isn't a penalty per se. It is a algorithmic change to how Google ranks sites and pages. If your site has duplicated content on it, you will need to fix all of it. Once your site has been cleaned up, it will can take a month or more for Google to fully re-index your entire site and see all of the duplicated content gone or properly handled (i.e. noindex or canonicalized).
It's not as if you have 1% of duplicated content that your site is affected but no one knows for sure what exact percentage triggers this effect, so your best course of action is to clean it all up.
By using the canonical tag, these pages will be removed from the index for your site. The "harm" would be that if someone searches for the pages your site wont be listed unless you have relevant comments for the search query.
-
Thanks, is there any way that I could trial this on the site by just adding the tag to a few pages or sections? Is it domain level metrics google is using, they have decided that the site is junk now as it has so much duplicated content?
The articles are slightly changed and there are comments on them.
What harm could I cause by trialing the canonical tag? if I took it of there later could there be some recovery time?
thanks a lot
-
If this content is merely duplications of articles which exist elsewhere then yes, you can add the canonical tag pointing to the source.
You would definitely not want to "nofollow" these pages. By adding a nofollow tag you are telling search engines not to flow page rank to the other links they find on the page. That is not the result you desire.
You could noindex the pages as well. Prior to doing such I would ask if you are offering comments or other user generated comments. If you are not, then the noindex tag is fine. If you do offer UGC, then I would recommend the canonical tag.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best way to deal with 100 product pages
It feels good to be BACK. I miss Moz. I left for a long time but happy to be back! 🙂 My client is a local HVAC company. They sell Lennox system. Lennox provides a tool that we hooked up to that allows visitors to their site to 'see' 120+ different kind of air quality, furnace and AC units. They problem is (I think its a problem) is Google and other crawl tools are seeing these 100+ pages that are not unique, helpful or related to my client. There is a little bit of cookie cutter text and images and specs and that's it. Are these pages potentially hurting my client? I can't imagine they are helping. Best way to deal with these? Thank you! Thank you! Matthew
Technical SEO | | Localseo41440 -
Privacy: Is Whois info used to help establish an admin relationship between sites in addition to host/IP etc ?
Hi Do you think Google looks at WhoIs details as a contributing factor to establishing an adminsitrative relationship between two domains (in addition to being hosted on similar hosts/IP blocks etc), and in regard to linkbuilding would having teh same whois details on both sites have a negative effect or be perfectly ok (if the sites are on different hosts/ip blocks) ? Also do you think whois privacy turned on has a negative effect on trust and subsequent seo ? Considering the answer to the above two questions: Do you think its a good or bad idea to have domain reg/whois ‘privacy’ turned on for a site of curated content relating to the project/primary sites niche, and linking to this site for contextual link benefit ? Im building out a site of curated content that i want to perform well in-itself as well as providing backlink benefit to the primary site but worried if they both have same whois details will cause seo problems or would that only be if also had same host/ip footprint ? Should i enable whois privacy, use a different address for reg, or actually make a point of using the same whois details for transparency ? All Best
Technical SEO | | Dan-Lawrence
Dan0 -
Can panda penalize News publisher sites?
Hey Guys,I was wondering how Panda behaves with news publisher sites.A site with +-1M visits a day that publishes +-300 news articles a day and the life of each article is one week top, given the nature of a news articles -->only relevant now.After one week the the news articles have virtually no page views. This results on a site with thousands of quality content pages that has no page views for years.Is it possible that the site gets penalized by panda for having thousands of pages with no visits?
Technical SEO | | Mr.bfz0 -
How long to recover from Panda Update
Hi there, I think I was affected by the recent Panda update as I had a lot of duplicate content for my product descriptions (about 300). I'm going through and rewriting these to be both helpful and unique. I was ranking quite nicely for a big spread of keywords, but have been seeing my rankings drop day after day since the update. Is it possible to see my rankings improve again after Google re-crawls my site, or would a penalty have been applied to my site preventing me to re-gain my positions for sometime. It's probably worth noting that I have a lot of unique and helpful content, it was just my product pages that had duplicate content, but I've seen my rankings across the board drop. Any discussion and insight would be much appreciated.
Technical SEO | | BlueTree_Sean0 -
Panda Recovery ETA?
I have a blog hit by Panda in 2011 and 2012. The thing is, I've no-indexed around 1000 posts out of 11xx. No-indexed tags and archives. But, Google was taking a very long time to remove them from their indexes. So, I had to do a manual removal from Google WMT. Removed /2011/ and /2013/ as directories, and removed /pages/ (this is an WordPress site) so all of them are now no longer in their index. It was a smartphone blog started in 2011 which I turned into an tech blog on a new domain (I let the old PR3 DA 30+ domain expire and now someone's asking me $200 if I am to get it). I had a team when it was a smartphone blog. Our articles had been featured on places like Engadget, PhoneArena, UberGizmo etc. So, with the loss of the domain, we've lost quite a few important backlinks as well. Also, Authorship doesn't work for the site. The Rich Snippets testing tool says everything's all right, but it never really works / shows up on SERPs. I fear it's because of a penalty. It seems to me like no one has ever thought about a penalty that affects Authorship. So, now you know the problem, and the things I did in order to fix it, could you tell me if: Google will lift the penalty whenever they wish. (And an ETA?) They'll lift it when the next major algorithmic update occurs. (I made the changes on September 28th) But I don't see how this is a possibility since Panda has now been integrated into the core algorithm. Anything else. Thanks in advance everyone!
Technical SEO | | RohitPalit0 -
Syndicated content outranks my original article
I have a small site and write original blog content for my small audience. There is a much larger, highly relevant site that is willing to accept guest blogs and they don't require original content. It is one of the largest sites within my niche and many potential customers of mine are there. When I create a new article I first post to my blog, and then share it with G+, twitter, FB, linkedin. I wait a day. By this time G has seen the links that point to my article and has indexed it. Then I post a copy of the article on the much larger site. I have a rel=author tag within the article but the larger site adds "nofollow" to that tag. I have tried putting a link rel=canonical tag in the article but the larger site strips that tag out. So G sees a copy of my content on this larger site. I'm hoping they realize it was posted a day later than the original version on my blog. But if not will my blog get labeled as a scraper? Second: when I Google the exact blog title I see my article on the larger site shows up as the #1 search result but (1) there is no rich snippet with my author creds (maybe because the author tag was marked nofollow?), and (2) the original version of the article from my blog is not in the results (I'm guessing it was stripped out as duplicate). There are benefits for my article being on the larger site, since many of my potential customers are there and the article does include a link back to my site (the link is nofollow). But I'm wondering if (1) I can fix things so my original article shows up in the search results, or (2) am I hurting myself with this strategy (having G possibly label me a scraper)? I do rank for other phrases in G, so I know my site hasn't had a wholesale penalty of some kind.
Technical SEO | | scanlin0 -
What the Panda are we doing wrong?
Starting at June 8 of this year (the exact date of the Panda 3.7 update) the organic search engine traffic to our website dropped by about 30%. We're talking about a fairly new domain (about 8 months old) that has (or at least is suppost to have) pearly white SEO, and no outside parties have ever done any SEO for it. Organic search traffic was very stable in the weeks prior to June 8. Organic search visits have dropped pretty much across the board (due to dropped ranking at the SERPS, as reported by our SEOmoz campaign). The (not provided) keyword has dropped 25%, while traffic from keywords related to our core products (joomla templates) have dropped almost 50%. Knowing that June 8 saw a Panda update, I dug up some of the old Panda posts (never thought I'd need those for one of my own sites) to see what factors trigger a Panda hit. Based on the factors mentioned in this article at SEW, I'll briefly discuss what is going on at our website. Affiliate links and ad units Not a single affiliate link or ad unit can be found on our website. Low-quality or thin content Only 163 URLs from the www subdomain have been submitted in our sitemap, of which 152 are indexed. About 25 of those pages (the individual questions on our FAQ page) could in my opinion be characterized as 'thin content' pages. Canonicalization Every single page on our www subdomain has a rel="canonical". Given that the demo subdomain is based on Joomla, we have less control over those pages (and there will probably be some duplicate content issues there), but nothing more than any clean Joomla website would have. Site speed Our www subdomain receives a near-perfect 97/100 on YSlow, the demo subdomain scores a 83/100. Quality In the past months several popular resources (blogs, infographics) have been released that were well linked to by other (significant) players in our niche. Social signals Our site received about 25 +1's, several dozen (or more) tweets and a few Facebook Likes. Search result pages We don't have those. Questions: Can anybody spot potentially Panda-triggering issues on our website? I'm aware that our link profile isn't perfect (not very bad either), but to my knowledge Panda was/is an on-page driven algorithm update, right? We're also running a demo subdomain (click 'demo' in the menu), hosting there five full Joomla installations to showcase our products (just like virtually all other template providers do). This subdomain seems to also have taken a hit, but less than the www subdomain (about 15% decrease in organic search visits). Is it possible that the demo subdomain has triggered this issue (and if so, what changes would you advice)? Any help would be greatly appreciated!
Technical SEO | | Theo-NL1 -
Panda 2.2 Full Recovery In Action
I have had several new clients come to me after Panda and Panda 2. Lots of audits. The client who had the worst problems, and has since corrected the worst issues based on my audit just bounced back in an epic way, and while it could be a short-term thing, I don't believe that's the case - it's just too big of a jump back - full recovery. I'm curious to find out if anyone sees a similar recovery on your sites. FYI the biggest problems (most of which have been resolved now) include: Content organization - it was a mess of a site Extreme over-use of ads on the page and in the content Topical focus - there was so much going on across every page of the site that confused Google Major site speed issues 5ewacr
Technical SEO | | AlanBleiweiss1