Need help fixing the duplicate content that keeps growing
-
Need help fixing the duplicate content that keeps growing
-
can you do me a favour and check if http://www.............magazine.com is duplicate with the above domain that we are talking about i used your similar page checker and it said 100% we have a redirect on the domain, but i am concerned that it may not be effective
-
Thank you for your help, So these pages have no content on them yet, So I guess I need to put some content on them. Do you think that these issues affect my google ranking?
T
-
Hey T,
I want to let you know that I removed the link to your campaign from your last post and I would recommend that you don't post those types of links publicly. Our site is secure, and only admins and account owners can access data through those links, but you never know what someone may try to do maliciously with that information.
For the link you provided, we are reporting those pages as duplicate content because they are pretty much 100% similar in the code and content for the page: http://www.screencast.com/t/qMgveoj4i. (Our campaign tolerance is 90% similarity.) The only difference is the state name the pages refer to, which is not enough to make the pages different. You can verify that using this tool here: http://smallseotools.com/similar-page-checker/
Here is a great resource for learning about canonical tags: https://moz.com/learn/seo/canonicalization
Here is a post about how we detect duplicate content:https://moz.com/devblog/near-duplicate-detection/ And for more information on using canonical tags, check out this great post by our very own Dr. Pete: http://moz.com/blog/rel-confused-answers-to-your-rel-canonical-questions -
Hi Chiaryn, and Bryan
thanks for your help..I did not want to be too specific initially as I did not know how open this help request would be.The site is as mentioned above.
One question, will all these duplicate content affect the site in Google?
I can apply the User-agent: *
Disallow: ?attachmentWhat is the code to perform the canonical tag?Also please can you look at this duplicate content.. I do t understand why it is duplicate content
If I fix these issues will my site ranking improve?
Thanks again
T
-
Hey There! It looks like you are trying get some assistance without specifically naming the site you are concerned about and I definitely understand that, but it is really difficult to give advice on this issue without more detailed information. However, I took a look at your campaigns and I am going to address the issue I am seeing with the site that had the largest increase of duplicate content over the last couple of crawls. I apologize if this isn't the site you are referring to.
The campaign I'm looking at is the 5 Star campaign. It looks like a large number of the pages with duplicate content are related to ?attachment parameters in the URL, such as www.site.com/?attachment_id=77899. There is very little content on these pages and it looks like they are added to the site pretty regularly, since all of the ones I looked at are dated closely together.
I'm not an SEO expert, so Bryan may have better advice for you, but I can give a few suggestion of how to resolve this issue. I don't entirely understand the purpose of these pages, so that would affect which of these options might be best for your personal strategy for the site.
You can add a canonical tag to these pages to point to one specific page as the most important page with this content. For this option, they would have to point back to the same page or our crawler will still show them as duplicates because we assume that the two canonical pages are then also likely to be duplicates. Google, however will stop indexing these pages.
You can also block these pages from being accessed using the robots.txt file for this site. For example, it would look something like this:
User-agent: *
Disallow: ?attachmentetc., until you have covered all of the parameters you would like to block. The User-agent: * blocks all crawlers from accessing those pages, but you can also use User-agent: rogerbot to specifically block only our crawler.
I hope this helps! Please let me know if I can help you with anything else.
-
That's still not quite enough to go on. Could you provide the message they're giving you, and/or URLs regarding the duplicate content? Examples in either should prove helpful.
-
Hi thanks for the heads up.. just been very frustrating.
Moz and Google show dup internal content.
Using WordPress and Yoest SEO plugin to try to fix some of the issues.
Blog content
Regards T
-
You'll have to be a lot more specific. Try answering at least some of these questions so we can help you:
- What content is being duplicated?
- Where are the duplicates?
- Is this all internal (on your site only)?
- Are you receiving any duplicate content warnings from Moz, Google, etc.?
- In what way does it "keep[s] growing?"
- What kind of content is this?
Once you provide answers to some of these questions, I'm sure we'll be able to help you fix the issue.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Need Refund
Hello! today my Card was charged with 179$ for Moz Pro fees which i was not aware of, kindly refund my money and cancel / delete any subscription related to my account. Username : aijaz555 Thanks
Getting Started | | aijaz5550 -
Duplicate titles issue
Hi, For the second week in a row MOZ is finding duplicate titles crawling my website.
Getting Started | | AlessiaCamera
But as you see in the attached screenshot it doesn't seem it's a clear duplicate title thing, as it's mainly due to the different pages having the same title.
What should I do? Is it really affecting my SERP positioning? ZANRT0 -
After fixing Crawl Errors, how long does it take to for Moz or Google to re-crawl a website?
Last night I found out through Moz that my robots.txt file was blocking any crawling of my website. I fixed the issue. Now do I just sit and wait?
Getting Started | | cmc-interactive0 -
Error help for newbie please
Hi, I signed up after seeing the videos on Udemy and YouTube (white board Friday) So I've started the free trial and am looking forward to getting my site ranking higher. I've crawled my site www.sussexchef.com and its come back with the following errors (please see below. 608 I'm sure this information is very important but I have no idea how to fix the 608, I found a robots.txt in my directory and deleted it as I think that maybe the problem? I crawled the site twice by accident so will have to wait till tomorrow to find out? 404 I found it quite hard to find the broken links at first but once I realized all the information I needed was in the table I think I got them all. did I miss a tutorial or am I just a little out of my depth here? 503 I have no idea how to fix these, I can click the links and it takes my to that page or file. so how can it be the server down? Or is this because they are links to PDF's? should i convert them to jpegs and give them meta data? I'd be grateful for any help anyone has to offer as I'm keen to learn how to promote my site better. Crawl Error Moz encountered an error on one or more pages on your site608 Page not Decodable as Specified Content EncodingInvestigate the cause of this issue on the Help Hub.Discovered: Sep 2 - 8Crawl Diagnostics Crawl Issue Found: 404 Errors 10% of site pages served 404 errors during the last crawlA high percentage of 404 pages can indicate a problem with the internal link structure.Crawl Diagnostics 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/wedding-caterers.aspx4041215http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/dinner-party-catering.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/christmas-party-catering.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/wedding-cakes.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/outdoor-catering-specialists.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/hen-party-cupcake-classes.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/funeral-caterers.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/drinks-service.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/private-party-catering.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/corporate-catering.aspx404115http://sussexchef.comN/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/caterers.aspx40401http://sussexchef.com/wedding-catering/N/AView Issue 404 : Received 404 (Not Found) error response for page. http://sussexchef.com/funeral-caterers-brighton.aspx Crawl Issue Found: 500 Errors More than 5% of site pages served 500 errors during the last crawlExcessive 500 errors impact search engine indexation. Double check that your website is serving pages properly to both users and crawlers. 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Wedding-Packages-2014.pdf141N/A50302N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Vegetarian-BBQ-Menu.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/AllInclusiveMenuPrices1.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/Finger%20Buffet.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/Dessert.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/SummerMenu.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/HotorColdBuffet.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/Canape%20Menu.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Susex-Chef-Xmas-Dinner-artwork-file.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/BBQ.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Fun-Finger-Buffets.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/HogRoast.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Private-Chef-Dinner-Packages.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/Salad%20Menu.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Childrens-Menu.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/HotForkBuffetDelivererd.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/Afternoon%20Tea.pdf141N/A50301N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Christmas-Promo.pdf141N/A50301N/A 500 : Received 500 (Internal Server Error) error response for page. http://sussexchef.com/?attachment_id=51110N/A50000N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2013/08/Biography.pdf10N/A50300N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/?attachment_id=62810N/A50300N/A 503 : Received 503 (Service Unavailable) error response for page. http://sussexchef.com/wp-content/uploads/2014/02/2014-02-01-11.12.00.jpg
Getting Started | | SussexChef830 -
How to locate page with the duplicate title? (Crawl Diagnostics - Duplicate Titles Warning)
I am looking through my crawl diagnostics and one of my errors states that a page has a duplicate title. My problem is that I do not know how to find the duplicate. Any advice here?
Getting Started | | bearpaw0 -
Why has my website gone from 2 duplicate pages to 5000+ duplicate pages n 1 week?
Hi Can anyone please help. Using the weekly moz reports I realised that my website has gone from 2 duplicate pages to over 5000. As well as this pages with "too long URL" has jumped to over 5000 as well as missing meta tags. Any help would be greatly appreciated! Cheers
Getting Started | | Stubs0 -
Moz's official stance on Subdomain vs Subfolder - does it need updating?
Hi, I am drawing your attention to Moz's Domain basics here: http://moz.com/learn/seo/domain It reads: "Since search engines keep different metrics for domains than they do subdomains, it is recommended that webmasters place link-worthy content like blogs in subfolders rather than subdomains. (i.e. www.example.com/blog/ rather than blog.example.com) The notable exceptions to this are language-specific websites. (i.e., en.example.com for the English version of the website)." I am wondering if this is still Moz's current recommendation on the subfolders vs subdomains debate, given that the above (sort of) implies that SE's may not combine ranking factors to the domain as a whole if subdomains are used - which (sort of) contradicts Matt Cutts last video on the matter ( http://www.youtube.com/watch?v=_MswMYk05tk ) which implies that this is not the case and there is so little difference that their recommendation is to use whatever is easiest. It would also seem to me that if you were looking through the eyes of Google, it would be silly to treat them differently if there were no difference at all other than subdomain vs subfolder as one of the main reasons a user would use a sud-domain is a technical on for which it would not make sense for Google to treat differently in terms of its algorithm. I notice that in terms of Moz, while most of the site uses subfolders, you do have http://devblog.moz.com/ - and I was wondering if this is due to a technical reason or conscious decision, as it would seem to me that the content within this section is indeed linkworthy (as it has external links pointing to it from external sources), therefore it would seem to not be following the initial advice that is posted in Moz's basics on domains. Therefore I am assuming it is due to a technical reason - or that Moz's adive is out of date with current Moz thinking, and is indeed in line with Matt C in that it doesn't matter. Cheers
Getting Started | | James773 -
Track effects of content changes on specific page SERP
We are planning to implement changes to the content and layout of a page on our website due to the results of an A/B test. The URL, title, and meta will remain the same. How can we track the effect of these changes on this specific page has on SERP? We are being vague because we don't want to guide any answers to a specific question as I have not fully figured out MOZ's capabilities.
Getting Started | | gliffy0