Duplicate Content Report: Duplicate URLs being crawled with "++" at the end
-
Hi,
In our Moz report over the past few weeks I've noticed some duplicate URLs appearing like the following:
Original (valid) URL:
http://www.paperstone.co.uk/cat_553-616_Office-Pins-Clips-and-Bands.aspx?filter_colour=Green
Duplicate URL:
http://www.paperstone.co.uk/cat_553-616_Office-Pins-Clips-and-Bands.aspx?filter_colour=Green**++**
These aren't appearing in Webmaster Tools, or in a Screaming Frog crawl of our site so I'm wondering if this is a bug with the Moz crawler? I realise that it could be resolved using a canonical reference, or performing a 301 from the duplicate to the canonical URL but I'd like to find out what's causing it and whether anyone else was experiencing the same problem.
Thanks,
George
-
So glad to help, George!
-
Hi Chiaryn,
Thanks - you've been really helpful! I had assumed that as the referrer wasn't in the Web UI (per WMT), it wasn't available anywhere. I'd also assumed it was a copywriting issue and not a product data issue.
Need to readdress my assumptions
George
-
Hey George,
Thanks for writing in.
I looked into the pages with the ++ in the URL and it seems that they do actually exist on the site, so it isn't an issue with our crawler that is causing these in your crawl errors. For example, a link to the URL http://www.paperstone.co.uk/cat_553_Desktop-Essentials.aspx?filter_colour=Green++ can be found in the source code of the page http://www.paperstone.co.uk/cat_553_Desktop-Essentials.aspx here: http://screencast.com/t/HpHTlSs5gH8H
You can find the referral pages for the ++ pages on the site by downloading the Full Crawl Diagnostics CSV. In the first column, perform a search for the ++. When you find the correct row, look in the column labeled referrer, AM. This tells you the referral URL of the page where our crawlers first found the URLs that include ++. You can then visit this URL to find the links to those pages.
Since these URLs with the ++ do resolve with a 200 http status and they have the same code and content as the pages without the ++, our crawler will count them as duplicate content. I'm not certain why Screaming Frog and GWT are not find or reporting these pages; it may be that they parse the + signs in the URL differently than our crawler does.
As Keri and bishop23 mentioned, this is most likely not a major issue if GWT isn't reporting the errors, but we prefer to report the issues because we would rather be safe than sorry.
I hope this helps. Please let me know if you have any other questions.
Chiaryn
-
I'm not seeing an answer that jumps out at me for this one. For the immediate future, don't sweat it if you're not seeing it in GWT. This is assigned to our help desk, and we'll have someone from there investigate more and get back to you, though it might be a few days because of the Thanksgiving holiday (if you don't get an answer today, it may be Monday before we have a chance to respond).
-
If they're not appearing on WMT than you should ignore unless it's an exact duplicated content, then delete
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
On Oct 1st I set up a report to be sent to my client MONTLY... however it says the next report will be sent out Dec 1st. Same on other clients...is this monthly or every 2 months
on Oct 1st I set up a report to be sent to my client MONTLY... however it says the next report will be sent out Dec 1st. Same on other clients...is this monthly or every 2 months
Product Support | | Master-Charles0 -
I have removed a subdomain from my main domain. We have stopped the subdomain completely. However the crawl still shows the error for that sub-domain. How to remove the same from crawl reports.
Earlier I had a forum as sub-domain and was mentioned in my main domain. However i have now discontinued the forum and have removed all the links and mention of the forum from my main domain. But the crawler still shows error for the sub-domain. How to make the crawler issues clean or delete the irrelevant crawl issues. I dont have the forum now and no links at the main site, bu still shows crawl errors for the forum which doesnt exist.
Product Support | | potterharry0 -
Keeping getting the same results for "link opportunities"
After solving my 404 error, the results in 'link opportunities' are still the same. How do i refresh the site and get updated results?
Product Support | | kevinbp0 -
Duplicated content issue?
Hi, Moz tools tell me that I have approx. 15xx Pages with High Priority Issues [Duplicate Page Content] going thru the details here are the majority: many products no longer exist and visitor redirected (404) to a same not found page. duplicated content with url appended with utm_source/utm_medium/utm_campaign GA tracking paras. On those page there are already canonical tags plus I've configed those parameters in Google Webmaster "URL Parameters" how much of these warning could compromise the seo proformance or provided that there have been dealt with should I just ignore these warning?
Product Support | | LauraHT0 -
I can't pull September monthly report, only August available, please help?
When i try select date range for my September report, monthly report is not available, it only shows weekly report. August monthly report is available and not September but we are already in October.
Product Support | | francoismuscat0 -
Finding the old reports and running new ones
When I click on a link to a recent report I don't end up there. I end up on a site I do not recognise or have time to plough through. Can anyone please give me a link to the reports. Many thanks in advance.
Product Support | | Niamh20 -
MOZ Crawl help
Our MOZ report says it crawled 1800 pages so it reports a lot of errors based on those pages. We don't have that many pages on our site. What is MOZ crawling? I updated the profile to make sure it crawls the filtered page section of Google Analytics.
Product Support | | JessiK0 -
Are the on page report cards graded according to the keywords associated with your campaign only?
One of the key phrases I have associated with my campaign is Las Vegas wedding venues. I have an on-page report card grade D for a page I am optimizing for the long tail key phrase outdoor wedding venues in Las Vegas. Some of the issues I am being asked to fix are:
Product Support | | leslieevarts
1. Broad Keyword Usage in Page Title: Employ the keyword in the page title, preferably as the first words in the element.
2. Keyword Usage in URL
3. Appropriate Keyword Usage in H1 Tag Should I disregard these fixes because Moz.com is running these report as if I am optimizing for Las Vegas wedding venues as oppose to outdoor wedding venues in Las Vegas? All help is appreciated!! Thank you so much 😃0