7,608 High Priority Crawl Diagnostic problems
-
Hey There,
I have an e-commerce site that is showing 7,608 High Priorities to fix - 7,536 are duplicate content. What's the most effective process to start with?
I'm open to outsourcing some of the work to an expert - email me on dave@emanbee.com
Thanks for your time,
Dave
-
Cheers Kate.
From doing more reading, MOZ/ Google views thin content (300 words or less) or webpages with 95% of the same HTML code as duplicate. That will be the majority of what is showing in my crawl diagnostics.
That means I'm back to your original advice of fixing up duplicate page titles from GWT.
Currently, the canonical tags are generated sitewide through a template function. Without full control over the canonical tag I can't fix or structure things as easily as I'd like so I will see if a web dev can help out with this. We should be able to add the whole link too.
Thanks again,
Dave
-
Looks like moz isn't taking the canonical into effect, as long as it's there, you're fine. But I'd warn you not to use relative canonical links ( /directory/page/ vs http://www.domain.com/directory/page/), link to the whole thing. I've seen this go wrong in the past. It's not causing issues now but could in the future.
-
Hi Katemorris,
Thanks again for getting back to me.
I have started going through and fixing up pages. I'm hoping you can clarify something from MOZ for me?
In MOZ > crawl diagnostics> duplicate page content (the largest and only high priority issue listed for me) > the first link in the list > show the duplicate pages
Below is an example of 4 links that are all showing as duplicates of http://www.mooloolabamusic.com.au/page/brands in the moz software:
http://www.mooloolabamusic.com.au/live-sound-lighting/lighting/atmospheric-effects/?pr=72-82&rf=pr
http://www.mooloolabamusic.com.au/live-sound-lighting/lighting/atmospheric-effects/?pr=0-72&rf=pr
http://www.mooloolabamusic.com.au/studio-production/?pr=1732-1828&rf=pr
http://www.mooloolabamusic.com.au/studio-production/?pr=1770-1827&rf=pr
Can you please clarify how these pages have duplicate content and how to fix this? There are thousands like this.
When I have a look at them using the moz search bar there is already a cononical tag in the header which is either not working or the moz software does not pick it up or is the site template creating 'duplicate content'?
Thanks so much for your time,
Dave
-
Start in Google Webmaster Tools or in the Moz Crawl. Identify those pages with the same title tag and work through that list. The title tag is usually a good indication of duplicate content.
If the content is duplicate for sure, determine if it's a useful duplicate. If so, use a canonical from the duplicate to the original. If it's just duplicated with no real reason, find out how to get rid of the duplicate. This can be anything from unnecessary parameters, to tag pages, and so many more.
You'll start to see trends in the data, try to fix the bigger problems as you see them.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Huge spike in crawl errors today - mozbot ignoring noindex tag?
Hi Mozzers, Today I received a ton of errors and warnings in my weekly crawl due to the mozbot crawling my noindex'd search results pages, such as this - http://www.consumerbase.com/Mailing-Lists.html?q=Construction&type=bus&channel=all&page=7&order=title&orderBy=DESC See image: http://screencast.com/t/qaZzq78j2Udx Anyone else seen a similar error this week with their crawl? Thanks!
Moz Pro | | Travis-W0 -
Problems with csv file from OSE
Hello Support, I have problems with the formatting of csv files from OSE in Excel. I got lines that only contain -- and these lines break up the data. It is possible to correct this manually but a bit annoying if you have 1500+ links generated in the file. I work a lot with csv files from other tools and programs and those give me no problems. Can you help me out please? Greetings Rob
Moz Pro | | FindFactory0 -
My moz only one page was crawled
I recently moved my shopping cart from one provider to another and today moz only crawled one page, could this be because maybe google has not indexed it yet or should i be concerned? I pointed the DNS at the new cart monday night if that helps. I would have expected it to be indexed by now
Moz Pro | | SmartVapes0 -
Can we add sites to the crawl queue for OSE?
Is it possible to request that Open Site Explorer crawls a new URL on its next run? This tool is the first place I go to when working on a new site, and when there is "No Data Available" this is a little frustrating. I fully appreciate that this lack of data is usually a signal that the website is either very new or of low quality, however that if often the reason that I am brought in and would very much like to benchmark and provide initial analysis using this tool. It would make sense that OSE crawls the sites that Moz members are working on wouldnt it? Scott.
Moz Pro | | eseyo0 -
SEomoz slow to crawl?
Hello - I am just trying out the trial and it said the next crawl was nov 1st but I see no change in any of the errors since the initial crawl... so just waiting to find out if what I changed was fixed or not. Is this normal ?
Moz Pro | | Bethany_BabyBrowns0 -
Is The Crawl Diagnostic tool working correctly?
The Crawl Diagnostic tool shows issues and displays a graph but they don't display the page specific results/suggestion like it used to. I get the "Congratulations, there are no pages affected by this issue!" message.
Moz Pro | | -PAUL-0 -
How can I clean up my crawl report from duplicate records?
I am viewing my Crawl Diagnostics Report. My report is filled with data which really shouldn't be there. For example I have a page: http://www.terapvp.com/forums/Ghost/ This is a main forum page. It contains a list of many threads. The list can be sorted on many values. The page is canonicalized, and has been since it was created. My crawl report shows this page listed 15 times. http://www.terapvp.com/forums/Ghost/?direction=asc http://www.terapvp.com/forums/Ghost/?direction=desc http://www.terapvp.com/forums/Ghost/?order=post_date and so forth. Each of those pages uses the same canonicalization reference shared above. I have three questions: Why is this data appearing in my crawl report? These pages are properly canonicalized. If these pages are supposed to appear in the report for some reason, how can I remove them? My desire is to focus on any pages which may have an issue which needs to be addressed. This site has about 50 forum pages and when you add an extra 15 pages per forum, it becomes a lot harder to locate actionable data. To make matters worse, these forum indexes often have many pages. So if I have a "Corvette" forum there that is 10 pages long, then there will be 150 extra pages just for that particular forum in my crawl report. Is there anything I am missing? To the best of my knowledge everything is set up according to the best SEO practices. If there is any other opinions, I would like to hear them.
Moz Pro | | RyanKent0 -
Crawl complete, but nothing changed?
Hi everyone, According to my account, the crawl diagnostics were completed yesterday. However, the duplicate page titles that it mentions aren't correct. The changes that I implemented several days ago are not being shown in the report. When I click the duplicate page title links, the latest date in the graph is 3/26. However, it says the crawl was completed on 3/30. Does it take a few days for the reports to match what the crawl actually discovered?
Moz Pro | | beeneeb0