Why is the exact same URL being seen as duplicate and showing an error in my SEO reports
-
Well, I am still having duplicate page issues.
I have a question about one of the errors SEO is giving me when I download a crawl report. I am going to attach a screen shot of part of the report so you can see for yourself, along with explaining it here.
SEO shows the list of URL's that it crawled in the report. In this(see attachment) portion of the report it has 321 results for the exact same URL. It also says all of these exact same URL's have received a 404 error. What I want to know is how does it make 321 results for the same URL? And with this error that I don't see when I look at the page?
-
Hey Josh,
You may want to speak with your developer on this one PHP is a server language and could be generating the unique pages and causing Roger to crawl twice depending on which way he approaches the page. I apologize for the delay here as I was not receiving notifications on this post.
-
I think it does so by PHP. Is there any easy way I can make sure?
-
Hey Josh,
Rogerbot discovered this link through the /blog/ subfolder on your page which led him to the which battery post. By any chance does the auto forward do so by PHP or Javascript? Sometimes Roger can get a little hung up on these pages and think they don't exist.
Let me know either way so we can get this taken care of!
Thanks
-
Ok I get which page it is now but I do not understand the issue. There isnt a direct link to that blog post from any of the other pages crawled. And if you use either link they both work. The Column a link however does auto forward to the MaxAmps site equivalent page. Could it somehow have to do with this?
-
Hey Josh,
This is James from Moz Help. I'd like to see if I can assist in diagnosing this problem. I had a chance to pull your crawl .CSV and the 321 pages you have in column F are actually the referring pages the actual page we crawled will be in Column A. Essentially the What Battery Can we MaxAmps Build for You has 321 unique pages it links too. While this page does not 404 it seems Roger can not get to the pages after this.
Feel free to follow up here or send us a message to help@moz.com
Have a great day!
-
Okay I will be sure to try and implement this to the blog.
You seem to know how a php dynamic site builds itself from one page. May I ask you another question? We have been going through this duplicate page problem for a long time. I generally get the answer to redirect or canonical the pages. I understand how I can do this to our blog as it is in word press. However, the rest of our site is not. The category section of our page, for instance, generates depending on how the person gets there from one single page of coding. And that is for all of our categories, not just one. How can I implement a redirect or canonical to this type of site. I would not want all of the categories to lead to one particular category. So if I put the canonical tag in my category page with a single url than that won't work. Also if I use a redirct in this page it will still lead me to only one category (correct?) instead of the people having the option to go to several different categories.
-
Hi Josh,
By chance does this url have parameters. Those may not be reporting properly in this report. That would be my first thought. I have seen that frequently in blog / forum crawls, as those usually have many parameters for starting at a certain post number. The simple solution is to just rel canonical the page to its root.
As for the 404 error I would guess that some page is generated or linking to urls with certain parameters that the page itself doesn't know how to handle.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Explain me how you use keyword explorer for SEO
Hello, Can you explain me how you use keyword explorer for SEO because I watched many video and after watching them everything is a little confusing... From my understanding, you have you main keyword let's say "Normandy bike tours". Based on that keyword you decide to cover TOPICS. Let's say I decide to cover D-Day landings, Omaha beach and and Caen on my page about" Normandy bike tours" ( I could cover more or others I imagine but now let's say those TOPICS are the ones I have decided to cover. From that point on what I understand is that I need to cover each of those TOPICS and the way to cover each of them is to type each of those words separately in the keyword explorer and look for words that the keyword explorer gives mes that it considers to be "semantically related" to each of those TOPICS. Then, if I choose the corrects one google will understand the different TOPIC and that should boost my ranking. Is it the whole idea ? PS : How many do you look at to find semantically related words ? 10, 20 or more ? Thank you,
Moz Bar | | seoanalytics1 -
Blocked Resource in Google Index. SSL certificate blocking 718 pages seen in Google Search Console.
My google search console indicates that my SSL certificate is blocking Googlebot. I was wondering if the blocking of my SSL certificate to the GoogleBot is causing any issues. I I'm not sure if this was only blocked recently by Volusion (my host) as a means of accommodating my ssl certificate not being able to address the various url versions of my site, or is this just commonplace and not really harmful to my indexing. I tested one of these "blocked" urls in the robots.txt tester and it showed that the Googlebot was allowed. Could it be just the SSL certificate at the bottom of the page is blocked? Thanks
Moz Bar | | mrkingsley0 -
URL is Inaccesible
Hi, I have tried this url: https://www.3dquickprinting.com/ on Moz onpage grader but it responds "Sorry, but that URL is inaccessible." I have checked my Robots.txt but it does not have any entry to block MOZ crawler. Please see: #User-Agent: *
Moz Bar | | HiteshP
#Crawl-Delay: 30 #For robots.txt
User-agent: BLEXBot
Disallow: /
User-agent: MJ12bot
Disallow: /
User-agent: TwengaBot
Disallow: /
User-agent: 008
Disallow: /
User-agent: WotBox
Disallow: / Please advice what to do get rid from this error. Thanks,0 -
Www and non www / duplicate content / redirects / www resolve issue
I am not getting docked for these specific errors, but I am getting docked for 1 page has a WWW resolve issue and 1 wrong URL in the sitemap... (SEM Rush) but when I use moz, it's not showing any issues. So I have these things set up so far: In .htaccess i have a command that removes the www. 301 redirect from www version to the non www (homepage) canonical on index.html pointing to non www version, I also set up a canonical tag for each page on the site search console with non www, www, https www, https non www all set to non www preference. Also, when I fetch the www version in google search console it says it's being 301 redirected to non www version which is basically what I want.Is there anything that i'm missing? These errors on SEM Rush are giving me anxiety lol.
Moz Bar | | donnieath1 -
How do I pull a rankings report for a specific week in the past?
The date of the rankings I want is January 13th. I can see the individual keyword history in rank tracker, but I have to go each word. I'd like to just have that report to compare it to, since that is when a lot of our rankings changed. So I need that report to compare our future efforts with. Please advise and thank you! Ruben
Moz Bar | | KempRugeLawGroup0 -
Moz Crawler URL paramaters & duplicate content
Hi all, this is my first post on Moz Q&A 🙂 Questions: Does the Moz Crawler take into account rel="canonical" for search results pages with sorting / filtering URL parameters? How much time does it take for an issue to disappear from the issues list after it's been corrected? Does it come op in the next weekly report? I'm asking because the crawler is reporting 50k+ pages crawled, when in reality, this number should be closer to 1000. All pages with query parameters have the correct canonical tag pointing to the root URL, so I'm wondering whether I need to noindex the other pages for the crawler to report correct data?: Original (canonical URL): DOMAIN.COM/charters/search/mx/BS?search_location=cabo-san-lucas Filter active URL: DOMAIN.COM/charters/search/mx/BS?search_location=cabo-san-lucas&booking_date=&booking_days=1&booking_persons=1&priceFilter%5B%5D=0%2C500&includedPriceFilter%5B%5D=drinks-soft Also, if noindex is the only solution, will it impact the ranking of the pages involved? Note: Google and Bing are semi-successful in reporting index page count, each reporting around 2.5k result pages when using the site:DOMAIN.com query. The rel canonical tag was missing for a short period of time about 4 weeks ago, but since fixing the issue these pages still haven't been deindexed. Appreciate any suggestions regarding Moz Crawler & Google / Bing index count!
Moz Bar | | Vukan_Simic0 -
Report Dates are being displayed in a strange format ??
My reports dates are showing a very strange format (month/day/year) how do we fix to correct format (day/month/year) ? 😉
Moz Bar | | Dan-Lawrence0 -
Keyword Difficulty Showing ONLY Bing Search Volume (Exact Match)
Hi I am using the "Keyword Difficulty" tool and selecting "Google US". But the report that gets generated shows "Bing Search Volume (Exact Match). Is there any way to get "Google Search Volume (Exact Match)" being shown in the report? Regards
Moz Bar | | rholt0