Help with Roger finding phantom links
-
It Monday and Roger has done another crawl and now I have a couple of issues:
- I have two pages showing 404->302 or 500 because these links do not exist. I have to fix the 500 but the 404 is trapped correctly.
http://www.oznappies.com/nappies.faq & http://www.oznappies.com/store/value-packs/\
The issue is when I do a site scan there is no anchor text that contains these links. So, what I would like to find out is where is Roger finding them. I cannot see any where in the Crawl Report that tells me where the origin of these links is.
- I also created a blog on Tumblr and now every tag and rss feed entry is producing a duplicate content error in the crawl stats. I cannot see anywhere in Tumblr to fix this issue.
Any Ideas?
-
Thanks again Ryan, you have been very helpful answering al lot of my questions.
-
Someone else asked the same question regarding tag pages yesterday. I would suggest asking a separate Q&A on that topic.
Tag pages & forum category pages are both often used as containers. They don't have any content except links to articles. I would ask for feedback as to the best practice. I suspect noindex, following those pages would be best, but I don't have the experience to feel comfortable offering that advice.
-
I have been looking at the data that Roger is reporting for the duplicate content and in ALL cases there is either a 301 or a NoIndex. So now I do not know why Roger is reporting them as a duplicate, robots should not see the second entry.
-
I did not think of looking at the csv report. I see it now thanks Ryan. There should be a soft 404 handler in place to process the bad urls, I will have to see why it is not working.
With tumblr, I was looking for an easy way to add a blog to the site.
The RSS is coming from tumblr as is all the content.
When we specify Tags in tumblr it creates urls e.g. mypage.com/article/tag1 mypage.com/article/tag2 mypage.com/article/tag3 which all contain the content of mypage.com/article with out a canonical to the original. It is a really strange non-seo friendly approach, and so I wondered if anyone had similar problems.
-
The crawl report offers a "referrer" field. That field offers where Roger found the offending link. In my experience that field has always been accurate.
When I try to access www.oznappies.com/faq I receive a 302 redirect and a 500 error. I would recommend adjusting non-existant pages to a soft 404 page. Still provide a 404 response to browsers, but offer users a friendly way to find information (i.e. links / search) and stay on your site.
A great example of a soft 404 page is http://www.orangecoat.com/a-404-page.html
For the Tumblr issue, I am not clear on the problem. Are you writing content and publishing on both the oznappies.com site and your tumblr site? Then this content is being published again on your site via a RSS import?
-
I removed the links and just left the text so these will cut and paste now. It confuses me where Roger found the links.
Thanks for running the Xenu scan. I have tried other site scanner and come up blank.
-
That second link is anchored to the wrong place.
Regardless I also cannot find the .faq page. I just ran Xenu over it to see what it could find, but no broken links showed up.
Afraid I don't use Tumblr either, so eh, pretty useless post. Sorry.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Where do you find good SEO analysts nowadays?
We are looking for talent and having a hard time finding staff with good qualification and passion. Any suggestions or feedback would be awesome. We are located in South Florida. Thanks Antoine
Moz Pro | | adupont650 -
All ranked pages on Googles SERP only links to home
I got a problem regarding my website called musik.dk, and hope you guys are able to help. I just got my first ranking results from Moz. My question is: All my keywords are linking to the home page, and not the artist page? For examples if I were to search on Rihanna on Google, then when musik.dk appears on the SERP, it only links to the home, and not musik.dk/rihanna.. This problem applies to all the given artists ranked on Googles SERP, it never shows the artist page itself, only links to home. Let me know if you need any information, and I gladly supply, this is kind of frustrating to me.. As a note: all pages besides the home has a PA of 1, and it doesn't really seem to change. UT3OBHr kVp7z38
Moz Pro | | Morten_Hjort0 -
Re: Competitive Link Comparison
In Competitive Link Comparison Top 5 contenders... why would the landing page have an HTTP Status showing as Blocked by robots.txt when it is not blocked within the robots.txt file and no files are shown as blocked in Google's webmaster tools. Sorr if I've ticked the incorrect topic categories
Moz Pro | | Hornblower0 -
Wiped from Google Top 50 and Need Diagnostic Help
Hey All, On the 4th of January our site (http://spotcolorstudio.com) Webmaster tools showed a massive drop in the number of impressions we're getting from Google. We went from over 500 to around 100 and we haven't recovered. That week's SEOMOZ keyword report showed we were wiped from the Top 50 for everything we were tracking except our branded terms. I've seen no indicators to why this might have happened. Our Domain Authority hasn't changed. I haven't received any malware notices in Webmaster tools. GetListed.org displayed our Google Places listing as not present despite being able to click through and see our listing displaying as "active." Is it possible there's something wrong with the DNS that I'm missing? What could cause a complete wiping like this that wouldn't trigger an alert in Webmaster tools? Any help, guidance or suggestions will be greatly appreciated! Craig
Moz Pro | | SpotColorMarketing530 -
Duplicate pages with canonical links still show as errors
On our CMS, there are duplicate pages such as /news, /news/, /news?page=1, /news/?page=1. From an SEO perspective, I'm not too worried, because I guess Google is pretty capable of sorting this out, but to be on the safe side, I've added canonical links. /news itself has no link, but all the other variants have links to "/news". (And if you go wild and add a bunch of random meaningless parameters, creating /news/?page=1&jim=jam&foo=bar&this=that, we will laugh at you and generate a canonical link back to "/news". We're clever like that.) So far so good. And everything appears to work fine. But SEOMoz is still flagging up errors about duplicate titles and duplicate content. If you click in, you'll see a "Note" on each error, showing that SEOMoz has found the canonical link. So SEOMoz knows the duplication isn't a problem, as we're using canonical links exactly the way they're supposed to be used, and yet is still flagging it as an error. Is this something I should be concerned about, or is it just a bug in SEOMoz?
Moz Pro | | LockyDotser0 -
OSE Backlink results - reported link not actually there?
Not a complaint, but a question to understand how the research tool operates: When I run backlink checks on websites, often the reported link is not only not on the page, but it's not found anywhere on the site. I use several tools to search for the link url as well as for the keyword. Why does the tool report a link is there, but I cannot find the links in some cases? Is there a lag in the information the tool is using, making it not quite up to date, or is it something else? Thanks much!
Moz Pro | | AdamThompson0 -
Open Site Explorer missing links
Hi, When the update of Open Site Explorer was released I noticed that the new version was missing a huge amount of links that the old version previously found. This still seems to be the case and it's pretty frustrating as we use the tool for our clients. Is this something that everybody is seeing and if so SEOMoz when do you think you'll have a solution? Many thanks
Moz Pro | | JonathanSmith0 -
Why would PA be 1 (0 links from 0 root domains) if it's linked to internally?
Question just about said it all: I've seen a number of pages on sites that have a PA of 1 (with the metrics being 0 links from 0 root domains) when I can see on the site that it is linked to internally - from the main nav (which is CSS, not Javascript) and also from the footer, if not other places. Why would this be? Update: upon looking further at the site, it appears that there's some kind of redirect going on, where the page linked to from the nav actually redirects to the real page. Would that eliminate PA, even if it's a 301? And additionally, is whatever is causing this lack of PA a reflection of how Google would relate to the page? Thanks, Aviva
Moz Pro | | debi_zyx0