Help with Roger finding phantom links
-
It Monday and Roger has done another crawl and now I have a couple of issues:
- I have two pages showing 404->302 or 500 because these links do not exist. I have to fix the 500 but the 404 is trapped correctly.
http://www.oznappies.com/nappies.faq & http://www.oznappies.com/store/value-packs/\
The issue is when I do a site scan there is no anchor text that contains these links. So, what I would like to find out is where is Roger finding them. I cannot see any where in the Crawl Report that tells me where the origin of these links is.
- I also created a blog on Tumblr and now every tag and rss feed entry is producing a duplicate content error in the crawl stats. I cannot see anywhere in Tumblr to fix this issue.
Any Ideas?
-
Thanks again Ryan, you have been very helpful answering al lot of my questions.
-
Someone else asked the same question regarding tag pages yesterday. I would suggest asking a separate Q&A on that topic.
Tag pages & forum category pages are both often used as containers. They don't have any content except links to articles. I would ask for feedback as to the best practice. I suspect noindex, following those pages would be best, but I don't have the experience to feel comfortable offering that advice.
-
I have been looking at the data that Roger is reporting for the duplicate content and in ALL cases there is either a 301 or a NoIndex. So now I do not know why Roger is reporting them as a duplicate, robots should not see the second entry.
-
I did not think of looking at the csv report. I see it now thanks Ryan. There should be a soft 404 handler in place to process the bad urls, I will have to see why it is not working.
With tumblr, I was looking for an easy way to add a blog to the site.
The RSS is coming from tumblr as is all the content.
When we specify Tags in tumblr it creates urls e.g. mypage.com/article/tag1 mypage.com/article/tag2 mypage.com/article/tag3 which all contain the content of mypage.com/article with out a canonical to the original. It is a really strange non-seo friendly approach, and so I wondered if anyone had similar problems.
-
The crawl report offers a "referrer" field. That field offers where Roger found the offending link. In my experience that field has always been accurate.
When I try to access www.oznappies.com/faq I receive a 302 redirect and a 500 error. I would recommend adjusting non-existant pages to a soft 404 page. Still provide a 404 response to browsers, but offer users a friendly way to find information (i.e. links / search) and stay on your site.
A great example of a soft 404 page is http://www.orangecoat.com/a-404-page.html
For the Tumblr issue, I am not clear on the problem. Are you writing content and publishing on both the oznappies.com site and your tumblr site? Then this content is being published again on your site via a RSS import?
-
I removed the links and just left the text so these will cut and paste now. It confuses me where Roger found the links.
Thanks for running the Xenu scan. I have tried other site scanner and come up blank.
-
That second link is anchored to the wrong place.
Regardless I also cannot find the .faq page. I just ran Xenu over it to see what it could find, but no broken links showed up.
Afraid I don't use Tumblr either, so eh, pretty useless post. Sorry.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Good tool to track external links from the website
I am in search of a tool that provides me links generating from my site to another site. Is there a software or tool that can scan the whole site and provide me what are the links of other sites in my site.
Moz Pro | | csfarnsworth0 -
Inbound Link Tool?
I signed up for SEOMOZ so that I could get a conclusive list of links for my competition. I hope I can get this information with this tool. Can I? And where can I find the tool? P.S. I don't need my own inbound links; I can get these with google webmaster tools.
Moz Pro | | bachdeg0 -
Google and Open Site Explorer not showing as many links
I've noticed this past week that when you search for the links pointing to a given site, by using the "link:" operator, that Google not showing as many links as they use to. I noticed this also with Open Site Explorer, it is not showing the detail link information as much as it did before. Is Google trying to mask what we can view now on competitors backlinks? If so, how can we see the backlink building that our competitors are doing?
Moz Pro | | tdawson090 -
A grade but no ranking. Can anyone help?
Hi there, I'm new to SEOmoz but familiar with SEO in general. I recently setup a new campaign for my wife's eCommerce business - www.toucankids.com. We optimized 6 pages for 6 well researched key phrases. These pages have been indexed by Google however they didn't show up in the on page reports section of SEOmoz. I've just worked out how to add them manually so i've done that. The good news is that all of the pages get an A grade so we must be doing something right. The bad news is that we haven't got a top 50 result for any of them which is quite disheartening. I'd rather not invest lots of effort more page optimisation if this approach isn't working. Can anyone offer any advice on next steps / fine-tuning? Thanks in advance! P.S. We're going through a process of link building to try and improve our domain authority at the moment so i'm hoping that Friday's linkscape update will give us some better news in that department.
Moz Pro | | rmbdigital0 -
Sometimes we could not download all the data of inbound links from all linking domains
Sometimes we could not download all the data of inbound links from all linking domains. (Around tithe.) Don't you have any idea?
Moz Pro | | crossfinity0 -
Help Understanding Crawl results on this site
I'm just starting to SEO this site http://thefirmbusinessbrokerage.com/welcome and I'm having trouble with the crawl report data. First question, should I be building links to the site above or the main page http://thefirmbusinessbrokerage.com/ (which is a flash intro). If I build links to the flash page, what do I do to the forwarding URL to the welcome page to make it effective? Second question, why does the crawl data report show up almost completely blank? Is this site perfect or are there some onsite issues that I'm not seeing. Thanks for your support and guidance on this site. I'm not hosting the site, just building links and offering optimization advice onsite. JOE
Moz Pro | | KreativElement0 -
SEOmoz API - Links and Anchor Text Calls
Hi, I'm testing out the SEOmoz API - however I'm stuggling to understand the use of the Cols parameter within the "anchor-text" method. I've looped through increasing numbers of "Cols" for a standard query and there just seems to be no logical pattern.
Moz Pro | | AlexThomas
** - Could someone please enlighten me as to how this works?** E.g. of results for query: http://lsapi.seomoz.com/linkscape/anchor-text/www.seomoz.org/?Scope=term_to_page&Sort=domains_linking_page&Cols=1 1Array ( [0] => Array ( [aturid] => 86128451138 ) [1] => Array ( [aturid] => 86128451144 ) [2] => Array ( [aturid] => 86128451131 ) ) 2Array ( [0] => Array ( [atut] => seomoz ) [1] => Array ( [atut] => seomoz.org ) [2] => Array ( [atut] => seo ) ) 3Array ( [0] => Array ( [aturid] => 86128451138 [atut] => seomoz ) [1] => Array ( [aturid] => 86128451144 [atut] => seomoz.org ) [2] => Array ( [aturid] => 86128451131 [atut] => seo ) ) 4Array ( [0] => Array ( [atui] => 38845159274 ) [1] => Array ( [atui] => 38845159274 ) [2] => Array ( [atui] => 38845159274 ) ) 5Array ( [0] => Array ( [atui] => 38845159274 [aturid] => 86128451138 ) [1] => Array ( [atui] => 38845159274 [aturid] => 86128451144 ) [2] => Array ( [atui] => 38845159274 [aturid] => 86128451131 ) ) 6Array ( [0] => Array ( [atui] => 38845159274 [atut] => seomoz ) [1] => Array ( [atui] => 38845159274 [atut] => seomoz.org ) [2] => Array ( [atui] => 38845159274 [atut] => seo ) ) 7Array ( [0] => Array ( [atui] => 38845159274 [aturid] => 86128451138 [atut] => seomoz ) [1] => Array ( [atui] => 38845159274 [aturid] => 86128451144 [atut] => seomoz.org ) [2] => Array ( [atui] => 38845159274 [aturid] => 86128451131 [atut] => seo ) ) 8Array ( [0] => Array ( [atuiu] => 1 ) [1] => Array ( [atuiu] => 1 ) [2] => Array ( [atuiu] => 0 ) ) 9Array ( [0] => Array ( [atuiu] => 1 [aturid] => 86128451138 ) [1] => Array ( [atuiu] => 1 [aturid] => 86128451144 ) [2] => Array ( [atuiu] => 0 [aturid] => 86128451131 ) ) 10Array ( [0] => Array ( [atuiu] => 1 [atut] => seomoz ) [1] => Array ( [atuiu] => 1 [atut] => seomoz.org ) [2] => Array ( [atuiu] => 0 [atut] => seo ) ) Links API: Similar confusion here for:
"TargetCols"
"SourceCols"
"LinkCols" The description here http://apiwiki.seomoz.org/w/page/13991141/Links API - is a bit vague It appears that the links API spits out everything anyway - that one's less of an issue. So... could anyone explain how the Anchor-text API parameter Cols works?? Cheers!0