How do I know which page a link is from
-
I've got an interesting situation. I hope you can help.
I have a list of links but I'm not sure which pages of my site they are from. How do I know which page a specific link is from?
Thanks in advance.
-
From another post I answered
"Maybe XPATH and PHP could get this? I'll point you at this - http://www.css-resources.com/list-a-websites-external-links-alphabetically-using-xpath-and-php.html - but don't ask me how to do it ;)"
Maybe ask Richard here - http://www.seomoz.org/q/list-of-links-pointing-out-of-my-site - and see if he found a way
-
Hi Vince,
Just to clarify, I'm thinking that perhaps the list of links may be 404 (or similar) errors that you want to fix?
If I am correct, and you have found the list of broken links in the SEOmoz Pro App, then you can locate the source URL's by exporting the error report as a csv, then looking at the information in Excel. If you locate the error in the list (by searching), then go to the very last column "Referrer" to find the source page.
Below is a screenshot of the Pro Tip from the help file, which is located on the Crawl Diagnostics help page
Of course, if my assumption is incorrect, then we just need you to be a little more specific about where the list came from and what you are wanting to achieve.
Hope that helps,
Sha
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
3,511 Pages Indexed and 3,331 Pages Blocked by Robots
Morning, So I checked our site's index status on WMT, and I'm being told that Google is indexing 3,511 pages and the robots are blocking 3,331. This seems slightly odd as we're only disallowing 24 pages on the robots.txt file. In light of this, I have the following queries: Do these figures mean that Google is indexing 3,511 pages and blocking 3,331 other pages? Or does it mean that it's blocking 3,331 pages of the 3,511 indexed? As there are only 24 URLs being disallowed on robots.text, why are 3,331 pages being blocked? Will these be variations of the URLs we've submitted? Currently, we don't have a sitemap. I know, I know, it's pretty unforgivable but the old one didn't really work and the developers are working on the new one. Once submitted, will this help? I think I know the answer to this, but is there any way to ascertain which pages are being blocked? Thanks in advance! Lewis
Technical SEO | | PeaSoupDigital0 -
How long does it take for Moz to discover links to pages
Hi folks, Our website is doing well in the Google rankings relative to our competitors who often have higher "Domain authority" than us as reported by Moz. I'm wondering how closely Moz's "Domain Authority" correlates with Google's. In particular, I wonder how long it takes Moz to discover inbound links. For instance our page at http://www.educationquizzes.com/ks3/english has many inbound links from pages on an outstanding educational website and yet our page authority is given by Moz as a measly "1"! Any insights would be very much appreciated.
Technical SEO | | colinking0 -
Page disappeared from Google index. Google cache shows page is being redirected.
My URL is: http://shop.nordstrom.com/c/converse Hi. The week before last, my top Converse page went missing from the Google index. When I "fetch as Googlebot" I am able to get the page and "submit" it to the index. I have done this several times and still cannot get the page to show up. When I look at the Google cache of the page, it comes up with a different page. http://webcache.googleusercontent.com/search?q=cache:http://shop.nordstrom.com/c/converse shows: http://shop.nordstrom.com/c/pop-in-olivia-kim Back story: As far as I know we have never redirected the Converse page to the Pop-In page. However the reverse may be true. We ran a Converse based Pop-In campaign but that used the Converse page and not the regular Pop-In page. Though the page comes back with a 200 status, it looks like Google thinks the page is being redirected. We were ranking #4 for "converse" - monthly searches = 550,000. My SEO traffic for the page has tanked since it has gone missing. Any help would be much appreciated. Stephan
Technical SEO | | shop.nordstrom0 -
How should i knows google to indexed my new pages ?
I have added many products in my ecommerce site but most of the google still not indexed yet. I already submitted sitemap a month ago but indexed process was very slow. Is there anyway to know the google to indexed my products or pages immediately. I can do ping but always doing ping is not the good idea. Any more suggestions ?
Technical SEO | | chandubaba1 -
Internal Linking
Hello there, I own a "how to" website with 1000+ articles, and the number of articles is growing every day. Often some articles are easier to understand if I link a certain step to an article that was written before, because that article explains the step in more detail. Should I use "read here/read more" or the "title of the article I'm referring to" as anchor text? When is internal linking too much? Should I use nofollow?
Technical SEO | | FisnikSylka0 -
Do we know if inbalanced anchor text distribution also applies to internal links?
I have the pages of my site linked together very well with editorial links in my copy and blog posts. But now I'm starting to wonder post-penguin if it's a problem if all my internal links to a certain page have the same anchor text? Or is my internal link juice not powerful enough to set off a red flag? I don't think I've seen this addressed anywhere or if we even know the answer to this or can only speculate.
Technical SEO | | UnderRugSwept0 -
Different links to to the same page
Hi, Based on the user's actions we post activity into users Facebook timeline. And each activity has link back to our particular page on our website. For example if original page was: www.Domain.com from Facebook timeline it would be like this: www.Domain.com?Ffb_action_ids=101508953168 Do you think this will have a negative effect on our page rankings as we will eded up having a lot of different URL's to the same page? www.Domain.com?Ffb_action_ids=101508953168 www.Domain.com?Ffb_action_ids=456788765609 etc.. Thank you, Karen Bdoyan
Technical SEO | | showme0