How to find an internal link that is generating a duplicate
-
Hello Mozers
Can anybody help me. It's a bit OCD, but, I really want to find the internal links within a clients site that are generating duplicate urls.
I did start looking page by page using search, but got a bit stir crazy!
I'm sure one of you smart SEO's will have a simple, clever solution:)
Thanks
Catherine
-
My problem is that I do have the referrer, it's an internal page, often the page that is creating the duplicate content. I keep searching for links that are generating the discrepancies and can't find them.
I've run a report using xenu, but now I've got even more wood (can't see the wood for the trees).
Can you recommend any good resources for using xenu?
That would be a big help!
Thanks Catherine
-
I've just submitted one.
Thank you Brian
Catherine
-
Thanks Alex
Two very sensible suggestions, thanks. I just got myself in a loop!
Ta
-
It's not OCD, it's good practice!
If the duplicate has been indexed, you could try using the link: operator with the URL in Google search.
Do you have Xenu Link Sleuth? If not it's free, and don't worry if it looks dodgy, it is kosher. You could crawl the whole site with Xenu. Once the crawl is complete right-click on the URLs you have as duplicate content and it'll show the pages that link to it.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
How can I stop a tracking link from being indexed while still passing link equity?
I have a marketing campaign landing page and it uses a tracking URL to track clicks. The tracking links look something like this: http://this-is-the-origin-url.com/clkn/http/destination-url.com/ The problem is that Google is indexing these links as pages in the SERPs. Of course when they get indexed and then clicked, they show a 400 error because the /clkn/ link doesn't represent an actual page with content on it. The tracking link is set up to instantly 301 redirect to http://destination-url.com. Right now my dev team has blocked these links from crawlers by adding Disallow: /clkn/ in the robots.txt file, however, this blocks the flow of link equity to the destination page. How can I stop these links from being indexed without blocking the flow of link equity to the destination URL?
Technical SEO | | UnbounceVan0 -
Why are these internal pages not showing any internal links?
If you look at Author profile pages like this one, http://experts.allbusiness.com/author/denise-oberry (THE top contributor on the site with over 82 posts under her belt), or any Author profile page, they show zero internal links or Page Authority. The same goes for most posts for each author on the site. Author pages should show internal links from every post the author has on the site. And specific posts should also have internal links from categories, etc. Yet they show zero. The only posts that show internal links and PA are ones that were either syndicated to the root domain's homepage, or syndicated to Fox Small Business. ZERO internal links. Does anyone know why this is? The root domain does not act this way with Author pages and posts. And I see nothing blocking links or indexing via the robots.txt file or page level nofollow tags. A real head scratcher for this SEO nerd, that I'm sure someone here will have a really simple answer to.
Technical SEO | | MiguelSalcido0 -
No follow links on a blog
Hi On our blog, we have a section called 'Tags'. I have just noticed that these links are all "no follow" links. The tags section does appear on every single page on the blog - is this recommend to have them as 'no follow' links or should I get our developer to change them. Thanks
Technical SEO | | Andy-Halliday0 -
Find all old links from a site to 301
We worked on this site a while ago - http://www.electric-heatingsupplies.co.uk/ Whilst we did a big 301 redirect exercise, I wanted to check that we "got" all of them. Is there a historical way I can check all the old indexed links to make sure they correlate to the new links? Thanks!!
Technical SEO | | lauratagdigital0 -
Bad link profile?
Hi Mozzers! We have recently been handed this client due to the former SEO company building up a bad link profile, which resulted in the site dropping off the search results all together. Forcing them to get a new domain. This happened in July last year and we are unsure whether it would be wise to submit a reconsideration request and then 301 their old sites pages to the new domain. Basically I'm asking whether you can spot any spammy links being built in their profile. Here is the old domain: http://www.claimssolicitors.co.uk/ It would be great if you could help me out! 🙂 Thanks
Technical SEO | | Webrevolve0 -
Anchor links percent
I really don't have a clue about how many internal anchor links are recommended for a page. I think it could be split into anchor text in the article content and also in the whole page. The article content: Only the unique content of this page The whole page: Everthing including menus, sitemap, etc. Does percent really matter? Could an excesive amount of anchor links diminish pagerank in the source page? Can google see an excesive amount of internal content links as spamming? Thanks 🙂 !!
Technical SEO | | heroselohim0 -
Internal Links not Crawled by Open Site Explorer
Can someone plz tell me why www.hotelelgreco.gr has only 2 internal links in OSE despite the fact that the text content has a plethora of them. Thanks in advance.
Technical SEO | | socrateskirtsios0 -
If you add a no follow to a time sensitive link, will it get picked up as broken link 404 in WMT report?
We have a client who publishes deals that are time sensitive. Links to the deals expire and so Google's crawlers are picking them up and finding a 404 If I no follow them, will the 404's still get picked up and reported in WMT? The same question applies to SEOMoz Pro.
Technical SEO | | Red_Mud_Rookie0