How to set up internal linking with subcategories?
-
I'm building a new website and am setting up internal link structure with subcategories and hoping to do so with best Seo practices in mind. When linking to a subcategory's main page, would I make the internal link www.xxx.com/fishing/ or www.xxx.com/fishing/index.html or does it matter? I'm just trying to avoid duplicate content I guess, if Google saw each page as a separate page. Any other cautions when using subdirectories in my navigation?
-
It seems like you are actually asking 2 questions in one. Let's approach them both separately.
Should you include index.html or not in your structure? Personally, I think the cleanest URL structure works the best. Get rid of any URL parameters that do not offer any benefit. Cleaner URLs not only look better, but are easier to read and share. Who really wants to share DOMAIN.com/fishing.index.html vs DOMAIN.com/fishing. It may seem petty, but in the grander scheme of things it will just work better. If you do this in one area of your site, keep it consistant through the whole site
As to making sure that Google only indexes one version, this can be done through the URL parameter redirects on your server. You can create rules where the URL automatically strips out any additional URL items, and redirects to the proper version. Once you create your sitemap, make sure to only include the versions of your URL you want indexed. One thing I have found in my entire SEO career is this: The easier and clearer you paint a picture for Google about your site, the better your results will be.
-
Hello,
Duplicate content is usually pretty simple to deal with see Sheena's response.
I would recommend at this point in design looking at the URL structure not primary as trying to avoid a negative, rather how to incorporate the most positives. That is how can you get the most SEO value out of the url's. Since you're at the point where you can make these changes, then now is the time to evaluate how you want your site to appear to the users and search engines.
Here are 2 Good, nay VERY GOOD post on Moz to help with this process.
1. Moz: Guide To SEO Chapter 4
2. Dr Pete: Anatomy Of A URLI hope this is helpful,
Don
-
First thing you need to do is determine which is your preferred URL (probably www.site.com/fishing/). Once you have that, the key to good linking is consistency (using the same, preferred URL for any given page you're linking to & also using descriptive, naturally-occurring anchor text that's relevant to the page's content).
The fact that your site has other versions of the page (/fishing/ and /fishing/index.html) means that you probably need to implement a solution to prevent dup content issues - either 301 redirect all variations to the preferred URL or implement rel=canonical tags to tell search bots which is the preferred URL to include in its index. You can read more about this here:
I hope this helps!
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Robots file set up
The robots file looks like it has been set up in a very messy way.
Technical SEO | | mcwork
I understand the # will comment out a line, does this mean the sitemap would
not be picked up?
Disallow: /js/ should this be allowed like /*.js$
Disallow: /media/wysiwyg/ - this seems to be causing alerts in webmaster tools as it can not access
the images within.
Can anyone help me clean this up please #Sitemap: https://examplesite.com/sitemap.xml Crawlers Setup User-agent: *
Crawl-delay: 10 Allowable Index Mind that Allow is not an official standard Allow: /index.php/blog/
Allow: /catalog/seo_sitemap/category/ Allow: /catalogsearch/result/ Allow: /media/catalog/ Directories Disallow: /404/
Disallow: /app/
Disallow: /cgi-bin/
Disallow: /downloader/
Disallow: /errors/
Disallow: /includes/
Disallow: /js/
Disallow: /lib/
Disallow: /magento/ Disallow: /media/ Disallow: /media/captcha/ Disallow: /media/catalog/ #Disallow: /media/css/
#Disallow: /media/css_secure/
Disallow: /media/customer/
Disallow: /media/dhl/
Disallow: /media/downloadable/
Disallow: /media/import/
#Disallow: /media/js/
Disallow: /media/pdf/
Disallow: /media/sales/
Disallow: /media/tmp/
Disallow: /media/wysiwyg/
Disallow: /media/xmlconnect/
Disallow: /pkginfo/
Disallow: /report/
Disallow: /scripts/
Disallow: /shell/
#Disallow: /skin/
Disallow: /stats/
Disallow: /var/ Paths (clean URLs) Disallow: /index.php/
Disallow: /catalog/product_compare/
Disallow: /catalog/category/view/
Disallow: /catalog/product/view/
Disallow: /catalog/product/gallery/
Disallow: */catalog/product/upload/
Disallow: /catalogsearch/
Disallow: /checkout/
Disallow: /control/
Disallow: /contacts/
Disallow: /customer/
Disallow: /customize/
Disallow: /newsletter/
Disallow: /poll/
Disallow: /review/
Disallow: /sendfriend/
Disallow: /tag/
Disallow: /wishlist/ Files Disallow: /cron.php
Disallow: /cron.sh
Disallow: /error_log
Disallow: /install.php
Disallow: /LICENSE.html
Disallow: /LICENSE.txt
Disallow: /LICENSE_AFL.txt
Disallow: /STATUS.txt
Disallow: /get.php # Magento 1.5+ Paths (no clean URLs) #Disallow: /.js$
#Disallow: /.css$
Disallow: /.php$
Disallow: /?SID=
Disallow: /rss*
Disallow: /*PHPSESSID Disallow: /:
Disallow: /😘 User-agent: Fatbot
Disallow: / User-agent: TwengaBot-2.0
Disallow: /0 -
Why are these internal pages not showing any internal links?
If you look at Author profile pages like this one, http://experts.allbusiness.com/author/denise-oberry (THE top contributor on the site with over 82 posts under her belt), or any Author profile page, they show zero internal links or Page Authority. The same goes for most posts for each author on the site. Author pages should show internal links from every post the author has on the site. And specific posts should also have internal links from categories, etc. Yet they show zero. The only posts that show internal links and PA are ones that were either syndicated to the root domain's homepage, or syndicated to Fox Small Business. ZERO internal links. Does anyone know why this is? The root domain does not act this way with Author pages and posts. And I see nothing blocking links or indexing via the robots.txt file or page level nofollow tags. A real head scratcher for this SEO nerd, that I'm sure someone here will have a really simple answer to.
Technical SEO | | MiguelSalcido0 -
Are sitewide links bad for SEO?
I have 11 real estate sites and have had links from one to another for about 7 years but someone just suggested me to take them all out because I might get penalized or affected by penguin. My main site was affected on July of 2012 and organic visits have dropped 43%...I've been working on many aspects of my SEO but it's been difficult to come back. Any suggestions are very welcome, thanks 🙂
Technical SEO | | mbulox0 -
Devalued links or negative affect?
Hi there, I'm looking into an issue with a site that was hit after Penguin was introduced. The site lost 70% of traffic over night. The site in question seemed to have a large number of backlinks with over optimized anchor text which seems to most likely be the reason for drop in rankings. But there is also some links from blog networks here too unfortunately, so my question here really is do Google just devalue these links and discount them from consideration in their ranking algorithm or do the links still count but instead of adding positive affects in SERPs add a negative affect? My reason for this question is I'm trying to determine whether it's worth saving this website or just starting fresh with a new domain. That does bring me to another question, if I have to start fresh on a new domain is it a possibility to reuse the content from the old site? (providing I remove the URLs from Google via Webmaster tools). Any help/advice/answers here would be greatly appreciated. Thanks in advance.
Technical SEO | | jayderby0 -
Rel=nofollow for affiliate links?
Hi, For a holiday/travel website including hotels and holiday packages from affiliates I am currently using the rel="nofollow" attribute to link out to the affiliate's website and wanted to know if this is the right way? To be more precise: there are distinct pages for each city and on a city specific page there are ~50 available hotels listed with some other information such as price and address, etc. Each of these hotels have an outlink to the affiliate's hotel website which uses private branding and as such is running on a subdomain hotels.mytraveldomain.tld. So in order not to pass on the link juice to the affiliate's website I thought I would simply use rel="nofollow". Would you also use nofollow? or are there any other opinions out there about that?
Technical SEO | | socialtowards1 -
What should I do about links coming in that are from link farm type sites?
I just noticed two back links to a couple of sites around pharmaceuticals/attorneys. The one link is to a chinese site with url: http://e.lifestyle.com.cn/fashionweekly/nzj/353093_2.shtml, and the other is to a site called Adroo: http://adroo.com/us/?view=list&list_id=104154&lang=en. Both appear to be some type of link farm sites, one has come in as a nofollow (surprise, you can buy "ads" on their site, both have decent DA. There is no reason for them to link to theses sites, should I find a way to stop the link? Also, on one of the sites we had a dmoz link and it is not showing in OSE? Link is still open in dmoz though. Thanks for any input.
Technical SEO | | RobertFisher0 -
Explain Competitive Link Analysis
Hi all...bit of a newbie into the SEO world! So excuse me if this question makes me sound simple 🙂 Basically we have recently had a link analysis done on our site but I'm finding it a little bit difficuilt to actually understand what it all means. What are followed and nofollowed links? What is the sort of ratio needed between the two for best results. And my next question being what are followed linking root domains and nofollowed linking root domains?
Technical SEO | | cttgroup0 -
Preserving Link Value
My client has an existing domain, domain A. They recently purchased and absorbed another company with their own domain, domain B. For marketing purposes company B will be rebranded as company A. They want to redirect domain B to domain A. The problem is that company B has by far the more visible domain, with 4x the number of inbound links. If I redirect domain B to domain A, what will happen to these links? I'm thinking their value will be lost.
Technical SEO | | waynekolenchuk0