Blog archives vs individual articles
-
In a client's blog, you can find each individual article pages as well as aggregate of articles per month or sometimes per day (including each entire article).
The problem is that the article appears twice, once in a dedicated page (article page) and once with other articles (in the archive).
Is there a specific SEO approach to this type of situation? Is there duplicate content?
What page name should I give each archive (if at all), as there are quite a few?
Thank you
-
Thank you Egol.
Your insights were very helpful.
David
-
I believe that when you mention indexing category pages and index pages you refer to titles only.
I use the title and about 20 words. I use WordPress where that is possible.
For now, the CMS is indexing each entire article in the monthly archive page. Which can create quite long pages as articles are not truncated.
I would try to use the first 20 words or first sentence if possible. If not possible I would move to a different content manager.
Just my two cents.
-
Thank you Egol,
I believe that when you mention indexing category pages and index pages you refer to titles only.
For now, the CMS is indexing each entire article in the monthly archive page. Which can create quite long pages as articles are not truncated.
-
I believe that Google is smart enough to know that millions of blogs have article pages, category pages and archive pages.
If your blog posts are unique content of substantive length and you only include a snippet on the category and archive pages then it is unlikely that you will suffer a duplicate content problem.
If you do have a duplicate content problem it will more likely come from scrappers grabbing your content or republishing your feed (that has full post content).
My approach is to allow indexing of article pages, category pages and index page but block only the pagination of the index and category pages.
If I blocked indexing of category or article pages I would lose thousands of visitors per day.
-
Thanks a lot Jeffrey,
Very helpful!
David
-
I'd leave it as "follow" since there's no reason to make it "nofollow" in this case. I believe that's what Yoast recommends via the plugin as well.
-
Thank you for your input, it is helpful.
Do you think I should simply do "noindex" or should I also say "follow" or "nofollow"?
Thanks
-
I would add a "noindex" tags to the archive pages and leave the article page alone. If it's the same archive setup I'm thinking of, there's little value to leaving this in the Google index so that's it's searchable.
Are you using WordPress? This can be easily done with the Yoast SEO plugin.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Google Webmaster tools Sitemap submitted vs indexed vs Index Status
I'm having an odd error I'm trying to diagnose. Our Index Status is growing and is now up to 1,115. However when I look at Sitemaps we have 763 submitted but only 134 indexed. The submitted and indexed were virtually the same around 750 until 15 days ago when the indexed dipped dramatically. Additionally when I look under HTML improvements I only find 3 duplicate pages, and I ran screaming frog on the site and got similar results, low duplicates. Our actual content should be around 950 pages counting all the category pages. What's going on here?
Technical SEO | | K-WINTER0 -
CNAME vs 301 redirect
Hi all, Recently I created a website for a new client and my next job is trying to get them higher in Google. I added them in OSE and noticed some strange backlinks. To my surprise the client has about 20 domain names. All automatically poiting to (showing) the same new mainsite now. www.maindomain.nl www.maindomain.be
Technical SEO | | Houdoe
www.maindomain.eu
www.maindomain.com
www.otherdomain.nl
www.otherdomain.com
... Some of these domains have backlinks too (but not so much). I suggested to 301 redirect them all to the main site. Just to avoid duplicate content. But now the webhoster comes into play: "It's a problem, client has only 1 hosting account, blablabla...". They told me they could CNAME the 20 domains to the main domain. Or A-record them to an IP address. This is too technical stuff for me. So my concrete questions are: Is it smart to do anything at all or am I just harming my client? The main site is ranking pretty well now. And some backlinks are from their copy sites (probably because everywhere the logo links to the full mainsite url). Does the CNAME or A-record solution has the same effect as a 301 redirect, from SEO perspective? Many thanks,
Hans0 -
Duplicate Titles on Wordpress blog pages
Hi, I have an issue where I am getting for duplicate page titles for pages that shouldn't exist. The issue is on the blog index page's (from 0 - 16) and involves the same set of attachment_id for each page, i.e. /blog/page/10/?attachment_id=minack /blog/page/10/?attachment_id=ponyrides /blog/page/11/?attachment_id=minack /blog/page/11/?attachment_id=ponyrides There are 6 attachment_id values (and they are not ID values either) which repeat for every page on the index now what I can't work out is where those 6 links are coming from as on the actual blog index page http://www.bosinver.co.uk/blog/page/10/ there are no links to it and the links just go to blog index page and it ignores the attachment_id value. There is no sitemap.xml file either which I thought might have contained the links. Thanks
Technical SEO | | leapSEO0 -
Multilingual Website - Sub-domain VS Sub-directory
Hi Folks - Need your advice on the pros and cons of going with a sub-domain vs a sub-directory approach for a multi lingual website. The best would be a ccTLD but that is not possible now, so I would be more interested in knowing your take on these 2 options. Though, I have gone through http://www.stateofsearch.com/international-multilingual-sites-criteria-to-establish-seo-friendly-structure/ and this somewhat vouches for a sub-directory, but what would you say'?
Technical SEO | | RanjeetP0 -
.com & .ie website how to avoid duplicate blog content?
We have 2 websites .com & .ie (both are more or less identical except 2 different markets). How can I avoid duplicate blog content as lots of our .com/blog and .ie/blog is the same? Maybe.... Our main .com blog articles are searchable then on our .ie blog content non searchable? (This way both markets get to view the content but only Google actually searches our .com blog) Alliteratively I would need to rewrite each article so that is unique Advise would be appreciated, thank you.
Technical SEO | | AdvanceSystems0 -
My Article Post Title in both the h1 and the h2 are the same. Is this good seo?
I'm seeing a common practice in wordpress themes where the h1 tag for a page has the logo in it, then the h2 would be the title to the article. I've decided to place the title in the h1 dynamically, like this: - Joe's Auto Store where '' is the actual title to the post - the logo is still being used as a background image in the h1... So for example, the page would show this: How install a car battery - Joe's Auto Store I think this is good seo still, but the other issue is that the first, subsequent also has the exact same title because this is the actual post title, which uses the first h2 on the page to display the title. So the code would look like this: - My Company paragraph content text stuff an example would be How install a car battery - Joe's Auto Store How install a car battery At Joe's we teach how to install batteries on site. There are mor...(etc.) Is this an issue since the post title in both the h1 and h2 are nearly the same (except for the company name)? Is this good seo still?
Technical SEO | | johnnydigital0 -
Backlink: External blog Vs. Internal blog. Which is the best?
Hi, some weeks ago a created a blog: mykeyword.wordpress.com Some one told me that it has got more trust that a "normal" www.mykeyword.com
Technical SEO | | Greenman
Is it true? So, i wrote some articles and dropped a guide (linking inside to mysite.com) to blog. My question is:
Right now i'm writing a lot of article ad i'm looking for the best channel where publish my content (post with link inside). My focus is improving quantity and quality of backlinks. Which way must i use? 1. Use my mykeyword.wordpress.com (give freshness to blog and new backlink)
2. Create ad internal blog mysite.com/blog and add article (without link?)
3. "Don't lose time" - Put new article only in external blog that will link to my site. I must manage a lot of new sites and i should increase SERP position. So, i have to choose the right way right now. Thanks 😉0 -
Very well established blog, new posts now being indexed very late
I have an established blog.We update it on daily basis. In the past, when I would publish a new post, it would get indexed within a minute or so. But since a month or so, its taking hours. Sometimes like 10-12 hours for new posts to get indexed. Only thing I have changed is robots.txt. This is the current robots file. User-agent: * Disallow: /cgi-bin Disallow: /wp-admin Disallow: /wp-includes Disallow: /wp-content/plugins Disallow: /wp-content/cache Disallow: /wp-content/themes Disallow: /wp-login.php Disallow: /*wp-login.php* Disallow: /trackback Disallow: /feed Disallow: /comments Disallow: /author Disallow: /category Disallow: */trackback Disallow: */feed Disallow: */comments Disallow: /login/ Disallow: /wget/ Disallow: /httpd/ Disallow: /*.php$ Disallow: /*?* Disallow: /*.js$ Disallow: /*.inc$ Disallow: /*.css$ Disallow: /*.gz$ Disallow: /*.wmv$ Disallow: /*.cgi$ Disallow: /*.xhtml$ Disallow: /*?* Disallow: /*? Allow: /wp-content/uploads User-agent: TechnoratiBot/8.1 Disallow: # ia_archiver User-agent: ia_archiver Disallow: / # disable duggmirror User-agent: duggmirror Disallow: / # allow google image bot to search all images User-agent: Googlebot-Image Disallow: /wp-includes/ Allow: /* # allow adsense bot on entire site User-agent: Mediapartners-Google* Disallow: Allow: /* Sitemap: http://www.domainname.com/sitemap.xml.gz Site has tons of backlinks. Just wondering if something is wrong with the robots file or if it could be something else.
Technical SEO | | rookie1230