Thin Content due to Photo Galleries
-
Hi folks,
i've got a question: we have about 3 million image sites with unique URL on our site. All images with a caption are transmitted to Google index, which regards 2/3 of all images.
We are afraid that this could cause some problems due to thin content.
Please take a look at one of our article sites with such a photo gallery: http://goo.gl/hq6bxG
All gallery pics with a caption are indexed: http://goo.gl/gd9TQ6
Do you have any advices how to handle those photo galleries? How should they be flaged for Google? Every pic "noindex" and "canonical"-Tag to the article?
Thx a lot!
Matthias
-
Hi. I wouldn't use "noindex", so images are actually getting into Google's image search etc, but canonical sounds fine.
-
Dear Dimitrii,
thanks for your answer.
We considered your recommended action to create a slider gallery. but as we are looking for a short term solution this is not an option now (we are planning this anyway in the near future).
Can't we optimize our galleries if we take all image sites out of index and set an canonical-tag to the article as show above? Or do you have any advice how to tag our image sites for Google without changing our site structure - for example images with unique caption stay in the index and images without caption are removed out of index?
Thx a lot!
Matthias
-
Hi Matthias,
I agree that the content is pretty thin and that it would probably be better to present them in a slider (check the example from Autobild http://www.autobild.de/bilder/mazda-mx-5-gegen-bw-z4-6937517.html#bild23). While the presentation is quite similar to your presentation - the source contains all the captions & all the images making the content much richer.
From a usability perspective: each image requires the page to reload completely which is not really great.
I imagine that changing the images from separate url's to a slider can be an enormous amount of work. Having thin content / semi duplicate content on your site is not necessarily a cause for punishment (unless with clear malicious intent) - the issue is mainly that these thin pages will not show up in search results. If you are not optimising for image search (which I assume based on the captions you put under the pictures) you could just as well leave them as it (your normal articles look ok on first sight so you have more than just thin content pages).
If you would optimise for images, you should make your captions a little bit more descriptive & longer and you definitely need to change you alt titles (looks too much like keyword stuffing) - you might check this WBF - it's old but not much has changed on Image Search since then (well - at least in Germany as you are still using the "old" type of image search)
rgds,
Dirk
-
Guten morgen, mein freund.
Well, I have questions about your website's structure, which, indeed, can answer your questions. So, what I see is that there is a page with a link to the gallery without any content. Each of the gallery's images is separate page without any content. Of course it's going to be thin content! Is there a reason the website has been structured this way?
What I recommend is either add content, not just caption, to every image of gallery if you wanna keep the way it's structured now, or rebuild website architecture. I'd do it this way:
Page with slider/gallery with description of the gallery, images are not separate pages, but kinda like a carousel or something. Make sure that all images in the same carousel are united by the same subject/event and each image has it's own unique caption. This way you'll combine the same gallery related pages into one, and this page will be not thin, that's for sure.
Hope this helps.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Pages with Duplicate Content
When I crawl my site through moz, it shows lots of Pages with Duplicate Content. The thing is all that pages are pagination pages. How should I solve this issue?
Technical SEO | | 100offdeal0 -
Purchasing duplicate content
Morning all, I have a client who is planning to expand their product range (online dictionary sites) to new markets and are considering the acquisition of data sets from low ranked competitors to supplement their own original data. They are quite large content sets and would mean a very high percentage of the site (hosted on a new sub domain) would be made up of duplicate content. Just to clarify, the competitor's content would stay online as well. I need to lay out the pros and cons of taking this approach so that they can move forward knowing the full facts. As I see it, this approach would mean forgoing ranking for most of the site and would need a heavy dose of original content as well as supplementing the data on page to build around the data. My main concern would be that launching with this level of duplicate data would end up damaging the authority of the site and subsequently the overall domain. I'd love to hear your thoughts!
Technical SEO | | BackPack851 -
Simple duplicate content query
Hello Community, One of my clients runs a job board website. They are having some new framework installed which will lead to them having to delete all their jobs and re-add them. The same jobs will be re-posted but with a different reference number which in turn with change each URL. I believe this will cause significant duplicate content issues, I just thought I would get a second opinion on best practice for approaching a situation like this. Would a possible solution be to delete jobs gradually and 301 re-direct old URLs to new URLs? Many thanks in advance, Adam
Technical SEO | | SO_UK0 -
Old Content Pages
Hello we run a large sports website. Since 2009 we have been doing game previews for most games every day for all the major sports..IE NFL, CFB, NBA, MLB etc.. Most of these previews generate traffic for 1-2 days leading up to or day of the event. After that there is minimal if any traffic and over the years almost nothing to the old previews. If you do a search for any of these each time the same matchup happens Google will update its rankings and filter out any old matchups/previews with new ones. So our question is what would you do with all this old content? Is it worth just keeping? Google Indexes a majority of it? Should we prune some of the old articles? The other option we thought of and its not really practical is to create event pages where we reuse a post each time the teams meet but if there was some sort of benefit we could do it.
Technical SEO | | dueces0 -
Duplicate content for vehicle inventory.
Hey all, In the automotive industry... When uploading vehicle inventory to a website I'm concerned with duplicate content issues. For example, 1 vehicle is uploaded to the main manufacturers website, then again to the actual dealerships website & then again to Craigslist & even sometimes to a group site. The information is all the same, description, notes, car details & images. What would you all recommend for alleviating duplicate content issues? Should I be using the rel canonical back to the manufacturers website? Once the vehicle is sold all pages disappear. Thanks so much for any advice.
Technical SEO | | DCochrane0 -
SEO for User Authenticated Content
Hi Everyone - I have a potential client who is seeking SEO for a site that contains about 95% of content only accessible through user authentication . Does anyone have tips for getting this indexed without having to open it up to the public? I was considering adding "snippets" into the robots.txt or creating an additional page with snippets linking to the login page. I'd appreciate any thoughts! Thanks!
Technical SEO | | manutx0 -
Duplicate content problem?
Hello! I am not sure if this is a problem or if I am just making something too complicated. Here's the deal. I took on a client who has an existing site in something called homestead. Files cannot be downloaded, making it tricky to get out of homestead. The way it is set up is new sites are developed on subdomains of homestead.com, and then your chosen domain points to this subdomain. The designer who built it has kindly given me access to her account so that I can edit the site, but this is awkward. I want to move the site to its own account. However, to do so Homestead requires that I create a new subdomain and copy the files from one to the other. They don't have any way to redirect the prior subdomain to the new one. They recommend I do something in the html, since that is all I can access. Am I unnecessarily worried about the duplicate content consequences? My understanding is that now I will have two subdomains with the same exact content. True, over time I will be editing the new one. But you get what I'm sayin'. Thanks!
Technical SEO | | devbook90 -
Duplicate Content on SEO Pages
I'm trying to create a bunch of content pages, and I want to know if the shortcut I took is going to penalize me for duplicate content. Some background: we are an airport ground transportation search engine(www.mozio.com), and we constructed several airport transportation pages with the providers in a particular area listed. However, the problem is, sometimes in a certain region multiple of the same providers serve the same places. For instance, NYAS serves both JFK and LGA, and obviously SuperShuttle serves ~200 airports. So this means for every airport's page, they have the super shuttle box. All the provider info is stored in a database with tags for the airports they serve, and then we dynamically create the page. A good example follows: http://www.mozio.com/lga_airport_transportation/ http://www.mozio.com/jfk_airport_transportation/ http://www.mozio.com/ewr_airport_transportation/ All 3 of those pages have a lot in common. Now, I'm not sure, but they started out working decently, but as I added more and more pages the efficacy of them went down on the whole. Is what I've done qualify as "duplicate content", and would I be better off getting rid of some of the pages or somehow consolidating the info into a master page? Thanks!
Technical SEO | | moziodavid0