Black Hat or Bulletproof?
-
I run a blog and a e-commerce website. Their not connected but their about the same thing. I want to put my blog articles onto my website (just a couple not every last one) but I'm afraid of the duplicate content issues.
Can I take an image of a blog post, make it a PDF, and put it under a category of my e-commerce site which is helping users with useful content.
This sounds like a great idea that Google wouldn't be able to tell the difference, in fact Google would like it and see it as a useful document.
To me this seems to good to be true, perhaps a form of black hat.
So my question is, is it black hat? Could I ever get penalized for doing this?
-
Just a quick note -- I've seen Google index PDFs that were scanned images of a cut-and-paste newsletter from the 1980s with a variety of different fonts. This is not a guaranteed way to keep Google out, and images will also make your files much bigger than just text.
-
I don't think so, but it will help keep you from dropping. You are doing it for your users and that is great I just worry if that would not be obvious to Google - that's all.
-
That is what I thought, I was hoping it wouldn't be considered a bad thing to do though. Oh well it is still useful for customers. So making these canonical will not boost my overall website ranking in the least bit?
-
It Takes about 2 minutes per post. Print Screen, Crop, Use Acrobat to make the PDF, upload to site, & write a quick paragraph.
-
Wouldn't you still need some supplemental text to go along with the pdf to explain why a visitor should download it? Seems like a lot of extra work converting blogs into pdfs, uploading them, and extra writing work. A link back makes more sense to me.
-
I wouldn't do that. It would work, but in case your site was ever manually looked at for any reason and that was noticed, that could look like an attempt to manipulate search results and you could get hit. I would just put it on as text and either noindex the page in your robots.txt file or do as Raymond and Nakul suggest and set up a canonical tag. In my very humble opinion I think the safest thing would just be to block bots from the page but the canonical isn't a bad suggestion at all.
-
Seeing how it would be an image and google's crawlers cant crawl the text in that image does it still need to be no follow or canonical?
-
Is your blog blog.yourdomain.com or yourdomain.com/blog/ or yourblogdomain.com ? As Raymond recommended, I would suggest doing a cross domain canonical and you should be good. I hope this helps.
-
Or you can link back to the original article with a rel="cannonical" or if you want to be 100% sure just make it rel="nofollow".
-
You want to add the content to help your users, right? You aren't trying to get it indexed, correct? Just noindex those pages...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Need help in diagnosing what I may be doing wrong
I have a site that has been having problems ranking. Initially, spam rate was at 18%. I have since changed the URL and forwarded to the original so now the spam rate is under 5%. Phone calls started picking back up very slowly but then by August 2024 things came to a screeching halt. Phone has been dead and very little business has been written. I did notice on the robots.txt file it had this: User-agent: *
Technical SEO | | SOM24
Disallow: /
User-agent: Googlebot
Disallow:
User-agent: bingbot
Disallow: /no-bing-crawl/
Disallow: wp-admin and now I have since changed it to this:
User-agent: *
Disallow: /wp-admin/
Allow: /wp-admin/admin-ajax.php Sitemap: https://urlishere.com/sitemap_index.xml My question is what else do I need to do to get this site to start ranking again? We are blogging once a month, writing press releases once a month, updating the social media a few times a week. I feel like maybe there is something on the backend that needs to be done to get this site back to ranking. I am using SEO by Yoast and I have filled in the title and meta description fields for all pages. There is a spot in Yoast where I can validate the site with Google, Bing, etc. I'm trying to figure out how to do that. I do see in the site's Google Webmaster Tools there are several pages not indexing. Any ideas on what else I can do to get this site to start ranking again? Thank you.0 -
How to fix core web vital issue on shopify website , any recommned app from shopfiy store?
I'm facing challenges optimizing Core Web Vitals on my Shopify store. Does anyone have experience with Shopify apps that effectively address LCP, FID, and CLS issues? Any specific recommendations would be greatly appreciated.
Technical SEO | | faizalialiali0 -
Blog not ranking for my name
Hi everyone. I'm new here so apologies if I'm not asking an appropriate question - just let me know! Can anyone help me figure out why my blog (https://www.jamescrowley.net/) isn't ranking at all for my name? I've run it through the standard Moz audit tools and it hasn't picked up any major issues. It ranks fine for my name plus " CTO", but doesn't appear anywhere in the top 50 without that qualifier. I realise there are many other 'James Crowley's to compete with but weirdly even my GitHub profile page appears to rank higher (https://github.com/jamescrowley) I moved the domain a while back (18 months+) and I used to rank highly, but it never seems to have recovered (all the standard redirects are in place, and told Google at the time about the move). Any suggestions would be very much appreciated!
Technical SEO | | james.crowley0 -
Google will index us, but Bing won't. Why?
Bing is crawling our site, but not indexing it, and we cannot figure out why -- plus it's being indexed fine in Google. Any ideas on what the issue with Bing might be? Here's are some details to let you know what we've already checked/established: We have 4 301’s and the rest of our site checks out We’ve already established our Robots is ok, and that we are fixing our site map/it's in fine shape We do not see anything blocking bingbot access to the site There is no varnish or any load balancers, so nothing on that end that would be blocking the access We also don't see any rules in the apache or the .htaccess config that would be blocking the access
Technical SEO | | Alex_RevelInteractive0 -
Black listed or not, struggling on this one.
I have a client who said they are black listed and they do not come up for any search query other than their name. I have done what I would expect to find the issues, like hurtful backlinks, poor coding etc however the code is fine, yes backlinks are a little slim. They have also said Penguin hit them hard last year. I am confused with this one as I have worked with clients who got hit by penguin and they improved but this particular client has not. http://www.specialistpaintsonline.co.uk is the website, and if anyone can shed some light as I may be missing something head on. regards
Technical SEO | | Shuffled0 -
Is it bad (black hat) to have an H1 text as a text indent?
Is it bad practice to use a text indent through CSS for H1 text on a homepage(basically hiding h1 text)? I'm just trying to compensate for the fact that some text that should really be in the h1 tag is actually an image.
Technical SEO | | inc.com1 -
Redesign an SEO-Disaster | Help with Redirects of Gray Hat Pages
Hi gang. I'm a new SEO and I'm currently working on the redesign of a website. I have just discovered a ton of hidden pages that are filled with duplicate content, basically reiterating the main keyword in a variety of different variations. Each page is titled with the variation on the keyword phrase and then has one paragraph of text very similar to the previous page, etc. Here is an example of one of the offensive pages (nice lookin' site, eh?): http://www.vasectomy-reversals.com/vasectomy_reversal_surgery.html The new site will not have any of these pages. I'm writing the 301 redirects now and want to redirect these offensive pages to the most relevant page on the new site. But, I'm afraid to redirect the offensive pages. Should I leave them alone, or can I have the former developer remove them? Help. Don't know how to handle these pages and their redirects. Thanks for your help! ~ Mills
Technical SEO | | Mills0