Black Hat or Bulletproof?
-
I run a blog and a e-commerce website. Their not connected but their about the same thing. I want to put my blog articles onto my website (just a couple not every last one) but I'm afraid of the duplicate content issues.
Can I take an image of a blog post, make it a PDF, and put it under a category of my e-commerce site which is helping users with useful content.
This sounds like a great idea that Google wouldn't be able to tell the difference, in fact Google would like it and see it as a useful document.
To me this seems to good to be true, perhaps a form of black hat.
So my question is, is it black hat? Could I ever get penalized for doing this?
-
Just a quick note -- I've seen Google index PDFs that were scanned images of a cut-and-paste newsletter from the 1980s with a variety of different fonts. This is not a guaranteed way to keep Google out, and images will also make your files much bigger than just text.
-
I don't think so, but it will help keep you from dropping. You are doing it for your users and that is great I just worry if that would not be obvious to Google - that's all.
-
That is what I thought, I was hoping it wouldn't be considered a bad thing to do though. Oh well it is still useful for customers. So making these canonical will not boost my overall website ranking in the least bit?
-
It Takes about 2 minutes per post. Print Screen, Crop, Use Acrobat to make the PDF, upload to site, & write a quick paragraph.
-
Wouldn't you still need some supplemental text to go along with the pdf to explain why a visitor should download it? Seems like a lot of extra work converting blogs into pdfs, uploading them, and extra writing work. A link back makes more sense to me.
-
I wouldn't do that. It would work, but in case your site was ever manually looked at for any reason and that was noticed, that could look like an attempt to manipulate search results and you could get hit. I would just put it on as text and either noindex the page in your robots.txt file or do as Raymond and Nakul suggest and set up a canonical tag. In my very humble opinion I think the safest thing would just be to block bots from the page but the canonical isn't a bad suggestion at all.
-
Seeing how it would be an image and google's crawlers cant crawl the text in that image does it still need to be no follow or canonical?
-
Is your blog blog.yourdomain.com or yourdomain.com/blog/ or yourblogdomain.com ? As Raymond recommended, I would suggest doing a cross domain canonical and you should be good. I hope this helps.
-
Or you can link back to the original article with a rel="cannonical" or if you want to be 100% sure just make it rel="nofollow".
-
You want to add the content to help your users, right? You aren't trying to get it indexed, correct? Just noindex those pages...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
cross canonicalization with redirect
I'm working with a website that has turned one of its pages into its own website within the main website - mostly for the ease of customers, making it simpler to access that page using www.page.com rather than www.mainsite.com/about/page.
Technical SEO | | Shrine.SEO.Gal
As a result, there are two urls for that page (the ones just mentioned), both pointing to the exact same page, but with different urls. Now, they have made it so www.mainsite.com/about/page permanently redirects to www.page.com. which I thought was a good call. However, what do I do about canonicalization? Is it good to point the canonicalization of www.page.com to www.mainsite.com/about/page so that the rankings and link equity are maintained in the main website? Or would the fact that the www.mainsite.com/about/page redirects to www.page.com mess that up? I hope this makes sense!0 -
Need help in diagnosing what I may be doing wrong
I have a site that has been having problems ranking. Initially, spam rate was at 18%. I have since changed the URL and forwarded to the original so now the spam rate is under 5%. Phone calls started picking back up very slowly but then by August 2024 things came to a screeching halt. Phone has been dead and very little business has been written. I did notice on the robots.txt file it had this: User-agent: *
Technical SEO | | SOM24
Disallow: /
User-agent: Googlebot
Disallow:
User-agent: bingbot
Disallow: /no-bing-crawl/
Disallow: wp-admin and now I have since changed it to this:
User-agent: *
Disallow: /wp-admin/
Allow: /wp-admin/admin-ajax.php Sitemap: https://urlishere.com/sitemap_index.xml My question is what else do I need to do to get this site to start ranking again? We are blogging once a month, writing press releases once a month, updating the social media a few times a week. I feel like maybe there is something on the backend that needs to be done to get this site back to ranking. I am using SEO by Yoast and I have filled in the title and meta description fields for all pages. There is a spot in Yoast where I can validate the site with Google, Bing, etc. I'm trying to figure out how to do that. I do see in the site's Google Webmaster Tools there are several pages not indexing. Any ideas on what else I can do to get this site to start ranking again? Thank you.0 -
How to fix core web vital issue on shopify website , any recommned app from shopfiy store?
I'm facing challenges optimizing Core Web Vitals on my Shopify store. Does anyone have experience with Shopify apps that effectively address LCP, FID, and CLS issues? Any specific recommendations would be greatly appreciated.
Technical SEO | | faizalialiali0 -
URLs dropping from index (Crawled, currently not indexed)
I've noticed that some of our URLs have recently dropped completely out of Google's index. When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'. Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case. I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people. Here are a few examples of the URLs that have gone missing: https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training https://www.ihasco.co.uk/courses/detail/conflict-resolution-training https://www.ihasco.co.uk/courses/detail/prevent-duty-training Any help here would be massively appreciated!
Technical SEO | | iHasco0 -
Black listed or not, struggling on this one.
I have a client who said they are black listed and they do not come up for any search query other than their name. I have done what I would expect to find the issues, like hurtful backlinks, poor coding etc however the code is fine, yes backlinks are a little slim. They have also said Penguin hit them hard last year. I am confused with this one as I have worked with clients who got hit by penguin and they improved but this particular client has not. http://www.specialistpaintsonline.co.uk is the website, and if anyone can shed some light as I may be missing something head on. regards
Technical SEO | | Shuffled0 -
Is buying a domain with a high PR and redirecting it to your site considered black hat?
I want to buy a domain that has clean backlinks and then redirect it to my new domain to bump up my PR. Is this considered a black hat technique? Thanks Carla
Technical SEO | | Carla_Dawson1 -
Redesign an SEO-Disaster | Help with Redirects of Gray Hat Pages
Hi gang. I'm a new SEO and I'm currently working on the redesign of a website. I have just discovered a ton of hidden pages that are filled with duplicate content, basically reiterating the main keyword in a variety of different variations. Each page is titled with the variation on the keyword phrase and then has one paragraph of text very similar to the previous page, etc. Here is an example of one of the offensive pages (nice lookin' site, eh?): http://www.vasectomy-reversals.com/vasectomy_reversal_surgery.html The new site will not have any of these pages. I'm writing the 301 redirects now and want to redirect these offensive pages to the most relevant page on the new site. But, I'm afraid to redirect the offensive pages. Should I leave them alone, or can I have the former developer remove them? Help. Don't know how to handle these pages and their redirects. Thanks for your help! ~ Mills
Technical SEO | | Mills0