Black Hat or Bulletproof?
-
I run a blog and a e-commerce website. Their not connected but their about the same thing. I want to put my blog articles onto my website (just a couple not every last one) but I'm afraid of the duplicate content issues.
Can I take an image of a blog post, make it a PDF, and put it under a category of my e-commerce site which is helping users with useful content.
This sounds like a great idea that Google wouldn't be able to tell the difference, in fact Google would like it and see it as a useful document.
To me this seems to good to be true, perhaps a form of black hat.
So my question is, is it black hat? Could I ever get penalized for doing this?
-
Just a quick note -- I've seen Google index PDFs that were scanned images of a cut-and-paste newsletter from the 1980s with a variety of different fonts. This is not a guaranteed way to keep Google out, and images will also make your files much bigger than just text.
-
I don't think so, but it will help keep you from dropping. You are doing it for your users and that is great I just worry if that would not be obvious to Google - that's all.
-
That is what I thought, I was hoping it wouldn't be considered a bad thing to do though. Oh well it is still useful for customers. So making these canonical will not boost my overall website ranking in the least bit?
-
It Takes about 2 minutes per post. Print Screen, Crop, Use Acrobat to make the PDF, upload to site, & write a quick paragraph.
-
Wouldn't you still need some supplemental text to go along with the pdf to explain why a visitor should download it? Seems like a lot of extra work converting blogs into pdfs, uploading them, and extra writing work. A link back makes more sense to me.
-
I wouldn't do that. It would work, but in case your site was ever manually looked at for any reason and that was noticed, that could look like an attempt to manipulate search results and you could get hit. I would just put it on as text and either noindex the page in your robots.txt file or do as Raymond and Nakul suggest and set up a canonical tag. In my very humble opinion I think the safest thing would just be to block bots from the page but the canonical isn't a bad suggestion at all.
-
Seeing how it would be an image and google's crawlers cant crawl the text in that image does it still need to be no follow or canonical?
-
Is your blog blog.yourdomain.com or yourdomain.com/blog/ or yourblogdomain.com ? As Raymond recommended, I would suggest doing a cross domain canonical and you should be good. I hope this helps.
-
Or you can link back to the original article with a rel="cannonical" or if you want to be 100% sure just make it rel="nofollow".
-
You want to add the content to help your users, right? You aren't trying to get it indexed, correct? Just noindex those pages...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Best redirect destination for 18k highly-linked pages
Technical SEO question regarding redirects; I appreciate any insights on best way to handle. Situation: We're decommissioning several major content sections on a website, comprising ~18k webpages. This is a well established site (10+ years) and many of the pages within these sections have high-quality inbound links from .orgs and .edus. Challenge: We're trying to determine the best place to redirect these 18k pages. For user experience, we believe best option is the homepage, which has a statement about the changes to the site and links to the most important remaining sections of the site. It's also the most important page on site, so the bolster of 301 redirected links doesn't seem bad. However, someone on our team is concerned that that many new redirected pages and links going to our homepage will trigger a negative SEO flag for the homepage, and recommends instead that they all go to our custom 404 page (which also includes links to important remaining sections). What's the right approach here to preserve remaining SEO value of these soon-to-be-redirected pages without triggering Google penalties?
Technical SEO | | davidvogel0 -
Need help in diagnosing what I may be doing wrong
I have a site that has been having problems ranking. Initially, spam rate was at 18%. I have since changed the URL and forwarded to the original so now the spam rate is under 5%. Phone calls started picking back up very slowly but then by August 2024 things came to a screeching halt. Phone has been dead and very little business has been written. I did notice on the robots.txt file it had this: User-agent: *
Technical SEO | | SOM24
Disallow: /
User-agent: Googlebot
Disallow:
User-agent: bingbot
Disallow: /no-bing-crawl/
Disallow: wp-admin and now I have since changed it to this:
User-agent: *
Disallow: /wp-admin/
Allow: /wp-admin/admin-ajax.php Sitemap: https://urlishere.com/sitemap_index.xml My question is what else do I need to do to get this site to start ranking again? We are blogging once a month, writing press releases once a month, updating the social media a few times a week. I feel like maybe there is something on the backend that needs to be done to get this site back to ranking. I am using SEO by Yoast and I have filled in the title and meta description fields for all pages. There is a spot in Yoast where I can validate the site with Google, Bing, etc. I'm trying to figure out how to do that. I do see in the site's Google Webmaster Tools there are several pages not indexing. Any ideas on what else I can do to get this site to start ranking again? Thank you.0 -
How to fix core web vital issue on shopify website , any recommned app from shopfiy store?
I'm facing challenges optimizing Core Web Vitals on my Shopify store. Does anyone have experience with Shopify apps that effectively address LCP, FID, and CLS issues? Any specific recommendations would be greatly appreciated.
Technical SEO | | faizalialiali0 -
Google Not Picking Up Posts
I am trying to work out why from March 4th Google is not seeing my posts. Our google impressions have dropped from 8,000 to 40. If you put in the full article name with speach marks it does not find it, and instead shows the home page in google. We have not had any warnings. We did have work done on our site but nothing else i could think of to cause this. Can anyone let me know what may have caused this. All articles are original
Technical SEO | | headlinesplus0 -
URLs dropping from index (Crawled, currently not indexed)
I've noticed that some of our URLs have recently dropped completely out of Google's index. When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'. Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case. I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people. Here are a few examples of the URLs that have gone missing: https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training https://www.ihasco.co.uk/courses/detail/conflict-resolution-training https://www.ihasco.co.uk/courses/detail/prevent-duty-training Any help here would be massively appreciated!
Technical SEO | | iHasco0 -
Is this Link Black Hat SEO Cloaking or is it OK?
I am a relatively new SEO professional, Can someone please look at this link and tell me if this is white or black hat SEO cloaking practices? http://loghomeconstructionpro.com/ It has an overlay landing page over a html page. I had a partner promote this to me as a proprietary software when really it just looks like cloaking. I want to do my business above board and this doesn't feel right. However, I would like some opinion on it before i pull the plus on my partner. Thanks all for the advice and the help. GD
Technical SEO | | gdavey0 -
Google will index us, but Bing won't. Why?
Bing is crawling our site, but not indexing it, and we cannot figure out why -- plus it's being indexed fine in Google. Any ideas on what the issue with Bing might be? Here's are some details to let you know what we've already checked/established: We have 4 301’s and the rest of our site checks out We’ve already established our Robots is ok, and that we are fixing our site map/it's in fine shape We do not see anything blocking bingbot access to the site There is no varnish or any load balancers, so nothing on that end that would be blocking the access We also don't see any rules in the apache or the .htaccess config that would be blocking the access
Technical SEO | | Alex_RevelInteractive0 -
Redesign an SEO-Disaster | Help with Redirects of Gray Hat Pages
Hi gang. I'm a new SEO and I'm currently working on the redesign of a website. I have just discovered a ton of hidden pages that are filled with duplicate content, basically reiterating the main keyword in a variety of different variations. Each page is titled with the variation on the keyword phrase and then has one paragraph of text very similar to the previous page, etc. Here is an example of one of the offensive pages (nice lookin' site, eh?): http://www.vasectomy-reversals.com/vasectomy_reversal_surgery.html The new site will not have any of these pages. I'm writing the 301 redirects now and want to redirect these offensive pages to the most relevant page on the new site. But, I'm afraid to redirect the offensive pages. Should I leave them alone, or can I have the former developer remove them? Help. Don't know how to handle these pages and their redirects. Thanks for your help! ~ Mills
Technical SEO | | Mills0