Black Hat or Bulletproof?
-
I run a blog and a e-commerce website. Their not connected but their about the same thing. I want to put my blog articles onto my website (just a couple not every last one) but I'm afraid of the duplicate content issues.
Can I take an image of a blog post, make it a PDF, and put it under a category of my e-commerce site which is helping users with useful content.
This sounds like a great idea that Google wouldn't be able to tell the difference, in fact Google would like it and see it as a useful document.
To me this seems to good to be true, perhaps a form of black hat.
So my question is, is it black hat? Could I ever get penalized for doing this?
-
Just a quick note -- I've seen Google index PDFs that were scanned images of a cut-and-paste newsletter from the 1980s with a variety of different fonts. This is not a guaranteed way to keep Google out, and images will also make your files much bigger than just text.
-
I don't think so, but it will help keep you from dropping. You are doing it for your users and that is great I just worry if that would not be obvious to Google - that's all.
-
That is what I thought, I was hoping it wouldn't be considered a bad thing to do though. Oh well it is still useful for customers. So making these canonical will not boost my overall website ranking in the least bit?
-
It Takes about 2 minutes per post. Print Screen, Crop, Use Acrobat to make the PDF, upload to site, & write a quick paragraph.
-
Wouldn't you still need some supplemental text to go along with the pdf to explain why a visitor should download it? Seems like a lot of extra work converting blogs into pdfs, uploading them, and extra writing work. A link back makes more sense to me.
-
I wouldn't do that. It would work, but in case your site was ever manually looked at for any reason and that was noticed, that could look like an attempt to manipulate search results and you could get hit. I would just put it on as text and either noindex the page in your robots.txt file or do as Raymond and Nakul suggest and set up a canonical tag. In my very humble opinion I think the safest thing would just be to block bots from the page but the canonical isn't a bad suggestion at all.
-
Seeing how it would be an image and google's crawlers cant crawl the text in that image does it still need to be no follow or canonical?
-
Is your blog blog.yourdomain.com or yourdomain.com/blog/ or yourblogdomain.com ? As Raymond recommended, I would suggest doing a cross domain canonical and you should be good. I hope this helps.
-
Or you can link back to the original article with a rel="cannonical" or if you want to be 100% sure just make it rel="nofollow".
-
You want to add the content to help your users, right? You aren't trying to get it indexed, correct? Just noindex those pages...
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
cross canonicalization with redirect
I'm working with a website that has turned one of its pages into its own website within the main website - mostly for the ease of customers, making it simpler to access that page using www.page.com rather than www.mainsite.com/about/page.
Technical SEO | | Shrine.SEO.Gal
As a result, there are two urls for that page (the ones just mentioned), both pointing to the exact same page, but with different urls. Now, they have made it so www.mainsite.com/about/page permanently redirects to www.page.com. which I thought was a good call. However, what do I do about canonicalization? Is it good to point the canonicalization of www.page.com to www.mainsite.com/about/page so that the rankings and link equity are maintained in the main website? Or would the fact that the www.mainsite.com/about/page redirects to www.page.com mess that up? I hope this makes sense!0 -
How to fix core web vital issue on shopify website , any recommned app from shopfiy store?
I'm facing challenges optimizing Core Web Vitals on my Shopify store. Does anyone have experience with Shopify apps that effectively address LCP, FID, and CLS issues? Any specific recommendations would be greatly appreciated.
Technical SEO | | faizalialiali0 -
Google Not Picking Up Posts
I am trying to work out why from March 4th Google is not seeing my posts. Our google impressions have dropped from 8,000 to 40. If you put in the full article name with speach marks it does not find it, and instead shows the home page in google. We have not had any warnings. We did have work done on our site but nothing else i could think of to cause this. Can anyone let me know what may have caused this. All articles are original
Technical SEO | | headlinesplus0 -
URLs dropping from index (Crawled, currently not indexed)
I've noticed that some of our URLs have recently dropped completely out of Google's index. When carrying out a URL inspection in GSC, it comes up with 'Crawled, currently not indexed'. Strangely, I've also noticed that under referring page it says 'None detected', which is definitely not the case. I wonder if it could be something to do with the following? https://www.seroundtable.com/google-ranking-index-drop-30192.html - It seems to be a bug affecting quite a few people. Here are a few examples of the URLs that have gone missing: https://www.ihasco.co.uk/courses/detail/sexual-harassment-awareness-training https://www.ihasco.co.uk/courses/detail/conflict-resolution-training https://www.ihasco.co.uk/courses/detail/prevent-duty-training Any help here would be massively appreciated!
Technical SEO | | iHasco0 -
Blog not ranking for my name
Hi everyone. I'm new here so apologies if I'm not asking an appropriate question - just let me know! Can anyone help me figure out why my blog (https://www.jamescrowley.net/) isn't ranking at all for my name? I've run it through the standard Moz audit tools and it hasn't picked up any major issues. It ranks fine for my name plus " CTO", but doesn't appear anywhere in the top 50 without that qualifier. I realise there are many other 'James Crowley's to compete with but weirdly even my GitHub profile page appears to rank higher (https://github.com/jamescrowley) I moved the domain a while back (18 months+) and I used to rank highly, but it never seems to have recovered (all the standard redirects are in place, and told Google at the time about the move). Any suggestions would be very much appreciated!
Technical SEO | | james.crowley0 -
Is this Link Black Hat SEO Cloaking or is it OK?
I am a relatively new SEO professional, Can someone please look at this link and tell me if this is white or black hat SEO cloaking practices? http://loghomeconstructionpro.com/ It has an overlay landing page over a html page. I had a partner promote this to me as a proprietary software when really it just looks like cloaking. I want to do my business above board and this doesn't feel right. However, I would like some opinion on it before i pull the plus on my partner. Thanks all for the advice and the help. GD
Technical SEO | | gdavey0 -
Google will index us, but Bing won't. Why?
Bing is crawling our site, but not indexing it, and we cannot figure out why -- plus it's being indexed fine in Google. Any ideas on what the issue with Bing might be? Here's are some details to let you know what we've already checked/established: We have 4 301’s and the rest of our site checks out We’ve already established our Robots is ok, and that we are fixing our site map/it's in fine shape We do not see anything blocking bingbot access to the site There is no varnish or any load balancers, so nothing on that end that would be blocking the access We also don't see any rules in the apache or the .htaccess config that would be blocking the access
Technical SEO | | Alex_RevelInteractive0 -
Is it bad (black hat) to have an H1 text as a text indent?
Is it bad practice to use a text indent through CSS for H1 text on a homepage(basically hiding h1 text)? I'm just trying to compensate for the fact that some text that should really be in the h1 tag is actually an image.
Technical SEO | | inc.com1