How to change URL of RSS Feed?
-
Hi,
There are some websites that keeps on scraping my content. I have blocked them already from accessing my website using .htaccess but they still get my content via RSS feed.
I have tried delaying the RSS feed but I think this affected google rankings. My question is, is there a way to change the URL of my RSS Feed?
From: http://www.mysite.com/feed to http://www.mysite.com/feed2
-
I think than you had better consider legal steps. If they have acces to your content from a third domain or ip they can also do that from a furth or fifth one. So no metter how many ips you will block, if they do know you feed address they can subscribe with a completely new one. In my opinion if this is the case than legal solution should be the best for you.Copyright your articles.
-
Hi Zsolt,
Unfortunately, the scrapers are smart enough to remove the links in the content that they copy. I don't know how they do it but even though my post has links and the h1 title is also a link, the are able to strip the url off.
-
Maybe one of the simplest solution is not blocking those sites to access your content. Make your h1 tag a link to the actual post. So if you have a post title post1 on the url domain.com/post1, the post1 heading on the top of the page should point to domain.com/post1. So if anybody steals your content than they will point back to the original content on your site, so of all copies your site will be he strongest one with the most links.
I would also place links in the content body pointing to my other pages, so anybody copying my content would be appriciated as they are giving handful of backlinks in return. I would also write a little info panel in the bottom: this atrticle was originaly posted on: www.domain.com and written by xy.com. Find similar articles here: domain.com/relatedposts.
In the mean time I would place a leagl statement that copying my content is all right, but just with the links included.
-
Hi Zsolt,
What might be the best solution to prevent the scrapers from stealing my content? Some examples of scrapers of my site are
and many more... I have tried blocking these sites in htaccess using both ip and domain name by the following code format:
Block by IP and Domain
order allow,deny
deny from 208.43.239.80
deny from hownews.infoBut though such code is already in the htaccess, they can still scrape my content. I know they are getting it from my RSS feed because when I put some code to delay my RSS Feed, they are not able to get my latest posts.
I hope you can help me find out on how to put a stop on this scrapers as they are really hurting my rankings. They even rank higher than my site which is the source of the original content.
Thanks in advance....
-
You have to cotact with your programmer to change the url for you, or in some cms systems you can do it in the backend.
What do you exactly mean by scraping? IF they steasl your content, than using a new url is not the best solution fo you.
Rss and rankings: rss usually contains the same information that is already available on your site on some url (not in all cases of course but usually). If that is the case than the feed only has negative affects on your rankings as it duplicates the content: the exact same text that you can find on domain.com/xy can be found on domain.com/feed/xy. So if that is the case you should not worry about your rankings.
If you change your url you should also redirect teh old one at the same time, if you do not do this than all of those who are yet subscribed will lose your feed, you do not want that. If you redirect, than anybody who knew the old url will get to the new one. I think it is pointless.
I would block the ip adresses i do not want to access my content. You could also try to apply legal stuff: say nobody is allowed to use your copys on their own sites. It is easy to find out if anybody does.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Canonical sitemap URL different to website URL architecture
Hi, This may or may not be be an issue, but would like some SEO advice from someone who has a deeper understanding. I'm currently working on a clients site that has a bespoke CMS built by another development agency. The website currently has a sitemap with one link - EG: www.example.com/category/page. This is obviously the page that is indexed in search engines. However the website structure uses www.example.com/page, this isn't indexed in search engines as the links are canonical. The client is also using the second URL structure in all it's off and online advertising, internal links and it's also been picked up by referral sites. I suspect this is not good practice... however I'd like to understand whether there are any negative SEO effectives from this structure? Does Google look at both pages with regard to visits, pageviews, bounce rate, etc. and combine the data OR just use the indexed version? www.example.com/category/page - 63.5% of total pageviews
Technical SEO | | MikeSutcliffe
www.example.com/page - 34.31% of total pageviews Thanks
Mike0 -
How to Remove /feed URLs from Google's Index
Hey everyone, I have an issue with RSS /feed URLs being indexed by Google for some of our Wordpress sites. Have a look at this Google query, and click to show omitted search results. You'll see we have 500+ /feed URLs indexed by Google, for our many category pages/etc. Here is one of the example URLs: http://www.howdesign.com/design-creativity/fonts-typography/letterforms/attachment/gilhelveticatrade/feed/. Based on this content/code of the XML page, it looks like Wordpress is generating these: <generator>http://wordpress.org/?v=3.5.2</generator> Any idea how to get them out of Google's index without 301 redirecting them? We need the Wordpress-generated RSS feeds to work for various uses. My first two thoughts are trying to work with our Development team to see if we can get a "noindex" meta robots tag on the pages, by they are dynamically-generated pages...so I'm not sure if that will be possible. Or, perhaps we can add a "feed" paramater to GWT "URL Parameters" section...but I don't want to limit Google from crawling these again...I figure I need Google to crawl them and see some code that says to get the pages out of their index...and THEN not crawl the pages anymore. I don't think the "Remove URL" feature in GWT will work, since that tool only removes URLs from the search results, not the actual Google index. FWIW, this site is using the Yoast plugin. We set every page type to "noindex" except for the homepage, Posts, Pages and Categories. We have other sites on Yoast that do not have any /feed URLs indexed by Google at all. Side note, the /robots.txt file was previously blocking crawling of the /feed URLs on this site, which is why you'll see that note in the Google SERPs when you click on the query link given in the first paragraph.
Technical SEO | | M_D_Golden_Peak0 -
Content and url duplication?
One of the campaign tools flags one of my clients sites as having lots of duplicates. This is true in the sense the content is sort of boiler plate but with the different countries wording changed. The is same with the urls but they are different in the sense a couple of words have changed in the url`s. So its not the case of a cms or server issue as this seomoz advises. It doesnt need 301`s! Thing is in the niche, freight, transport operators, shipping, I can see many other sites doing the same thing and those sites have lots of similar pages ranking very well. In fact one site has over 300 keywords ranked on page 1-2, but it is a large site with an 12yo domain, which clearly helps. Of course having every page content unique is important, however, i suppose it is better than copy n paste from other sites. So its unique in that sense. Im hoping to convince the site owner to change the content over time for every country. A long process. My biggest problem for understanding duplication issues is that every tabloid or broadsheet media website would be canned from google as quite often they scrape Reuters or re-publish standard press releases on their sites as newsworthy content. So i have great doubt that there is a penalty for it. You only have to look and you can see media sites duplication everywhere, everyday, but they get ranked. I just think that google dont rank the worst cases of spammy duplication. They still index though I notice. So considering the business niche has very much the same content layout replicated content, which rank well, is this duplicate flag such a great worry? Many businesses sell the same service to many locations and its virtually impossible to re write the services in a dozen or so different ways.
Technical SEO | | xtopher660 -
Do keywords in url parameter count?
I have a client who is on an older ecommerce platform that does not allow url rewrites in anyway. It would cost a ton of money to custom dev a solution. Anyways right now they have set up a parameter on their product urls to at least get the keyword in there. My question is, will this keyword actually be counted since it is in a parameter? An example url is http://domain.com/Catalog.aspx?Level1=01&Level2=02&C=Product-name-here Does this 'product-name-here' count as having the keyword in the url according to google?
Technical SEO | | webfeatseo0 -
500 Server Error on RSS Feed
Hi there, I am getting multiple 500 errors on my RSS feed. Here is the error: <dt>Title</dt> <dd>500 : Error</dd> <dt>Meta Description</dt> <dd>Traceback (most recent call last): File "build/bdist.linux-x86_64/egg/downpour/init.py", line 391, in _error failure.raiseException() File "/usr/local/lib/python2.7/site-packages/twisted/python/failure.py", line 370, in raiseException raise self.type, self.value, self.tb Error: 500 Internal Server Error</dd> <dt>Meta Robots</dt> <dd>Not present/empty</dd> <dt>Meta Refresh</dt> <dd>Not present/empty</dd> Any ideas as to why this is happening, they are valid feeds?
Technical SEO | | mistat20000 -
Does SEO Reflect when We change the Host??
I would like to know, if i switch to host not Domain, Just Host which just change the internal IP, Will that Affect to My SEO? If YES then What is the procedure to safe our SEO before Move? Or what is Procedure after we Moved. Hope many people found this issues, and got the solution, I through Google Webmaster do through filter, but i am not sure about it. Thanks
Technical SEO | | Webranx0 -
How do I fix these duplicate URLs?
HI guys, I ran a report on my site and it shows some duplicate titles (example below). Do I need to add something to the htaccess file or another file to fix this? I understand that the search engines should only see 1 URL for the page. 2 pages have "Bikes for sale | used bikes | second hand bicycles" title pauslwebsite.com/bikes/ paulswebsite.com/bikes/index.asp Thanks
Technical SEO | | paulmund0 -
Dynamic Parameters in URL
I have received lots of warnings because of long urls. Most of them are because my website has many Attributes to FILTER out products. And each time the user clicks on one, its added to the URL. pls see my site here: www.theprinterdepo.com The warning is here: Although search engines can crawl dynamic URLs, search engine representatives have warned against using over 2 parameters in any given URL. The question to the community is: -What should I do? These attributes really help the user to find easier the products. I could remove some of the attributes, I am not sure if my ecommerce solution (MAGENTO), allows to change the behavior of this so that this does not use querystring parameters.
Technical SEO | | levalencia10