Does Frequency of content updates affect likelyhood outbound links will be indexed?
-
I have several pages on our website with low pr, that also themselves link to lots and lots of pages that are service/product specific. Since there are so many outbound links, I know that the small amount of PR will be spread thin as it is. My question is, if I were to supply fresh content to the top level pages, and change it often, would that influence whether or not google indexes the underlying pages? Also if I supply fresh content to the underlying pages, once google crawls them, would that guarantee that google considers them 'important' enough to be indexed"
I guess my real question is, can freshness of content and frequency of update convince google that the underlying pages are 'worthy of being indexed', and can producing fresh content on those pages 'keep google's interest', so to speak, despite having little if any pagerank.
-
Hello Ilya,
There are several good responses here, and I think some of them would depend on how large your site is and what types of pages they are. Judging by your URL example below, I'm guessing it is real estate related or at least that you have localized pages in different geographic areas.
You have a few issues here. First, this video might help, but it is sort of outdated and misleading in some ways. There may not be a set limit (i.e. we're only going to index 10k pages) but how much of your site gets indexed, and how often it gets crawled is based largely on the quality of your site (assuming all other factors are there, such as sitemaps and crawlable navigation, etc...). And the quality of your site depends on many, many different factors. Of course the two most important for this discussion would probably be uniqueness/usefulness of the content, and the amount of links the site and sections of the site, as well as the deep pages have.
The more links you can get into those deep pages, the more likely it is that Google is going to crawl more often, and index those pages. You said you "can't" get links into those pages. If you can't get links into them, they probably aren't "quality" and therein lies your problem.
If by "can't" you just mean there isn't enough time in the day for you to build links into ALL of these pages, you can still build links into as many as you can. This will get the bots crawling down to that level of your site more often, and make it more likely that this level of your site will be indexed.
Here is another useful link, although it is dated as well:
http://www.seomoz.org/blog/googles-indexation-capHaving fresh content (with a fresh "last modified" date) usually does, in my experience, entice Googlebot to come back more often. Does that translate into "indexing" more pages? I don't know. But I do know that having better content and more links into those inner pages does translate into more indexation, and not just for the pages linked to externally, but for that entire section/folder/directory of your site.
Consider user-generated content on those pages if you can. A lot of VERY popular review and realestate sites' deep pages would go unindexed without it.
-
We shouldn't confuse a query that deserves freshness (QDF) with enticing Google to recrawl a page or set of pages by giving them fresh content. Maybe I read your response wrong, but those are two different things. QDF would apply, for instance, if you were writing an article right now about the nuclear disaster in Japan; not if you were updating a page from three years ago about how to lose weight after pregnancy, or how to optimize a webpage.
-
From my experience, adding fresh content on a regular basis, even when the pages are rather empty, will make Google crawl more and more your website. As crawl budget gets bigger, deeper pages will be crawled.
Although I never worked on a similar case to yours, I would suggest adding fresh content on a regular basis and link those new pages on the homepage to get them crawled ASAP. Put internal links to the pages you want to be crawled in those new pages if they are revelant.
-
Not as much. You may have to engineer some process for feed generation. The idea is to have the content in RSS and help it propogate through stuff like ping.
-
It can, as Rand has said in the past, results deserve freshness, that is, results seem to always include a few such pages.
-
saibose...do you think a service like linklicious? (link->rss) would work?
-
the 100 links is more of a guideline and not a strict rule as such. Your 1st objective should be to enable the page to be indexed. If Query Deserves Freshness(QDF) algorithms in Google will eventually index your URL. Its a matter of time with you linking to that page from atleast 1 page.
My advice would be to link it from more pages (if possible) and keep the content fresh.
Maybe you can even try the RSS idea as well.
-
I guess it would depend a little how you're doing it, however the best way to get Google to crawl your product pages is to get links directly to them from other sites that are being crawled often/ have authority. I would also suggest creating a (XML) sitemap and submit it to them if you haven't already.
If all your links are coming to your homepage (not uncommon in smaller sites) then Google's going to usually enter your site that way and if there's a lot of links on the homepage and the site only has a little authority then it has to prioritise how many and which pages to visit.
Having regular content updates may get Google to change which pages it crawls at any one time, though some of your other pages may then have longer cache dates.
Ultimately if your site structure is good enough then you really need to work on building links to the product pages to regularly 'convince' Google to crawl them. Though adding relevant content is one way of doing this
-
Thank you guys.
Anthony, I am not sure I agree; indexing and crawling are 2 different things. I guess that is really what I'm getting at here. I can force google to crawl my whole site daily (or almost daily) with rss feeds, sitemaps, proper structure, frequent updates, etc....but WILL that freshness of content force google to go hm....despite the page being very insignificant, it might be important enough to go into my index.
Saibose, unfortunately i'm well beyond the 100 link limit....I am noticing quite a bit of the pages that ARE indexed, ARE ranking since they're well optimized through on-page and they are targeting extremely long-tail keyphrases. So my main goal is to convince goal to index these pages because once I do, they will rank.
What I have done so far:
1. Made sure that the page is easily accessible from at least 1 page on the website
2. Create a sitemap (proper sitemap index and several underlying sitemap files).
3. Submitted the sitemaps and increase google crawl rate; (I noted google is crawling around 1700 pages/day on my site.
4. Made sure that the page is at most 3 levels deep. (site/state/city) (we'er talking about city level pages)
5. created proper urls (/site/state/city)
I think maybe I misspoke. I am not doubting that google will 'crawl' the page. What I am asking is if I can't link externally to it, and the internal page rank passed is very small, will adding fresh content and making google think that the page gets updated frequently convince google to index it? Does frequent crawling finally force indexing or is it possible google may say "no matter how often you update this page, its just NOT important enough for me to index it," if noone links to it outside your site.
-
I think you are getting at the concept of continually updating the content on a few pages of your site to make sure they are indexed by google. If the page is not indexed already, that means it likely isn't being crawled by google at all so changing the content on the page won't make much of a difference.
Instead, make sure the page you want indexed is easily found within the website's internal linking structure, preferably only a handful of clicks away from the homepage. An even better way to make sure the page is indexed is to get a few external links pointed at it. If you are simply trying to achieve indexation and not expecting the page to rank high in the SERPs, something as easy as bookmarking the site to a few websites and tweeting it once or twice will probably get the job done.
As for your comment on whether or not google will consider your page 'important' enough to be indexed, I don't think you will have a problem with that as long as you are writing unique content.
-
The problem is very common for content heavy websites where content lies somewhere way down the hiearchy.
I am considering or assuming a few things here:
1. The webpage you are referring to is already crawled atleast once.
2. It is accessible from atleast one link on your homepage
3. It does not have a huge number of outbound links ..that is, around 100(within and outside your domain).
Your 1st task should be to get Google to crawl the page (s)
1. get a tool like gsite crawler and crawl your entire website. Create and submit a XML sitemap of your website to Google webmaster tools. Create links from your pages that are already indexed to this page (pages). That way, Google bot will find its way eventually.
2. Update fresh content on the page. Create a RSS feed of the content updates very frequently and serve it up front on the homepage or an important page of your website (which ranks well in Google).
All said, you have to wait and watch. There is no way you can forcefully ask Google to crawl your webpage. Also, updating your homepage content (just text with no link to your deep pages) wouldnt help in speeding up the process. But, its a good practice to keep your homepage content fresh so that Google bots visit your website regularly and you get Google love.
Hope that answers your question.
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Outbound links
Hi everyone, Just a quick question about using info/statistics from other sources in my articles. If I use a quote/piece of info from another online article do I just say where it's from and link to it? Is this acceptable or do you have to get permission? I find the whole permissions thing quite confusing! I know that outbound links are good for SEO so just wanted to check this. Thanks!
On-Page Optimization | | ClareO0 -
Update old article or publish new content and redirect old post?
Hi all, I'm targetting a keyword and we used to rank quite good for it. Last couple of months traffic of that keyword (and variations) is going down a bit. I wrote an extensive new post on the same topic, much more in dept and from 600 to 1800 words covering the same topic. Is it better to update the old article and mention that it's updated recently, or publish a new post and redirect the old post to the new post?
On-Page Optimization | | jorisbrabants0 -
Strange google indexing behaviour
Hi all Looking for a second opinion on a strange issue with has occurred on my site. The site is a magento store and because I am using all the default merchant descriptions at the moment I have noindexed the product pages (there are 300k products, the plan is to rewrite the content as we go, starting with most popular sellers). The Gbot is blocked from the pages and all the products have header tag. We forgot to noindex the popular search terms page on the site and as a result google has indexed some search result pages - we may keep this open, not sure yet, We are seeing a very strange thing in the serps. Google has indexed the search result pages, as mentioned above, however, the description and title tag being used do not belong to that page, they belong to the product page the search result links to. If i do a search in google for the indexed pages i get the categories and lots of, what appears to be, product pages. https://www.google.co.uk/search?q=site:arropa.co.uk/store&espv=2&biw=1536&bih=772&ei=LE5xVd3qA4HlUNnggKgH&start=250&sa=N One would assume that a page listed with the title of Ladies 1 Pair Young Trasparenze Mumbai Animal Print . and the description of Come on, program a little of your crazy side! Part of the edgy, sassy Young Trasparenze Medley, these soft touch, nontransparent stockings function a crazy, (along with the price) would be an entry for that individual product. However, clicking on that product opens up a search results page (very slowly as the site is processing an update still - it is not for public use thus far) which can be seen here http://arropa.co.uk/store/catalogsearch/result/?q=+ladies+1+pair+young+trasparenze+mumbai+animal+print+tights+75+off+military+l+ yes, the search result page is for that particular item but nowhere on the page is the title, description and price, nor has it ever been. Am a little puzzled about this and what it would do re duplicate content as im using the manufacturer data at present. Ideally i would like to keep the search results pages open. Any thoughts would be most welcome. Couple of things to note. Im aware the site is too slow for general public use. It will be fully cached once running, as i say, it has 300k+ products so isn't small. Also, am aware that there are no images. They exist but we are moving the images around, hence being down. Always a fun task when there are 25gb of the things!! Many thanks Carl
On-Page Optimization | | WonkyDog0 -
Indexed iframed content behind login
Hi, I have a question regarding iframed content. I would like to get my non cms content which is served via an iframe solution (from the same domain) behind a anonymous or personal login indexed by search engines. How can we make this work? I've looked at the following solutions: http://googlewebmastercentral.blogspot.nl/2008/10/first-click-free-for-web-search.htmlhttp://productforums.google.com/forum/#!topic/webmasters/l9n8oGLQRkUBut I would like the content to be crawlable deeper than the just one page (if this is possible using the iframe solution).We could also setup different new pages in our CMS with the same content...Any suggestions?Thanks!Arnout
On-Page Optimization | | hellemans0 -
Are My footer links bad?
I started working here recently, they said the footer links were to help with navigation of their most popular products. I am curious after reading http://www.seomoz.org/blog/internal-linking-strategies-for-2012-and-beyond if having these footer links could hurt the ranking of those key words after the penguin update. I am looking more into the analytics, and have not seen a negative impact yet.
On-Page Optimization | | DoRM0 -
No Follow Internal Links
Hi Mozzers, I know that this has been asked a few times and answered as well, I would just like to know some more on the internal link count on a page. I ran the SEOmoz report and many of the pages on the website have more than 150+ internal links. Now, should I use the rel=nofollow tag on some pages that I feel are not important? I have a list of pages which are not important from the SEO point of view, but from the usability factors they need to be there so I cannot remove the links to them. So, would be OK to place the rel=nofollow tag on them. My whole purpose is to reduce the count of internal links on the page as seen by SE's. Now, some say that the rel=nofollow tag does not lower the link count, but it can definitely (I believe) prevent the bots time in getting to those pages, which SEOmoz report also quotes. (__When search engine spiders crawl the Internet they are limited by technology resources and are only able to crawl a certain number of links per webpage. ) So, probably I can save their time. Does anyone have any views on this, Cheers,
On-Page Optimization | | RanjeetP0 -
Duplicate page title and content
Hello, I have an ecommerce store where we offer many similar products, and the main difference could be the color or memory storage. Due to this reason my main problem appears to be be duplicate page title and content. What is the best way to correct this issue? I cant make them different neither. I always include this particular difference in title or description. I guess it is not enough? any way to fix it? thanks!
On-Page Optimization | | tolyadem10 -
Where does link juice flow on a cloaked link?
Hello, I use a wordpress plug in that allows me to display tot he user any link I want from my domain, so it might be like: www.domain.com/gift-card, but the actual link is www.someaffiliatelink/w09fjai;owfoienw <--- and then a bunch of crap after the domain for the affiliate link. It uses the common technique of an iframe to hide the actual url from the user and show the one that I want them to see. What I am wondering is, does link juice in this case flow to my site, or to their site? And also, do you have any comments regarding this type of link cloaking? Thanks. Thanks
On-Page Optimization | | BigJohnson0