• Google Adds 'Caffeine' for More Up-to-date Results
    11 replies, posted
[quote]Google has introduced a new Web indexing system to provide users with more up-to-date search results, the company said Tuesday. The new system, called Caffeine, delivers results that are closer to "live" than Google's previous system, the company said. Previously, Google would crawl a fraction of the Web each night, index it and push it out in its results. With Caffeine, as Google crawls the Web and finds new information, it indexes it immediately. "We process it immediately so we can serve it seconds later," said Matt Cutts, the head of Google's webspam team. He unveiled the news at the Search Marketing Expo in Seattle. When Google started, it would update its index only every four months, he said. Around 2000, it started indexing every month in a process that took a week to 10 days. "The funny thing is, we didn't have enough capacity to update all our data centers at once," he said. That meant that people might get different results when searching for the same term if they were hitting different Google data centers. Caffeine went live "in the last few days" and is now being used in all Google data centers, he said. In addition to serving "fresher" results, Caffeine "massively increases our ability to scale up," Cutts said. The company will be able to index many more documents -- "on the order of 100 petabytes," he said. Caffeine adds new information at a rate of hundreds of thousands of gigabytes per day, Google said in a blog post. The progression in how Google does its indexing mirrors how people increasingly expect to find the very latest information online. Google noticed that after the Sept. 11 attacks on the U.S., when people were looking for the most up-to-the-minute information possible, Cutts said.[/quote] [b]SOURCE:[/b] [url]http://www.pcworld.com/article/198349/google_adds_caffeine_for_more_uptodate_results.html[/url] Basically you'll be able to access more information that was "just updated".
Sounds good.
[quote=Google]on the order of [B]100 petabytes[/B][/quote] Holy crap, that is a lot of information, wonder if this creates additional server load when accessing latest information at such a constant rate.
[quote]In addition to serving "fresher" results, Caffeine "massively increases our ability to scale up," Cutts said. The company will be able to index many more documents -- "on the order of 100 petabytes," he said.[/quote] [quote]The company will be able to index many more documents -- "on the order of 100 petabytes," he said.[/quote] [quote]"on the order of 100 petabytes,"[/quote] [quote][b]100 petabytes[/b][/quote] :psyboom:
needs exabytes
100 petabytes can also be expressed as: 900,719,925,474,099,200 bits 112,589,990,684,262,400 bytes 109,951,162,777,600 kilobytes 107,374,182,400 megabytes 104,857,600 gigabytes 102,400 terabytes 100 petabytes 0.09765625 exabytes 0.0000953674 zettabytes Holy shit.
That's a lot of linux distro space right there.
Awesome name for a product, not sure how to use it.
Not enough exabytes for 42.zip
And this is why Google is a better search engine.
[QUOTE=YWNJack;22481955]Awesome name for a product, not sure how to use it.[/QUOTE] Just use Google the same way as always, this is a change to their crawler back-end.
Holy shit.
Sorry, you need to Log In to post a reply to this thread.