<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Elephantsquared. &#187; search</title>
	<atom:link href="http://elephantsquared.com/tag/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://elephantsquared.com</link>
	<description>hypertext &#38; software</description>
	<lastBuildDate>Sat, 19 May 2012 10:59:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Google is about to get a lot faster</title>
		<link>http://elephantsquared.com/2010/01/google-is-about-to-get-a-lot-faster/</link>
		<comments>http://elephantsquared.com/2010/01/google-is-about-to-get-a-lot-faster/#comments</comments>
		<pubDate>Thu, 14 Jan 2010 12:39:06 +0000</pubDate>
		<dc:creator>George T</dc:creator>
				<category><![CDATA[google]]></category>
		<category><![CDATA[cpu]]></category>
		<category><![CDATA[ext2]]></category>
		<category><![CDATA[ext3]]></category>
		<category><![CDATA[ext4]]></category>
		<category><![CDATA[filesystem]]></category>
		<category><![CDATA[google goggles]]></category>
		<category><![CDATA[linux]]></category>
		<category><![CDATA[raid]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://elephantsquared.com/?p=1216</guid>
		<description><![CDATA[I just ran into a linux-ext4 mailing list concerning Linux ext4 filesystem development and I found an interesting read regarding Google, filesystems and the future of search that I wanted to share. The particular discussion was about jfs filesystem benchmark results where Michael Rubin (from Google) said: Google is currently in the middle of upgrading [...]]]></description>
			<content:encoded><![CDATA[<p>I just ran into a <a href="http://lists.openwall.net/linux-ext4/" target="_blank">linux-ext4</a> mailing list concerning Linux ext4 filesystem development and I found an interesting read regarding Google, filesystems and the future of search that I wanted to share. The particular discussion was about jfs filesystem benchmark results where Michael Rubin (from Google) <a href="http://lists.openwall.net/linux-ext4/2010/01/04/8" target="_blank">said</a>:</p>
<blockquote><p>Google is currently in the middle of upgrading from ext2 to a more up to date file system. We ended up choosing ext4. This thread touches upon many of the issues we wrestled with, so I thought it would be interesting to share.</p></blockquote>
<p>Just think about this for a moment. Google was still running on ext2 filesystem. Although ext2 remains one super stable filesystem, imagine the size of the impact of the forthcoming upgrade to ext4 filesystem. In case your are not familiar with the ext4 filesystem, ext4 is a journaling file system developed as the successor to ext3. Ext4&#8242;s main features range from <strong>backward compatibility</strong>, <strong>bigger filesystem/file sizes</strong> and <strong>multiblock allocation</strong> to <strong>fast fsck</strong>, <strong>online defragmentation</strong> and <strong>inode-related features</strong>. For more detailed information you should check out <strong>kernel.org</strong>&#8216;s <a href="http://ext4.wiki.kernel.org/index.php/Main_Page" target="_blank">article</a> on ext4.</p>
<blockquote><p>The driving performance reason to upgrade is that while ext2 had been &#8220;good enough&#8221; for a very long time the metadata arrangement on a stale file system was leading to what we call &#8220;read inflation&#8221;. This is where we end up doing many seeks to read one block of data. In general latency from poor block allocation was causing performance hiccups. We spent a lot of time with unix standard benchmarks (dbench, compile bench, et al) on xfs, ext4, jfs to try to see which one was going to perform the best.</p>
<p style="text-align: right;">&#8211;Michael Rubin</p>
</blockquote>
<p>Now, back to Google, what do you think this could mean to our future searches? Search results showing up in light speed time? Probably. For example, take a look at <a href="http://www.google.com/mobile/goggles/" target="_blank">Google Goggles</a>. It is an amazing product right? However the current status quo of the speed of its results is limited by the perception we have about the device&#8217;s speed <strong>and not only</strong>. I mean most of us realize it is not an easy thing to match a photo of the Parthenon temple I just took with trillions of random photos indexed somehow. Every query like this one, is narrowed by the network bottleneck and the processing speeds, which include hard disk&#8217;s read/write speed.</p>
<p>Imagine a super-fast network, with lets say 10Gbps bandwidth, where on the one end you press a &lt;button&gt; and all it does is erasing 10^100 rows and recreating 10^100 rows in a SQL database on the other end. Given that, apart from the CPU usage which is required for a process like this, there is the limit of read and write speed of the hard disks even if using RAID technology. This is where ext4 comes in to make this whole process a lot faster.</p>
<p>I can only imagine what&#8217;s next for our future searches. Fast and responsive holographic search results? I really don&#8217;t think there is a limit here, right?</p>
]]></content:encoded>
			<wfw:commentRss>http://elephantsquared.com/2010/01/google-is-about-to-get-a-lot-faster/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

