<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>lucasjosh.com &#187; REST</title>
	<atom:link href="http://lucasjosh.com/blog/category/rest/feed/" rel="self" type="application/rss+xml" />
	<link>http://lucasjosh.com/blog</link>
	<description></description>
	<lastBuildDate>Mon, 01 Mar 2010 14:15:08 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Looking into HBase</title>
		<link>http://lucasjosh.com/blog/2008/11/11/looking-into-hbase/</link>
		<comments>http://lucasjosh.com/blog/2008/11/11/looking-into-hbase/#comments</comments>
		<pubDate>Tue, 11 Nov 2008 17:04:23 +0000</pubDate>
		<dc:creator>josh</dc:creator>
				<category><![CDATA[HBase]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[REST]]></category>
		<category><![CDATA[Ruby]]></category>

		<guid isPermaLink="false">http://lucasjosh.com/blog/?p=197</guid>
		<description><![CDATA[HBase is the open source implementation of Google&#8217;s Bigtable.  I&#8217;ve been keeping my eye on it in combination with Hadoop.  I had some extra time today so I decided to see how easy it would be to hook it up with the aggregator we built for things like Topics.
One of the nice things [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://hadoop.apache.org/hbase">HBase</a> is the open source implementation of <a href="http://labs.google.com/papers/bigtable.html">Google&#8217;s Bigtable</a>.  I&#8217;ve been keeping my eye on it in combination with <a href="http://wiki.apache.org/hadoop/">Hadoop</a>.  I had some extra time today so I decided to see how easy it would be to hook it up with the aggregator we built for things like <a href="http://topics.latimes.com">Topics</a>.</p>
<p>One of the nice things about HBase is the <a href="http://wiki.apache.org/hadoop/Hbase/HbaseRest">REST interface</a> that can read and write data.  I hooked up <a href="http://github.com/sishen/hbase-ruby">the Ruby client</a> so that whenever I saved posts from the feed to MySQL, it would also send data to HBase.  </p>
<p>The writing to HBase is pretty straightforward and the REST client makes it really easy.  However, getting the data out needs to be looked at a bit more closely.</p>
<p>HBase is NOT a relational database.  If you approach like it is, you will get utterly confused and frustrated.  Instead, it can be thought of as a collection of Maps.  So, in order to get data out, you need to iterate over the Maps looking for particular columns.</p>
<p>When you use the REST API, you do this via the creation of a scanner and <i>pop</i>&#8216;ing off the results like from a queue.</p>
<p>That&#8217;s some of what I found out, let&#8217;s see what else I can dig into today.</p>
]]></content:encoded>
			<wfw:commentRss>http://lucasjosh.com/blog/2008/11/11/looking-into-hbase/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
