<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>MrBalky Heavy Industries &#187; lijit</title>
	<atom:link href="http://www.mrbalky.com/category/lijit/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.mrbalky.com</link>
	<description>Fat free with endorphins, anti-oxidants, bioflavonoids, creatine, glucosamine and caffeine!</description>
	<lastBuildDate>Fri, 27 Jan 2012 23:23:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Hive tables, partitions and LZO compression</title>
		<link>http://www.mrbalky.com/2011/02/24/hive-tables-partitions-and-lzo-compression/</link>
		<comments>http://www.mrbalky.com/2011/02/24/hive-tables-partitions-and-lzo-compression/#comments</comments>
		<pubDate>Thu, 24 Feb 2011 11:55:41 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hive]]></category>
		<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://www.mrbalky.com/?p=587</guid>
		<description><![CDATA[At Lijit we&#8217;ve been working with lots of the projects in the Hadoop ecosystem.  In particular, we&#8217;re using Hive quite a bit, since it abstracts map/reduce into a familiar SQL-like language. We deal with fairly large amounts of webserver log data, so are also saving HDFS space and job i/o by using the hadoop-lzo package. [...]]]></description>
			<content:encoded><![CDATA[<p>At <a href="http://www.lijit.com">Lijit</a> we&#8217;ve been working with lots of the projects in the <a href="http://hadoop.apache.org/">Hadoop</a> ecosystem.  In particular, we&#8217;re using <a href="http://hive.apache.org/">Hive</a> quite a bit, since it abstracts map/reduce into a familiar SQL-like language.</p>
<p>We deal with fairly large amounts of webserver log data, so are also saving HDFS space and job i/o by using the <a href="https://github.com/kevinweil/hadoop-lzo">hadoop-lzo</a> package.  It gives fast compression that retains our ability to use the data through Hive queries.</p>
<p>If you are only interested in compression, and have Hadoop and Hive configured appropriately, you can even mix compressed and uncompressed data in separate partitions of a Hive table.  A normal table definition will work:</p>
<pre><code>CREATE EXTERNAL TABLE foo (
                       columnA string,
                       columnB string )
       PARTITIONED BY (date string)
       ROW FORMAT DELIMITED FIELDS TERMINATED BY "\t"
       LOCATION '/path/to/hive/tables/foo';</code></pre>
<p>One big advantage of LZO, though, is its ability to be split in map/reduce jobs.  This is done by creating an index of the LZO file with the LzoIndexer tool of the hadoop-lzo project. To actually use the index, you will need to use a special input format for your Hive table:</p>
<pre><code>CREATE EXTERNAL TABLE foo (
         columnA string,
         columnB string )
    PARTITIONED BY (date string)
    ROW FORMAT DELIMITED FIELDS TERMINATED BY "\t"
    STORED AS INPUTFORMAT "com.hadoop.mapred.DeprecatedLzoTextInputFormat"
          OUTPUTFORMAT "org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat"
    LOCATION '/path/to/hive/tables/foo';</code></pre>
<p>Now to actually come to the point.  In my case, I had already created the table, and was trying to add indexing after the fact.  Hive permits changing input format with an alter statement:</p>
<pre><code>ALTER TABLE foo
    SET FILEFORMAT
        INPUTFORMAT "com.hadoop.mapred.DeprecatedLzoTextInputFormat"
        OUTPUTFORMAT "org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat";</code></pre>
<p>But this alters only future partitions, not existing partitions.  They retain their TextInputFormat.  So now when I ran my Hive queries, instead of the LZO index file being used for splitting the input, it wound wind up used as table data.  My results were mostly correct, but there were some result rows that were garbage.</p>
<p>I fixed this by dropping and recreating the table and partitions with the correct input format.  Because I use EXTERNAL tables, the data itself was preserved.</p>
<p>While this is not a big deal, I have lost the ability to mix compressed and uncompressed data in the table.  The Hive language manual claims I can alter partition metadata, which would be another way to deal with this, but so far I&#8217;ve not been able to make that work in versions 0.5 and 0.6.</p>
<p>Thanks to Dmitriy and Johan from Twitter for helping me understand all this.</p>
<p>hadoop-lzo:<br />
<a href="https://github.com/kevinweil/hadoop-lzo"></p>
<p>https://github.com/kevinweil/hadoop-lzo</a></p>
<p>The original hadoop-gpl-compression project:<br />
<a href="http://code.google.com/a/apache-extras.org/p/hadoop-gpl-compression/wiki/FAQ?redir=1">http://code.google.com/a/apache-extras.org/p/hadoop-gpl-compression/wiki/FAQ?redir=1</a></p>
<p>Hive language manual:<br />
<a href="http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL"></p>
<p>http://wiki.apache.org/hadoop/Hive/LanguageManual/DDL</a></p>
<p>Discussion of Hive and table attributes:<br />
<a href="https://issues.apache.org/jira/browse/HIVE-957">https://issues.apache.org/jira/browse/HIVE-957</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2011/02/24/hive-tables-partitions-and-lzo-compression/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Moab</title>
		<link>http://www.mrbalky.com/2007/12/18/moab/</link>
		<comments>http://www.mrbalky.com/2007/12/18/moab/#comments</comments>
		<pubDate>Tue, 18 Dec 2007 21:01:06 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[lijit]]></category>
		<category><![CDATA[mtb]]></category>

		<guid isPermaLink="false">http://www.mrbalky.com/2007/12/18/moab/</guid>
		<description><![CDATA[The posting binge continues. Lots of people search me for &#8220;moab&#8220;, so I thought I&#8217;d meet my readers expectations and write a Moab post, complete with link to a flickr set. Of course, when speaking of MTB or Moab (or crafting a post for search results), I also need to be sure to link to [...]]]></description>
			<content:encoded><![CDATA[<p>The posting binge continues.  Lots of people search <a href="http://www.lijit.com/users/MrBalky">me</a> for &#8220;<a href="http://www.lijit.com/pvs/MrBalky?q=moab">moab</a>&#8220;, so I thought I&#8217;d meet my readers expectations and write a Moab post, complete with link to a <a href="http://flickr.com/photos/mrbalky/sets/72157603491185886/">flickr set</a>.</p>
<p>Of course, when speaking of MTB or Moab (or crafting a post for search results), I also need to be sure to link to my buddy Brian&#8217;s site <a href="http://www.singletrackrides.com/">SingleTrackRides</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2007/12/18/moab/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>You mean somebody might actually read this?</title>
		<link>http://www.mrbalky.com/2007/12/17/you-mean-somebody-might-actually-read-this/</link>
		<comments>http://www.mrbalky.com/2007/12/17/you-mean-somebody-might-actually-read-this/#comments</comments>
		<pubDate>Mon, 17 Dec 2007 18:59:32 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://www.mrbalky.com/2007/12/17/you-mean-somebody-might-actually-read-this/</guid>
		<description><![CDATA[I noticed today that Todd has added me to his blogroll. And somebody&#8217;s been searching me on physical therapy. That must be my physical therapist Larry Meyer up at BCSM. (I&#8217;m recovering from a double-osteotomy to correct a valgus malalignment. I have way better pictures than that Wikipedia article.) So my reaction? Uh oh; I [...]]]></description>
			<content:encoded><![CDATA[<p>I noticed today that <a href="http://falseprecision.typepad.com">Todd</a> has added me to his blogroll.  And somebody&#8217;s been <a href="http://www.lijit.com/users/MrBalky">searching me</a> on <a href="http://www.lijit.com/pvs/MrBalky?q=physical%20therapy">physical therapy</a>.  That must be my physical therapist Larry Meyer up at <a href="http://www.bch.org/sportsmedicine/">BCSM</a>.  (I&#8217;m recovering from a double-<a href="http://en.wikipedia.org/wiki/Osteotomy">osteotomy</a> to correct a valgus malalignment.  I have <a href='http://www.mrbalky.com/wp-content/uploads/2007/12/front-view-sm.jpg' title='front-view-sm.jpg'>way better pictures</a> than that Wikipedia article.)</p>
<p>So my reaction?  Uh oh;   I guess I&#8217;d better do something more here than just post ridiculous YouTube videos and <a href="http://www.mrbalky.com/2007/04/20/random-text-post/">random text</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2007/12/17/you-mean-somebody-might-actually-read-this/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Lijit Cocktail</title>
		<link>http://www.mrbalky.com/2007/08/21/the-lijit-cocktail/</link>
		<comments>http://www.mrbalky.com/2007/08/21/the-lijit-cocktail/#comments</comments>
		<pubDate>Tue, 21 Aug 2007 15:04:04 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://mrbalky.com/2007/08/21/the-lijit-cocktail/</guid>
		<description><![CDATA[One of the bloggers over at drinkoftheweek.com that is using our service dreamed up a cocktail in our honor. We had to give him an iPhone, though: http://www.drinkoftheweek.com/blog/this-weeks-drink-lijit-cocktail/ We tried &#8216;em out yesterday. Just a little bit too sweet for my taste, but it&#8217;s hard to argue with a drink named after you. We&#8217;d like [...]]]></description>
			<content:encoded><![CDATA[<p>One of the bloggers over at <a href="http://www.drinkoftheweek.com/">drinkoftheweek.com</a> that is using our service dreamed up a cocktail in our honor. We had to give him an iPhone, though:<br />
<a href="http://www.drinkoftheweek.com/blog/this-weeks-drink-lijit-cocktail/">http://www.drinkoftheweek.com/blog/this-weeks-drink-lijit-cocktail/</a></p>
<p><a href="http://iquitforlijit.typepad.com/i_quit_for_lijit/2007/08/a-lijit-cocktai.html">We tried &#8216;em out yesterday</a>.  Just a little bit too sweet for my taste, but it&#8217;s hard to argue with a drink named after you. </p>
<p>We&#8217;d like to call it the Lijito, but we&#8217;re not sure if that&#8217;s something derogatory in Portugues or something, so we might just wind up calling it the Lijitini.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2007/08/21/the-lijit-cocktail/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Random text post</title>
		<link>http://www.mrbalky.com/2007/04/20/random-text-post/</link>
		<comments>http://www.mrbalky.com/2007/04/20/random-text-post/#comments</comments>
		<pubDate>Fri, 20 Apr 2007 20:58:06 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[experiments]]></category>
		<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://mrbalky.com/2007/04/20/random-text-post/</guid>
		<description><![CDATA[Here&#8217;s some random text for The Google to index: I thought, &#8220;sldfkjdlkjd wwerrrjrjs ttlljjjgllkkttj sldfe sdfw?&#8221; But naturally, fjwjejrb jboegj and wllejrbbbjelti3! So instead I wnne85gnns dlfkjwrng yot! Shortly thereafter, wnern6xioa hgqqp38ng alasjtq8p6 sgklgfnqp lanffsan ouq dlfn sda gqpq dnfgga. But still, there is the matter of akl e7a z fngfn lqn fqi ore. &#8220;IuthlzJKSf [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s some random text for The Google to index:</p>
<p>I thought, &#8220;sldfkjdlkjd wwerrrjrjs ttlljjjgllkkttj sldfe sdfw?&#8221;  But naturally, fjwjejrb jboegj and wllejrbbbjelti3!  So instead I wnne85gnns dlfkjwrng yot!</p>
<p>Shortly thereafter, wnern6xioa hgqqp38ng alasjtq8p6 sgklgfnqp lanffsan ouq dlfn sda gqpq dnfgga.  But still, there is the matter of akl e7a z fngfn  lqn fqi ore.</p>
<p>&#8220;IuthlzJKSf hnaa jasdj tjla5!&#8221; I said.  &#8220;A hafaljsda nfqo ietyalf hld sh q5a.&#8221;</p>
<p>No, that will never work.  Aj awfahqeol q89 ahlfqp3984 bdfalkf.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2007/04/20/random-text-post/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wijit Gadget</title>
		<link>http://www.mrbalky.com/2007/04/10/wijit-gadget/</link>
		<comments>http://www.mrbalky.com/2007/04/10/wijit-gadget/#comments</comments>
		<pubDate>Tue, 10 Apr 2007 14:22:28 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[experiments]]></category>
		<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://mrbalky.com/?p=22</guid>
		<description><![CDATA[Or is that gadget wijit? Anyway, there&#8217;s an official one now, so I turned off the old, experimental one. Get the new one here:]]></description>
			<content:encoded><![CDATA[<p>Or is that gadget wijit?  Anyway, there&#8217;s an official one now, so I turned off the old, experimental one.</p>
<p>Get the new one here: <a href="http://www.lijit.com/wijitgadget/gadgetInstall.php?username=MrBalky"><img src="http://www.lijit.com/wijitgadget/add-to-google-small.gif" alt="add to google" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2007/04/10/wijit-gadget/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Lijit Wijit Gadget</title>
		<link>http://www.mrbalky.com/2007/03/22/lijit-wijit-gadget/</link>
		<comments>http://www.mrbalky.com/2007/03/22/lijit-wijit-gadget/#comments</comments>
		<pubDate>Thu, 22 Mar 2007 13:34:22 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[experiments]]></category>
		<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://mrbalky.com/?p=18</guid>
		<description><![CDATA[A few weeks ago I quit using My Yahoo! and began using Google Personalized Home instead. Call me old school; I still like that kind of portal page. Google Home is just a collection of &#8220;Gadgets&#8221;, and it turns out it&#8217;s pretty easy to write one. So I converted the Lijit Wijit into a Gadget; [...]]]></description>
			<content:encoded><![CDATA[<p>A few weeks ago I quit using <a href="http://my.yahoo.com">My Yahoo!</a> and began using <a href="http://www.google.com/ig?hl=en">Google Personalized Home</a> instead.  Call me old school; I still like that kind of portal page.</p>
<p>Google Home is just a collection of &#8220;Gadgets&#8221;, and it turns out it&#8217;s pretty easy to write one.  So I converted the <a href="http://www.lijit.com/informers/wijit_kit/pvs">Lijit Wijit</a> into a Gadget; the Lijit Wijit Gadget.</p>
<p>So if you use Google Home and want to add MrBalky search to your page, just <a href="http://www.lijit.com/wijitgadget/gadgetInstall.php?username=MrBalky"> <img src="http://www.google.com/webmasters/add-to-google-plus.gif" alt="add to google home" /></a>.</p>
<p>It&#8217;s really just a test.  Look for a better Lijit Wijit Gadget, coming soon from your good friends at <a href="http://www.lijit.com">Lijit Networks</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2007/03/22/lijit-wijit-gadget/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Personal Network Search</title>
		<link>http://www.mrbalky.com/2006/12/27/personal-network-search/</link>
		<comments>http://www.mrbalky.com/2006/12/27/personal-network-search/#comments</comments>
		<pubDate>Wed, 27 Dec 2006 18:39:12 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://mrbalky.com/?p=6</guid>
		<description><![CDATA[Search BalkyWorld over there on the right &#8211;> Would you like to know more? Read what Todd Vernon and Stan James have to say about PNS.]]></description>
			<content:encoded><![CDATA[<p>Search BalkyWorld over there on the right &#8211;></p>
<p>Would you like to know more?  Read what <a href="http://falseprecision.typepad.com/my_weblog/2006/12/its_the_year_of.html">Todd Vernon</a> and <a href="http://wanderingstan.com/2006-12-26/personal_network_search">Stan James</a> have to say about PNS.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2006/12/27/personal-network-search/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Add lijit personal search to WordPress</title>
		<link>http://www.mrbalky.com/2006/12/18/add-lijit-search-to-wordpress/</link>
		<comments>http://www.mrbalky.com/2006/12/18/add-lijit-search-to-wordpress/#comments</comments>
		<pubDate>Mon, 18 Dec 2006 17:03:05 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[lijit]]></category>
		<category><![CDATA[tips]]></category>

		<guid isPermaLink="false">http://mrbalky.com/?p=4</guid>
		<description><![CDATA[Here&#8217;s how I did it, anyway: Go to your lijit page (mine is http://www.lijit.com/users/MrBalky) Click &#8220;Customize your search&#8221; to enter your Google account info Back on your lijit page, click &#8220;Get this wijit&#8221; in the search box Copy the javascript wijit code. Head to your WordPress admin pages -> Presentation -> Theme Editor. Click &#8220;searchform.php&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s how <strong>I</strong> did it, anyway:</p>
<ol>
<li>Go to your lijit page (mine is <a title="MrBalky's lijit" href="http://www.lijit.com/users/MrBalky">http://www.lijit.com/users/MrBalky</a>)</li>
<li>Click &#8220;Customize your search&#8221; to enter your Google account info</li>
<li>Back on your lijit page, click &#8220;Get this wijit&#8221; in the search box</li>
<li>Copy the javascript wijit code.</li>
<li>Head to your WordPress admin pages -> Presentation -> Theme Editor.</li>
<li>Click &#8220;searchform.php&#8221; on the lower-right.</li>
<li>Add the wijit code.  (Since there&#8217;s nothing to search on this blog, I just removed the existing search code.)</li>
<li>Click &#8220;update file>>&#8221;, and you&#8217;re done.</li>
</ol>
<p>It&#8217;s a snap.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2006/12/18/add-lijit-search-to-wordpress/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>First Post</title>
		<link>http://www.mrbalky.com/2006/12/16/first-post/</link>
		<comments>http://www.mrbalky.com/2006/12/16/first-post/#comments</comments>
		<pubDate>Sat, 16 Dec 2006 23:06:12 +0000</pubDate>
		<dc:creator>mrbalky</dc:creator>
				<category><![CDATA[lijit]]></category>

		<guid isPermaLink="false">http://mrbalky.com/mrbalky/?p=3</guid>
		<description><![CDATA[Tryin&#8217; out WordPress just to be a little more lijit.]]></description>
			<content:encoded><![CDATA[<p>Tryin&#8217; out WordPress just to be a little more <a title="lijit.com" href="http://lijit.com">lijit.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.mrbalky.com/2006/12/16/first-post/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

