My colleagues and clients at StyleFeeder are good enough to let me post on their tech blog from time to time. I’m exploring Hadoop on their behalf, as partially described here: http://blog.tech.stylefeeder.com/2010/01/14/hadoop-for-the-lone-analyst/. That’s basically a HOWTO for Hadoop 0.20 + Apache logs + MySQL on EC2, with tips on streaming, compression, Pig, Redhat/CentOS and the Cloudera Python scripts for EC2.
0 Responses
Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.