Me on Hadoop Setup, at StyleFeeder

My colleagues and clients at StyleFeeder are good enough to let me post on their tech blog from time to time. I'm exploring Hadoop on their behalf, as partially described here: That's basically a HOWTO for Hadoop 0.20 + Apache logs + MySQL on EC2, with tips on streaming, compression, Pig, Redhat/CentOS and the Cloudera Python scripts for EC2.