Sep 14 2010

Episode 86: Hadoop

Play

Special guest, Eric Wendelin, tells us about Hadoop

News/Follow-Ups – 01:29

Geek Tools – 03:50

Web Apps – 06:14

  • SynchTube – Sync watching youtube videos with your friends
  • SwipeGood – Rounds your transactions up to the dollar and donates (via Pol Llovet)

Hadoop – 18:25

  • Eric Wendelin
  • What is Hadoop?
  • MapReduce?
  • Where is it useful? Give some examples.
    • Server log analysis, Image processing, Search Indexes, Sorting, Recommendations (Netflix, Amazon)
  • Isn’t it a lot of work to get a Hadoop cluster running?
  • Suppose I’m a medium-size business owner who uses an Oracle database. Why would I go through the effort of building a Hadoop cluster when I can just buy a bigger database machine?
  • When would I NOT want to use Hadoop?
    • Eric’s rule of 1 million
  • Testing Hadoop jobs
  • How can I get started?