"Scalding is a Scala library that makes it easy to specify Hadoop MapReduce jobs. Scalding is built on top of Cascading, a Java library that abstracts away low-level Hadoop details. Scalding is comparable to Pig, but offers tight integration with Scala, bringing advantages of Scala to your MapReduce jobs."
"HPaste unlocks the rich functionality of HBase for a Scala audience. In so doing, it attempts to achieve the following goals:
Provide a strong, clear syntax for querying and filtration
Perform as fast as possible while maintaining idiomatic Scala client code -- the abstractions should not show up in a profiler!
Re-articulate HBase's data structures rather than force it into an ORM-style atmosphere.
A rich set of base classes for writing MapReduce jobs in hadoop against HBase tables.
Provide a maximum amount of code re-use between general Hbase client usage, and operation from within a MapReduce job.
Use Scala's type system to its advantage--the compiler should verify the integrity of the schema.
Be a verbose DSL--minimize boilerplate code, but be human readable!"
If you've been tasked with the job of maintaining large and complex Hadoop clusters, or are about to be, this book is a must. You'll learn the particulars of Hadoop operations, from planning, installing, and configuring the system to providing ongoing maintenance.