This book is a practical guide to building solutions with Apache Hadoop. Unlike most books on the subject that are “a mile wide and an inch deep”, this book provides a deeper, code-level dive that shows how to use Hadoop technologies, in concert, to deliver real-world solutions. The authors provide in-depth code examples in Java and XML from applications that they have successfully built and deployed.
- Storing data with HDFS and Hbase
- Processing data with MapReduce and other technologies
- Automating data processing with Oozie
- Delivering real-time solutions with Hadoop
- Hadoop security
- Running Hadoop with Amazon Web Services
- And more.
The authors explain not just "how it works", but the when and why behind using these tools effectively. For example, they describe best practices for storing data and for calculations; customizing how data is read and executed; automating Hadoop processes in real-time; and building secure enterprise solutions that protect the company's investment without sacrificing availability. The book also covers recent additions to the Hadoop ecosystem, including multiple namespaces and MapReduce2. Not only does this book cover the use of the APIs that various Hadoop systems are exposing, but exposes their inner workings, allowing architects and developers to better leverage and customize them.
Unless otherwise noted above, most orders ship within 1 to 2 days. We will promptly notify you if there is a stock problem with any items on your order and provide you with an estimated delivery date. If you have a firm need by date, please provide such information in the comment section at checkout.
Page Count (est.): 504
Pub Date: 10/14/2013