Professional Hadoop is the complete reference and resource for experienced developers looking to employ Apache Hadoop in real-world settings. Written by an expert team of certified Hadoop developers, committers, and Summit speakers, this book details every key aspect of Hadoop technology to enable optimal processing of large data sets. Designed expressly for the professional developer, this book skips over the basics of database development to get you acquainted with the framework's processes and capabilities right away. The discussion covers each key Hadoop component individually, culminating in a sample application that brings all of the pieces together to illustrate the cooperation and interplay that make Hadoop a major big data solution.
Chapter 1 Hadoop Introduction
Chapter 2 Storage
Chapter 3 Computation
Chapter 4 User Experience
Chapter 5 Integration with Other Systems
Chapter 6 Hadoop Security
Chapter 7 Ecosystem at Large: Hadoop with Apache Bigtop
Chapter 8 In-Memory Computing in Hadoop Stack
Benoy Antony is an Apache Hadoop Committer and is a Senior MTS, Architect at eBay. He is also a Hadoop Summit speaker.
Konstantin Boudnik is an Apache Hadoop Committer and Vice President, Open Source Development at WANdisco. He has over 20 years of extensive background in software development, clustered and distributed systems' design and implementation.
Cheryl Adams is a Senior Cloud Data & Infrastructure Architect. Her work includes supporting HealthCare Data for large government contracts, deploying production based changes through scripting, monitoring and troubleshooting, and monitoring environments using the latest tools for databases, web servers, web API and storage.
Branky Shao is a software engineer at eBay in San Francisco where he is building real time applications with Kafka and Storm. He has extensive experience designing and implementing various software: distributed systems, data integration, framework/APIs and web applications.
Cazen Lee is a Software Architect at Samsung SDS. He is currently in charge of the Hadoop module for Samsung's big data platform. Prior to joining Samsung, Cazen served as a developer and architect for the integrated data warehouse layer in the financial industry, including work with Samsung Life Insurance and Korea Securities Finance Corp.
Kai Sasaki is a Software Engineer, Yahoo! He is an engineer focused on web and data processing platforms. He develops and maintains notification platforms using APNS and GCM, develops data processing platforms such as Hadoop and Storm, contributor to Hadoop, Spark and Storm projects, has operated and scaled thousands of Hadoop clusters and is a committer to the DeepLearning4J project.