Hadoop in Action
Hadoop in Action
“A guide for beginners, a source of insight for advanced users.” —Philipp K. Janert Principal Value, LLC
B ook Description
Big data can be difficult to handle using traditional databases. Apache Hadoop is a NoSQL
applications framework that runs on distributed clusters. This lets it scale to huge datasets. if you
need analytic information from your data, Hadoop’s the way to go.
Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce
style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex
data analysis tasks. Included are best practices and design patterns of MapReduce programming.
This book requires basic Java skills. Knowing basic statistical concepts can help with the more
advanced examples.
F eatures
Introduction to MapReduce | Examples illustrating ideas in practice | Hadoop’s Streaming API
|Other related tools, like Pig and Hive
C ontents
Hadoop–a Distributed Programming Framework | Introducing Hadoop | Starting Hadoop |
Components of Hadoop | Hadoop In Action | Writing basic MapReduce programs | Advanced
MapReduce | Programming Practices | Cookbook | Managing Hadoop |Hadoop Gone Wild |
Running Hadoop in the cloud | Programming with Pig | Hive and the Hadoop herd | Case studies
Published by: DREAMTECH PRESS WILEY INDIA PVT. LTD. Distributed by:
19-A, Ansari Road, Daryaganj, New Delhi-110 002, INDIA 4435-36/7, Ansari Road, Daryaganj, New Delhi-110 002, INDIA
Tel: +91-11-2324 3463-73, Fax: +91-11-2324 3078 Tel: +91-11-4363 0000, Fax: +91-11-2327 5895
Email: feedback@dreamtechpress.com, Website: www.dreamtechpress.com Email: csupport@wiley.com, Website: www.wileyindia.com
Regional Offices: Bangalore: Tel: +91-80-2313 2383, Fax: +91-80-2312 4319, Email: blrsales@wiley.com
Mumbai: Tel: +91-22-2788 9263, 2788 9272, Telefax: +91-22-2788 9263, Email: mumsales@wiley.com
Dreamtech books are exclusively sold by Wiley India Pv t. Ltd.