Book review - Planning for Big Data

I review for the O'Reilly Reader Review Program Planning for Big Data - A CIO's Handbook to the Changing Data Landscape (Published by OReilly and Released in March 2012) is more a collection of articles (more than a book) by Edd Dumbill.

It starts with an Introductory chapter on who should read the book and why. There are then two chapters on big data - what it is, what are its attributes, how do they come, how are they used and the challenges with them.

The book then talks about Apache Hadoop, the flagship big data framework. It then looks at how enterprise vendors have used Hadoop - taking  EMC, IBM, Microsoft and Oracle as examples. It also devotes sections to exclusive big data vendors like Cloudera and Hortonworks.

There is a separate chapter on Microsoft's plan for big data.

Another chapter deals with the challenges of keeping big data in the cloud and talks about the leading cloud players of big data - Amazon, Google and Microsoft.

Data Marketplace comes next, which is an interesting concept in itself. It is followed by a chapter on NoSQL (by Mike Loukides), an alternate mechanism to Relational Databases.

Another chapter is devoted to how to convert big data to visualisation and why that matters.

The final chapter summarizes the discussions in the earlier chapter, as well as gives an outlook into the future.

The book is written with the lay man in mind and thus is easy to read.  While there is sufficient attention to details, it does not get into technical nitty-gritties.  The experience of the authors on the subject shows in the chapters.

Parts of the book may be dated since it was written in 2012, but the concepts remain the same.

All in all, a nice book to get an insight into big data.

[[This book was reviewed as a part of the O'Reilly Blogger Review Program]]


