Description
Integrating Hadoop leverages the discipline of data integration and applies it to the Hadoop open-source software framework for storing data on clusters of commodity hardware. It is packed with the need-to-know for managers, architects, designers, and developers responsible for populating Hadoop in the enterprise, allowing you to harness big data and do it in such a way that the solution: Complies with (and even extends) enterprise standards Integrates seamlessly with the existing information infrastructure Fills a critical role within enterprise architecture. Integrating Hadoop covers the gamut of the setup, architecture and possibilities for Hadoop in the organization, including: Supporting an enterprise information strategy Organizing for a successful Hadoop rollout Loading and extracting of data in Hadoop Managing Hadoop data once it's in the cluster Utilizing Spark, streaming data, and master data in Hadoop processes - examples are provided to reinforce concepts.