Elasticsearch for Hadoop PDF

Elasticsearch for Hadoop PDF

Name:
Elasticsearch for Hadoop PDF

Published Date:
10/27/2015

Status:
[ Active ]

Description:

Publisher:
PACKT - Packt Publishing, Inc.

Document status:
Active

Format:
Electronic (PDF)

Delivery time:
10 minutes

Delivery time (for Russian version):
200 business days

SKU:

Choose Document Language:
$10.8
Need Help?
ISBN: 9781785288999

Integrate Elasticsearch into Hadoop to effectively visualize and analyze your data

About This Book

• Build production-ready analytics applications by integrating the Hadoop ecosystem with Elasticsearch

• Learn complex Elasticsearch queries and develop real-time monitoring Kibana dashboards to visualize your data

• Use Elasticsearch and Kibana to search data in Hadoop easily with this comprehensive, step-by-step guide

Who This Book Is For

This book is targeted at Java developers with basic knowledge on Hadoop. No prior Elasticsearch experience is expected.

What You Will Learn

• Set up the Elasticsearch-Hadoop environment

• Import HDFS data into Elasticsearch with MapReduce jobs

• Perform full-text search and aggregations efficiently using Elasticsearch

• Visualize data and create interactive dashboards using Kibana

• Check and detect anomalies in streaming data using Storm and Elasticsearch

• Inject and classify real-time streaming data into Elasticsearch

• Get production-ready for Elasticsearch-Hadoop based projects

• Integrate with Hadoop eco-system such as Pig, Storm, Hive, and Spark

In Detail

The Hadoop ecosystem is a de-facto standard for processing terra-bytes and peta-bytes of data. Lucene-enabled Elasticsearch is becoming an industry standard for its full-text search and aggregation capabilities. Elasticsearch-Hadoop serves as a perfect tool to bridge the worlds of Elasticsearch and Hadoop ecosystem to get best out of both the worlds. Powered with Kibana, this stack makes it a cakewalk to get surprising insights out of your massive amount of Hadoop ecosystem in a flash.

In this book, you'll learn to use Elasticsearch, Kibana and Elasticsearch-Hadoop effectively to analyze and understand your HDFS and streaming data.

You begin with an in-depth understanding of the Hadoop, Elasticsearch, Marvel, and Kibana setup. Right after this, you will learn to successfully import Hadoop data into Elasticsearch by writing MapReduce job in a real-world example. This is then followed by a comprehensive look at Elasticsearch essentials, such as full-text search analysis, queries, filters and aggregations; after which you gain an understanding of creating various visualizations and interactive dashboard using Kibana. Classifying your real-world streaming data and identifying trends in it using Storm and Elasticsearch are some of the other topics that we'll cover. You will also gain an insight about key concepts of Elasticsearch and Elasticsearch-hadoop in distributed mode, advanced configurations along with some common configuration presets you may need for your production deployments. You will have “Go production checklist” and high-level view for cluster administration for post-production. Towards the end, you will learn to integrate Elasticsearch with other Hadoop eco-system tools, such as Pig, Hive and Spark.

Style and approach

A concise yet comprehensive approach has been adopted with real-time examples to help you grasp the concepts easily.


Edition : 15
File Size : 1 file , 3.6 MB
Number of Pages : 222
Published : 10/27/2015
isbn : 9781785288999

History


Related products

Hands-On GPU Computing with Python
Published Date: 05/14/2019
$9
Instant Magento Shipping How-To
Published Date: 05/23/2013
$5.1
Advanced Machine Learning with Python
Published Date: 07/28/2016
$12

Best-Selling Products

CLSI AUTO01-A
Published Date: 12/20/2000
Laboratory Automation: Specimen Container/Specimen Carrier; Approved Standard, AUTO01AE
$54
CLSI AUTO02-A2
Published Date: 01/05/2006
Laboratory Automation: Bar Codes for Specimen Container Identification; Approved Standard, AUTO02A2E
$54
CLSI AUTO03-A2
Published Date: 09/01/2009
Laboratory Automation: Communications with Automated Clinical Laboratory Systems, Instruments, Devices, and Information Systems; Approved Standard, Second Edition, AUTO03A2
$54
CLSI AUTO04-A
Published Date: 03/20/2001
Laboratory Automation: Systems Operational Requirements, Characteristics, and Information Elements; Approved Standard, AUTO04AE
CLSI AUTO05-A
Published Date: 03/20/2001
Laboratory Automation: Electromechanical Interfaces; Approved Standard, AUTO05AE
CLSI AUTO07-A
Published Date: 06/20/2004
Laboratory Automation: Data Content for Specimen Identification; Approved Standard, AUTO07AE