Spark: Big Data Cluster Computing in Production PDF

Spark: Big Data Cluster Computing in Production PDF

Name:
Spark: Big Data Cluster Computing in Production PDF

Published Date:
03/01/2016

Status:
Active

Description:

Publisher:
John Wiley and Sons

Document status:
Active

Format:
Electronic (PDF)

Delivery time:
10 minutes

Delivery time (for Russian version):
200 business days

SKU:

Choose Document Language:
Need Help?

Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

  • Review Spark hardware requirements and estimate cluster size
  • Gain insight from real-world production use cases
  • Tighten security, schedule resources, and fine-tune performance
  • Overcome common problems encountered using Spark in production

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.


ISBN(s) : 9781119254010
Published : 03/01/2016

History


Related products


Best-Selling Products

SA/SNZ HB 119:2019
Published Date: 12/20/2019
Mines and quarries electrical protection
$43.362
SA/SNZ HB 146:2018
Published Date: 05/23/2018
Management of electrical cable in mines and quarries
$30.492
SA/SNZ HB 168:2017
Published Date: 09/18/2017
Document control
$30.492
SA/SNZ HB 205:2017
Published Date: 06/30/2017
Managing health-and-safety-related risk
$23.562
SA/SNZ HB 252:2014
Published Date: 04/28/2014
Communications cabling manual - Module 3: Residential communications cabling handbook
$33.264
SA/SNZ HB 331:2020 Amd 1:2020
Published Date: 10/01/2020
Overhead line design handbook
Free Download