Descargas de libros gratis para Android Professional Spark: Big Data Cluster Computing in Production

Professional Spark: Big Data Cluster Computing in Production

Professional Spark: Big Data Cluster Computing in Production

by Ema Iancuta, Kai Sasaki, Anikate Singh, Brennon York

Editorial: Wiley
ISBN: 9781119254010
Número de páginas: 260
Formatos: pdf, ePub, mobi, fb2
Tamaño de archivo: 7 Mb
Fecha de publicación: 2020-08-08
DESCARGAR

Descripción

Professional Spark addresses the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. It covers: Cluster Managers Performance Tuning Security Lifecycle of Jobs Shared experience about what problems are encountered while running Spark in production. Real use cases where Spark fits best. Data warehouse with Spark SQL How to schedule resources on a production cluster between Spark applications. Tips and tricks. Spark with Tachyon. The benefits of storing the the RDDs of heap. Describe the available spark db connectors. Tips and tricks, and how to use them. What are the limitations and the advantages of using spark MLlib? Spark streaming in production. What are the limitations? The problems encountered? How were they fixed? most important for a production environment : security. How to ensure security on spark (Kerberos) Spark on Yarn Spark on Mesos Spark Hardware requirements / estimating cluster size