Learning Real-time Processing with Spark Streaming

Learning Real-time Processing with Spark Streaming

by Sumit Gupta
English | 2015 | ISBN: 1783987669 | 198 Pages | True PDF | 5.6 MB

This book is intended for big data developers with basic knowledge of Scala but no knowledge of Spark. It will help you grasp the basics of developing real-time applications with Spark and understand efficient programming of core elements and applications.

What You Will Learn:

- Install and configure Spark and Spark Streaming to execute applications
- Explore the architecture and components of Spark and Spark Streaming to use it as a base for other libraries
- Process distributed log files in real-time to load data from distributed sources
- Apply transformations on streaming data to use its functions
- Integrate Apache Spark with the various advance libraries like MLib and GraphX
- Apply production deployment scenarios to deploy your application

Using practical examples with easy-to-follow steps, this book will teach you how to build real-time applications with Spark Streaming.

Starting with installing and setting the required environment, you will write and execute your first program for Spark Streaming. This will be followed by exploring the architecture and components of Spark Streaming along with an overview of libraries/functions exposed by Spark. Next you will be taught about various client APIs for coding in Spark by using the use-case of distributed log file processing. You will then apply various functions to transform and enrich streaming data. Next you will learn how to cache and persist datasets. Moving on you will integrate Apache Spark with various other libraries/components of Spark like Mlib, GraphX, and Spark SQL. Finally, you will learn about deploying your application and cover the different scenarios ranging from standalone mode to distributed mode using Mesos, Yarn, and private data centers or on cloud infrastructure.



[Fast Download] Learning Real-time Processing with Spark Streaming

Ebooks related to "Learning Real-time Processing with Spark Streaming" :
Semantic Data Mining : An Ontology-based Approach
PHP and MySQL for Dynamic Web Sites: Visual QuickPro Guide, 5th Edition
Oracle Database 12c PL/SQL Programming
MariaDB and MySQL Common Table Expressions and Window Functions Revealed
Data Security in Cloud Computing
Cloud Data Management
Phrase Mining from Massive Text and Its Applications
Learning Apache Mahout Classification
Database Modeling and Design, Fifth Edition: Logical Design, 5 edition
Microsoft SQL Server 2005 XML
Copyright Disclaimer:
This site does not store any files on its server. We only index and link to content provided by other sites. Please contact the content providers to delete copyright contents if any and email us, we'll remove relevant links or contents immediately.