Amazon cover image
Image from Amazon.com

Learning Spark

By: Contributor(s): Material type: TextTextPublication details: New Delhi : SPD, 2015.Description: xvi, 254 p. : ill. ; 24 cmISBN:
  • 9789351109945
Subject(s): DDC classification:
  • 006.312 23 KAR-L
LOC classification:
  • QA76.9.D343 K363 2015
Contents:
Introduction to data analysis with Spark -- Downloading Spark and getting started -- Programming with RDDs -- Working with key/value pairs -- Loading and saving your data -- Advanced Spark programming -- Running on a cluster -- Tuning and debugging Spark -- Spark SQL -- Spark streaming -- Machine learning with MLlib.
Summary: This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.--
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Collection Call number Status Date due Barcode Item holds
Books Books IIITD General Stacks Computer Science and Engineering 006.312 KAR-L (Browse shelf(Opens below)) Available 006120
Total holds: 0
Browsing IIITD shelves, Shelving location: General Stacks, Collection: Computer Science and Engineering Close shelf browser (Hides shelf browser)
No cover image available
006.312 FRA-9 97 things about ethics everyone in data science should know : 006.312 JAN-D Data analysis : 006.312 JUR-A Agile data science: 006.312 KAR-L Learning Spark 006.312 KOT-D Data science 006.312 MAC-9 97 things every data engineer should know : 006.312 MAH-D Data analytics

Introduction to data analysis with Spark -- Downloading Spark and getting started -- Programming with RDDs -- Working with key/value pairs -- Loading and saving your data -- Advanced Spark programming -- Running on a cluster -- Tuning and debugging Spark -- Spark SQL -- Spark streaming -- Machine learning with MLlib.

This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run. You'll learn how to express parallel jobs with just a few lines of code, and cover applications from simple batch jobs to stream processing and machine learning.--

There are no comments on this title.

to post a comment.
© 2024 IIIT-Delhi, library@iiitd.ac.in