Guide to High Performance Distributed Computing
Case Studies with Hadoop, Scalding and Spark
Seiten
2015
|
2015
Springer International Publishing (Verlag)
978-3-319-13496-3 (ISBN)
Springer International Publishing (Verlag)
978-3-319-13496-3 (ISBN)
This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.
Part I: Programming Fundamentals of High Performance Distributed Computing.- Introduction.- Getting Started with Hadoop.- Getting Started with Spark.- Programming Internals of Scalding and Spark.- Part II: Case studies using Hadoop, Scalding and Spark.- Case Study I: Data Clustering using Scalding and Spark.- Case Study II: Data Classification using Scalding and Spark.- Case Study III: Regression Analysis using Scalding and Spark.- Case Study IV: Recommender System using Scalding and Spark.
Erscheint lt. Verlag | 9.3.2015 |
---|---|
Reihe/Serie | Computer Communications and Networks |
Zusatzinfo | XVII, 304 p. 43 illus. |
Verlagsort | Cham |
Sprache | englisch |
Maße | 155 x 235 mm |
Gewicht | 649 g |
Themenwelt | Mathematik / Informatik ► Informatik ► Netzwerke |
Schlagworte | algorithms • Case Studies • Hadoop • High Performance Computing • Spark |
ISBN-10 | 3-319-13496-5 / 3319134965 |
ISBN-13 | 978-3-319-13496-3 / 9783319134963 |
Zustand | Neuware |
Haben Sie eine Frage zum Produkt? |
Mehr entdecken
aus dem Bereich
aus dem Bereich
das umfassende Handbuch für den Einstieg in die Netzwerktechnik
Buch | Hardcover (2023)
Rheinwerk (Verlag)
29,90 €
von den Grundlagen zur Funktion und Anwendung
Buch (2023)
Carl Hanser (Verlag)
29,99 €