Agile Data Science 2.0 - Russell Jurney

Agile Data Science 2.0

Building Full-Stack Data Analytics Applications with Spark

(Autor)

Buch | Softcover
352 Seiten
2017 | 2., überarbeitete Auflage
O'Reilly Media (Verlag)
978-1-4919-6011-0 (ISBN)
53,95 inkl. MwSt
Data science teams looking to turn research into useful analytics applications require not only the right tools, but also the right approach if they’re to succeed. With the revised second edition of this hands-on guide, up-and-coming data scientists will learn how to use the Agile Data Science development methodology to build data applications with Python, Apache Spark, Kafka, and other tools.

Author Russell Jurney demonstrates how to compose a data platform for building, deploying, and refining analytics applications with Apache Kafka, MongoDB, ElasticSearch, d3.js, scikit-learn, and Apache Airflow. You’ll learn an iterative approach that lets you quickly change the kind of analysis you’re doing, depending on what the data is telling you. Publish data science work as a web application, and affect meaningful change in your organization.
  • Build value from your data in a series of agile sprints, using the data-value pyramid
  • Extract features for statistical models from a single dataset
  • Visualize data with charts, and expose different aspects through interactive reports
  • Use historical data to predict the future via classification and regression
  • Translate predictions into actions
  • Get feedback from users after each sprint to keep your project on track

Russell Jurney cut his data teeth in casino gaming, building web apps to analyze the performance of slot machines in the US and Mexico. After dabbling in entrepreneurship, interactive media and journalism, he moved to silicon valley to build analytics applications at scale at Ning and LinkedIn. He lives on the ocean in Pacifica, California with his wife Kate and two fuzzy dogs.

Chapter 1 Theory
Chapter 2 Agile Tools
Chapter 3 Data
Chapter 4 Collecting and Displaying Records
Chapter 5 Visualizing Data with Charts and Tables
Chapter 6 Exploring Data with Reports
Chapter 7 Making Predictions
Chapter 8 Deploying Predictive Systems
Chapter 9 Improving Predictions
Chapter 10 Climbing The Pyramid: Incorporating the Weather
Appendix A Manual Installation

Erscheinungsdatum
Verlagsort Sebastopol
Sprache englisch
Maße 150 x 250 mm
Gewicht 666 g
Einbandart kartoniert
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Software Entwicklung Agile Software Entwicklung
Schlagworte Airflow • Apache • Data Analytics Tools • Data Science • elastic • Kafka • MongoDB • Python • Search • Spark
ISBN-10 1-4919-6011-6 / 1491960116
ISBN-13 978-1-4919-6011-0 / 9781491960110
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Das umfassende Handbuch

von Wolfram Langer

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90