Guide to Intelligent Data Science - Michael R. Berthold, Christian Borgelt, Frank Höppner, Frank Klawonn, Rosaria Silipo

Guide to Intelligent Data Science

How to Intelligently Make Use of Real Data
Buch | Softcover
XIII, 420 Seiten
2021 | 2nd ed. 2020
Springer International Publishing (Verlag)
978-3-030-45576-7 (ISBN)
60,98 inkl. MwSt

Making use of data is not anymore a niche project but central to almost every project. With access to massive compute resources and vast amounts of data, it seems at least in principle possible to solve any problem. However, successful data science projects result from the intelligent application of: human intuition in combination with computational power; sound background knowledge with computer-aided modelling; and critical reflection of the obtained insights and results.

Substantially updating the previous edition, then entitled Guide to Intelligent Data Analysis, this core textbook continues to provide a hands-on instructional approach to many data science techniques, and explains how these are used to solve real world problems. The work balances the practical aspects of applying and using data science techniques with the theoretical and algorithmic underpinnings from mathematics and statistics. Major updates on techniques and subject coverage (including deep learning) are included.

Topics and features: guides the reader through the process of data science, following the interdependent steps of project understanding, data understanding, data blending and transformation, modeling, as well as deployment and monitoring; includes numerous examples using the open source KNIME Analytics Platform, together with an introductory appendix; provides a review of the basics of classical statistics that support and justify many data analysis methods, and a glossary of statistical terms; integrates illustrations and case-study-style examples to support pedagogical exposition; supplies further tools and information at an associated website.

This practical and systematic textbook/reference is a "need-to-have" tool for graduate and advanced undergraduate students and essential reading for all professionals who face data science problems. Moreover, it is a "need to use, need to keep" resource following one's exploration of thesubject.

lt;p>Prof. Dr. Michael R. Berthold is Professor for Bioinformatics and Information Mining in the Department of Computer Science at the University of Konstanz, Germany.

Prof. Dr. Christian Borgelt is Professor for Data Science in the departments of Mathematics and Computer Sciences at the Paris Lodron University of Salzburg, Austria; he also co-authored the Springer textbook, Computational Intelligence.

Prof. Dr. Frank Höppner is Professor of Information Engineering in the Department of Computer Science at Ostfalia University of Applied Sciences, Wolfenbüttel, Germany.

Prof. Dr. Frank Klawonn is Professor for Data Analysis and Pattern Recognition at the same institution and head of the Biostatistics Group at the Helmholtz Centre for Infection Research, Braunschweig, Germany; he has authored the Springer textbook, Introduction to Computer Graphics.

Dr. Rosaria Silipo is a Principal Data Scientist and Head of Evangelism at KNIME AG, Zurich, Switzerland.

Introduction.- Practical Data Analysis: An Example.- Project Understanding.- Data Understanding.- Principles of Modeling.- Data Preparation.- Finding Patterns.- Finding Explanations.- Finding Predictors.- Evaluation and Deployment.- The Labelling Problem.- Appendix A: Statistics.- Appendix B: KNIME.

Erscheinungsdatum
Reihe/Serie Texts in Computer Science
Zusatzinfo XIII, 420 p. 179 illus., 122 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 659 g
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Schlagworte Bioinformatics • Calculus • classification • Cognition • Data Analysis • Databases • KNIME • Knowledge • Modeling • pattern recognition • Statistics
ISBN-10 3-030-45576-9 / 3030455769
ISBN-13 978-3-030-45576-7 / 9783030455767
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Das umfassende Handbuch

von Wolfram Langer

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90
Erfolgskonzepte für die datengetriebene Organisation

von Sebastian Wernicke

Buch | Softcover (2023)
Vahlen (Verlag)
29,80