Exploratory Data Analysis with MATLAB - Wendy L. Martinez, Angel R. Martinez, Jeffrey Solka

Exploratory Data Analysis with MATLAB

Buch | Hardcover
590 Seiten
2017 | 3rd edition
Chapman & Hall/CRC (Verlag)
978-1-4987-7606-6 (ISBN)
149,85 inkl. MwSt
This book describes the various methods used for exploratory data analysis with an emphasis on MATLAB implementation. It covers approaches for visualizing data, data tours and animations, clustering (or unsupervised learning), dimensionality reduction, and more. A set of graphical user interfaces allows users to apply the ideas to their own data.
Praise for the Second Edition:
"The authors present an intuitive and easy-to-read book. … accompanied by many examples, proposed exercises, good references, and comprehensive appendices that initiate the reader unfamiliar with MATLAB."—Adolfo Alvarez Pinto, International Statistical Review

"Practitioners of EDA who use MATLAB will want a copy of this book. … The authors have done a great service by bringing together so many EDA routines, but their main accomplishment in this dynamic text is providing the understanding and tools to do EDA.

—David A Huckaby, MAA Reviews

Exploratory Data Analysis (EDA) is an important part of the data analysis process. The methods presented in this text are ones that should be in the toolkit of every data scientist. As computational sophistication has increased and data sets have grown in size and complexity, EDA has become an even more important process for visualizing and summarizing data before making assumptions to generate hypotheses and models.

Exploratory Data Analysis with MATLAB, Third Edition presents EDA methods from a computational perspective and uses numerous examples and applications to show how the methods are used in practice. The authors use MATLAB code, pseudo-code, and algorithm descriptions to illustrate the concepts. The MATLAB code for examples, data sets, and the EDA Toolbox are available for download on the book’s website.

New to the Third Edition






Random projections and estimating local intrinsic dimensionality



Deep learning autoencoders and stochastic neighbor embedding



Minimum spanning tree and additional cluster validity indices



Kernel density estimation



Plots for visualizing data distributions, such as beanplots and violin plots



A chapter on visualizing categorical data

Wendy L. Martinez is a mathematical statistician with the U.S. Bureau of Labor Statistics. She is a fellow of the American Statistical Association, a co-author of several popular Chapman & Hall/CRC books, and a MATLAB® user for more than 20 years. Her research interests include text data mining, probability density estimation, signal processing, scientific visualization, and statistical pattern recognition. She earned an M.S. in aerospace engineering from George Washington University and a Ph.D. in computational sciences and informatics from George Mason University. Angel R. Martinez is fully retired after a long career with the U.S. federal government and as an adjunct professor at Strayer University, where he taught undergraduate and graduate courses in statistics and mathematics. Before retiring from government service, he worked for the U.S. Navy as an operations research analyst and a computer scientist. He earned an M.S. in systems engineering from the Virginia Polytechnic Institute and State University and a Ph.D. in computational sciences and informatics from George Mason University. Since 1984, Jeffrey L. Solka has been working in statistical pattern recognition for the Department of the Navy. He has published over 120 journal, conference, and technical papers; has won numerous awards; and holds 4 patents. He earned an M.S. in mathematics from James Madison University, an M.S. in physics from Virginia Polytechnic Institute and State University, and a Ph.D. in computational sciences and informatics from George Mason University.

Introduction to Exploratory Data Analysis. Dimensionality Reduction – Linear Methods. Dimensionality Reduction – Nonlinear Methods. Data Tours. Finding Clusters. Model-Based Clustering. Smoothing Scatterplots. Visualizing Clusters. Distribution Shapes. Multivariate Visualization. Appendices.

Erscheinungsdatum
Reihe/Serie Chapman & Hall/CRC Computer Science & Data Analysis
Zusatzinfo 16 Illustrations, color; 84 Illustrations, black and white
Sprache englisch
Maße 156 x 234 mm
Gewicht 940 g
Themenwelt Mathematik / Informatik Mathematik Angewandte Mathematik
ISBN-10 1-4987-7606-X / 149877606X
ISBN-13 978-1-4987-7606-6 / 9781498776066
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich