Data Mining Algorithms in C++ -  Timothy Masters

Data Mining Algorithms in C++ (eBook)

Data Patterns and Algorithms for Modern Applications
eBook Download: PDF
2017 | 1. Auflage
XIV, 286 Seiten
Apress (Verlag)
978-1-4842-3315-3 (ISBN)
Systemvoraussetzungen
79,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Discover hidden relationships among the variables in your data, and learn how to exploit these relationships.  This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications.  All algorithms include an intuitive explanation of operation, essential equations, references to more rigorous theory, and commented C++ source code.

Many of these techniques are recent developments, still not in widespread use.  Others are standard algorithms given a fresh look.  In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program.  The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work.

What you'll learn
  • Monte-Carlo permutation tests provide statistically sound assessment of relationships present in your data.
  • Combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the data.
  • Feature weighting as regularized energy-based learning ranks variables according to their predictive power when there is too little data for traditional methods.
  • The eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the data.
  • Plotting regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is high, provides visual insight into anomalous relationships.

Who this book is for

The techniques presented in this book and in the DATAMINE program will be useful to anyone interested in discovering and exploiting relationships among variables.  Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.


Timothy Masters has a PhD in statistics and is an experienced programmer.  His dissertation was in image analysis.  His career moved in the direction of signal processing, and for the last 25 years he's been involved in the development of automated trading systems in various financial markets.


Discover hidden relationships among the variables in your data, and learn how to exploit these relationships.  This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications.  All algorithms include an intuitive explanation of operation, essential equations, references to more rigorous theory, and commented C++ source code.Many of these techniques are recent developments, still not in widespread use.  Others are standard algorithms given a fresh look.  In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program.  The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work.What You'll LearnUse Monte-Carlo permutation tests to provide statistically sound assessments of relationships present in your dataDiscover how combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the dataWork with feature weighting as regularized energy-based learning to rank variables according to their predictive power when there is too little data for traditional methodsSee how the eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the dataPlot regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is highWho This Book Is ForAnyone interested in discovering and exploiting relationships among variables.  Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.

Timothy Masters has a PhD in statistics and is an experienced programmer.  His dissertation was in image analysis.  His career moved in the direction of signal processing, and for the last 25 years he's been involved in the development of automated trading systems in various financial markets.

1. Information and Entropy 2. Screening for Relationships 3. Displaying Relationship Anomalies 4. Fun With Eigenvectors 5. Using the DATAMINE Program

Erscheint lt. Verlag 15.12.2017
Zusatzinfo XIV, 286 p.
Verlagsort Berkeley
Sprache englisch
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Programmiersprachen / -werkzeuge C / C++
Informatik Theorie / Studium Algorithmen
Informatik Theorie / Studium Compilerbau
Mathematik / Informatik Mathematik Analysis
Schlagworte algorithms • Big Data • C++ • Code • Data Mining • Mining • programming • Software • technique
ISBN-10 1-4842-3315-8 / 1484233158
ISBN-13 978-1-4842-3315-3 / 9781484233153
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 4,7 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
Achieve data excellence by unlocking the full potential of MongoDB

von Marko Aleksendric; Arek Borucki; Leandro Domingues …

eBook Download (2024)
Packt Publishing (Verlag)
53,99
A guide to developing efficient and elegant T-SQL code

von Pam Lahoud; Pedro Lopes

eBook Download (2024)
Packt Publishing (Verlag)
35,99