Astronomy and Big Data

A Data Clustering Approach to Identifying Uncertain Galaxy Morphology
Buch | Hardcover
XII, 105 Seiten
2014 | 2014
Springer International Publishing (Verlag)
978-3-319-06598-4 (ISBN)

Lese- und Medienproben

Astronomy and Big Data - Kieran Jay Edwards, Mohamed Medhat Gaber
106,99 inkl. MwSt

With the onset of massive cosmological data collection through media such as the Sloan Digital Sky Survey (SDSS), galaxy classification has been accomplished for the most part with the help of citizen science communities like Galaxy Zoo. Seeking the wisdom of the crowd for such Big Data processing has proved extremely beneficial. However, an analysis of one of the Galaxy Zoo morphological classification data sets has shown that a significant majority of all classified galaxies are labelled as "Uncertain".

This book reports on how to use data mining, more specifically clustering, to identify galaxies that the public has shown some degree of uncertainty for as to whether they belong to one morphology type or another. The book shows the importance of transitions between different data mining techniques in an insightful workflow. It demonstrates that Clustering enables to identify discriminating features in the analysed data sets, adopting a novel feature selection algorithms called Incremental Feature Selection (IFS). The book shows the use of state-of-the-art classification techniques, Random Forests and Support Vector Machines to validate the acquired results. It is concluded that a vast majority of these galaxies are, in fact, of spiral morphology with a small subset potentially consisting of stars, elliptical galaxies or galaxies of other morphological variants.

Introduction.- Astronomy, Galaxies and Stars: An Overview.- Astronomical Data Mining.- Adopted Data Mining Methods.- Research Methodology.- Development of Data Mining Models.- Experimentation Results.- Conclusion and Future Work.

Erscheint lt. Verlag 29.4.2014
Reihe/Serie Studies in Big Data
Zusatzinfo XII, 105 p. 54 illus., 24 illus. in color.
Verlagsort Cham
Sprache englisch
Maße 155 x 235 mm
Gewicht 350 g
Themenwelt Informatik Datenbanken Data Warehouse / Data Mining
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Naturwissenschaften Physik / Astronomie Astronomie / Astrophysik
Technik
Schlagworte Astronomy • Big Data • citizen science • Data Clustering • Galaxy morphology • Galaxy Zoo Project
ISBN-10 3-319-06598-X / 331906598X
ISBN-13 978-3-319-06598-4 / 9783319065984
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Wie bewerten Sie den Artikel?
Bitte geben Sie Ihre Bewertung ein:
Bitte geben Sie Daten ein:
Mehr entdecken
aus dem Bereich
Auswertung von Daten mit pandas, NumPy und IPython

von Wes McKinney

Buch | Softcover (2023)
O'Reilly (Verlag)
44,90
Das umfassende Handbuch

von Wolfram Langer

Buch | Hardcover (2023)
Rheinwerk (Verlag)
49,90
Erfolgskonzepte für die datengetriebene Organisation

von Sebastian Wernicke

Buch | Softcover (2023)
Vahlen (Verlag)
29,80