Faculty and Staff Scholarship

Galaxy and Mass Assembly: Automatic morphological classification of galaxies using statistical learning

Sreevarsha Sreejith, University of Innsbruck
Sergiy Pereverzyev, University of Innsbruck
Lee S. Kelvin, University of Innsbruck
Francine R. Marleau, University of Innsbruck
Markus Haltmeier, University of Innsbruck
Judith Ebner, University of Innsbruck
Joss Bland-Hawthorn, The University of Sydney
Simon P. Driver, The University of Western Australia
Alister W. Graham, Swinburne University of Technology
Benne W. Holwerda, University of LouisvilleFollow
Andrew M. Hopkins, Australian Astronomical Observatory
Jochen Liske, Universität Hamburg
Jon Loveday, University of Sussex
Amanda J. Moffett, Vanderbilt University
Kevin A. Pimbblet, University of Hull
Edward N. Taylor, Swinburne University of Technology
Lingyu Wang, SRON Netherlands Institute for Space Research
Angus H. Wright, Universität Bonn

Document Type

Article

Publication Date

3-1-2018

Department

Physics and Astronomy

Abstract

We apply four statistical learning methods to a sample of 7941 galaxies (z < 0.06) from the Galaxy And Mass Assembly survey to test the feasibility of using automated algorithms to classify galaxies. Using 10 features measured for each galaxy (sizes, colours, shape parameters, and stellar mass), we apply the techniques of Support Vector Machines, Classification Trees, Classification Trees with Random Forest (CTRF) and Neural Networks, and returning True Prediction Ratios (TPRs) of 75.8 per cent, 69.0 per cent, 76.2 per cent, and 76.0 per cent, respectively. Those occasions whereby all four algorithms agree with each other yet disagree with the visual classification ('unanimous disagreement') serves as a potential indicator of human error in classification, occurring in ~ 9 per cent of ellipticals, ~ 9 per cent of little blue spheroids, ~ 14 per cent of early-type spirals, ~ 21 per cent of intermediate-type spirals, and ~ 4 per cent of late-type spirals and irregulars. We observe that the choice of parameters rather than that of algorithms is more crucial in determining classification accuracy. Due to its simplicity in formulation and implementation, we recommend the CTRF algorithm for classifying future galaxy data sets. Adopting the CTRF algorithm, the TPRs of the five galaxy types are: E, 70.1 per cent; LBS, 75.6 per cent; S0-Sa, 63.6 per cent; Sab-Scd, 56.4 per cent, and Sd-Irr, 88.9 per cent. Further, we train a binary classifier using this CTRF algorithm that divides galaxies into spheroid-dominated (E, LBS, and S0-Sa) and disc-dominated (Sab-Scd and Sd-Irr), achieving an overall accuracy of 89.8 per cent. This translates into an accuracy of 84.9 per cent for spheroid-dominated systems and 92.5 per cent for disc-dominated systems.

ThinkIR Citation

Sreejith, Sreevarsha; Pereverzyev, Sergiy; Kelvin, Lee S.; Marleau, Francine R.; Haltmeier, Markus; Ebner, Judith; Bland-Hawthorn, Joss; Driver, Simon P.; Graham, Alister W.; Holwerda, Benne W.; Hopkins, Andrew M.; Liske, Jochen; Loveday, Jon; Moffett, Amanda J.; Pimbblet, Kevin A.; Taylor, Edward N.; Wang, Lingyu; and Wright, Angus H., "Galaxy and Mass Assembly: Automatic morphological classification of galaxies using statistical learning" (2018). Faculty and Staff Scholarship. 509.
https://ir.library.louisville.edu/faculty/509

DOI

10.1093/MNRAS/STX2976

ORCID

0000-0002-4884-6756

Download

Find in your library

Included in

Astrophysics and Astronomy Commons

COinS

Faculty and Staff Scholarship

Galaxy and Mass Assembly: Automatic morphological classification of galaxies using statistical learning

Document Type

Publication Date

Department

Abstract

ThinkIR Citation

DOI

ORCID

Included in

Search

Browse

Author Corner

Links

Contact:

Faculty and Staff Scholarship

Galaxy and Mass Assembly: Automatic morphological classification of galaxies using statistical learning

Authors

Document Type

Publication Date

Department

Abstract

ThinkIR Citation

DOI

ORCID

Included in

Share

Search

Browse

Author Corner

Links

Contact: