Galaxy Zoo Talk

"Improving galaxy morphologies for SDSS with Deep Learning"

  • JeanTate by JeanTate

    That's the title of an astro-ph preprint, dated 15 November (but it is listed under 17 November), by H. Domínguez Sánchez, M. Huertas-Company, M. Bernardi, D. Tuccillo, and J. L. Fisher.

    Here's the abstract:

    We present a morphological catalogue for ∼ 670,000 galaxies in the Sloan Digital Sky Survey in two flavours: T-Type, related to the Hubble sequence, and Galaxy Zoo 2 (GZ2 hereafter) classification scheme. By combining accurate existing visual classification catalogues with machine learning, we provide the largest and most accurate morphological catalogue up to date. The classifications are obtained with Deep Learning algorithms using Convolutional Neural Networks (CNNs).
    We use two visual classification catalogues, the GZ2 and the Nair et al. 2010, for training CNNs with color images in order to obtain T-Types and a series of GZ2 type questions (disk/features, edge-on galaxies, bar signature, bulge prominence, roundness and mergers). We also provide an additional probability enabling a separation between pure elliptical (E) from S0, where the T-Type model is not so efficient. For the T-Type, our results show smaller offset and scatter than previous models trained with support vector machines. For the GZ2 type questions, our models have large accuracy (> 97%), precision and recall values (> 90%) when applied to a test sample with the same characteristics as the one used for training, i.e., with small uncertainties in the GZ2 classification. When we apply our models to a subset of galaxies with larger uncertainties in GZ2 (low agreement between classifiers), the resulting probability distributions show a clear bimodality. This allows us to recover a significant fraction of galaxies with a robust classification, for which the GZ2 was classification was uncertain. The catalogue is publicly released with the paper.

    link

    Posted