The universe is the vast expanse of cosmic space
consisting of billions of galaxies. Galaxies are made of billions of
stars that revolve around a gravitation center of the black hole.
Quasars are quasi-stellar object which emits electromagnetic
radiation more potent than the luminosities of the galaxies,
combined. In this paper, the fourth version, the Sloan Digital
Sky Survey (SDSS-4), Data Release 16 dataset, was used to
classify the SDSS dataset into galaxies, stars, and quasars using
machine learning and deep learning architectures. We
efficiently utilize both image and metadata in tabular format to
build a novel multi-modal architecture and achieve state-of-the-
art results. For the tabular data, we compared classical machine
learning algorithms (Logistic Regression, Random Forest,
Decision Trees, Adaboost, LightGBM, etc.) with artificial neural
networks. Deep learning architecture such as Resnet50, VGG16,
EfficientNetB2, Xception, and Densenet121 have been used for
images. Our works shed new light on multi-modal deep learning
with their ability to handle imbalanced class datasets. The multi-
modal architecture further resulted in higher metrics (accuracy,
precision, recall, F1 score) than models using only images or
tabular data.