Semantic Comparison of State-of-the-Art Deep Learning Methods for Image Multi-Label Classification.
arXiv (Cornell University)
Article 2019 English
Authors
AK
Adam Kubany
SI
Shimon Ben Ishay
RO
R. Ohayon
Abstract
1 min read
Image understanding relies heavily on accurate multi-label classification. In
recent years, deep learning (DL) algorithms have become very successful tools
for multi-label classification of image objects, and various implementations of
DL algorithms have been released for public use in the form of application
programming interfaces (APIs). In this study, we evaluate and compare 10 of the
most prominent publicly available APIs in a best-of-breed challenge. The
evaluation is performed on the Visual Genome labeling benchmark dataset using
12 well-recognized similarity metrics. In addition, for the first time in this
kind of comparison, we use a semantic similarity metric to evaluate the
semantic similarity performance of these APIs. In this evaluation, Microsoft's
Computer Vision, TensorFlow, Imagga, and IBM's Visual Recognition performed
better than the other APIs. Furthermore, the new semantic similarity metric
provided deeper insights for comparison.
Discussion(0)
No comments yet. Be the first to comment.