Detailed Information

Cited 1 time in webofscience Cited 0 time in scopus
Metadata Downloads

Scale-aware token-matching for transformer-based object detectoropen access

Authors
Jung, AecheonHong, SungeunHyun, Yoonsuk
Issue Date
Sep-2024
Publisher
Elsevier B.V.
Keywords
Object detection; Small object detection; Vision transformer
Citation
Pattern Recognition Letters, v.185, pp 197 - 202
Pages
6
Indexed
SCIE
SCOPUS
Journal Title
Pattern Recognition Letters
Volume
185
Start Page
197
End Page
202
URI
https://scholarx.skku.edu/handle/2021.sw.skku/112640
DOI
10.1016/j.patrec.2024.08.006
ISSN
0167-8655
1872-7344
Abstract
Owing to the advancements in deep learning, object detection has made significant progress in estimating the positions and classes of multiple objects within an image. However, detecting objects of various scales within a single image remains a challenging problem. In this study, we suggest a scale-aware token matching to predict the positions and classes of objects for transformer-based object detection. We train a model by matching detection tokens with ground truth considering its size, unlike the previous methods that performed matching without considering the scale during the training process. We divide one detection token set into multiple sets based on scale and match each token set differently with ground truth, thereby, training the model without additional computation costs. The experimental results demonstrate that scale information can be assigned to tokens. Scale-aware tokens can independently learn scale-specific information by using a novel loss function, which improves the detection performance on small objects. © 2024 The Author(s)
Files in This Item
There are no files associated with this item.
Appears in
Collections
Computing and Informatics > Convergence > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher HONG, SUNGEUN photo

HONG, SUNGEUN
Computing and Informatics (Convergence)
Read more

Altmetrics

Total Views & Downloads

BROWSE