Home > Published Issues > 2023 > Volume 14, No. 6, 2023 >
JAIT 2023 Vol.14(6): 1261-1272
doi: 10.12720/jait.14.6.1261-1272

Observation-Centric with Appearance Metric for Computer Vision-Based Vehicle Counting

Allysa Kate Brillantes 1,2,*, Edwin Sybingco 1,2,3, Robert Kerwin Billones 2,3,4, Argel Bandala 1,2, Alexis Fillone 2,3,5, and Elmer Dadios 2,3,4
1. Department of Electronics and Computer Engineering, De La Salle University, Manila, Philippines;
Email: edwin.sybingco@dlsu.edu.ph (E.S.), argel.bandala@dlsu.edu.ph (A.B.)
2. Center for Engineering and Sustainable Development Research, Manila, Philippines;
Email: robert.billones@dlsu.edu.ph (R.K.B.), alexis.fillone@dlsu.edu.ph (A.F.), elmer.dadios@dlsu.edu.ph (E.D.)
3. Intelligent Systems Innovation (ISI, Inc.), Philippines
4. Department of Manufacturing Engineering and Management, De La Salle University, Manila, Philippines
5. Department of Civil Engineering, De La Salle University, Manila, Philippines
*Correspondence: allysa_brillantes@dlsu.edu.ph (A.K.B.)

Manuscript received April 13, 2023; revised June 19, 2023; accepted July 19, 2023; published November 24, 2023.

Abstract—Tracking objects in video sequences is a key step for applications involving computer vision like traffic monitoring and security systems. Occlusion is a frequent problem in object tracking which can result in the tracker losing track of the occluded object or misidentifying it with the occluding object. Moreover, the limited memory and computing power of traffic analysis systems presents a scaling problem, especially in object tracking applications. This paper aims to improve object tracking performance by minimizing data association errors in low frame rate tracking applications. Reducing frame rates alleviates memory and computing power limitations, and utilizing a tracker that can handle occlusion can address occlusion-related issues in object tracking. The proposed tracking method, Mask-OCSORT, uses the observation-tracking method with cosine similarity, intersection-over-union, and velocity consistency metrics for the association problem. The paper analyzes the effect of using bounding box and mask predictions of deep learning models in generating tracks. This study uses evaluation metrics like HOTA, MOTA, and IDF1 to assess the proposed tracking method, and employs evaluation metrics such as precision, recall, and F-score to assess the counting based on generated IDs from the tracking method. The study applied the Mask-OCSORT tracking for vehicle counting application and achieved an F-score of 87.18% at 5 frames per second (fps), and 75% at 1 fps.
 
Keywords—mask-OCSORT, object tracking, instance segmentation, traffic surveillance systems, low framerate, data association

Cite: Allysa Kate Brillantes, Edwin Sybingco, Robert Kerwin Billones, Argel Bandala, Alexis Fillone, and Elmer Dadios, "Observation-Centric with Appearance Metric for Computer Vision-Based Vehicle Counting," Journal of Advances in Information Technology, Vol. 14, No. 6, pp. 1261-1272, 2023.

Copyright © 2023 by the authors. This is an open access article distributed under the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits use, distribution and reproduction in any medium, provided that the article is properly cited, the use is non-commercial and no modifications or adaptations are made.