An Improved Speed Estimation Using Deep Homography Transformation Regression Network on Monocular Videos

Ervin Yohannes, Chih Yang Lin, K. Shih, Tipajin Thaipisutikul, Avirmed Enkhbat, Fitri Utaminingrum

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Vehicle speed estimation is one of the most critical issues in intelligent transportation system (ITS) research, while defining distance and identifying direction have become an inseparable part of vehicle speed estimation. Despite the success of traditional and deep learning approaches in estimating vehicle speed, the high cost of deploying hardware devices to get all related sensor data, such as infrared/ultrasonic devices, Global Positioning Systems (GPS), Light Detection and Ranging (LiDAR systems), and magnetic devices, has become the key barrier to improvement in previous studies. In this paper, our proposed model consists of two main components: 1) a vehicle detection and tracking component - this module is designed for creating reliable detection and tracking every specific object without doing calibration; 2) homography transformation regression network - this module has a function to solve occlusion issues and estimate vehicle speed accurately and efficiently. Experimental results on two datasets show that the proposed method outperforms the state-of-the-art methods by reducing the mean square error (MSE) metric from 14.02 to 6.56 based on deep learning approaches. We have announced our test code and model on GitHub with https://github.com/ervinyo/Speed-Estimation-Using-Homography-Transformation-and-Regression-Network.

Original languageEnglish
Pages (from-to)5955-5965
Number of pages11
JournalIEEE Access
Volume11
DOIs
StatePublished - 2023

Keywords

  • deep learning
  • homography transformation
  • Intelligent transportation system
  • speed estimation
  • vehicle detection and recognition
  • vehicle tracking

Fingerprint

Dive into the research topics of 'An Improved Speed Estimation Using Deep Homography Transformation Regression Network on Monocular Videos'. Together they form a unique fingerprint.

Cite this