Real-time object detection for autonomous driving-based on deep learning

Liu, Guangrui

Real-time object detection for autonomous driving-based on deep learning

dc.contributor.advisor	Rahnemoonfar, Maryam
dc.contributor.author	Liu, Guangrui
dc.contributor.committeeMember	Li, Longzhuang
dc.contributor.committeeMember	Belkhouche, Mohammed
dc.date.accessioned	2017-11-02T21:46:37Z
dc.date.available	2017-11-02T21:46:37Z
dc.date.issued	2017-05
dc.description	A thesis Submitted in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE in COMPUTER SCIENCE from Texas A&M University-Corpus Christi in Corpus Christi, Texas.	en_US
dc.description.abstract	Optical vision is an essential component for autonomouscars. Accurate detection of vehicles, street buildings, pedestrians and road signs could assist self-driving cars the drive as safely as humans. However, object detection has been a challenging task for decades since images of objects in the real-world environment are affected by illumination, rotation, scale, and occlusion. In recent years, many Convolutional Neural Network (CNN) based classification-after-localization methods have improved detection results in various conditions. However, the slow recognition speed of these two-stage methods limits their usage in real-time situations. Recently, a unified object detection model, You Only Look Once (YOLO) [20], was proposed, which could directly regress from input image to object class scores and positions. Its single network structure processes images at 45 fps on PASCAL VOC 2007 dataset [7] and has higher detection accuracy than other current real-time methods. However, when applied to auto-driving object detection tasks, this model still has limitations. It processes images individually despite the fact that an object's position changes continuously in the driving scene. Thus, the model ignores alot of important information between continuous frames. In this research, we applied YOLO to three different datasets to test its general applicability. We fully analyzed its performance from various aspects on KITTI dataset [10] which is specialized for autonomous driving. We proposed a novel technique called memory map, which considers inter-frame information, to strengthen YOLO's detection ability in driving scene. We broadened the model's applicability scope by applying it to a new orientation estimation task. KITTI is our main dataset. Additionally, ImageNet [5] dataset is used for pre-training, and three other datasets. And Pascal VOC 2007/2012 [7], Road Sign [2], and Face Detection Dataset and Benchmark (FDDB) [15] were used for other class domains.	en_US
dc.description.college	College of Science and Engineering	en_US
dc.description.department	Computing Sciences	en_US
dc.format.extent	92 pages.	en_US
dc.identifier.uri	http://hdl.handle.net/1969.6/5637
dc.language.iso	en_US	en_US
dc.rights	This material is made available for use in research, teaching, and private study, pursuant to U.S. Copyright law. The user assumes full responsibility for any use of the materials, including but not limited to, infringement of copyright and publication rights of reproduced materials. Any materials used should be fully credited with its source. All rights are reserved and retained regardless of current or future development or laws that may apply to fair use standards. Permission for publication of this material, in part or in full, must be secured with the author and/or publisher.	en_US
dc.rights.holder	Liu, Guangrui
dc.subject	auto-driving	en_US
dc.subject	computer vision	en_US
dc.subject	convolutional neural network	en_US
dc.subject	deep learning	en_US
dc.subject	object detection	en_US
dc.subject	orientation estimation	en_US
dc.title	Real-time object detection for autonomous driving-based on deep learning	en_US
dc.type	Text	en_US
dc.type.genre	Thesis	en_US
thesis.degree.discipline	Computer Science	en_US
thesis.degree.grantor	Texas A & M University--Corpus Christi	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Liu, Guangrui thesis.pdf
Size:: 2.18 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.72 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Theses
College of Engineering Theses and Dissertations