Real-time object detection for autonomous driving-based on deep learning

dc.contributor.advisorRahnemoonfar, Maryam
dc.contributor.authorLiu, Guangrui
dc.contributor.committeeMemberLi, Longzhuang
dc.contributor.committeeMemberBelkhouche, Mohammed
dc.date.accessioned2017-11-02T21:46:37Z
dc.date.available2017-11-02T21:46:37Z
dc.date.issued2017-05
dc.descriptionA thesis Submitted in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE in COMPUTER SCIENCE from Texas A&M University-Corpus Christi in Corpus Christi, Texas.en_US
dc.description.abstractOptical vision is an essential component for autonomouscars. Accurate detection of vehicles, street buildings, pedestrians and road signs could assist self-driving cars the drive as safely as humans. However, object detection has been a challenging task for decades since images of objects in the real-world environment are affected by illumination, rotation, scale, and occlusion. In recent years, many Convolutional Neural Network (CNN) based classification-after-localization methods have improved detection results in various conditions. However, the slow recognition speed of these two-stage methods limits their usage in real-time situations. Recently, a unified object detection model, You Only Look Once (YOLO) [20], was proposed, which could directly regress from input image to object class scores and positions. Its single network structure processes images at 45 fps on PASCAL VOC 2007 dataset [7] and has higher detection accuracy than other current real-time methods. However, when applied to auto-driving object detection tasks, this model still has limitations. It processes images individually despite the fact that an object's position changes continuously in the driving scene. Thus, the model ignores alot of important information between continuous frames. In this research, we applied YOLO to three different datasets to test its general applicability. We fully analyzed its performance from various aspects on KITTI dataset [10] which is specialized for autonomous driving. We proposed a novel technique called memory map, which considers inter-frame information, to strengthen YOLO's detection ability in driving scene. We broadened the model's applicability scope by applying it to a new orientation estimation task. KITTI is our main dataset. Additionally, ImageNet [5] dataset is used for pre-training, and three other datasets. And Pascal VOC 2007/2012 [7], Road Sign [2], and Face Detection Dataset and Benchmark (FDDB) [15] were used for other class domains.en_US
dc.description.collegeCollege of Science and Engineeringen_US
dc.description.departmentComputing Sciencesen_US
dc.format.extent92 pages.en_US
dc.identifier.urihttp://hdl.handle.net/1969.6/5637
dc.language.isoen_USen_US
dc.rightsThis material is made available for use in research, teaching, and private study, pursuant to U.S. Copyright law. The user assumes full responsibility for any use of the materials, including but not limited to, infringement of copyright and publication rights of reproduced materials. Any materials used should be fully credited with its source. All rights are reserved and retained regardless of current or future development or laws that may apply to fair use standards. Permission for publication of this material, in part or in full, must be secured with the author and/or publisher.en_US
dc.rights.holderLiu, Guangrui
dc.subjectauto-drivingen_US
dc.subjectcomputer visionen_US
dc.subjectconvolutional neural networken_US
dc.subjectdeep learningen_US
dc.subjectobject detectionen_US
dc.subjectorientation estimationen_US
dc.titleReal-time object detection for autonomous driving-based on deep learningen_US
dc.typeTexten_US
dc.type.genreThesisen_US
thesis.degree.disciplineComputer Scienceen_US
thesis.degree.grantorTexas A & M University--Corpus Christien_US
thesis.degree.levelMastersen_US
thesis.degree.nameMaster of Scienceen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Liu, Guangrui thesis.pdf
Size:
2.18 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.72 KB
Format:
Item-specific license agreed upon to submission
Description: