Treffer: Proposal-free framework for object instance segmentation using deep learning

Title:
Proposal-free framework for object instance segmentation using deep learning
Contributors:
Wang, Kun (author.), Wang, Xiaogang , active 2003 (thesis advisor.), Chinese University of Hong Kong Graduate School. Division of Electronic Engineering. (degree granting institution.)
Publication Year:
2018
Collection:
The Chinese University of Hong Kong: CUHK Digital Repository / 香港中文大學數碼典藏
Document Type:
Fachzeitschrift text
File Description:
electronic resource; remote; 1 online resource (xvi, 68 leaves) : illustrations (some color); computer; online resource
Language:
English
Chinese
Relation:
cuhk:2188673; local: ETD920200755; local: 991039750405303407
Rights:
Use of this resource is governed by the terms and conditions of the Creative Commons "Attribution-NonCommercial-NoDerivatives 4.0 International" License (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Accession Number:
edsbas.33A55C11
Database:
BASE

Weitere Informationen

M.Phil. ; Object detection, which deals with finding instances of semantic objects of predefined classes (e.g., humans, cars, or dogs) in digital images and videos, is one of the fundamental tasks in the field of computer vision. In the research community, object detection is the base for many other tasks, such as 3D detection, human identification, relationship,and etc, Also, it is the key element and has been deployed in real-world applications including smartphone apps, autonomous driving cars, and intelligent surveillance cameras. ; The task definition of object detection is simple. it is nothing but just to locate objects and tell the categories. However, it is quite challenging. Due to large variations in viewpoints, poses, occlusions, lighting conditions and background, object detection is challenging. The visual cues from multiple support regions of different sizes and resolutions are complementary in classifying a candidate box in object detection. In natural digital images, there exists a complex structure between objects, e.g, it is very likely that the water bottles should be on tables. But how to model this relationship is still an open question. Recent years, several deep learning based detection frame works are proposed. Thanks to the powerful representations, we achieve a significant improvement. However, most of them are quite time-consuming. How to make existing detectors more efficient, or even propose a new one is vital. ; In this dissertation. we address these challenges from four aspects to make object detection scalable to real-world data and applications: ; •First, how to learn better feature representations and classifiers in object detection. Feature matters. We propose to take features of different resolutions and support regions and pass messages to each other to validate their existence through the bi-directional structure. And the gate function is used for controlling the message passing rate among these features. ; •Second, we introduce graphic model. Graphic model is good at ...