Treffer: Semantic segmentation by deep convolutional network

Title:
Semantic segmentation by deep convolutional network
Contributors:
Li, Xiaoxiao (author.), Loy, Chen Change (thesis advisor.), Chinese University of Hong Kong Graduate School. Division of Information Engineering. (degree granting institution.)
Publication Year:
2018
Collection:
The Chinese University of Hong Kong: CUHK Digital Repository / 香港中文大學數碼典藏
Document Type:
Fachzeitschrift text
File Description:
electronic resource; remote; 1 online resource (xvii, 115 leaves) : illustrations (chiefly color); computer; online resource
Language:
English
Chinese
Relation:
cuhk:2187998; oclc: 1244235443; local: ETD920200146; local: AAI13837849; local: 991039750397903407
Rights:
Use of this resource is governed by the terms and conditions of the Creative Commons "Attribution-NonCommercial-NoDerivatives 4.0 International" License (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Accession Number:
edsbas.69CBFF7C
Database:
BASE

Weitere Informationen

Ph.D. ; Object segmentation is a fundamental and long-standing problem in computer vision. It is defined as a multi-label classification problem, aiming to part ition different objects into different segments. According to the different types of inputs and goals, this task can be further divided into sub-tasks, i.e., semantic image segmentation , video object segmentation and so on . This thesis mainly focuses on addressing the segmentation t ask by using deep learning techniques, include increasing accuracy and accelerating the algorithm. By studying two typical segmentation t asks, we propose three novel approaches to address the common challenges in object segmentation. ; To modeling contextual information in natural images, Markov Random Field (MRF) is widely used in t he semantic segmentation task. On the one hand, convolut ional Neural Network(CNN) has shown its pot ential for learning strong features and greatly advanced the field of segmentation. From these, we propose a novel framework to combine CNN and MRF to achieve the benefit of both, namely Deep Parsing Network (DPN). Specifically, DPN extends a contemporary CNN architecture to model unary terms and additional layers are carefully devised to approximate the mean field algorithm (MF) for pairwise terms DPN is able to achieve high performance by approximating one iteration of MF and makes MF easier to be parallelized and speeded up in GPU. Our method achieves state-of-the-art accuracy 77.5% on the PASCAL VOC 2012 dataset. ; Video segmentation is also a valuable topic and more applicable in the real-world application. However, conventional video segmentation methods oft en rely on temporal cont inuity to propagate masks and suffer from issues like drift ing and inability to handle occlusions and large displacement. We formulate an effective mechanism to prevent the target from being lost via adaptive object re-identification. Specifically, we proposed a novel DyeNet which joint the mask propagation module and reidentification module into a single ...