Abstract
A semi-supervised online video object segmentation algorithm, which accepts user annotations about a target object at the first frame, is proposed in this work. We propagate the segmentation labels at the previous frame to the current frame using optical flow vectors. However, the propagation is error-prone. Therefore, we develop the convolutional trident network (CTN), which has three decoding branches: separative, definite foreground, and definite background decoders. Then, we perform Markov random field optimization based on outputs of the three decoders. We sequentially carry out these processes from the second to the last frames to extract a segment track of the target object. Experimental results demonstrate that the proposed algorithm significantly outperforms the state-of-the-art conventional algorithms on the DAVIS benchmark dataset.
Original language | English |
---|---|
Title of host publication | Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 7474-7483 |
Number of pages | 10 |
ISBN (Electronic) | 9781538604571 |
DOIs | |
Publication status | Published - 2017 Nov 6 |
Event | 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 - Honolulu, United States Duration: 2017 Jul 21 → 2017 Jul 26 |
Publication series
Name | Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 |
---|---|
Volume | 2017-January |
Other
Other | 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017 |
---|---|
Country/Territory | United States |
City | Honolulu |
Period | 17/7/21 → 17/7/26 |
Bibliographical note
Publisher Copyright:© 2017 IEEE.
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition