Seeing Glass: Joint Point Cloud and Depth Completion for Transparent Objects

Haoping Xu, Yi Ru Wang, Sagi Eppel, Alàn Aspuru-Guzik, Florian Shkurti, Animesh Garg

Research output: Contribution to journalConference articlepeer-review

16 Scopus citations

Abstract

The basis of many object manipulation algorithms is RGB-D input. Yet, commodity RGB-D sensors can only provide distorted depth maps for a wide range of transparent objects due light refraction and absorption. To tackle the perception challenges posed by transparent objects, we propose TranspareNet, a joint point cloud and depth completion method, with the ability to complete the depth of transparent objects in cluttered and complex scenes, even with partially filled fluid contents within the vessels. To address the shortcomings of existing transparent object data collection schemes in literature, we also propose an automated dataset creation workflow that consists of robot-controlled image collection and vision-based automatic annotation. Through this automated workflow, we created Toronto Transparent Objects Depth Dataset (TODD), which consists of nearly 15000 RGB-D images. Our experimental evaluation demonstrates that TranspareNet outperforms existing state-of-the-art depth completion methods on multiple datasets, including ClearGrasp, and that it also handles cluttered scenes when trained on TODD. Code and dataset will be released at https://www.pair.toronto.edu/TranspareNet/.

Original languageEnglish
Pages (from-to)827-838
Number of pages12
JournalProceedings of Machine Learning Research
Volume164
StatePublished - 1 Jan 2021
Externally publishedYes
Event5th Conference on Robot Learning, CoRL 2021 - London, United Kingdom
Duration: 8 Nov 202111 Nov 2021

Keywords

  • 3D Perception
  • Dataset
  • Depth Completion
  • Transparent Objects

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software
  • Control and Systems Engineering
  • Statistics and Probability

Fingerprint

Dive into the research topics of 'Seeing Glass: Joint Point Cloud and Depth Completion for Transparent Objects'. Together they form a unique fingerprint.

Cite this