OCID-Ref|计算机视觉数据集|物体识别数据集
收藏OCID-Ref: A 3D Robotic Dataset with Embodied Language for Clutter Scene Grounding
数据集概述
OCID-Ref 是一个包含 305,694 个引用表达式的新型数据集,源自 2,300 个场景,提供 RGB 图像和点云输入。该数据集专注于引用表达式分割任务,特别针对被遮挡物体的视觉定位。
数据集内容
- 引用表达式数量:305,694
- 场景数量:2,300
- 数据类型:RGB 图像和点云
数据集下载
使用说明
详细的使用说明请参考 instruction.txt。
引用
@inproceedings{wang-etal-2021-ocid, title = "{OCID}-Ref: A 3{D} Robotic Dataset With Embodied Language For Clutter Scene Grounding", author = "Wang, Ke-Jyun and Liu, Yun-Hsuan and Su, Hung-Ting and Wang, Jen-Wei and Wang, Yu-Siang and Hsu, Winston and Chen, Wen-Chin", booktitle = "Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies", month = jun, year = "2021", address = "Online", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/2021.naacl-main.419", doi = "10.18653/v1/2021.naacl-main.419", pages = "5333--5338" }
许可证
该数据集遵循 MIT 许可证(详细信息见 LICENSE)。