Paper Project Page Code

Step 1: 3D Scanning. Follow the to record a 3D scan (.ply). Make sure the z-axis is up or choose the correct up-axis. The .ply should be less than 100 MB, include color and vertex normals. The scene should be metric scale, i.e., the units are in meters. Our model is trained on indoor scenes (appartments, hotels, offices), this is where it will perform best. The performance will likely be reduced on other scenes, for example outdoor scenes.

Instructions Here we provide instructions for recording 3D scans using the "3D Scanner App" for iPhone or iPad with a Lidar sensor (3D meshes recorded with other devices are also welcome). For privacy reasons, please do not record people, remove any personal objects from the scene, and only record on locations where you have permission to do so.

Step 2: 3D Instance Segmentation. Upload your 3D scene (.ply) below to obtain the semantic instance segmentation.

Step 3: Results. For the processed scans, we provide:
- Visualizations
- Predictions (ScanNet FileFormat).

Selected Examples

Selected Failure Cases

Mask3D is currently limited in its ability to identify object classes that were not encountered during training.
Examples of such classes include futons (Japanese beds).