Data Convention

The format of our multi-view dataset is derived from VolSDF.

Directory Structure

scan<scan_id>/
	cameras.npz
	image/ -> {:04d}.png # tone-mapped LDR images
	depth/ -> {:04d}.exr
	normal/ -> {:04d}.exr
	mask/ -> {:04d}.png
	val/ -> {:04d}.png # validation images (LDR)
	hdr/ -> {:04d}.exr # raw HDR images
	# followings are optional
	light_mask/ -> {:04d}.png # emitter mask images
	material/ -> {:04d}_kd.exr, {:04d}_ks.exr, {:04d}_rough.exr # diffuse, specular albedo and roughness

Zeros areas in the depth maps and normal maps indicate invalid areas such as windows.

Note that not all areas inside the scenes use the GGX material model. We'll provide a mask of the invalid areas.

Camera Information

The cameras.npz contains each image's associated camera projection matrix 'world_mat_{i}' and a normalization matrix 'scale_mat_{i}', the same as VolSDF. Besides, we also provide a validation set of images for novel view synthesis, whose associated camera projection matrices are 'val_mat_{i}'. Validation set and training set share the same normalization matrix.

The normalization matrices may not be readily available in cameras.npz. You can manually run data/normalize_cameras.py to generate cameras_normalize.npz. Since our method requires the entire scene to be within a radius-3 bounding sphere, we suggest normalizing cameras by radius 2.0 or 2.5.

An example of running normalize_cameras.py:

python normalize_cameras.py --id <scan_id> -n <synthetic/real/...> -r 2.0

Note that we follow OpenCV camera coordinate system (X right, Y downwards, Z into the image plane).

Dataset Format Conversion

If you want to convert the dataset format to NeRF blender format, run npz_to_blender.py:

python npz_to_blender.py --root /path/to/dataset

The script will automatically scale all pose matrices to fit within a [-1, 1] bounding box.

About Real Dataset

Our real dataset comes from Inria and Scalable-NISR, with estimated depth from MVS tools and manually-labeled light masks (2 living room scenes). All depths has an absolute scale without needs of shifting like MonoSDF. All camera calibrations and depths are provided by the authors of Scalable-NISR. We thank them for providing the datasets.

Normal is not provided in the real dataset, and we find it sufficient for plausible reconstruction without a normal supervision in these scenes. Of course, you can estimate normal using any methods if you want to enable normal supervision.

About EXR format

We suggest using OpenCV to load an .exr format float32 image:

import os
os.environ['OPENCV_IO_ENABLE_OPENEXR'] = '1' # Enable OpenCV support for EXR
import cv2
...
im = cv2.imread(im_path, -1) # im will be an numpy.float32 array of shape (H, W, C)
im = cv2.cvtColor(im, cv2.COLOR_BGR2RGB) # cv2 reads image in BGR shape, convert into RGB

We suggest using tev to preview HDR .exr images conveniently.

About Mesh

Due to copyright issues, we could not release the original 3D mesh of our synthetic scenes. Instead, we'll provide a point cloud sampled from the GT mesh to enable 3D reconstruction evaluations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DATA_CONVENTION.md

DATA_CONVENTION.md

Data Convention

Directory Structure

Camera Information

Dataset Format Conversion

About Real Dataset

About EXR format

About Mesh

Files

DATA_CONVENTION.md

Latest commit

History

DATA_CONVENTION.md

File metadata and controls

Data Convention

Directory Structure

Camera Information

Dataset Format Conversion

About Real Dataset

About EXR format

About Mesh