Variational Autoencoder for 3D Voxel Compression

Juncheng Liu, Steven Mills, Brendan McCane

Year: 2020
Citations: 6

Abstract

3D scene sensing and understanding is a fundamental task in the field of computer vision and robotics. One widely used representation for 3D data is a voxel grid. However, explicit representation of 3D voxels always requires large storage space, which is not suitable for light-weight applications and scenarios such as robotic navigation and exploration. In this paper we propose a method to compress 3D voxel grids using an octree representation and Variational Autoencoders (VAEs). We first capture a 3D voxel grid -in our application with collaborating Realsense D435 and T265 cameras. The voxel grid is decomposed into three types of octants which are then compressed by the encoder and reproduced by feeding the latent code into the decoder. We demonstrate the efficiency of our method by two applications: scene reconstruction and path planning.

Keywords

VoxelComputer scienceOctreeAutoencoderArtificial intelligenceComputer visionGridRepresentation (politics)EncoderPattern recognition (psychology)

Variational Autoencoder for 3D Voxel Compression

Abstract

Keywords

Related papers

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory