Abusing Heightfields as Fast Image Canvas
# The Problem
A big part of getting a machine learning model to run is to prepare and import the data in the right shape. Usually this shape will be some n-dimensional Tensor.
When working with 2D image data the files are read from disk. There are tons of tools and functions in most of the DL Frameworks that convert all sorts of files to correctly shaped tensors. In Houdini however, the data at hand is most likely not in a format (.jpg .png etc.) that can be extracted easily by a premade data loading solution.
To extract the data somewhat liveish without having to write anything to disk and read it back in another step, a function that reads directly from houdini geometry is more efficient.
This brings us to the main issue: Where to store the data?
Unfortunately storing image data on 3D geometry isn’t very efficient. Polygon grids also store connectivity information and other data which make them rather slow to work with, especially when dealing with higher resolutions.
Tested when trying to do Digit Recognition in Houdini.
# The Solution
Heightfields or ‘2D-Volumes’ (weird name).
They have a grid-like topology and only store a single value per voxel instead of unnecessary connectivity information or other data.
Houdini also ships with a python function to extract voxel data quickly, which allows us to convert the 2D information into the necessary shape.
|
|
See “Training a Neural Net to Understand my Drawings” to find out more about the applications of this technique.