Skip to main content

Tutorial 2: Representation

The micro-CT scan converts the scroll into a volumetric grid of density measurements, providing a mathematical description with integer coordinates and average material densities in small adjacent cubes.

Physical scroll to voxel representation

While the raw scan volume represents the maximum amount of information available, this dataset is huge and the written surface is difficult to locate throughout the scroll. Mathematical calculations can be used to manipulate the digital dataset into different representations that are easier to interpret.

Besides the raw scan volume, three types of representations have been explored for segmentation exploiting both traditional methods and machine learning:

  • Traditional voxel processing. Thresholding, edge detection, noise reduction, colourization, etc.
  • Point clouds. Assigning points in a new coordinate system to represent papyrus surfaces via traditional edge/surface detection.
  • Semantic masks. Set background voxels = 0, papyrus voxels = 1 via machine learning, thresholding, noise reduction, morphological operations.

There have been varying levels of success using each of these types of representations.

Traditional voxel processing​

Input: 3D scan volume (voxels in .tif “image stack”)
Output: new 3D volume (voxels in .tif “image stack”)

Traditional image processing techniques have mostly been intended to aid manual segmentation, either to make the papyrus easier to interpret or to help segmentation algorithms follow the surface better. Popular techniques include thresholding, noise reduction, various colourization schemes, edge detection, and skeletonization.

Voxel to voxel representations using traditional techniques.

Most of these techniques have not been successful at improving segmentation.

Point clouds​

Input: 3D scan volume (voxels in .tif “image stack”)
Output: point cloud (.ply or .obj)

The goal of point clouds is to extract a limited but useful subset of information from the raw scan volume. In the point cloud representation, calculations select voxels that describe the surface of a sheet.

Transforming voxels into a point cloud representation.

The fixed grid and density values are discarded, using traditional edge/surface detection to generate infinitesimal points characterized by 3D real coordinates. If needed, the discarded information can be reattached to the points to revert to the voxel representation.

Semantic mask​

Input: 3D scan volume (voxels in .tif “image stack”)
Output: binary 3D volume (voxels in .tif “image stack”)

Volumetric segmentation involves mapping the exact 3D shape of the papyrus sheets, and annotating which voxels are papyrus and which belong to the same papyrus wrap vs adjacent wraps. The semantic mask assists in this process by labelling papyrus vs air.

Binarizing papyrus and no papyrus

The semantic mask is generated by combining machine learning with thresholding, noise reduction, and morphological operations. The semantic one could also analyze smaller sections of the papyrus.

Segmentation​

So what comes next? The goal of the representation step was to make segmentation easier. In our next tutorial on “Segmentation”, we look at the different approaches used to map and extract the written surface of the scroll.