Scrolls


8µm 3D X-ray scans of two intact scrolls (top halves only), scanned in horizontal slices from bottom to top. Each half scroll scan is 14,000 .tif files, 120MB each. Each slice is 8µm tall, so this scroll half is 11.2cm tall. Both were scanned at 54keV, though we also released a smaller slice of Scroll 2 at 88keV. These are the scans you need to read to win the Grand Prize.
The two scrolls are from the Institut de France and were scanned at about 8µm resolution at the Diamond Light Source particle accelerator. We are only releasing data from the top halves of these scrolls (they were standing on end when they were scanned). We will work with you to apply your techniques to the bottom halves in order to validate your submission.
Data format
The scroll data is published on the data server on in the .volpkg
format, which is the data format used by Volume Cartographer (learn more in Tutorial 3).
A .volpkg
needs to have the following files and directories, otherwise Volume Cartographer will crash. You can, however, add more files or directories.
*.volpkg/
/config.json
: Contains some metadata, such as the name and the thickness of the papyrus./volumes/
: The actual CT scans./<id>/meta.json
: Contains metadata about the image stack./<id>/<number>.tif
: The actual image stack itself.
/renders/
: Where rendering data will be stored. May be empty./paths/
: This is where segments are stored (see next chapter: Segments). May be empty.
A typical .tif file from the scrolls look like this, giving a top-down view:

Data server
For the full scrolls specifically, you’ll find the following files on the data server:
/full-scrolls/
/Scroll{1,2}.volpkg/
/volumes/
: Both scrolls have a main volume at 54keV. Scroll 2 also has a much smaller volume (“central slab”) taken at 88keV. Both have a 8µm pixel size. Scroll 2 has a scanning artifact in the middle of the scroll./working/<name>.mp4
: Video montage of the slices.
At 8µm resolution, the data files are big. We believe this resolution is necessary to detect ink, as suggested by the excellent paper From invisibility to readability: Recovering the ink of Herculaneum. Each .tif file in the full scroll sans is 8 micrometers tall so if you want to grab a centimeter of them from the middle of the scroll, you can just download 1,250 of them from the middle of the scan. We provide scripts for doing this once you've registered.
This is the video montage of Scroll 1:
Community-contributed data
/volumes_small/
: 10x smaller “thumbnails” of the slices./<volume_id>/{*.tif,meta.json}
: Image stack of the 10x smaller volume, e.g. for use in Volume Cartographer. Generated usingbuild_small
./<volume_id>_small.tif
: The same data, but in a single tif file with multiple layers. Generated usingbuild_small_volume
./<volume_id>_volumes_small_axis{1,2}/*.png
: The same data but across the two orthogonal axes, and as pngs instead of tifs. Generated usingperpSlices.py
/volumes_masked/
: Same as/volumes/
, but with non-scroll data blacked out, and the files compressed. About 2x smaller than the original volumes, though may take longer to load in software due to the compression. See this repo for more details (and this thread for details about the compression vs performance tradeoff)./volume_grids/<volume_id>/cell_yxz_*.tif
: The volume split into “cells” of 500x500x500 voxels. Generated usingbuild_grid_layer
.
One community member, Ahron Wayne (@WayneWayneHello on Discord), has bought a benchtop CT scanner and has been making various scans with them, with the goal of making a simulated scroll (a “campfire scroll”) that can be used as ground truth data. His scans can be found in /waynewaynehello-uploads/
.
