Skip to main content

The Data

To download: Fill out the registration form and then visit the data server (LICENSE).

To learn more about the data, see the linked pages below. Also be sure to check out:

Scrolls

Micro-CT scans of intact Herculaneum scrolls. The mission is to virtually unwrap the contents of the scrolls from the CT scans, revealing the text hidden within. Scroll 1 was used to win the 2023 Grand Prize, but 95% of the scroll remains unread!

More information

Scroll 1 (PHerc. Paris. 4)
Scroll 2 (PHerc. Paris. 3)
Scroll 3 (PHerc. 332)
Scroll 4 (PHerc. 1667)

Fragments

Micro-CT scans of detached scroll fragments. Since the fragments have exposed text on their surfaces, they can be used as ground truth for machine learning-based ink detection approaches (see Tutorial 5: Ink Detection).

More information

Fragment 1 (PHerc. Paris. 2 Fr 47)
Fragment 2 (PHerc. Paris. 2 Fr 143)
Fragment 3 (PHerc. Paris. 1 Fr 34)
Fragment 4 (PHerc. Paris. 1 Fr 39)
Fragment 5 (PHerc. 1667 Cr 1 Fr 3)
Fragment 6 (PHerc. 51 Cr 4 Fr 48)

Segments

Segmentation is the mapping of sheets of papyrus in a 3D X-ray volume. The resulting surface volumes can be used directly to look for ink.

More information

Some segments from Scroll 1.