This folder contains the digitally born CEB documents. 

"selected_pages.txt" describes which pages of the documents are selected to construct the dataset. 
In this file, each line begins with a file name, followed by a list of pages which are selected.

"number_of_formulas.csv" describes the number of formulas in each page.
