...
HTRC Extracted Features Dataset :https://sandbox.htrc.illinois.edu/HTRC-UI-Portal2/Features documentation and download.
Features are data attributes defined in such a way that they can be identified by a computer and analyzed at scale. The HTRC Feature Extraction alpha dataset has already processed the underlying text, identifying headers and footers, rejoining hyphenated words, and offering page-level details such as:
...
Questions? Please contact <htrc-support-l@list.indianaiu.edu>.