Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Stubbytree places files in a directory structure based on the file name, with the highest level directory being the HathiTrust source library ID code (i.e. volume ID prefix), and then using every third character of the cleaned volume ID, starting with the first, to create a sub-directory. For example the Extracted Features file for the volume with HathiTrust ID nyp.33433070251792 would be located at: 

...

  • Start with a root directory and HathiTrust volume ID (htid), such as nyp.33433070251792
  • Split the HathiTrust volume ID at the first period into a library ID (libid) and volume ID (volid), resulting in nyp and 33433070251792
  • Clean the volume ID, replacing colons, forward slashes and periods with + , / =, and = , respectively
  • Create a stub name by taking every third character of the cleaned volume ID, starting with the first (33433070251792 to 33759)

...