Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

1) use case examples for the HTRC feature extraction functionality, available in a Google doc here https://docs.google.com/document/d/14-be-4VBNeVPZsFO-e7UWephgf71LfYvhlr9qssEfTg/edit  (document prepared by Sayan Bhattacharyya)
2) challenges presented to HTRC text mining, or more broadly, historical text: scalability, curse of dimensionality, plus OCR errors. 

...