Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

On this job result page, I can use the tabs to see outputs of running the topic modeling algorithm. This algorithm displays the top words in my workset as a tag cloud (tagcloudcleantokencounts.html), in a list (tagcouldcleantokencounts.csv.txt), and logs (stdout.txt and stderror.txt).

Download a workset

After you have created a workset, you can download it as a list of volume identifiers in comma separated value (csv) format. Because each workset is functionally a list of pointers to content in the HathiTrust Digital Library, the full text of the volumes is not included in the download. If you are interested in receiving a dataset from the HathiTrust to do research on your own machine, please refer to theĀ directions for requesting a custom dataset. The volume identifiers in a workset are consistent with the volume identifiers used elsewhere across the HathiTrust.

From the homepage of HTRC Analytics sign in and then navigate to Worksets.

Image Removed

Click on the name of the workset you would like to download and click the "Download" button.

Image Removed