...
You can now see the status of the job, as shown below. The status of the job will initially show as “Staging”. (Refresh the screen after some time and you will see the status to have changed to “Queued”. )
3.5 Open the completed job
Eventually, the job will have “completed”, and the screen, on refreshing, will look as follows. Click on the link representing the job name.
3.6 Prepare to download the results returned by the EF_Rsync_Script_Generator algorithm
At this time, the screen should look like the following:
At the very bottom left of your browser window, you will see a message like the following. (The number you see within the parentheses may vary, depending on how many times you have executed this step before. If doing this step for the first time, there will be no parentheses.) Press the “Keep” button.
3.7 Download the script returned by the EF_Rsync_Script_Generator algorithm
At this point, the script will be downloaded to your computer’s hard disk, and you will see the message at the bottom left of your browser window be replaced by just the name of the downloaded file:
3. 8 Run the script returned by the EF_Rsync_Script_Generator algorithm
Find the downloaded file on your hard disk. It is a shell script that you can run from a terminal window. (When you do so, the basic features data file and the advanced features data file will both be transferred to your hard disk. ) Check the file size ( if you want) by using the ls -l command at the Unix shell prompt, and then execute the script, as shown below.
gwork025:Downloads sayan$ ls -l EF_Rsync.sh
-rw-r-----@ 1 sayan GSLIS-AD\sayan 320 May 5 00:53 EF_Rsync.sh
gwork025:Downloads sayan$ sh EF_Rsync.sh
mdp.39015012864743.advanced.json.bz2
sent 152 bytes received 200 bytes 704.00 bytes/sec
total size is 10192 speedup is 28.95
mdp.39015012864743.basic.json.bz2
sent 1538 bytes received 1121 bytes 5318.00 bytes/sec
total size is 171977 speedup is 64.68
If your workset contained N volumes with HathiTrust volume IDs V1, V2, V3,... VN respectively, then executing the shell script as shown above will cause the following compressed advanced and basic feature datafiles for the corresponding volumes,
V1.advanced.json.bz2, V1.basic.json.bz2,
V2.advanced.json.bz2, V2.basic.json.bz2,
V3.advanced.json.bz2, V3.basic.json.bz2,...
VN.advanced.json.bz2, VN.basic.json.bz2,
to be transferred to your computer’s hard disk via rsync. You will then be able to uncompress these files into text files in json format. You will be able to view the features by opening the uncompressed files in a suitable editor (such as Oxygen), and be able to manipulate the files programmatically. (For this particular workset, recall that there was only one volume, the book Buch der Lieder by Heinrich Heine, with the HathiTrust volumeID mdp.39015012864743. Therefore, the files mdp.39015012864743.advanced.json.bz2 and mdp.39015012864743.basic.json.bz2 were transferred.