Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

You can now see the status of the job, as shown below. The status of the job will initially show as “Staging”. (Refresh the screen after some time and you will see the status to have changed to “Queued”. )

 


 

3.5 Open the completed job


Eventually, the job will have “completed”, and the screen, on refreshing, will look as follows. Click on the link representing the job name.


 

3.6 Prepare to download the results returned by the EF_Rsync_Script_Generator algorithm

At this time, the screen should look like the following:


At the very bottom left of your browser window, you will see a message like the following. (The number you see within the parentheses may vary, depending on how many times you have executed this step before. If doing this step for the first time, there will be no parentheses.) Press the “Keep” button.


 

3.7 Download the script returned by the EF_Rsync_Script_Generator algorithm

 

At this point, the script will be downloaded to your computer’s hard disk, and you will see the message at the bottom left of your browser window be replaced by just the name of the downloaded file:


Image Added


 

3. 8 Run the script returned by the EF_Rsync_Script_Generator algorithm

 

Find the downloaded file on your hard disk. It is a shell script that you can run from a terminal window. (When you do so, the basic features data file and the advanced features data file will both be transferred to your hard disk. ) Check the file size ( if you want) by using the  ls -l command at the Unix shell prompt, and then execute the script, as shown below.

gwork025:Downloads sayan$ ls -l EF_Rsync.sh

-rw-r-----@ 1 sayan  GSLIS-AD\sayan  320 May  5 00:53 EF_Rsync.sh


gwork025:Downloads sayan$ sh EF_Rsync.sh


mdp.39015012864743.advanced.json.bz2


sent 152 bytes  received 200 bytes  704.00 bytes/sec

total size is 10192  speedup is 28.95


mdp.39015012864743.basic.json.bz2


sent 1538 bytes  received 1121 bytes  5318.00 bytes/sec

total size is 171977  speedup is 64.68


If your workset contained N volumes with HathiTrust volume IDs V1, V2, V3,... VN respectively, then executing the shell script as shown above will cause the following compressed advanced and basic feature datafiles for the corresponding volumes,



V1.advanced.json.bz2, V1.basic.json.bz2, 

V2.advanced.json.bz2, V2.basic.json.bz2,

V3.advanced.json.bz2, V3.basic.json.bz2,...

VN.advanced.json.bz2, VN.basic.json.bz2,


to be transferred to your computer’s hard disk via rsync. You will then be able to uncompress these files into text files in json format. You will be able to view the features by opening the uncompressed files in a suitable editor (such as Oxygen), and be able to manipulate the files programmatically. (For this particular workset, recall that there was only one volume, the book Buch der Lieder by Heinrich Heine, with the HathiTrust volumeID mdp.39015012864743. Therefore, the files mdp.39015012864743.advanced.json.bz2 and mdp.39015012864743.basic.json.bz2 were transferred.