HTRC Algorithms tutorial

See step-by-step instructions for running an HTRC algorithm.

Run a text analysis algorithm 

HTRC Analytics provides a number of text analysis algorithms for users to analyze their worksets, ranging from simple word count to sophisticated ones such as topic modeling. Typically, you need to choose an algorithm of your interest and then select the workset you want to run analysis on with proper parameter settings.

Make sure you are logged in to HTRC Analytics. Navigate to the algorithms by clicking Run algorithms on the menu bar on the top part of page.


On the Run algorithms page, select an algorithm from the list. You can read the description to learn about what the algorithms can do on your workset. For this example we chose the "Token Count and Tag Cloud Creator" algorithm. Click “Execute” to use this algorithm.


Fill in the required parameters. Some parameters have optional default values while for some others you will need to fill them in. You will also need to select a workset (i.e. a collection) that you want to work with. Below shows the parameters entered for this demo. Click on the "Submit" button to submit the job.


After submitting, you will be taken to the "Algorithm results" page for viewing job status. You can stay on the "Algorithm results" page and refresh the page to see the most up-to-date status of the job. Depending on how fast the job is computed, you will probably see the job listed in the "Active Jobs" section. You can also see all the active jobs submitted by you.


Examine results of algorithm

During and after submitting a text analysis job, you can see your work on the "Algorithm results" page.


After the job is finished, you can see the job is listed in the "Completed results" section. Click on the job name to see its result.

This is also the page you will come to to see your results in the future. To get here again, click the "Algorithm results" button from the "Run algorithms" page. (Note that you must be signed in to access these options.)

If you want to view past jobs, you can filter the results by name.

To view results, click on the job name.

On this result page, I can use the tabs to see outputs of running the tag cloud algorithm. This algorithm displays the top words in my workset as a tag cloud, in a list, and logs (stdout.txt and stderr.txt).