HTRC Research Impact

This is a list of scholarly articles, datasets, and other research products which have made substantial use of HTRC’s tools and services and HathiTrust’s unique data.

Monographs

Franklin, Samuel W. (2023). The Cult of Creativity: A Surprisingly Recent History. Chicago: The University of Chicago Press.

Based in part on Franklin’s 2016 HTRC Advanced Collaborative Support grant, “Inside the Creativity Boom.” Read the project report.

Sinykin, Dan (2023). Big Fiction: How Conglomeration Changed the Publishing Industry and American Literature. New York: Columbia University Press.

Based in part on Sinykin’s 2019 HTRC Advanced Collaborative Support grant, “Supporting The Conglomerate Era Project.” Read the project report.

So, Richard Jean (2021). Redlining Culture : A Data History of Racial Inequality and Postwar Fiction. New York: Columbia University Press.

Based in part on So’s 2017 HTRC Advanced Collaborative Support grant, “A Computational History of the U.S. Novel, 1950-2000.” Read the project report.

Underwood, Ted (2019). Distant Horizons : Digital Evidence and Literary Change. Chicago: University of Chicago Press.

Based in part on Underwood’s longstanding collaborations with HTRC, including his co-creation of HTRC’s “NovelTM Datasets for English-Language Fiction, 1700-2009” (read the dataset project report here); his dataset of “Word Frequencies in English-Language Literature, 1700-1922”; and others.

 

Scholarly Articles

Adams, A.L. (2021). Online tools for digital humanities. Public Services Quarterly, 17(3), 177-182, DOI: https://doi.org/10.1080/15228959.2021.1938789

Bagga, S., & Piper, A. (2022). HATHI 1M: Introducing a Million Page Historical Prose Dataset in English from the Hathi Trust. Journal of Open Humanities Data, 8(7). DOI: https://doi.org/10.5334/johd.71

Beausang, C. (2022). Diachronic delta: A computational method for analysing periods of accelerated change in literary datasets. Digital Scholarship in the Humanities, 37(3), 644–659. https://doi.org/10.1093/llc/fqab041

Brown, N.M., Mendenhall, R., Black, M.L., Van Moer, M., Zerai, A., Flynn, K. (2016). Mechanized Margin to Digitized Center: Black Feminism's Contributions to Combatting Erasure within the Digital Humanities. International Journal of Humanities and Arts Computing, 10(1): 110-125. https://doi.org/10.3366/ijhac.2016.0163  

Craig, K. (2018). Introduction to Bookworm; Robots Reading Vogue; Bookworm: HathiTrust; Bookworm: Open Library; Building a Bookworm. Journal of American History, 105(1): 244–247. DOI: https://doi.org/10.1093/jahist/jay139

Dobson, J. (2020). Interpretable Outputs: Criteria for Machine Learning in the Humanities. Digital Humanities Quarterly, 15(2). http://digitalhumanities.org:8081/dhq/vol/15/2/000555/000555.html

Dobson, J. (2022). Vector hermeneutics: On the interpretation of vector space models of text. Digital Scholarship in the Humanities, 37(1). DOI: https://doi.org/10.1093/llc/fqab079

Ehrlich, H. (2015). Poe in Cyberspace: Balloons! Drones!! The Global Internet!!! The Edgar Allan Poe Review, 16(2), 242–246.

Erlin, M., Piper, A., Knox, D., Pentecost, S. and Blank, A. (2022). The TRANSCOMP Dataset of Literary Translations from 120 Languages and a Parallel Collection of English-language Originals. Journal of Open Humanities Data, 8(0), p.29.DOI: https://doi.org/10.5334/johd.94

Grallert, T. (2022). Open Arabic Periodical Editions: A Framework for Bootstrapped Scholarly Editions Outside the Global North. Digital Humanities Quarterly, 16(2). http://digitalhumanities.org:8081/dhq/vol/16/2/000593/000593.html 

Hamilton, S., & Piper, A. (2023). MultiHATHI: A Complete Collection of Multilingual Prose Fiction in the HathiTrust Digital Library. Journal of Open Humanities Data, 9(3). DOI: http://doi.org/10.5334/johd.95

Kelly, N.M., White, N., Glass, L. (2021). Squatter Regionalism: Postwar Fiction, Geography, and the Program Era. Journal of Cultural Analytics 6(2). DOI:

Kilner, K., & Fitch, K. (2017). Searching for My Lady’s Bonnet: discovering poetry in the National Library of Australia’s newspapers database. Digital Scholarship in the Humanities, 32(1), i69–i83. DOI:

Lee, A. S., Chiarawongse, P., Guldi, J., & Zsom, A. (2020). The Role of Critical Thinking in Humanities Infrastructure: The Pipeline Concept with a Study of HaToRI (Hansard Topic Relevance Identifier). Digital Humanities Quarterly, 14(3). http://digitalhumanities.org:8081/dhq/vol/14/3/000481/000481.html 

Le-Khac, L., & Hao, K. (2021). The Asian American Literature We’ve Constructed. Journal of Cultural Analytics 6(2). DOI:

Moravec, M., Chang, K.K. (2021). Feminist Bestsellers: A Digital History of 1970s Feminism. Journal of Cultural Analytics 6(2). DOI:

Nurmikko-Fuller, T. (2022). Teaching Linked Open Data using Bibliographic Metadata. Journal of Open Humanities Data, 8(6). DOI: http://doi.org/10.5334/johd.60

Ravenscroft, A., Allen, C. (2019). Finding and Interpreting Arguments: An Important Challenge for Humanities Computing and Scholarly Practice. DHQ: Digital Humanities Quarterly, 13(4).

http://digitalhumanities.org:8081/dhq/vol/13/4/000436/000436.html

Shanahan, J., Burke, R., Lučić, A. (2020). Reading Chicago Reading: Quantitative Analysis of a Repeating Literary Program. DHQ: Digital Humanities Quarterly, (14)2.

http://digitalhumanities.org:8081/dhq/vol/14/2/000461/000461.html

Sinykin, D., Roland, E. (2021). Against Conglomeration: Nonprofit Publishing and American Literature After 1980. Journal of Cultural Analytics, 6(2). DOI:

Stevens, G. (2017). New Metadata Recipes for Old Cookbooks: Creating and Analyzing a Digital Collection Using the HathiTrust Research Center Portal. Code4Lib Journal, 37(1). Accessed November 23, 2022.

Conference Papers, Presentations, and Posters

Ball, L., & Bothma, T. (2020). The capability of search tools to retrieve words with specific properties from large text collections. In Proceedings of ISIC, the Information Behaviour Conference, Pretoria, South Africa, 28 September - 1 October, 2020. Information Research, 25(4), paper isic2030. DOI:

Ledbetter, W., & Spring, J. (2020). Peace Speech Identification Using ABBYY Fine Reader and HathiTrust. 2020 IEEE International Conference on Big Data (Big Data), 5739-5743. DOI: https://doi.org/10.1109/BigData50022.2020.9377870

VandenBosch, A., Schmidt, B.M., Matusiak, K.K. and Organisciak, P. (2021), Moving Past Metadata: Improving Digital Libraries with Content-Based Methods. Proceedings of the Association for Information Science and Technology, 58: 849-851.

Datasets

Bagga, Sunyam; Piper, Andrew (2021). HATHI 1M: Introducing a Million Page Historical Prose Dataset in English from the Hathi Trust. https://doi.org/10.7910/DVN/HAKKUA , Harvard Dataverse, V2.

Hamilton, S., & Piper, A. (2023). MultiHATHI: A Complete Collection of Multilingual Prose Fiction in the HathiTrust Digital Library. Journal of Open Humanities Data, 9, 3. DOI: http://doi.org/10.5334/johd.95

Wilkins, Matthew, and Guangchen Ruan (2020). Geographic Locations in English-Language Literature, 1701-2011. DOI: https://doi.org/10.13012/2K5C-RF13 .