Abstract
This project is about developing an open-source AI framework for digital libraries to explore, curate, and analyze large collections of digitized poetry texts. The project team will identify potentially underrepresented poetry groups; build labeled poetry datasets based on theme and emotion categories; develop artificial intelligence (AI) models for understanding poetry; and incorporate user evaluation and feedback to improve the AI model’s performance, fairness, as well as reduce bias. They will publish project results in a toolkit to help future librarians adopt the ethical AI lifecycle to their collections. In addition, the project results will allow public users to explore massive digital collections of poetry using richer metadata tags, such as theme and emotion. Ultimately, this research will increase the exposure and discovery of underrepresented poetry, leading to increased access to diverse literature in digital libraries.
Bio
Kahyun Choi is an assistant professor and 2022 Luddy Fellow in the Department of Information and Library Science and Data Science Program at Indiana University Bloomington. She earned her Ph.D. in the School of Information Sciences at the University of Illinois at Urbana-Champaign. Before her Ph.D., she also worked as a software engineer in Naver, a search engine company in Korea. Her research interests include ethical AI workflow for Libraries, Archives, and Museums (LAMs), music information retrieval, public library- based AI education program, computational lyrics analysis, and computational poetry analysis. Her research applies computational methods and machine learning algorithms to audio and text data. She has received awards and fellowships, including the 2021 Institute of Museum and Library Services (IMLS) National Leadership Grant, 2021 Luddy Faculty Fellowship, and 2022 IMLS Early Career Research Development Project Grant.