The LGBTQ+ AV Archive Mining Project (2020-2021) was aimed at developing and making available models for extracting text, building usable text datasets, and developing public-facing data visualizations based on audiovisual (AV) materials in the LGBTQ+ collections in the UWM Archives. The LGBTQ+ AV Archive Mining Project used machine learning tools and human quality control and oversight to build and process the text datasets available here.
Creation of this corpus was funded by the Andrew W. Mellon Foundation as part of the second cohort for Collections as Data: Part to Whole.
Submissions from 2021
(1) LGBTQ+ AV Collections: Full Text Corpus, UWM Libraries
ACT UP Milwaukee Records: Text Corpus of AV Materials, UWM Libraries
Cream City Foundation Records: Text Corpus of AV Materials, UWM Libraries
Gay Peoples Union Records, 1971-1984: Text Corpus of AV Materials, UWM Libraries
Milwaukee Gay/Lesbian Cable Network Records: Text Corpus of AV Materials, UWM Libraries
Miriam Ben-Shalom Papers, 1971-1999: Text Corpus of AV Materials, UWM Libraries
Oral History Interviews of the Milwaukee LGBT History Project: Text Corpus, UWM Libraries
PrideFest Records: Text Corpus of AV Materials, UWM Libraries
Shall Not Be Recognized Exhibition Records: Text Corpus of AV Materials, UWM Libraries
Vivent Health Records: Text Corpus of AV Materials, UWM Libraries