UWM Libraries

Document Type


Publication Date



PrideFest is a nationally recognized festival celebrating the LGBT community that has been held annually in Milwaukee since 1988. This text corpus is derived from the AV materials in that collection. The full finding aid for the PrideFest Records can be found here:


This corpus was created as part of a project to develop workflows and best practices to use machine learning tools to extract text from archival AV materials, with a focus on the LGBTQ+ collections that are part of the UWM Archives. In addition to creating the corpus, the project also developed a prototype dashboard to demonstrate the teaching and research potential of the corpus using text analysis and engaging new modes of discovery.

Creation of this corpus was funded by the Andrew W. Mellon Foundation as part of the second cohort for Collections as Data: Part to Whole.