Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.

Newspapers as Data: A Collections as Data Project

About the project


The University of Arizona Libraries was awarded a Collections as Data: Part to Whole grant funded by The Andrew W. Mellon Foundation and re-granted by the University of Nevada Las Vegas, in collaboration with the University of Iowa.

"Using Newspapers as Data for Collaborative Pedagogy: A Multidisciplinary Interrogation of the Borderlands in Undergraduate Classrooms" brings together a group of library faculty and disciplinary scholars to introduce students to data literacy and computational analysis using digitized historical newspapers from Arizona.

Read more

About the newspapers

In an earlier project, the University of Arizona Libraries partnered with the State Library of Arizona to digitize Arizona newspapers for the National Digital Newspaper Program. The newspapers are now available open-access on the Library of Congress' Chronicling America.

This computational analysis project focuses on eight papers, including Spanish-language newspapers, newspapers of African American communities, and newspapers from predominantly white English-speaking communities, all located within the Southwest during two time periods: 1915 to 1922 and 1941 to 1959.

  • Apache Sentinel and Post Script of the Apache Sentinelpublished 1943-1946, digital files available 1943-1946
  • Arizona Post, published 1946-2020, digital files available 1946-1962
  • Arizona Sun, published 1942-1965, digital files available 1946-1962
  • The Bisbee Daily Review, published 1896 (as the Weekly Orb)-2020, digital files available 1901-1922
  • The Border Vidette, published 1894-1934, digital files available 1897-1934
  • El Sol, published 1939-1981, digital files available 1942-1962
  • El Tucsonense, published 1915-1959, digital files available 1915-1957
  • Phoenix Tribune, published 1918-1931, digital files available 1918-1931

About the courses

Newspapers were used as data to explore topics as part of students' assignments in courses during the Fall 2020 semester:

ENGL 696E: Archival Research Methods in Rhetoric and Composition

HIST 358: Natural History of Disasters

HIST 495/595: Archives, Museums, and Zoos: Introduction to Public History

JOUR 405/505: Arizona Student Media Apprenticeship

SPAN 350: Introduction to Literary Analysis


View the recording of the symposium.

Learn more

Newspaper data

  • Borderlands newspapers on GitHub: Includes Jupyter Notebook lessons and a sample data set of three newspapers: Bisbee Daily ReviewBorder Vidette, and El Tucsonense, 1917-1919
  • UA Data Repository: Includes text data files of the eight newspapers used for the CAD project.

Virtual student showcase

images from student showcase

The Newspapers as Data Virtual Student Showcase highlights student work from several of the courses in which text mining of historical newspapers was incorporated.

Recommended readings

Examples of text data mining:

Text data mining in Python:

Project Team

  • Project lead: Mary Feeney, Librarian, Research & Learning Department, The University of Arizona Libraries
  • Disciplinary lead: Anita Huizar-Hernández, Associate Professor, Department of Spanish and Portuguese, The University of Arizona
  • Administrative lead: Sarah Shreeves, Vice Dean, The University of Arizona Libraries
  • Celeste González de Bustamante, Associate Professor, School of Journalism, and Director of the Center for Border and Global Journalism, The University of Arizona
  • Marya McQuirter, Director of the Public History Collaborative and Assistant Professor, Department of History
  • Katherine Morrissey, Associate Professor, Department of History
  • Cristina Ramírez, Associate Professor, Department of English, and Program Director of the Rhetoric, Composition, and the Teaching of English graduate program
  • Jeff Oliver, Data Science Specialist, Office of Digital Innovation and Stewardship, The University of Arizona Libraries
  • Verónica Reyes-Escudero, Katheryne B. Willock Head of Special Collections, The University of Arizona Libraries
  • Megan Senseney, Office of Digital Innovation and Stewardship Department Head, The University of Arizona Libraries
  • Erika Castaño, Assistant Librarian and Archivist, Special Collections, The University of Arizona Libraries