How OHD Works

OHD uses a CSV-driven approach to manage your collection. Rather than creating individual web pages for each interview, you’ll manage all your interviews through spreadsheet/CSVs for your metadata and transcripts and then let the system automatically generate the web presentation.

Project Structure Overview

your-repository/
├── _data/
│   ├── your-metadata.csv     # Your collection metadata file
│   ├── filters.csv           # Your coding vocabulary
│   ├── theme.yml             # Your site configuration (basic customizations)
│   └── transcripts/          # Folder for transcript CSVs
│       └── transcript1.csv   # Sample transcript 1
│       └── transcript2.csv   # Sample transcript 2
├── _config.yml               # Project configuration (title, metadata, etc.)
└── objects/                  # Folder for interview images

The Complete Workflow

1. Setup Your Environment

Create a copy of the OHD GitHub Repository
Configure your project settings

2. Prepare Your Content

Transform interview transcripts into structured CSV files –> see examples

Add metadata through a spreadsheet (based on CollectionBuilder-CSV)
- Connect transcripts with audio/video sources via Metadata (optional)
Code Your Transcripts (Optional)
- Create a controlled vocabulary for coding
- Apply thematic codes to your transcript content

3. Customize Your Site

Configure theme settings and appearance
Set up visualization options and coding structure

4. Publish and Share

Publish your collection using GitHub Pages or via other deployment items
Share your new site!

The entire process is designed to be flexible, allowing you to focus on the content and analysis rather than web development. Whether you’re working with a single interview or a large collection, OHD scales to meet your needs.

At the end of the process, you’ll have a professional-quality website that publishes your transcripts with powerful search, browse, and visualization features to promote your research and engage the public.

About Oral History as Data

Oral History as Data was first built in 2018, coming out of work at the University of Idaho Library’s Center for Digital Inquiry and Learning (CD?L). The framework served as the foundation for several digital humanities projects, including Voices of Gay Rodeo, Idaho Queered, and CTRL+Shift.

Devin Becker (@dcnb) is the primary developer for this project.

The project is closely related to the CollectionBuilder project, and the current iteration was built on top of a CollectionBuilder-CSV template.

Quick Start