How OHD Works

OHD uses a CSV-driven approach to manage your collection. Rather than creating individual web pages for each interview, you’ll manage all your interviews through spreadsheet/CSVs for your metadata and transcripts and then let the system automatically generate the web presentation.

Project Structure Overview

your-repository/
├── _data/
│   ├── your-metadata.csv     # Your collection metadata file
│   ├── filters.csv           # Your coding vocabulary
│   ├── theme.yml             # Your site configuration (basic customizations)
│   └── transcripts/          # Folder for transcript CSVs
│       └── transcript1.csv   # Sample transcript 1
│       └── transcript2.csv   # Sample transcript 2
├── _config.yml               # Project configuration (title, metadata, etc.)
└── objects/                  # Folder for interview images and/or recordings

Quick Start

Prerequisites

Before you begin, you should have:

  1. GitHub Account: A free account at github.com
  2. Interview Content: At least one interview transcript ready to work with
  3. Basic Familiarity: Some understanding of spreadsheets and CSV files

1. Set Up Your Repository

  1. Go to the Oral History as Data Template repository
  2. Click the green “Use this template” button
  3. Name your new repository and click “Create repository from template”
  4. Your repository is now ready to customize

2. Configure Your Project

  1. Edit the _config.yml file to update:
    • Project title and description
    • Metadata CSV filename (defaults to demo-ohd-metadata.csv)
    • Other project details

3. Prepare Your CSV Files

You’ll need two types of CSV files:

  • Metadata File: One CSV with info about each interview (required fields: objectid, title, display_template).
  • Transcript Files: One CSV per interview with columns: speaker, words, tags, timestamp. Store in _data/transcripts/ folder.

To see samples of these files, check out our example CSVs

4. Create Your Coding System (Optional)

  1. Create _data/filters.csv to define tag codes and descriptions
  2. Apply these codes in the “tags” column of your transcript files for visualization

5. Upload your CSV files

  1. Upload your metadata and filters CSVs to the /_data/ folder
  2. Uplaod your transcript CSVs to the /_data/transcripts/

5. Publish Your Site

  1. Go to Settings → Pages in your GitHub repository
  2. In the Build and Deployment section, Select “GitHub Actions” from the dropdown options (change from “Deploy from a branch”)
  3. GitHub will automatically detect the existing workflow file in your repository - no additional configuration needed
  4. Wait a few minutes for your site to build
  5. Access your site at the URL provided

More Detail

For more detail on the instructions above, explore the rest of this site:

  1. Explore the Prepare section to learn how to format your transcripts and metadata
  2. Review the Setup section for information on setting up and customizing your site
  3. See our publishing guide for details on sharing your site publicly

Continue to Preparation