How OHD Works
OHD uses a CSV-driven approach to manage your collection. Rather than creating individual web pages for each interview, you’ll manage all your interviews through spreadsheet/CSVs for your metadata and transcripts and then let the system automatically generate the web presentation.
Project Structure Overview
your-repository/
├── _data/
│ ├── your-metadata.csv # Your collection metadata file
│ ├── filters.csv # Your coding vocabulary
│ ├── theme.yml # Your site configuration (basic customizations)
│ └── transcripts/ # Folder for transcript CSVs
│ └── transcript1.csv # Sample transcript 1
│ └── transcript2.csv # Sample transcript 2
├── _config.yml # Project configuration (title, metadata, etc.)
└── objects/ # Folder for interview images and/or recordings
Quick Start
Prerequisites
Before you begin, you should have:
- GitHub Account: A free account at github.com
- Interview Content: At least one interview transcript ready to work with
- Basic Familiarity: Some understanding of spreadsheets and CSV files
1. Set Up Your Repository
- Go to the Oral History as Data Template repository
- Click the green “Use this template” button
- Name your new repository and click “Create repository from template”
- Your repository is now ready to customize
2. Configure Your Project
- Edit the
_config.yml
file to update:- Project title and description
- Metadata CSV filename (defaults to
demo-ohd-metadata.csv
) - Other project details
3. Prepare Your CSV Files
You’ll need two types of CSV files:
- Metadata File: One CSV with info about each interview (required fields: objectid, title, display_template).
- Transcript Files: One CSV per interview with columns: speaker, words, tags, timestamp. Store in
_data/transcripts/
folder.
To see samples of these files, check out our example CSVs
4. Create Your Coding System (Optional)
- Create
_data/filters.csv
to define tag codes and descriptions - Apply these codes in the “tags” column of your transcript files for visualization
5. Upload your CSV files
- Upload your metadata and filters CSVs to the
/_data/
folder - Uplaod your transcript CSVs to the
/_data/transcripts/
5. Publish Your Site
- Go to Settings → Pages in your GitHub repository
- In the Build and Deployment section, Select “GitHub Actions” from the dropdown options (change from “Deploy from a branch”)
- GitHub will automatically detect the existing workflow file in your repository - no additional configuration needed
- Wait a few minutes for your site to build
- Access your site at the URL provided
More Detail
For more detail on the instructions above, explore the rest of this site:
- Explore the Prepare section to learn how to format your transcripts and metadata
- Review the Setup section for information on setting up and customizing your site
- See our publishing guide for details on sharing your site publicly