Quick Start
If you just want to jump in to creating an Oral History as Data project without reading all of the sections, this guide will help you understand the basic workflow.
Prerequisites
Before you begin, you should have:
- GitHub Account: A free account at github.com
- Interview Content: At least one interview transcript ready to work with
- Basic Familiarity: Some understanding of spreadsheets and CSV files
Quick Start
1. Set Up Your Repository
- Go to the Oral History as Data Template repository
- Click the green “Use this template” button
- Name your new repository and click “Create repository from template”
- Your repository is now ready to customize
2. Configure Your Project
- Edit the
_config.yml
file to update:- Project title and description
- Metadata CSV filename (defaults to
demo-ohd-metadata.csv
) - Other project details
3. Prepare Your CSV Files
You’ll need a main CSV and transcript CSV files:
Metadata File (metadata.csv
or your custom name):
- Contains information about each interview using the CollectionBuilder-CSV format
- Each row represents one interview
- Required fields include: objectid, title, date
- Compatible with other CollectionBuilder projects
- Store this file in the root directory
- See our examples/demo-ohd-metadata.csv for a sample, or browse all examples
Transcript Files:
- For each interview, create a CSV with four columns: speaker, words, tags, timestamp
- The timestamp field is optional but enables synchronization with audio/video
- Each row represents one segment of dialogue
- Store these in the
_data/transcripts/
directory - Name each file to match the item’s objectid in your metadata
- Check our examples
for sample transcript files
4. Create Your Coding System
- Create or edit
_data/filters.csv
- Define your tag codes and descriptions
- Apply these codes in the “tags” column of your transcript files
- You can also use Python scripts to automatically tag your transcripts.
5. Publish Your Site
- Go to Settings → Pages in your GitHub repository
- In the Build and Deployment section, Select “GitHub Actions” from the dropdown options (change from “Deploy from a branch”)
- An option for “GitHub Pages Jekyll” will appear. Click “Configure”
- A new page will open with a GitHub Action file opened
- Click the green “Commit Changes” button at the top right
- A modal will appear: just click the green “Commit Changes” on the bottom right
- Wait a few minutes for your site to build
- Access your site at the URL provided
Project Structure Overview
your-repository/
├── _data/
│ ├── your-metadata.csv # Your collection metadata file
│ ├── filters.csv # Your coding vocabulary
│ ├── theme.yml # Your site configuration (basic customizations)
│ └── transcripts/ # Folder for transcript CSVs
│ └── transcript1.csv # Sample transcript 1
│ └── transcript2.csv # Sample transcript 2
├── _config.yml # Project configuration (title, metadata, etc.)
└── objects/ # Folder for interview images
More Detail
For more detail on the instructions above, explore the rest of this site:
- Explore the Prepare section to learn how to format your transcripts and metadata
- Review the Setup section for information on setting up and customizing your site
- Visit the Publish section for details on sharing your site publicly