Glossary of Terms
This glossary defines key terms and concepts used throughout the Oral History as Data documentation.
A
- Alt Text (Alternative Text)
- Descriptive text for images that displays when images can’t be loaded and is read by screen readers. Essential for accessibility. Example:
alt="Portrait of John Smith from 1975".
B
- Baseurl
- The URL path where your site is hosted, used in
_config.yml. For GitHub Pages, this is usually/repository-name. Example: If your site is atusername.github.io/my-project/, the baseurl is/my-project. - Bootstrap
- A CSS framework used by CollectionBuilder and OHD to provide responsive design and pre-built components. Bootstrap 5 is the current version used.
- Browse Page
- A page that displays all items in your collection in a grid or list view, allowing users to filter and search through the collection.
C
- Child Object
- In a compound object structure, a child object is a related item (photo, document, artifact) that appears below a parent transcript. Connected via
parentidfield. - CollectionBuilder
- The open-source framework that Oral History as Data is built upon. CollectionBuilder creates digital collection sites from CSV metadata.
- CollectionBuilder-CSV
- The specific version of CollectionBuilder that uses CSV files for metadata, which OHD extends with transcript-specific features.
- Commit
- In Git, saving changes to your repository with a descriptive message. Changes must be committed before they can be pushed to GitHub.
- Compound Object
- A collection item that consists of multiple related components, such as an interview transcript with associated photographs. In OHD, the interview transcript is the parent and related items are children.
- CSV (Comma-Separated Values)
- A plain text file format that stores tabular data. Used in OHD for metadata, transcript content, and configuration. Can be created in Excel or Google Sheets.
D
- Display Template
- A field in your metadata that determines how an item is displayed. For oral history interviews, use
display_template: transcript. Other values includeimage,pdf,video,audio.
F
- Featured Image
- An image used for the home page background or to represent an item in browse views. Set in
_data/theme.ymlor metadata fields (image_small,image_thumb). - Filters.csv
- A CSV file located at
_data/filters.csvthat defines your tag vocabulary (coding scheme). Contains two columns:taganddescription. Tags from this file are used to code and visualize transcript segments. - Front Matter
- YAML-formatted metadata at the top of a markdown file, enclosed by three dashes (
---). Controls page settings like title, layout, and navigation order.
G
- GitHub
- A web platform for hosting and collaborating on Git repositories. OHD projects are typically hosted on GitHub.
- GitHub Actions
- An automation service that builds and deploys your OHD site. The preferred method for GitHub Pages publishing, allowing custom Jekyll plugins.
- GitHub Pages
- Free web hosting service provided by GitHub that publishes static websites directly from a repository. Perfect for OHD collections.
- Git
- Version control software that tracks changes to files. GitHub is built on Git.
H
- Hex Code
- A six-character color code used in web design, starting with
#. Example:#0d6efdfor blue. Used in_data/theme.ymlfor customizing colors.
I
- Include
- A reusable component in Jekyll, stored in the
_includes/directory. OHD uses includes for Bootstrap components, figures, and transcript features. - Interviewee
- The person being interviewed in an oral history. This is an OHD-specific metadata field.
- Interviewer
- The person conducting the interview. This is an OHD-specific metadata field.
J
- Jekyll
- The static site generator that powers OHD and CollectionBuilder. Converts markdown files and data into a complete website. Written in Ruby.
- JSON (JavaScript Object Notation)
- A data format used by OHD for large transcript collections. Can be generated from CSV files for better performance on the visualization page.
K
- Kramdown
- The markdown parser used by Jekyll. Supports extended markdown syntax including footnotes and tables.
L
- Layout
- A template that defines how a page is structured, stored in
_layouts/. Common OHD layouts includehome-cover,transcript, andlesson-content. - Liquid
- A templating language used by Jekyll to generate dynamic content. Uses tags like
{% if %}and variables like{{ page.title }}.
M
- Markdown
- A lightweight markup language used to write content for OHD pages. Converts plain text to formatted HTML. File extension:
.md. - Metadata
- Structured information about each interview in your collection. Stored in a CSV file in the
_data/directory. Required fields includeobjectid,title, anddisplay_template.
N
- Nav Order
- A front matter value that determines the order of pages in your site’s navigation menu. Lower numbers appear first.
O
- Object Location
- A metadata field containing the URL or path to media files (audio, video, images). For YouTube, use format:
https://youtu.be/VIDEO_ID. For local files:/objects/filename.mp3. - Objectid
- A unique identifier for each item in your collection. Must match the transcript CSV filename exactly (without .csv extension). Use lowercase letters, numbers, and underscores only. Example:
smith_john. - OHD (Oral History as Data)
- A framework for creating digital oral history collections with interactive transcript visualizations, built on top of CollectionBuilder.
P
- Parent Object
- In a compound object structure, the main interview transcript item. Child objects reference the parent via
parentidfield matching the parent’sobjectid. - Parentid
- A metadata field used in compound objects. Links child objects to their parent by matching the parent’s
objectid. - Permalink
- The permanent URL for a page, set in front matter. Example:
permalink: /about/creates a page atyoursite.com/about/. - Push
- In Git, uploading committed changes from your local computer to GitHub. Changes must be pushed to appear on GitHub and trigger site rebuilds.
R
- Repository (Repo)
- A project folder that contains all your site files, tracked by Git. Hosted on GitHub for OHD projects.
- Ruby
- The programming language that Jekyll is written in. Must be installed to run Jekyll locally.
- RubyGems
- A package manager for Ruby that installs Jekyll and its dependencies.
S
- SASS/SCSS
- CSS preprocessor languages that add features like variables and nesting. OHD uses SCSS files in the
_sass/directory for styling. - Section
- A navigation grouping in OHD docs. The first page in a section uses
section_id, subsequent pages usesectionto create dropdown menus. - Speaker
- A column in transcript CSV files indicating who is speaking in each segment. Typically the interviewer or interviewee name.
- Static Site
- A website consisting of fixed HTML, CSS, and JavaScript files (no database required). OHD generates static sites using Jekyll.
- Subject
- A metadata field containing topics or themes covered in an interview. Multiple subjects separated by semicolons. Example:
mining; labor history; safety. Powers subject browse pages and filtering.
T
- Tags
- Thematic codes applied to transcript segments for visualization and analysis. Tags must be defined in
_data/filters.csvand applied in thetagscolumn of transcript CSVs. Multiple tags separated by semicolons. - Template
- A starter repository containing all necessary files for an OHD project. Create new projects by clicking “Use this template” on the OHD repository.
- Theme.yml
- A configuration file at
_data/theme.ymlthat controls display options, colors, and feature toggles for your OHD site. - Timestamp
- A column in transcript CSV files indicating when each segment occurs in the audio/video recording. Formats:
MM:SSorHH:MM:SS. Enables click-to-navigate functionality. - Transcript CSV
- A CSV file containing interview transcript content, stored in
_data/transcripts/. Columns includespeaker,words,tags, andtimestamp. Filename must match metadataobjectid.
U
- UTF-8
- A character encoding that supports all languages and special characters. Always save CSV files with UTF-8 encoding to prevent display issues.
- URL Variables
- Settings in
_config.ymlthat define your site’s web addresses, includingurl,baseurl, andsource-code.
V
- Visualization
- The interactive colored bar chart at the top of transcript pages, showing coded segments. Users can filter by tags to focus on specific themes.
- Vocabulary
- See Filters.csv. The set of tags/codes defined for analyzing transcripts across a collection.
W
- Words
- A column in transcript CSV files containing the actual spoken text from the interview. The only required column in transcript CSVs.
- Workflow
- In GitHub Actions, an automated process defined by a YAML file in
.github/workflows/. OHD uses a Jekyll workflow to build and deploy sites.
Y
- YAML (YAML Ain’t Markup Language)
- A human-readable data format used in
_config.yml,_data/theme.yml, and front matter. Uses key-value pairs and strict indentation (spaces, not tabs).
Related Resources
- Troubleshooting Guide - Solutions to common problems
- Resources for Further Learning - Technology documentation
- Accessibility Guide - Making your collection accessible to all users