I had the pleasure of being on Circulating Ideas with Steve Thomas. We talked about a bunch of things including open textbooks, accessibility, alternate formats, and being a systems librarian. He’s a great host and an interesting person to chat with. The interview went up last week.
Without a transcript a podcast isn’t accessible to Deaf and some Hard of Hearing people. It felt strange to be talking about accessibility and universal design and have it be in an audio-only format. So I decided to produce a transcript.
I heard the folks from Pop Up Archive present at code4lib in Portland. Pop Up Archive makes sound searchable using speech-to-text technology. Their clients are mostly public radio broadcasters who are looking to make their sound archives searchable. I remember thinking at code4lib that this could be an interesting tool to help make politics more accessible and transparent. For example, transcripts could be made available fairly quickly after a municipal committee (or provincial or federal committee) met. The transcript is almost the byproduct of this process.
I was curious how it could be used to produce a transcript. I was also curious about how accurate the machine transcript was, as well as how long it would take me to clean up. First, you upload the sound file. Next, you can add metadata about the file you uploaded. Then Pop Up Archive processes your sound file. The machine transcript takes as long as your file is, in my case 39 minutes, to process. The machine transcript was about 80% accurate. Finally you can edit the machine transcript on their platform. It took me about 2 hours to clean up a 39 minute interview.