We’ve moved!

You will be redirected to our new home in five seconds. If not, click here.

GigaOM Network: GigaOM | Earth2Tech | jkOnTheRun | NewTeeVee | OStatic | TheAppleBlog | WebWorkerDaily | Jobs Live Events | About | Contact

December 22, 2005

Auto-transcribing podcasts?

I think it would be really cool if someone could come up with a way to automatically derive written transcriptions from podcasts. This would work similar to speech recognition programs but it needs to be speaker independent to be effective, kind of like closed-captioning used by TV stations. The purpose for producing written transcripts would be two-fold- make podcasts available to those in the community who are deaf and unable to access the information shared in the many podcasts now available and to make the shows searchable once the transcripts are published.

I would love to hear from those in a position who have thoughts about possible solutions that could be used from existing technology. I am willing to work with anyone using any of the podcasts I produce as a test case, with the hope of eventually using this on a wide range of podcasts. Please contact me if you have any ideas along these lines, or post a comment here. Let's see what we can come up with as a community effort.

Enjoy this post? Receive more jkOnTheRun content for FREE by subscribing to the RSS feed!

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d83451b75769e200d83459eede69e2

Listed below are links to weblogs that reference Auto-transcribing podcasts?:

» Transcribing podcasts/audio files from James E. Lee
jkOnTheRun:I think it would be really cool if someone could come up w... [Read More]

» Transcribing podcasts/audio files from James E. Lee's Blog
jkOnTheRun: I think it would be really cool if someone could come up with a way to automatically derive written transcriptions from podcasts. This would work similar to speech recognition programs but it needs to be speaker independent to be... [Read More]

» Transcribing podcasts/audio files from James E. Lee's Blog
jkOnTheRun: I think it would be really cool if someone could come up with a way to automatically derive written transcriptions from podcasts. This would work similar to speech recognition programs but it needs to be speaker independent to... [Read More]

Comments

Great idea James.
I have often thought of working on somthing like that with the Microsoft Speach SDK.

Would also be interested in this - more from a business point of view though.

I work for a market research company - if we could process a recording of an interview through software and come out with a transcript at the other end, that would help us tremendously.

We've tried basic speech recognition, but that isn't accurate enough.

Will watch this with interest. . .

Compaq Research had a project called SpeechBot some years ago - it was a search engine for Internet radio shows.

Unfortunately, it seems to be offline now. Read something about it here:
http://www.corante.com/getreal/archives/003865.html

I've been looking at stuff like this for years, from people like Virage and other digital AV transcription services. All of it is expensive, though very cool. I wanted to use it to provide transcripts of digitally recorded Distance Ed classes. Still do, just wish it was cheaper.

I've written prototypes and talked with some of the technical architects at MS about speech recognition technology. The current APIs are geared for realtime recognition -- dictation -- and it's actually quite difficult to use them in a way that exploits offline CPU power (that is, "I'll wait 8 hours for the job to finish, if the result is better").

This is not to say that the task is impossible; it will certainly be achieved someday. With today's APIs, lower-quality but still valuable work could be done (extraction of keywords and so forth). However, it is a decidedly non-trivial task.

James, I asked this same question on a podcast a few weeks ago and a listener directed me to www.podzinger.com, which scans your mp3 and turns it into text. It still has a few shortcomings and will need to improve, but the recognition is about the same as using Word and a microphone. It is not a complete solution, but a good beginning.

Podzinger somehow transcribes podcasts into text for searching, but does not offer a transcript service. Too bad

www.podziner.com

Comments are temporarily disabled for site maintenance and will return at 6 PM PDT.

 

RSS and Mobile-Friendly View

Contributors

Kevin C. Tofel

James Kendrick

Kevin's gear   JK's gear

Awards

Microsoft MVP Awardees

CNET100 2004Weblog Awards
2004ReadersChoice 2004_BoardOfExperts
Powered by TypePad
Member since 05/2004

Copyright Notice


  • Copyright 2008 The GigaOM Network. All rights reserved. The content in this RSS feed, as well as the content presented on the web pages of the blog, is provided for your personal non-commercial use only and may not be republished in whole or in part without the express written or verbal consent of the publisher. All rights are reserved.
StatCounter