SpokenMedia

Peter Wilkins August 24, 2010

Customizing the Google Search Appliance for Greenfield OCW

MIT’s OpenCourseWare uses MIT’s Google Search Appliance (GSA) to search its content. MIT supports customization of GSA results through XSL transformation. This post describes how we plan to use GSA to search lecture transcripts and return results containing the lecture videos that the search terms appear in. Since OCW publishes static content, it doesn’t incorporate an integral search engine. Search is provided through Continue Reading

Filed Under: Developer Tagged With: Developer, GoogleSearchAppliance, Greenfield, MIT, Search

Brandon Muramatsu August 12, 2010

How Google Translate Works

Google posted a high level overview of how Google Translate works. Source: Google

Filed Under: Developer, News Tagged With: Video, YouTube Video

Peter Wilkins August 6, 2010

Running the Baseline Recognizer

The software that processes lecture audio into a textual transcript is comprised of a series of scripts that marshall input files and parameters to a speech recognition engine. Interestingly, since the engine is data driven, its code seldom changes; improvements in performance and accuracy are achieved by refining the data it uses to perform its […]

Filed Under: Developer Tagged With: Baseline, Developer, Recognizer

Brandon Muramatsu July 28, 2010

An interesting hack from Yahoo! Openhack India

Sound familiar? Automatic, Real-time close captioning/translation for flickr videos. How? We captured the audio stream that comes out to speaker and gave as input to mic. Used Microsoft Speech API and Julius to convert the speech to text. Used a GreaseMonkey script to sync with transcription server(our local box) and video and displayed the transcribed […]

Filed Under: News Tagged With: Flickr Video, Transcription, Translation

Brandon Muramatsu July 24, 2010

Converting .sbv to .trans/continuous text

As a step in comparing the output from YouTube’s Autocaptioning, we need to transform their .sbv file into something we can use in our comparison tests (a .trans file). We needed to strip the hours out of the timecode, drop the end time, and bring everything to a single line. Update: It turns out we […]

Filed Under: Developer Tagged With: .sbv, .trans, Autocaption, conversion, YouTube

Brandon Muramatsu July 19, 2010

Caption File Formats

There’s been some discussion on the Matterhorn list recently about caption file formats, and I thought it might be useful to describe what we’re doing with file formats for SpokenMedia. SpokenMedia uses two file formats, our original .wrd files output from the recognition process and Timed Text Markup Language (TTML). We also need to handle […]

Filed Under: Developer Tagged With: .sbv, .srt, .stm, .wrd, File Format, YouTube

Brandon Muramatsu July 14, 2010

SpokenMedia at T4E 2010 Conference

Brandon Muramatsu presented on SpokenMedia at the Technology for Education 2010 Conference in Mumbai, India on July 1, 2010. Implementing SpokenMedia for the Indian Institute for Human Settlements from Brandon Muramatsu Source: Brandon Muramatsu Download Video (MP4, 230MB) View more presentations from Brandon Muramatsu. Cite as: Muramatsu, B., McKinney, A. & Wilkins, P. (2010, July […]

Filed Under: Presentation Tagged With: Slideshare, T4E, T4E 2010, Technology for Education, TechTV

Brandon Muramatsu June 21, 2010

Towards cross-video search

Preparing Transcripts for Search Across Multiple Videos

Here’s a workflow diagram I put together to demonstrate how we’re approaching the problem of searching over the transcripts of multiple videos and ultimately returning search results that maintain time-alignment for playback. You’ll notice I included using OCW on lecture slides to help in search and retrieval–this is not an area we’re currently focusing on, […]

Filed Under: Developer Tagged With: Search

Brandon Muramatsu June 17, 2010

Making Progress

In the last month or two we’ve made some good progress with getting additional parts of the SpokenMedia workflow into a working state. Here’s a workflow diagram showing what we can do with SpokenMedia today. (The bright yellow indicates features working in the last two months, the gray indicates features we’ve had working since December […]

Filed Under: Developer, News Tagged With: Acoustic Model, Domain Model

Peter Wilkins June 16, 2010

Using Lucene/Solr for Transcript Search

Overview In any but a trivial implementation, searching lecture transcripts presents challenges not found in other search targets. Major among them is that each transcript word requires its own metadata (start and stop times). Solr, a web application that derives its search muscle from Apache Lucene, has a query interface that is both rich and […]

Filed Under: Developer Tagged With: Lucene, Search, Solr

Customizing the Google Search Appliance for Greenfield OCW

How Google Translate Works

Running the Baseline Recognizer

An interesting hack from Yahoo! Openhack India

Converting .sbv to .trans/continuous text

Caption File Formats

SpokenMedia at T4E 2010 Conference

Towards cross-video search

Making Progress

Using Lucene/Solr for Transcript Search

Automatic Transcription Available for Testing

What is SpokenMedia?

Recent Posts

Archives