Archives for August 2010

Peter Wilkins August 24, 2010

Customizing the Google Search Appliance for Greenfield OCW

MIT’s OpenCourseWare uses MIT’s Google Search Appliance (GSA) to search its content. MIT supports customization of GSA results through XSL transformation. This post describes how we plan to use GSA to search lecture transcripts and return results containing the lecture videos that the search terms appear in. Since OCW publishes static content, it doesn’t incorporate an integral search engine. Search is provided through Continue Reading

Filed Under: Developer Tagged With: Developer, GoogleSearchAppliance, Greenfield, MIT, Search

Brandon Muramatsu August 12, 2010

How Google Translate Works

Google posted a high level overview of how Google Translate works.

Source: Google

Filed Under: Developer, News Tagged With: Video, YouTube Video

Peter Wilkins August 6, 2010

Running the Baseline Recognizer

The software that processes lecture audio into a textual transcript is comprised of a series of scripts that marshall input files and parameters to a speech recognition engine. Interestingly, since the engine is data driven, its code seldom changes; improvements in performance and accuracy are achieved by refining the data it uses to perform its tasks.

There are two steps to produce the transcript. The first creates an audio file in the correct format for speech recognition. The second processes that audio file into the transcript.

Filed Under: Developer Tagged With: Baseline, Developer, Recognizer

Archives for August 2010

Customizing the Google Search Appliance for Greenfield OCW

How Google Translate Works

Running the Baseline Recognizer

Automatic Transcription Available for Testing

What is SpokenMedia?

Recent Posts

Archives