...
The technology requires any number of documents (e.g, existing transcripts, text from slides, research papers, indexes from textbooks, etc.) that are used to create a or augment an existing domain model, in addition to the input video. The domain model is critical for speech recognition, transcript development and ultimately video segmentation for search and retrieval. (Research has shown the poor overlap between existing speech recognition dictionaries and the terms used in typical science and engineering academic lectures.) The technology can be used to create a speaker model as well.
Project Plan
\[ProjectPlanMarch2009\] Wiki Markup
When we begin to look at lecture transcription as a service, key aspects seem to be:
...