Internetvideos im Griff

Diese Idee hat das Potenzial, dem Internet auf dieser Stufe der Entwicklung einen kräftigen Schubs zu geben: Eine Suchmaschine (blinx.com), die gesprochene Worte aus im Internet veröffentlichten Videos durchsuchbar macht.

Außerdem gibt es mit „The Wall“ eine interessante kleine Sammlung jeweils aktueller Themen.

Bin gespannt, was aus der Idee wird, und ob Google & co demnächst mit einem ähnlichen Angebot nachziehen.

Hier noch ein paar Details:

Search engines — like Google — that were developed during the first, text-based era of the Web do a poor job of searching through this rising sea of video. That’s because they don’t search the videos themselves, but rather things associated with them, including the text of a Web page, the “metadata” that computers use to display or understand pages (like keywords or the semantic tags that describe different content), video-file suffixes (like .mpeg or .avi), or captions or subtitles. None of these methods are very satisfactory. Many Internet videos have little or obscure text, and clips often have no or misleading metadata. Modern video players do not reveal video-file suffixes, and captions and subtitles imperfectly capture the spoken words in a video.

Mr. Chandratillake’s solution does not reject any existing video search methods, but supplements them by transcribing the words uttered in a video, and searching them. This is an achievement: effective speech recognition is a “nontrivial problem,” in the language of computer scientists. Blinkx’s speech-recognition technology employs neural networks and machine learning using “hidden Markov models,” a method of statistical analysis in which the hidden characteristics of a thing are guessed from what is known. Mr. Chandratillake calls this method “contextual search,” and he says it works so well because the meanings of the sounds of speech are unclear when considered by themselves. “Consider the phrase ‘recognize speech,’ ” he wrote in an e-mail message. “Its phonemes (‘rek-un-nise-peach’) are incredibly similar to those contained in the phrase ‘wreck a nice beach.’ Our systems use our knowledge of which words typically appear in which contexts and everything we know about a given clip to improve our ability to guess what each phoneme actually means.”

Millions of Videos, and Now a Way to Search Inside Them — New York Times

Tags » , «

Autor:
Datum: Freitag, 2. März 2007 16:09
Trackback: Trackback-URL Themengebiet: Allgemein

Feed zum Beitrag: RSS 2.0 Kommentare und Pings geschlossen.

Keine weiteren Kommentare möglich.