MonkeySpeak
Audio Mining

I have a question related to "audio mining", the process of analyzing audio clips for the purpose of searching and other processing. I see references on the web (eg, this and this) that suggest that the technology exists and is in use, but only on large (and expensive) scales.

Most of these systems allow you search the audio database with text queries, which requires the database to have undergone speech-to-text analysis. Speech-to-text, particularly for arbitrary (and unfamiliar) speakers is incredibly difficult, and so it's not surprising that these systems are not mainstream or cheap yet.

But I'm interested in a subset of this problem: single-speaker, speech-only search. That is, I record a bunch of audio clips of me speaking, and then I submit a query--in the form of spoken words. So it's only one speaker, and the system never has to interpret the audio as text.

I know this simpler problem is not trivial, but it seems like it should be much simpler than the commercial systems I've read about. And I can't see any examples of this simpler system.

Anybody?

Comments

Posted by Dan Lyke on September 22, 2004 at 03:44 PM

Just a pointer to a class of technologies that you've already probably examined, but the various "recognize songs from short samples" technologies are being deployed (Gracenote's MusicID(sm) is the one I know of), it might be worth seeing if that gives you any search terms which are useful.

Posted by tdl on September 23, 2004 at 08:54 AM

Yeah, Muscle Fish has a sound-effect-clip-searchy-thingy, SoundFisher, which is similar. They say


It is also not adapted to speech recognition or speaker identification tasks, although occasionally and under very specific (usually impractical) situations, SoundFisher may provide some utility for music and speech-related sound management.


I conclude from that, and from other stuff I've read, that audio mining tools are depressingly domain-specific, and tend not to cross over well.

Posted by Eckberg on October 20, 2004 at 05:45 PM

I have no idea what you're talking about - but need to get your address again.

Hope you had a safe trip back and all is well in your world!

Peace,
Jason

Comments are now closed on this site. Too much comment spam. Sigh.


© 2002-2003 Tom Lokovic. Some rights reserved.
Except where indicated, all content on this site is governed by a Creative Commons License.
Version control and software project management provided by CVSDude.
Powered by Movable Type 2.63.
Validate XHTMLValidate CSS