Sven Behnke
 Multimodal Communication and Speech Processing

Multimodal communication via speech, mimics, eye gazes, and body language makes intuitive interaction between humans and machines possible.
In the project "Learning Humanoid Robots", I participate in the development of humanoid communication robots that will be used as museum guide.

I am interested in computational auditory scene analysis, the discovery of speech features, and noise robustness.
My goal is to transfer the hierarchical approach with local recurrent connectivity that proved to be useful for visual perception to the auditory domain.
Since a common fundamental is one of the main cues for auditory grouping, I am interested in finding and tracking the fundamental frequency of the speaker of interest in order to use the harmonic structure of voiced speech to separate the speaker from the background.

  • Publications


  • back to research projects