Abstract: Recovering sound sources from embedded repetition

Recovering sound sources from embedded repetition

J H McDermott, D Wrobleski and A J Oxenham

Published in Proceedings of the National Academy of Sciences, vol.108(3), pp. 1188--1193, Jan 2011.

DOI: 10.1073/pnas.1004765108


  • Reprint (pdf)
  • Sound Demonstrations

  • Cocktail parties and other natural auditory environments present organisms with mixtures of sounds. Segregating individual sound sources is thought to require prior knowledge of source properties, yet these presumably cannot be learned unless the sources are segregated first. Here we show that the auditory system can bootstrap its way around this problem by identifying sound sources as repeating patterns embedded in the acoustic input. Due to the presence of competing sounds, source repetition is not explicit in the input to the ear, but it produces temporal regularities that listeners detect and use for segregation. We used a simple generative model to synthesize novel sounds with naturalistic properties. We found that such sounds could be segregated and identified if they occurred more than once across different mixtures, even when the same sounds were impossible to segregate in single mixtures. Sensitivity to the repetition of sound sources can permit their recovery in the absence of other segregation cues or prior knowledge of sounds, and could help solve the cocktail party problem.
  • Listing of all publications