User:Jliu49/sandbox
Computational Auditory Scene Analysis
What is CASA?
(Brief Overview)
Goals
(Top edit on existing material with added application information)
Cocktail party processor
Applications
(Brief overview, will be expanded later in the article)
Compared with ASR
Difference between ASR (automatic speech recognition) and CASA [1]
Basics of CASA Systems
Human ASA
(The main neuroscience part of the article)[2]
Structure and Function of the Auditory System
Will include neuroanatomy and function (but with a focus of CASA application) of the following:
a. Auditory periphery
b. Cochlea
c. Movements of the basilar membrane
d. Auditory nerve responses
e. Auditory cortex
System Architecture
(brand new information to page)
Cochleagram
(brand new information to page)
Correlogram
(brand new information to page)[3]
Cross-Correlogram
(brand new information to page)
Time-Frequency Masks
(brand new information to page)
Resynthesis
(brand new information to page)
Evaluation of CASA systems
Will include the following:
a. Comparison with Clean Target Signal
b. Automatic Recognition Measure
c. Human Listening
d. Correspondence with Biological Data
Applications
(in this section, addition to current information)[4]
Monaural CASA
(historical)
Binaural CASA
(historical)
Neural CASA Models
(current)
Analysis of Musical Audio Signals
(current)[5]
Neural Perceptual Modeling
(current)
See also
References
- ^ Wang, DeLiang, “Computational Scene Analysis”, Challenges for Computational Intelligence, Springer, Berlin, pp. 163-191, 2007.
- ^ Wang, DeLiang, “Computational Scene Analysis”, Challenges for Computational Intelligence, Springer, Berlin, pp. 163-191, 2007.
- ^ Wang, DeLiang. Computational Auditory Scene Analysis: Principles, Algorithms, And Applications. Hoboken, N.J.: Wiley interscience , 2006.
- ^ Brown, G., Cooke, M., "Computational scene analysis", Computer Speech and Language", vol. 8, pp. 297-336, 1994.
- ^ Godsmark, D., Brown, G., "A blackboard architecture for computational auditory scene analysis", Speech Communication, vol. 27, pp. 351-366, 1999.