Get desktop application:
View/edit binary Protocol Buffers messages
Interface for controlling the audio feature extractor. Next available ID: 3
Stimulus configuration containing input-specific parameters such as sampling rate, model-agnostic analysis parameters and so on.
Auditory pipeline consisting of one or more models.
Model configuration. Next available ID: 4
Used in:
Model type.
Model stage type.
Concrete model configuration.
Auditory models currently supported. Next available ID: 12
Used in:
Placeholder for an unknown (unitialized) model.
Mock basical membrane model.
Cascade of Asymmetric Resonators with Fast-Acting Compression (CARFAC). See Richard F. Lyon, "Using a Cascade of Asymmetric Resonators with Fast-Acting Compression as a Cochlear Model for Machine-Hearing Applications", Autumn Meeting of the Acoustical Society of Japan (2011), pp. 509--512.
Peripheral ear model by Frank Baumgarte. See F. Baumgarte: "Ein psychophysiologisches Gehoermodell zur Nachbildung von Wahrnehmungsschwellen fuer die Audiocodierung", PhD Dissertation, University of Hannover, 2000.
Peripheral model up to and including the hair cell by Zilany, et. al. based on: Zilany, M.S.A., Bruce, I.C., and Carney, L.H. (2014). "Updated parameters and expanded simulation options for a model of the auditory periphery," Journal of the Acoustical Society of America. Zilany, M.S.A., Bruce, I.C., Nelson, P.C., and Carney, L.H. (2009). "A phenomenological model of the synapse between the inner hair cell and auditory nerve: Long-term adaptation with power-law dynamics," Journal of the Acoustical Society of America, 126(5): 2390-2412.
Gammatone filterbank model. Two implementations are available: 1. M. Slaney (1998): "Auditory Toolbox Version 2", Technical Report #1998-010, Interval Research Corporation, 1998; 2. Ning Ma's implementation of: M. Cooke (1993): "Modelling Auditory Processing and Organisation", Cambridge University Press, Series "Distinguished Dissertations in Computer Science", August. One can select either implementation via model-specific configuration.
Inner hair cell synapse model by Ray Meddis, et. al.: - Ray Meddis (1986): "Simulation of mechanical to neural transduction in the auditory receptor", Journal of the Acoustical Society of America 79(3), 702--711. - Ray Meddis, Michael J. Hewitt, and Trevor M. Shackleton (1990): "Implementation details of a computation model of the inner hair‐cell auditory‐nerve synapse", The Journal of the Acoustical Society of America 87, 1813.
Inner hair cell synaptic model from Carney, Bruce and Zilany labs: Bruce, I.C., Erfani, Y., and Zilany, M.S.A. (2018). "A Phenomenological model of the synapse between the inner hair cell and auditory nerve: Implications of limited neurotransmitter release sites", Hearing research, 360, 40--54, (Special Issue on "Computational Models in Hearing").
Synapse spike generator model from Carney, Bruce and Zilany labs: Bruce, I.C., Erfani, Y., and Zilany, M.S.A. (2018). "A Phenomenological model of the synapse between the inner hair cell and auditory nerve: Implications of limited neurotransmitter release sites", Hearing research, 360, 40--54, (Special Issue on "Computational Models in Hearing").
Synapse spike generator model from Scott Jackson: Jackson BS, Carney LH (2005), "The spontaneous-rate histogram of the auditory nerve can be explained by only two or three spontaneous rates and long-range dependence", J. Assoc. Res. Otolaryngol. 6:148-159.
Synapse spike generator model from Zhang, et. al: Zhang, X., Heinz, M. G., Bruce, I. C., & Carney, L. H. (2001): "A phenomenological model for the responses of auditory-nerve fibers: I. Nonlinear tuning with compression and suppression.", The Journal of the Acoustical Society of America, 109(2), 648-670.
Inner hair cell synapse model by Sumner, et. al.: Sumner, C. J, Lopez-Poveda, E. A., O'Mard, L. P. and Meddis, R. (2002): "A revised model of the inner-hair cell and auditory-nerve complex.", The Journal of the Acoustical Society of America (JASA), vol.111, no.5, pp. 2178--2188.
Types of outputs the model can produce. Next available ID: 5
Placeholder for an unknown output.
Displacement of a basilar membrane in response to the pressure stimuli. This forms an excitation for the inner hair cells.
Transmembrane potential across inner hair cells (afferents).
Auditory nerve synapse: Firing rate probabilities.
Auditory nerve synapse: Spike onset times.
Next available ID: 2
Used in:
"The auditory system transforms sound waves into distinct patterns of neural activity, which are then integrated with information from other sensory systems to guide behavior, including orienting movements to acoustical stimuli and intraspecies communication. The first stage of this transformation occurs at the external and middle ears, which collect sound waves and amplify their pressure, so that the sound energy in the air can be successfully transmitted to the fluid-filled cochlea of the inner ear. In the inner ear, a series of biomechanical processes occur that break up the signal into simpler, sinusoidal components, with the result that the frequency, amplitude, and phase of the original signal are all faithfully transduced by the sensory hair cells and encoded by the electrical activity of the auditory nerve fibers. One product of this process of acoustical decomposition is the systematic representation of sound frequency along the length of the cochlea, referred to as tonotopy." From "Neuroscience", Dale Purves, et. al. (2011). The following message defines several stages in this processing pipeline. A model may implement one or more of these stages. Next available ID: 4
Used in:
Placeholder for an unknown auditory stage.
Basilar membrane.
Hair cells: inner (afferent) and outer (efferent) hair cells.
Auditory nerve (AN) synapse.
This message consists of several sub-configurations that are parsed and converted to the configuration structs required by the CARFAC API. This is cumbersome and could be avoided if CARFAC implementation used protocol buffers. Next available ID: 6
The Cascade of Asymmetric Resonators (CAR).
Inner Hair Cell (IHC) filter params.
Automatic Gain Control (AGC) parameters.
Signal types to save.
Enabling <agc_open_loop> breaks the AGC feedback loop, making the filters linear; false is the normal value, using feedback from the output level to control filter damping, thereby giving a compressive amplitude characteristic to reduce the output dynamic range.
Automatic gain control (AGC) parameters for designing AGC filters. Next available ID: 5
Used in:
If zero, the AGC is disabled.
Parameters required to design the set of coefficients implementing 'The Cascade of Asymmetric Resonators' (CAR). Next available ID: 12
Used in:
Used for the velocity nonlinearity.
The offset gives us quadratic part.
The minimum damping factor in mid-freq channels.
The maximum damping factor in mid-freq channels.
This is how far zero is above the pole.
A range from 0 to 1 to compress
theta.
Inner hair cell (IHC) parameters, which are used to design the IHC filters. Next available ID: 9
Used in:
Message with the flags indicating which signals to store. Next available ID: 5
Used in:
Store Neural Activity Patterns (NAP).
Store Basilar Membrane (BM) displacements.
Store Outer Hair Cells (OHCs) signals.
Store Adaptive Gain Control (AGC).
Next available ID: 2
Type of the filterbank.
Used in:
Malcolm Slaney's filterbank. See - M. Slaney (Apple TR #35), "An Efficient Implementation of the Patterson-Holdsworth Cochlear Filter Bank.", 33-34.
This gammatone filter is based on the implementation by Ning Ma from University of Sheffield who, in turn, based his implementation on an original algorithm from Martin Cooke's Ph.D thesis (Cooke, 1993) using the base-band impulse invariant transformation. See - http://www.dcs.shef.ac.uk/~ning/resources/gammatone
Some of the parameters may be chosen to be ignored by the particular models. For example, CARFAC uses its own adaptive computation of number of output channels, hence ignoring the <num_channels> parameters in this message. Next available ID: 14
Used in:
Input sampling rate.
Audio scaling gain. By default we scale the waveform to [-1.0,1.0] range. To these values we can also apply a gain factor. Note from Dick Lyon on CARFAC: "be aware that the -1 to 1 range, if used fully, represents very loud sound. For "normal" level, you probably want to throw in a gain of 0.01 to 0.1 on top of the mapping from int16 to that range".
Downsample the output by sampling every <n> samples. Some implementations prefer this value over customizing the output sample rate.
Store outputs from the intermediate stages of processing. For example, if the pipeline consists of basilar membrane followed by a synaptic model, the outputs of both stages are stored.
Number of channels (frequency bands) for analysis corresponding to <n> equidistant locations along the cochlea. If unset, model-specific defaults will be used.
Lowest characteristic frequency (CF) for analysis (in Hz). If unset, model-specific defaults will be used.
Highest characteristic frequency (CF) for analysis (in Hz). If unset, model-specific defaults will be used.
Resample the response: Upsampling factor <p>. The signal is resampled by <p/q>, where <q> is the downsampling factor.
Resample the response: Downsampling factor <q>. The signal is resampled by <p/q>, where <p> is the upsampling factor.
If enabled, will apply windowing function to the response. By default no windowing is applied and the response contains original number of stimulus samples.
Window (also frame) duration (in seconds).
Frame shift (in seconds). After computing each frame, advance to the next frame by this amount.
Type of the windowing function (if applying windowing).
Calcium conductance mode as defined in DSAM. Next available ID: 2
Used in:
As defined in the paper.
Cleft replenishment mode as defined in DSAM. Next available ID: 2
Used in:
As defined in the paper.
Next available ID: 7
Calcium conductance mode.
Cleft replenishment mode.
If enabled, outputs spike rates, otherwise outputs probabilities.
Maximum calcium conductance (in Siemens units).
Calcium threshold concentration.
Maximum number of transmitter packets (quanta) in free pool.
Next available ID: 5
Number of channels.
Sample rate (in Hz).
Number of bits per sample.
Samples.
Type of the windowing function. For various types of functions see "Discrete-time Signal Processing" by Alan V. Oppenheim and Ronald W. Schafer (1989). Next available ID: 3
Used in:
No windowing function to apply.
Hann window (see https://en.wikipedia.org/wiki/Hann_function).
Hamming window. See: https://en.wikipedia.org/wiki/Window_function#Hamming_window.
Next available ID: 2
Collect responses for the fiber population of the supplied size.