Abe, Kobayashi, and Imai. 1995.
“Harmonics Tracking and Pitch Extraction Based on Instantaneous Frequency.” In
International Conference on Acoustics, Speech, and Signal Processing, 1995. ICASSP-95.
Benetos, Cherla, and Weyde. 2013. “An Efficient Shift-Invariant Model for Polyphonic Music Transcription.” In 6th International Workshop on Machine Learning and Music.
Bertin-Mahieux, Ellis, Whitman, et al. 2011. “The Million Song Dataset.” In 12th International Society for Music Information Retrieval Conference (ISMIR 2011).
Blackman, and Tukey. 1959. The measurement of power spectra from the point of view of communications engineering.
Bogert, Healy, and Tukey. 1963. “The Quefrency Alanysis of Time Series for Echoes: Cepstrum, Pseudo-Autocovariance, Cross-Cepstrum and Saphe Cracking.” In.
Box, Jenkins, Reinsel, et al. 2016. Time Series Analysis: Forecasting and Control. Wiley Series in Probability and Statistics.
Carabias-Orti, Virtanen, Vera-Candeas, et al. 2011.
“Musical Instrument Sound Multi-Excitation Model for Non-Negative Spectrogram Factorization.” IEEE Journal of Selected Topics in Signal Processing.
Carter. 1987.
“Coherence and Time Delay Estimation.” Proceedings of the IEEE.
Chen, and Wang. n.d. “High-Level Music Descriptor Extraction Algorithm Based on Combination of Multi-Channel Cnns and Lstm.” In Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR’2017), Suzhou, China.
Childers, Skinner, and Kemerait. 1977.
“The Cepstrum: A Guide to Processing.” Proceedings of the IEEE.
Choi, Keunwoo, and Cho. 2019. “Deep Unsupervised Drum Transcription.” In.
Choi, Keunwoo, Fazekas, Sandler, et al. 2017.
“Transfer Learning for Music Classification and Regression Tasks.” In
Proceeding of The 18th International Society of Music Information Retrieval (ISMIR) Conference 2017.
Choi, Jeong, Lee, Park, et al. 2019. “Zero-Shot Learning for Audio-Based Music Classification and Tagging.” In.
Cochran, Cooley, Favin, et al. 1967.
“What Is the Fast Fourier Transform?” Proceedings of the IEEE.
Defferrard, Benzi, Vandergheynst, et al. 2017.
“FMA: A Dataset For Music Analysis.” In
Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR’2017), Suzhou, China.
Dieleman, and Schrauwen. 2014.
“End to End Learning for Music Audio.” In
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Elbaz, and Zibulevsky. 2017.
“Perceptual Audio Loss Function for Deep Learning.” In
Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR’2017), Suzhou, China.
Flamary, Févotte, Courty, et al. 2016.
“Optimal Spectral Transportation with Application to Music Transcription.” In
arXiv:1609.09799 [Cs, Stat].
Fonseca, Pons, Favory, et al. 2017.
“Freesound Datasets: A Platform for the Creation of Open Audio Datasets.” In
Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR’2017), Suzhou, China.
Fuentes, Maia, Rocamora, et al. 2019. “Tracking Beats and Microtiming in Afro-Latin American Music Using Conditional Random Fields and Deep Learning.” In.
Fu, Lu, Ting, et al. 2011.
“A Survey of Audio-Based Music Classification and Annotation.” IEEE Transactions on Multimedia.
Glover, Lazzarini, and Timoney. 2009.
“Simpl: A Python Library for Sinusoidal Modelling.” In
DAFx 09 Proceedings of the 12th International Conference on Digital Audio Effects, Politecnico Di Milano, Como Campus, Sept. 1-4, Como, Italy.
Godsill, and Davy. 2005.
“Bayesian Computational Models for Inharmonicity in Musical Instruments.” In
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005.
Grosche, P., Muller, and Kurth. 2010.
“Cyclic Tempogram - a Mid-Level Tempo Representation for Music Signals.” In
2010 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP).
Grosche, Peter, Müller, and Sapp. 2010. “What Makes Beat Tracking Difficult? A Case Study on Chopin Mazurkas.” In Proceedings of the International Conference on Music Information Retrieval (ISMIR 2010).
Helmholtz. 1863. Die Lehre von Den Tonempfindungen Als Physiologische Grundlage Für Die Theorie Der Musik.
Hermes. 1988.
“Measurement of Pitch by Subharmonic Summation.” The Journal of the Acoustical Society of America.
Hershey, Chaudhuri, Ellis, et al. 2017.
“CNN Architectures for Large-Scale Audio Classification.” In
Proc. IEEE ICASSP 2017.
Hoffman, Matthew, Bach, and Blei. 2010.
“Online Learning for Latent Dirichlet Allocation.” In
Advances in Neural Information Processing Systems.
Hoffman, Matthew D, Blei, and Cook. 2010.
“Bayesian Nonparametric Matrix Factorization for Recorded Music.” In
International Conference on Machine Learning.
Irizarry. 2001.
“Local Harmonic Estimation in Musical Sound Signals.” Journal of the American Statistical Association.
Joël Bensoam, and David Roze. 2013.
“Solving Interactions Between Nonlinear Resonators.” In
Proceedings of the Sound and Music Computing Conference.
Kailath, Sayed, and Hassibi. 2000. Linear Estimation. Prentice Hall Information and System Sciences Series.
Kalouptsidis, Mileounis, Babadi, et al. 2011.
“Adaptive Algorithms for Sparse System Identification.” Signal Processing.
Lahat, Niederjohn, and Krubsack. 1987.
“A Spectral Autocorrelation Method for Measurement of the Fundamental Frequency of Noise-Corrupted Speech.” IEEE Transactions on Acoustics, Speech and Signal Processing.
Lattner, Dorfler, and Arzt. 2019.
“Learning Complex Basis Functions for Invariant Representations of Audio.” In
Proceedings of the 20th Conference of the International Society for Music Information Retrieval.
Lattner, and Grachten. 2019.
“High-Level Control of Drum Track Generation Using Learned Patterns of Rhythmic Interaction.” In
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2019).
Ljung. 1999. System Identification: Theory for the User. Prentice Hall Information and System Sciences Series.
Luo, Yin-Jyun, Agres, and Herremans. 2019.
“Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Variational Autoencoders.” In
Proceedings of the 20th Conference of the International Society for Music Information Retrieval.
MacKinlay, and Botev. 2019.
“Mosaic Style Transfer Using Sparse Autocorrelograms.” In
Proceedings of the 20th Conference of the International Society for Music Information Retrieval.
Makhoul, Kubala, Schwartz, et al. 1999. “Performance Measures For Information Extraction.” In In Proceedings of DARPA Broadcast News Workshop.
Maxwell, Pasquier, and Whitman. 2009.
“Hierarchical Sequential Memory for Music: A Cognitive Model.” In
Proceedings of the Tenth International Society for Music Information Retrieval Conference (ISMIR 2009).
McFee, and Ellis. 2011.
“Analyzing Song Structure with Spectral Clustering.” In
IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Mesaros, Heittola, and Virtanen. 2016.
“Metrics for Polyphonic Sound Event Detection.” Applied Sciences.
Moorer. 1974.
“The Optimum Comb Method of Pitch Period Analysis of Continuous Digitized Speech.” IEEE Transactions on Acoustics, Speech and Signal Processing.
Müller, Meinard, and Driedger. 2012.
“Data-Driven Sound Track Generation.” In
Multimodal Music Processing.
Müller, M., Ellis, Klapuri, et al. 2011.
“Signal Processing for Music Analysis.” IEEE Journal of Selected Topics in Signal Processing.
Noll. 1967.
“Cepstrum Pitch Determination.” The Journal of the Acoustical Society of America.
Oppenheim, and Schafer. 2004.
“From Frequency to Quefrency: A History of the Cepstrum.” IEEE Signal Processing Magazine.
Parncutt. 1997.
“A Model of the Perceptual Root(s) of a Chord Accounting for Voicing and Prevailing Tonality.” In
Music, Gestalt, and Computing. Lecture Notes in Computer Science 1317.
Pati, Lerch, and Hadjeres. 2019.
“Learning to Traverse Latent Spaces for Musical Score Inpainting.” In
Proceedings of the 20th Conference of the International Society for Music Information Retrieval.
Paulus, Müller, and Klapuri. 2010.
“Audio-Based Music Structure Analysis.” In
ISMIR.
Plomp, and Levelt. 1965.
“Tonal Consonance and Critical Bandwidth.” The Journal of the Acoustical Society of America.
Pons, Lidy, and Serra. 2016.
“Experimenting with Musically Motivated Convolutional Neural Networks.” In
2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI).
Robertson, A. N., and Plumbley. 2006.
“Real-Time Interactive Musical Systems: An Overview.” Proc. Of the Digital Music Research Network, Goldsmiths University, London.
Robertson, Andrew, and Plumbley. 2007.
“B-Keeper: A Beat-Tracker for Live Performance.” In
Proceedings of the 7th International Conference on New Interfaces for Musical Expression. NIME ’07.
Robertson, Andrew, and Plumbley. 2013.
“Synchronizing Sequencing Software to a Live Drummer.” Computer Music Journal.
Robertson, Andrew, Stark, and Davies. 2013.
“Percussive Beat Tracking Using Real-Time Median Filtering.” In
Proceedings of European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases.
Robertson, Andrew, Stark, and Plumbley. 2011.
“Real-Time Visual Beat Tracking Using a Comb Filter Matrix.” In
Proceedings of the International Computer Music Conference 2011.
Rochebois, and Charbonneau. 1997.
“Cross-Synthesis Using Interverted Principal Harmonic Sub-Spaces.” In
Music, Gestalt, and Computing. Lecture Notes in Computer Science 1317.
Salamon, Serrà, and Gómez. 2013.
“Tonal Representations for Music Retrieval: From Version Identification to Query-by-Humming.” International Journal of Multimedia Information Retrieval.
Schlüter, and Böck. 2014.
“Improved Musical Onset Detection with Convolutional Neural Networks.” In
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Schmidt, and Kim. 2011.
“Learning Emotion-Based Acoustic Features with Deep Belief Networks.” In
2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA).
Scholler, and Purwins. 2011.
“Sparse Approximations for Drum Sound Classification.” IEEE Journal of Selected Topics in Signal Processing.
Smith, Evan C., and Lewicki. 2004.
“Learning Efficient Auditory Codes Using Spikes Predicts Cochlear Filters.” In
Advances in Neural Information Processing Systems.
Smith, Evan C., and Lewicki. 2006.
“Efficient Auditory Coding.” Nature.
Smyth, and Elmore. 2009.
“Explorations in Convolutional Synthesis.” In
Proceedings of the 6th Sound and Music Computing Conference, Porto, Portugal.
Southall, Wu, Lerch, et al. 2017.
“MDB Drums — An Annotated Subset of MedleyDB for Automatic Drum Transcription.” In
Late Breaking Demo (Extended Abstract), Proceedings of the International Society for Music Information Retrieval Conference (ISMIR).
Terhardt, Stoll, and Seewann. 1982.
“Algorithm for Extraction of Pitch and Pitch Salience from Complex Tonal Signals.” The Journal of the Acoustical Society of America.
Thickstun, Harchaoui, Foster, et al. 2017. “MIREX 2017: Frequency Domain Convolutions for Multiple F0 Estimation.”
Thickstun, Harchaoui, Foster, et al. 2018.
“Invariances and Data Augmentation for Supervised Music Transcription.” In
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Thickstun, Harchaoui, and Kakade. 2017.
“Learning Features of Music from Scratch.” In
Proceedings of International Conference on Learning Representations (ICLR) 2017.
Venkataramani, and Smaragdis. 2017.
“End to End Source Separation with Adaptive Front-Ends.” arXiv:1705.02514 [Cs].
Wu, and Lerch. 2017.
“Automatic Drum Transcription Using the Student-Teacher Learning Paradigm with Unlabeled Music Data.” In
Proceedings of the International Society for Music Information Retrieval Conference (ISMIR).
Yang, Chou, and Yang. 2017.
“MidiNet: A Convolutional Generative Adversarial Network for Symbolic-Domain Music Generation.” In
Proceedings of the 18th International Society for Music Information Retrieval Conference (ISMIR’2017), Suzhou, China.