Home 9 Conferences 9 ISMIR 2022

ISMIR 2022

Full Proceedings

Papers

Scaling Polyphonic Transcription with Mixtures of Monophonic Transcriptions 44-51
Ian Simon, Joshua Gardner, Curtis Hawthorne, Ethan Manilow, Jesse Engel
Tailed U-Net: Multi-Scale Music Representation Learning 67-75
Marcel A Vélez Vásquez, John Ashley Burgoyne
DDSP-based Singing Vocoders: A New Subtractive-based Synthesizer and A Comprehensive Evaluation 76-83
Da-Yi Wu, Wen-Yi Hsiao, Fu-Rong Yang, Oscar D Friedman, Warren Jackson, Scott Bruzenak, Yi-Wen Liu, Yi-Hsuan Yang
YM2413-MDB: A Multi-Instrumental FM Video Game Music Dataset with Emotion Annotations 100-108
Eunjin Choi, Yoonjin Chung, Seolhee Lee, Jongik Jeon, Taegyun Kwon, Juhan Nam
Pop Music Generation with Controllable Phrase Lengths 125-131
Daiki Naruse, Tomoyuki Takahata, Yusuke Mukuta, Tatsuya Harada
Modeling the rhythm from lyrics for melody generation of pop songs 141-148
Daiyu Zhang, Ju-Chiang Wang, Katerina Kosta, Jordan B. L. Smith, Shicen Zhou
Visualization for AI-Assisted Composing 151-159
Simeon Rau, Frank Heyen, Stefan Wagner, Michael Sedlmair
Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts 186-192
Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard
Learning Hierarchical Metrical Structure Beyond Measures 201-209
Junyan Jiang, Daniel Chin, Yixiao Zhang, Gus Xia
Mid-level Harmonic Audio Features for Musical Style Classification 210-217
Francisco C. F. Almeida, Gilberto Bernardes, Christof Weiss
Distortion Audio Effects: Learning How to Recover the Clean Signal 218-225
Johannes Imort, Giorgio Fabbro, Marco A Martinez Ramirez, Stefan Uhlich, Yuichiro Koyama, Yuki Mitsufuji
End-to-End Full-Page Optical Music Recognition for Mensural Notation 226-232
Antonio Ríos-Vila, Jose M. Inesta, Jorge Calvo-Zaragoza
Mel Spectrogram Inversion with Stable Pitch 233-239
Bruno Di Giorgi, Mark Levy, Richard Sharp
Latent feature augmentation for chorus detection 240-247
Xingjian Du, Huidong Liang, Yuan Wan, Yuheng Lin, Ke Chen, Bilei Zhu, Zejun Ma
Supervised and Unsupervised Learning of Audio Representations for Music Understanding 256-263
Matthew C Mccallum, Filip Korzeniowski, Sergio Oramas, Fabien Gouyon, Andreas Ehmann
Generating Coherent Drum Accompaniment with Fills and Improvisations 264-271
Rishabh A Dahale, Vaibhav Vinayak Talwadker, Preeti Rao, Prateek Verma
Raga Classification From Vocal Performances Using Multimodal Analysis 283-290
Martin Clayton, Preeti Rao, Nithya Shikarpur, Sujoy Roychowdhury, Jin Li
Traces of Globalization in Online Music Consumption Patterns and Results of Recommendation Algorithms 291-297
Oleg Lesota, Emilia Parada-Cabaleiro, Stefan Brandl, Elisabeth Lex, Navid Rekabsaz, Markus Schedl
Network Analyses for Cross-Cultural Music Popularity 298-305
Kongmeng Liew, Vipul Mishra, Yangyang Zhou, Elena V. Epure, Romain Hennequin, Shoko Wakamiya, Eiji Aramaki
Three related corpora in Middle Byzantine music notation and a preliminary comparative analysis 306-313
Polykarpos Polykarpidis, Dionysios Kalofonos, Dimitrios Balageorgos, Christina Anagnostopoulou
Playing Technique Detection by Fusing Note Onset Information in Guzheng Performance 314-320
Dichucheng Li, Yulun Wu, Qinyu Li, Jiahao Zhao, Yi Yu, Fan Xia, Wei Li
Automatic Chinese National Pentatonic Modes Recognition Using Convolutional Neural Network 345-352
Zhaowen Wang, Mingjin Che, Yue Yang, Wen Wu Meng, Qinyu Li, Fan Xia, Wei Li
Adapting meter tracking models to Latin American music 361-368
Lucas S Maia, Martín Rocamora, Luiz W P Biscainho, Magdalena Fuentes
A Dataset for Greek Traditional and Folk Music: Lyra 377-383
Charilaos Papaioannou, Ioannis Valiantzas, Theodore Giannakopoulos, Maximos Kaliakatsos-Papakostas, Alexandros Potamianos
Performance MIDI-to-score conversion by neural beat tracking 395-402
Lele Liu, Qiuqiang Kong, Veronica Morfi, Emmanouil Benetos
Automatic music mixing with deep learning and out-of-domain data 411-418
Marco A Martinez Ramirez, Weihsiang Liao, Chihiro Nagashima, Giorgio Fabbro, Stefan Uhlich, Yuki Mitsufuji
Learning Unsupervised Hierarchies of Audio Concepts 427-436
Darius Afchar, Romain Hennequin, Vincent Guigue
ATEPP: A Dataset of Automatically Transcribed Expressive Piano Performance 446-453
Huan Zhang, Jingjing Tang, Syed Rm Rafee, Simon Dixon, George Fazekas, Geraint A. Wiggins
PDAugment: Data Augmentation by Pitch and Duration Adjustments for Automatic Lyrics Transcription 454-461
Chen Zhang, Jiaxing Yu, Luchin Chang, Xu Tan, Jiawei Chen, Tao Qin, Kejun Zhang
Parameter Sensitivity of Deep-Feature based Evaluation Metrics for Audio Textures 462-468
Chitralekha Gupta, Yize Wei, Zequn Gong, Purnima Kamath, Zhuoyao Li, Lonce Wyse
Multi-pitch Estimation meets Microphone Mismatch: Applicability of Domain Adaptation 477-484
Franca Bittner, Marcel Gonzalez, Maike L Richter, Hanna Lukashevich, Jakob Abeßer
Symphony Generation with Permutation Invariant Language Model 551-558
Jiafeng Liu, Yuanliang Dong, Zehua Cheng, Xinran Zhang, Xiaobing Li, Feng Yu, Maosong Sun
MuLan: A Joint Embedding of Music Audio and Natural Language 559-566
Qingqing Huang, Aren Jansen, Joonseok Lee, Ravi Ganti, Judith Yue Li, Daniel P W Ellis
Learning Multi-Level Representations for Hierarchical Music Structure Analysis. 591-597
Morgan Buisson, Brian Mcfee, Slim Essid, Hélène C. Crayencour Crayencour
Multi-instrument Music Synthesis with Spectrogram Diffusion 598-607
Curtis Hawthorne, Ian Simon, Adam Roberts, Neil Zeghidour, Joshua Gardner, Ethan Manilow, Jesse Engel
Contrastive Audio-Language Learning for Music 640-649
Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas
MusAV: A dataset of relative arousal-valence annotations for validation of audio models 650-658
Dmitry Bogdanov, Xavier Lizarraga-Seijas, Pablo Alonso-Jiménez, Xavier Serra
Heterogeneous Graph Neural Network for Music Emotion Recognition 667-674
Angelo Cesar Mendes Da Silva, Diego F Silva, Ricardo Marcondes Marcacini
A Model You Can Hear: Audio Identification with Playable Prototypes 694-700
Romain Loiseau, Baptiste Bouvier, Yann Teytaut, Elliot Vincent, Mathieu Aubry, Loic Landrieu
Generating music with sentiment using Transformer-GANs 717-725
Pedro L T Neves, José Fornari, João B Florindo
Improving Choral Music Separation through Expressive Synthesized Data from Sampled Instruments 726-732
Ke Chen, Hao-Wen Dong, Yi Luo, Julian Mcauley, Taylor Berg-Kirkpatrick, Miller Puckette, Shlomo Dubnov
Using Activation Functions for Improving Measure-Level Audio Synchronization 749-755
Yigitcan Özer, Matej Ištvánek, Vlora Arifi-Müller, Meinard Müller
A Reproducibility Study on User-centric MIR Research and Why it is Important 764-771
Peter Knees, Bruce Ferwerda, Andreas Rauber, Sebastian Strumbelj, Annabel Resch, Laurenz Tomandl, Valentin Bauer, Fung Yee Tang, Josip Bobinac, Amila Ceranic, Riad Dizdar
Music Separation Enhancement with Generative Modeling 772-780
Noah Schaffer, Boaz Cogan, Ethan Manilow, Max Morrison, Prem Seetharaman, Bryan Pardo
Semantic Control of Generative Musical Attributes 817-824
Stewart Greenhill, Majid Abdolshah, Vuong Le, Sunil Gupta, Svetha Venkatesh
Concept-Based Techniques for “Musicologist-Friendly” Explanations in Deep Music Classifiers 876-883
Francesco Foscarin, Katharina Hoedt, Verena Praher, Arthur Flexer, Gerhard Widmer
Verse versus Chorus: Structure-aware Feature Extraction for Lyrics-based Genre Recognition 884-890
Maximilian Mayerl, Stefan Brandl, Günther Specht, Markus Schedl, Eva Zangerle
BAF: An audio fingerprinting dataset for broadcast monitoring 908-916
Guillem Cortès, Alex Ciurana, Emilio Molina, Marius Miron, Owen Meyers, Joren Six, Xavier Serra
Modeling perceptual loudness of piano tone: theory and applications 933-940
Yang Qu, Yutian Qin, Lecheng Chao, Hangkai Qian, Ziyu Wang, Gus Xia