Home 9 Conferences 9 ISMIR 2021

ISMIR 2021

Full Proceedings

Proceedings of the 22nd International Society for Music Information Retrieval Conference
Online, Nov 7-12, 2021 (ISBN: 978-1-7327299-0-2)

Download pdf

Papers

Four-way Classification of Tabla Strokes with Models Adapted from Automatic Drum Transcription 19-26
Rohit M A, Amitrajit Bhattacharjee, Preeti Rao

Download pdf

A Contextual Latent Space Model: Subsequence Modulation in Melodic Sequence 27-34
Taketo Akama

Download pdf

OMR-assisted transcription: a case study with early prints 35-41
María Alfaro-Contreras, David Rizo, Jose M. Inesta, Jorge Calvo-Zaragoza

Download pdf

Deeper Convolutional Neural Networks and Broad Augmentation Policies Improve Performance in Musical Key Estimation 42-49
Stefan A Baumann

Download pdf

The Music Performance Markup Format and Ecosystem 50-57
Axel Berndt

Download pdf

Identification of rhythm guitar sections in symbolic tablatures 58-65
Louis Bigo, David Regnier, Nicolas Martin

Download pdf

On-Line Audio-to-Lyrics Alignment Based on a Reference Performance 66-73
Charles Brazier, Gerhard Widmer

Download pdf

Visualizing Intertextual Form with Arc Diagrams: Contour and Schema-based Methods 74-80
Aaron Carter-Enyi, Gilad Rabinovitch, Nathaniel Condit-Schultz

Download pdf

Unsupervised Domain Adaptation for Document Analysis of Music Score Images 81-87
Francisco J. Castellanos, Antonio-Javier Gallego, Jorge Calvo-Zaragoza

Download pdf

Codified audio language modeling learns useful representations for music information retrieval 88-96
Rodrigo Castellon, Chris Donahue, Percy Liang

Download pdf

Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding 97-104
Chin-Jui Chang, Chun-Yi Lee, Yi-Hsuan Yang

Download pdf

SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours 105-112
Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang

Download pdf

Semi-supervised violin fingering generation using variational autoencoders 113-120
Vincent K.M. Cheung, Hsuan-Kai Kao, Li Su

Download pdf

Listen, Read, and Identify: Multimodal Singing Language Identification of Music 121-127
Keunwoo Choi, Yuxuan Wang

Download pdf

On Perceived Emotion in Expressive Piano Performance: Further Experimental Evidence for the Relevance of Mid-level Perceptual Features 128-134
Shreyan Chowdhury, Gerhard Widmer

Download pdf

Cosine Contours: a Multipurpose Representation for Melodies 135-142
Bas Cornelissen, Willem Zuidema, John Ashley Burgoyne

Download pdf

Controllable deep melody generation via hierarchical music structure representation 143-150
Shuqi Dai, Zeyu Jin, Celso Gomes, Roger Dannenberg

Download pdf

MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription 151-158
Emir Demirel, Sven Ahlbäck, Simon Dixon

Download pdf

Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music 159-166
Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick, Julian Mcauley

Download pdf

An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition 167-173
Sachinda Edirisooriya, Hao-Wen Dong, Julian Mcauley, Taylor Berg-Kirkpatrick

Download pdf

A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration 174-181
Anders Elowsson, Olivier Lartillot

Download pdf

Building the MetaMIDI Dataset: Linking Symbolic and Audio Musical Data 182-188
Jeffrey Ens, Philippe Pasquier

Download pdf

Modeling and Inferring Proto-Voice Structure in Free Polyphony 189-196
Christoph Finkensiep, Martin A Rohrmeier

Download pdf

PKSpell: Data-Driven Pitch Spelling and Key Signature Estimation 197-204
Francesco Foscarin, Nicolas Audebert, Raphael Fournier-S’Niehotta

Download pdf

Filosax: A Dataset of Annotated Jazz Saxophone Recordings 205-212
Dave Foster, Simon Dixon

Download pdf

An interpretable music similarity measure based on path interestingness 213-219
Giovanni Gabbolini, Derek Bridge

Download pdf

Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition 220-228
Hugo F Flores Garcia, Aldo Aguilar, Ethan Manilow, Bryan Pardo

Download pdf

What if the ‘When’ Implies the ‘What’?: Human harmonic analysis datasets clarify the relative role of the separate steps in automatic tonal analysis 229-236
Mark R H Gotham, Rainer Kleinertz, Christof Weiss, Meinard Müller, Stephanie Klauk

Download pdf

Let’s agree to disagree: Consensus Entropy Active Learning for Personalized Music Emotion Recognition 237-245
Juan S. Gómez-Cañón, Estefania Cano, Yi-Hsuan Yang, Perfecto Herrera, Emilia Gomez

Download pdf

Sequence-to-Sequence Piano Transcription with Transformers 246-253
Curtis Hawthorne, Ian Simon, Rigel Swavely, Ethan Manilow, Jesse Engel

Download pdf

Neural Waveshaping Synthesis 254-261
Ben Hayes, Charalampos Saitis, Gyorgy Fazekas

Download pdf

A semi-automated workflow paradigm for the distributed creation and curation of expert annotations 262-269
Johannes Hentschel, Fabian C. Moss, Markus Neuwirth, Martin A Rohrmeier

Download pdf

BeatNet: CRNN and Particle Filtering for Online Joint Beat, Downbeat and Meter Tracking 270-277
Mojtaba Heydari, Frank Cwitkowitz, Zhiyao Duan

Download pdf

Joint Estimation of Note Values and Voices for Audio-to-Score Piano Transcription 278-284
Yuki Hiramatsu, Eita Nakamura, Kazuyoshi Yoshii

Download pdf

Learning note-to-note affinity for voice segregation and melody line identification of symbolic music data 285-292
Yo-Wei Hsiao, Li Su

Download pdf

VOCANO: A note transcription framework for singing voice in polyphonic music 293-300
Jui-Yang Hsu, Li Su

Download pdf

De-centering the West: East Asian Philosophies and the Ethics of Applying Artificial Intelligence to Music 301-309
Rujing Huang, Bob L. T. Sturm, Andre Holzapfel

Download pdf

A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset 310-317
Tun Min Hung, Bo-Yu Chen, Yen Tung Yeh, Yi-Hsuan Yang

Download pdf

EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation 318-325
Hsiao-Tzu Hung, Joann Ching, Seungheon Doh, Nabin Kim, Juhan Nam, Yi-Hsuan Yang

Download pdf

Piano Sheet Music Identification Using Marketplace Fingerprinting 326-333
Kevin Ji, Daniel Yang, Timothy Tsai

Download pdf

Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss 334-341
Keunhyoung Kim, Jongpil Lee, Sangeun Kum, Juhan Nam

Download pdf

Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation 342-349
Qiuqiang Kong, Yin Cao, Haohe Liu, Keunwoo Choi, Yuxuan Wang

Download pdf

Artist Similarity Using Graph Neural Networks 350-357
Filip Korzeniowski, Sergio Oramas, Fabien Gouyon

Download pdf

“Finding Home”: Understanding How Music Supports Listeners’ Mental Health through a Case Study of BTS 358-365
Jin Ha Lee, Arpita Bhattacharya, Ria Antony, Nicole Santero, Anh Le

Download pdf

Cross-cultural Mood Perception in Pop Songs and its Alignment with Mood Detection Algorithms 366-373
Harin Lee, Frank Höger, Marc Schönwiesner, Minsu Park, Nori Jacoby

Download pdf

Reconsidering quantization in MIR 374-380
Jordan Lenchitz

Download pdf

A unified model for zero-shot music source separation, transcription and synthesis 381-388
Liwei Lin, Gus Xia, Qiuqiang Kong, Junyan Jiang

Download pdf

Pitch-Informed Instrument Assignment using a Deep Convolutional Network with Multiple Kernel Shapes 389-395
Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlbäck

Download pdf

SpecTNT: a Time-Frequency Transformer for Music Audio 396-403
Wei-Tsung Lu, Ju-Chiang Wang, Minz Won, Keunwoo Choi, Xuchen Song

Download pdf

AugmentedNet: A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal Tasks 404-411
Néstor Nápoles López, Mark R H Gotham, Ichiro Fujinaga

Download pdf

MINGUS: Melodic Improvisation Neural Generator Using Seq2Seq 412-419
Vincenzo Madaghiele, Pasquale Lisena, Raphael Troncy

Download pdf

User-centered evaluation of lyrics-to-audio alignment 420-427
Ninon Lizé Masclef, Andrea Vaglio, Manuel Moussallam

Download pdf

Synthesizer Sound Matching with Differentiable DSP 428-434
Naotake Masuda, Daisuke Saito

Download pdf

A Modular System for the Harmonic Analysis of Musical Scores using a Large Vocabulary 435-442
Andrew Mcleod, Martin A Rohrmeier

Download pdf

A deep learning method for enforcing coherence in Automatic Chord Recognition 443-451
Gianluca Micchi, Katerina Kosta, Gabriele Medeot, Pierre Chanquion

Download pdf

Modeling beat uncertainty as a 2D distribution of period and phase: a MIR task proposal 452-459
Martin A Miguel, Diego Fernandez Slezak

Download pdf

A case study of deep enculturation and sensorimotor synchronization to real music 460-467
Olof Misgeld, Torbjörn L Gulz, Jūra Miniotaitė, Andre Holzapfel

Download pdf

Symbolic Music Generation with Diffusion Models 468-475
Gautam Mittal, Jesse Engel, Curtis Hawthorne, Ian Simon

Download pdf

Learning from Musical Feedback with Sonic the Hedgehog 476-483
Faraaz Nadeem

Download pdf

DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis With GANs 484-492
Javier Nistal, Stefan Lattner, Gaël Richard

Download pdf

Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure 493-499
Takehisa Oyama, Ryoto Ishizuka, Kazuyoshi Yoshii

Download pdf

Agreement Among Human and Automated Transcriptions of Global Songs 500-508
Yuto Ozaki, John M Mcbride, Emmanouil Benetos, Peter Pfordresher, Joren Six, Adam Tierney, Polina Proutskova, Emi Sakai, Haruka Kondo, Haruno Fukatsu, Shinya Fujii, Patrick E. Savage

Download pdf

Automatic Recognition of Texture in Renaissance Music 509-516
Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Bjorn W. Schuller, Markus Schedl

Download pdf

Is Disentanglement enough? On Latent Representations for Controllable Music Generation 517-524
Ashis Pati, Alexander Lerch

Download pdf

Pulse clarity metrics developed from a deep learning beat tracking model 525-530
Nicolás Pironio, Diego Fernandez Slezak, Martin A Miguel

Download pdf

On the Veracity of Local, Model-agnostic Explanations in Audio Classification: Targeted Investigations with Adversarial Examples 531-538
Verena Praher, Katharina Prinz, Arthur Flexer, Gerhard Widmer

Download pdf

Is there a “language of music-video clips” ? A qualitative and quantitative study 539-546
Laure Prétet, Gaël Richard, Geoffroy Peeters

Download pdf

Tabla Gharana Recognition from Audio music recordings of Tabla Solo performances 547-554
Gowriprasad R, Venkatesh V, Hema A Murthy, R Aravind, Sri Rama Murty K

Download pdf

Navigating noise: Modeling perceptual correlates of noise-related semantic timbre categories with audio features 555-561
Lindsey Reymore, Emmanuelle Beauvais-Lacasse, Bennett Smith, Stephen Mcadams

Download pdf

Quantitative User Perceptions of Music Recommendation List Diversity 562-568
Kyle Robinson, Dan Brown

Download pdf

A Formal Model of Extended Tonal Harmony 569-578
Martin A Rohrmeier, Fabian C. Moss

Download pdf

CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis 579-585
Simon Rouard, Gaëtan Hadjeres

Download pdf

Curriculum Learning for Imbalanced Classification in Large Vocabulary Automatic Chord Recognition 586-593
Luke O Rowe, George Tzanetakis

Download pdf

Deep Embeddings and Section Fusion Improve Music Segmentation 594-601
Justin Salamon, Oriol Nieto, Nicholas J. Bryan

Download pdf

Multi-Task Learning of Graph-based Inductive Representations of Music Content 602-609
Antonia Saravanou, Federico Tomasi, Rishabh Mehrotra, Mounia Lalmas

Download pdf

DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models 610-617
Pedro Pereira Sarmento, Adarsh Kumar, Cj Carr, Zack Zukowski, Mathieu Barthet, Yi-Hsuan Yang

Download pdf

Does Track Sequence in User-generated Playlists Matter? 618-625
Harald Victor Schweiger, Emilia Parada-Cabaleiro, Markus Schedl

Download pdf

A Differentiable Cost Measure for Intonation Processing in Polyphonic Music 626-633
Simon J Schwär, Sebastian Rosenzweig, Meinard Müller

Download pdf

Improving Music Performance Assessment With Contrastive Learning 634-641
Pavan M Seshadri, Alexander Lerch

Download pdf

Tracing Affordance and Item Adoption on Music Streaming Platforms 642-649
Dougal Shakespeare, Camille Roth

Download pdf

Computational analysis and modeling of expressive timing in Chopin’s Mazurkas 650-656
Zhengshan Shi

Download pdf

Computational analysis of melodic mode switching in raga performance 657-664
Nithya Nadig Shikarpur, Asawari Keskar, Preeti Rao

Download pdf

SinTra: Learning an inspiration model from a single multi-track music segment 665-672
Qingwei Song, Qiwei Sun, Dongsheng Guo, Haiyong Zheng

Download pdf

Contrastive Learning of Musical Representations 673-681
Janne Spijkervet, John Ashley Burgoyne

Download pdf

Musical Tempo Estimation Using a Multi-scale Network 682-689
Xiaoheng Sun, Qiqi He, Gao Yongwei, Wei Li

Download pdf

On the Integration of Language Models into Sequence to Sequence Architectures for Handwritten Music Recognition 690-696
Pau Torras, Arnau Baró, Lei Kang, Alicia Fornés

Download pdf

Kiite Cafe: A Web Service for Getting Together Virtually to Listen to Music 697-704
Kosetsu Tsukuda, Keisuke Ishida, Masahiro Hamasaki, Masataka Goto

Download pdf

Toward an Understanding of Lyrics-viewing Behavior While Listening to Music on a Smartphone 705-713
Kosetsu Tsukuda, Masahiro Hamasaki, Masataka Goto

Download pdf

The Words Remain the Same: Cover Detection with Lyrics Transcription 714-721
Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gael Richard

Download pdf

MuseBERT: Pre-training Music Representation for Music Understanding and Controllable Generation 722-729
Ziyu Wang, Gus Xia

Download pdf

Supervised Metric Learning For Music Structure Features 730-737
Ju-Chiang Wang, Jordan B. L. Smith, Wei-Tsung Lu, Xuchen Song

Download pdf

Learning long-term music representations via hierarchical contextual constraints 738-745
Shiqi Wei, Gus Xia

Download pdf

Learning Pitch-Class Representations from Score-Audio Pairs of Classical Music 746-753
Christof Weiss, Johannes Zeitler, Tim Zunner, Florian Schuberth, Meinard Müller

Download pdf

Training Deep Pitch-Class Representations With a Multi-Label CTC Loss 754-761
Christof Weiss, Geoffroy Peeters

Download pdf

Audio Defect Detection in Music with Deep Networks 762-768
Daniel Wolff, Remi Mignot, Axel Roebel

Download pdf

Semi-supervised Music Tagging Transformer 769-776
Minz Won, Keunwoo Choi, Xavier Serra

Download pdf

Emotion Embedding Spaces for Matching Music to Stories 777-785
Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham Mysore, Xavier Serra

Download pdf

CollageNet: Fusing arbitrary melody and accompaniment into a coherent song 786-793
Abudukelimu Wuerkaixi, Christodoulos Benetatos, Zhiyao Duan, Changshui Zhang

Download pdf

Human-in-the-Loop Adaptation for Interactive Musical Beat Tracking 794-801
Kazuhiko Yamamoto

Download pdf

Composer Classification With Cross-Modal Transfer Learning and Musically-Informed Augmentation 802-809
Daniel Yang, Timothy Tsai

Download pdf

Aligning Unsynchronized Part Recordings to a Full Mix Using Iterative Subtractive Alignment 810-817
Daniel Yang, Kevin Ji, Timothy Tsai

Download pdf

ADTOF: A large dataset of non-synthetic music for automatic drum transcription 818-824
Mickael Zehren, Marco Alunno, Paolo Bientinesi

Download pdf

Learn by Referencing: Towards Deep Metric Learning for Singing Assessment 825-832
Huan Zhang, Yiliang Jiang, Tao Jiang, Hu Peng

Download pdf

AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer 833-840
Jingwei Zhao, Gus Xia

Download pdf