Home 9 Conferences 9 ISMIR 2024

ISMIR 2024

Full Proceedings

Proceedings of the 25th International Society for Music Information Retrieval Conference
San Francisco, California, USA and Online, November 10-14, 2024 (ISBN: 978-1-7327299-4-0)

Download pdf

Papers

Formal Modeling of Structural Repetition Using Tree Compression 53-60
Zeng Ren, Yannis Rammos, Martin A. Rohrmeier

Download pdf

Saraga Audiovisual: A Large Multimodal Open Data Collection for the Analysis of Carnatic Music 61-69
Adithi Shankar, Genís Plaja-Roglans, Thomas Nuttall, Martín Rocamora, Xavier Serra

Download pdf

X-Cover: Better Music Version Identification System by Integrating Pretrained ASR Model 70-77
Xingjian Du, Mingyu Liu, Pei Zou, Xia Liang, Zijie Wang, Huidong Liang, Bilei Zhu

Download pdf

Harmonic and Transposition Constraints Arising From the Use of the Roland TR-808 Bass Drum 78-85
Emmanuel Deruty

Download pdf

FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs 86-94
Hitoshi Suda, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, Jun Ogata

Download pdf

Classical Guitar Duet Separation Using GuitarDuets – A Dataset of Real and Synthesized Guitar Recordings 95-102
Marios Glytsos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos

Download pdf

Can LLMs “Reason” in Music? an Evaluation of LLMs’ Capability of Music Understanding and Generation 103-110
Ziya Zhou, Yuhang Wu, Zhiyue Wu, Xinyue Zhang, Ruibin Yuan, Yinghao Ma, Lu Wang, Emmanouil Benetos, Wei Xue, Yike Guo

Download pdf

Music2Latent: Consistency Autoencoders for Latent Audio Compression 111-119
Marco Pasini, Stefan Lattner, George Fazekas

Download pdf

Robust and Accurate Audio Synchronization Using Raw Features From Transcription Models 120-127
Johannes Zeitler, Ben Maman, Meinard Müller

Download pdf

Harnessing the Power of Distributions: Probabilistic Representation Learning on Hypersphere for Multimodal Music Information Retrieval 128-136
Takayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto

Download pdf

Towards Automated Personal Value Estimation in Song Lyrics 137-145
Andrew M. Demetriou, Jaehun Kim, Sandy Manolios, Cynthia Liem

Download pdf

Audio Conditioning for Music Generation via Discrete Bottleneck Features 146-153
Simon Rouard, Yossi Adi, Jade Copet, Axel Roebel, Alexandre Defossez

Download pdf

Variation Transformer: New Datasets, Models, and Comparative Evaluation for Symbolic Music Variation Generation 154-163
Chenyu Gao, Federico Reuben, Tom Collins

Download pdf

Automatic Detection of Moral Values in Music Lyrics 164-172
Vjosa Preniqi, Iacopo Ghinassi, Julia Ive, Kyriaki Kalimeri, Charalampos Saitis

Download pdf

Semi-Supervised Piano Transcription Using Pseudo-Labeling Techniques 173-181
Sebastian Strahl, Meinard Müller

Download pdf

Note-Level Transcription of Choral Music 182-188
Huiran Yu, Zhiyao Duan

Download pdf

Learning Multifaceted Self-Similarity Over Time and Frequency for Music Structure Analysis 189-197
Tsung-Ping Chen, Kazuyoshi Yoshii

Download pdf

A Contrastive Self-Supervised Learning Scheme for Beat Tracking Amenable to Few-Shot Learning 198-206
Antonin Gagneré, Slim Essid, Geoffroy Peeters

Download pdf

Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis 207-214
Morgan Buisson, Brian McFee, Slim Essid

Download pdf

Six Dragons Fly Again: Reviving 15th-Century Korean Court Music With Transformers and Novel Encoding 217-224
Danbinaerin Han, Mark R. H. Gotham, DongMin Kim, Hannah Park, Sihun Lee, Dasaem Jeong

Download pdf

Lessons Learned From a Project to Encode Mensural Music on a Large Scale With Optical Music Recognition 225-231
David Rizo, Jorge Calvo-Zaragoza, Patricia García-Iasci, Teresa Delgado-Sánchez

Download pdf

The Changing Sound of Music: An Exploratory Corpus Study of Vocal Trends Over Time 232-239
Elena Georgieva, Pablo Ripollés, Brian McFee

Download pdf

Music Proofreading With RefinPaint: Where and How to Modify Compositions Given Context 240-247
Pedro Ramoneda, Martín Rocamora, Taketo Akama

Download pdf

Notewise Evaluation for Music Source Separation: A Case Study for Separated Piano Tracks 248-255
Yigitcan Özer, Hans-Ulrich Berendes, Vlora Arifi-Müller, Fabian-Robert Stöter, Meinard Müller

Download pdf

Automatic Estimation of Singing Voice Musical Dynamics 256-263
Jyoti Narang, Nazif Can Tamer, Viviana De La Vega, Xavier Serra

Download pdf

Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation 264-271
Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi

Download pdf

Diff-a-Riff: Musical Accompaniment Co-Creation via Latent Diffusion Models 272-280
Javier Nistal, Marco Pasini, Cyran Aouameur, Maarten Grachten, Stefan Lattner

Download pdf

Exploring Internet Radio Across the Globe With the MIRAGE Online Dashboard 281-287
Ngan V.T. Nguyen, Elizabeth Acosta, Tommy Dang, David Sears

Download pdf

MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling 288-294
Andrew C. Edwards, Xavier Riley, Pedro Pereira Sarmento, Simon Dixon

Download pdf

Transcription-Based Lyrics Embeddings: Simple Extraction of Effective Lyrics Embeddings From Audio 295-303
Jaehun Kim, Florian Henkel, Camilo Landau, Samuel E. Sandberg, Andreas F. Ehmann

Download pdf

A Method for MIDI Velocity Estimation for Piano Performance by a U-Net With Attention and FiLM 304-310
Hyon Kim, Xavier Serra

Download pdf

MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation 311-318
Yun-Han Lan, Wen-Yi Hsiao, Hao-Chung Cheng, Yi-Hsuan Yang

Download pdf

End-to-End Piano Performance-MIDI to Score Conversion With Transformers 319-326
Tim Beyer, Angela Dai

Download pdf

From Real to Cloned Singer Identification 327-334
Dorian Desblancs, Gabriel Meseguer-Brocal, Romain Hennequin, Manuel Moussallam

Download pdf

Emotion-Driven Piano Music Generation via Two-Stage Disentanglement and Functional Representation 335-342
Jingyue Huang, Ke Chen, Yi-Hsuan Yang

Download pdf

Efficient Adapter Tuning for Joint Singing Voice Beat and Downbeat Tracking With Self-Supervised Learning Features 343-351
Jiajun Deng, Yaolong Ju, Jing Yang, Simon Lui, Xunying Liu

Download pdf

Which Audio Features Can Predict the Dynamic Musical Emotions of Both Composers and Listeners? 352-359
Eun Ji Oh, Hyunjae Kim, Kyung Myun Lee

Download pdf

Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model 360-368
Julia Barnett, Hugo Flores García, Bryan Pardo

Download pdf

Green MIR? Investigating Computational Cost of Recent Music-Ai Research in ISMIR 371-380
Andre Holzapfel, Anna-Kaisa Kaila, Petra Jääskeläinen

Download pdf

Field Study on Children’s Home Piano Practice: Developing a Comprehensive System for Enhanced Student-Teacher Engagement 381-388
Seikoh Fukuda, Yuko Fukuda, Masamichi Hosoda, Ami Motomura, Eri Sasao, Masaki Matsubara, Masahiro Niitsuma

Download pdf

Inner Metric Analysis as a Measure of Rhythmic Syncopation 389-396
Brian Bemman, Justin Christensen

Download pdf

HAISP: A Dataset of Human-AI Songwriting Processes From the AI Song Contest 397-404
Lidia J. Morris, Rebecca Leger, Michele Newman, John Ashley Burgoyne, Ryan Groves, Natasha Mangal, Jin Ha Lee

Download pdf

Cue Point Estimation Using Object Detection 405-412 2
Giulia Argüello, Luca A. Lanzendörfer, Roger Wattenhofer

Download pdf

The ListenBrainz Listens Dataset 413-419
Kartik Ohri, Robert Kaye

Download pdf

SpecMaskGIT: Masked Generative Modeling of Audio Spectrogram for Efficient Audio Synthesis and Beyond 420-428
Marco Comunità, Zhi Zhong, Akira Takahashi, Shiqi Yang, Mengjie Zhao, Koichi Saito, Yukara Ikemiya, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji

Download pdf

Long-Form Music Generation With Latent Diffusion 429-437
Zach Evans, Julian D. Parker, CJ Carr, Zachary Zukowski, Josiah Taylor, Jordi Pons

Download pdf

Composer’s Assistant 2: Interactive Multi-Track MIDI Infilling With Fine-Grained User Control 438-445
Martin E. Malandro

Download pdf

Towards Zero-Shot Amplifier Modeling: One-to-Many Amplifier Modeling via Tone Embedding Control 446-453
Yu-Hua Chen, Yen-Tung Yeh, Yuan-Chiao Cheng , Jui-Te Wu, Yu-Hsiang Ho, Jyh-Shing Roger Jang, Yi-Hsuan Yang

Download pdf

Mel-RoFormer for Vocal Separation and Vocal Melody Transcription 454-461
Ju-Chiang Wang, Wei-Tsung Lu, Jitong Chen

Download pdf

Unsupervised Synthetic-to-Real Adaptation for Optical Music Recognition 462-469
Noelia N. Luna-Barahona, Adrián Roselló, María Alfaro-Contreras, David Rizo, Jorge Calvo-Zaragoza

Download pdf

MMT-BERT: Chord-Aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT 470-477
Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama

Download pdf

Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata 478-485
Recep Oguz Araz, Xavier Serra, Dmitry Bogdanov

Download pdf

Who’s Afraid of the `Artyfyshall Byrd’? Historical Notions and Current Challenges of Musical Artificiality 486-492
Nicholas Cornia, Bruno Forment

Download pdf

End-to-End Automatic Singing Skill Evaluation Using Cross-Attention and Data Augmentation for Solo Singing and Singing With Accompaniment 493-500
Yaolong Ju, Chun Yat Wu, Betty Cortiñas Lorenzo, Jing Yang, Jiajun Deng, Fan Fan, Simon Lui

Download pdf

Cluster and Separate: A GNN Approach to Voice and Staff Prediction for Score Engraving 503-510
Francesco Foscarin, Emmanouil Karystinaios, Eita Nakamura, Gerhard Widmer

Download pdf

From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano 511-519
Huan Zhang, Jinhua Liang, Simon Dixon

Download pdf

Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-Efficient Approach 520-528
Pedro Ramoneda, Vsevolod E. Eremenko, Alexandre D’Hooge, Emilia Parada-Cabaleiro, Xavier Serra

Download pdf

Purposeful Play: Evaluation and Co-Design of Casual Music Creation Applications With Children 529-539
Michele Newman, Lidia J. Morris, Jun Kato, Masataka Goto, Jason Yip, Jin Ha Lee

Download pdf

El Bongosero: A Crowd-Sourced Symbolic Dataset of Improvised Hand Percussion Rhythms Paired With Drum Patterns 540-546
Nicholas Evans, Behzad Haki, Daniel Gómez-Marín, Sergi Jordà

Download pdf

Utilizing Listener-Provided Tags for Music Emotion Recognition: A Data-Driven Approach 547-554
Joanne Affolter, Martin A. Rohrmeier

Download pdf

PiCoGen2: Piano Cover Generation With Transfer Learning Approach and Weakly Aligned Data 555-562
Chih-Pin Tan, Hsin Ai, Yi-Hsin Chang, Shuen-Huei Guan, Yi-Hsuan Yang

Download pdf

Diff-MST: Differentiable Mixing Style Transfer 563-570
Soumya Sai Vanka, Christian J. Steinmetz, Jean-Baptiste Rolland, Joshua D. Reiss, George Fazekas

Download pdf

Semi-Supervised Contrastive Learning of Musical Representations 571-579
Julien PM Guinot, Elio Quinton, George Fazekas

Download pdf

Improved Symbolic Drum Style Classification With Grammar-Based Hierarchical Representations 580-587
Léo Géré, Nicolas Audebert, Philippe Rigaux

Download pdf

Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation 588-595
Jiwoo Ryu, Hao-Wen Dong, Jongmin Jung, Dasaem Jeong

Download pdf

Continual Learning for Music Classification 596-602
Pedro González-Barrachina, María Alfaro-Contreras, Jorge Calvo-Zaragoza

Download pdf

TheGlueNote: Learned Representations for Robust and Flexible Note Alignment 603-610
Silvan Peter, Gerhard Widmer

Download pdf

GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model 611-617
Xavier Riley, Zixun Guo, Andrew C. Edwards, Simon Dixon

Download pdf

A Kalman Filter Model for Synchronization in Musical Ensembles 618-624
Hugo T. Carvalho, Min Susan Li, Massimiliano Di Luca, Alan M. Wing

Download pdf

Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation 625-633
Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Michael Anslow, Geoffroy Peeters

Download pdf

Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music With Lightweight Finetuning 634-641
Fang Duo Tsai, Shih-Lun Wu, Haven Kim, Bo-Yu Chen, Hao-Chung Cheng, Yi-Hsuan Yang

Download pdf

MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing 642-650
Shangda Wu, Yashan Wang, Xiaobing Li, Feng Yu, Maosong Sun

Download pdf

GraphMuse: A Library for Symbolic Music Graph Processing 651-658
Emmanouil Karystinaios, Gerhard Widmer

Download pdf

ST-ITO: Controlling Audio Effects for Style Transfer With Inference-Time Optimization 661-668
Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, Joshua D. Reiss

Download pdf

ComposerX: Multi-Agent Symbolic Music Composition With LLMs 669-679
Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang , Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo

Download pdf

Do Music Generation Models Encode Music Theory? 680-687
Megan Wei, Michael Freeman, Chris Donahue, Chen Sun

Download pdf

PolySinger: Singing-Voice to Singing-Voice Translation From English to Japanese 688-696
Silas Antonisen, Iván López-Espejo

Download pdf

On the Validity of Employing ChatGPT for Distant Reading of Music Similarity 697-704
Arthur Flexer

Download pdf

Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music 705-712
Venkatakrishnan Vaidyanathapuram Krishnan, Noel Alben, Anish A. Nair, Nathaniel Condit-Schultz

Download pdf

Between the AI and Me: Analysing Listeners’ Perspectives on AI- and Human-Composed Progressive Metal Music 713-720
Pedro Pereira Sarmento, Jackson J. Loth, Mathieu Barthet

Download pdf

Combining Audio Control and Style Transfer Using Latent Diffusion 721-728
Nils Demerlé, Philippe Esling, Guillaume Doras, David Genova

Download pdf

Computational Analysis of Yaredawi YeZema Silt in Ethiopian Orthodox Tewahedo Church Chants 729-736
Mequanent Argaw Muluneh, Yan-Tsung Peng, Li Su

Download pdf

Lyrics Transcription for Humans: A Readability-Aware Benchmark 737-744
Ondřej Cífka, Hendrik Schreiber, Luke Miner, Fabian-Robert Stöter

Download pdf

A Critical Survey of Research in Music Genre Recognition 745-782
Owen Green, Bob L. T. Sturm, Georgina Born, Melanie Wald-Fuhrmann

Download pdf

Content-Based Controls for Music Large Language Modeling 783-790
Liwei Lin, Gus Xia, Junyan Jiang, Yixiao Zhang

Download pdf

Exploring the Inner Mechanisms of Large Generative Music Models 791-798
Marcel A. Vélez Vásquez, Charlotte Pouw, John Ashley Burgoyne, Willem Zuidema

Download pdf

Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases 799-806
Saebyul Park, Halla Kim, Jiye Jung, Juyong Park, Jeounghoon Kim, Juhan Nam

Download pdf

Robust Lossy Audio Compression Identification 807-813
Hendrik Vincent Koops, Gianluca Micchi, Elio Quinton

Download pdf

RNBert: Fine-Tuning a Masked Language Model for Roman Numeral Analysis 814-821
Malcolm Sailor

Download pdf

MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models 825-833
Benno Weck, Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas, Dmitry Bogdanov

Download pdf

Human Pose Estimation for Expressive Movement Descriptors in Vocal Musical Performances 834-841
Sujoy Roychowdhury, Preeti Rao, Sharat Chandran

Download pdf

Enhancing Predictive Models of Music Familiarity With EEG: Insights From Fans and Non-Fans of K-Pop Group NCT127 842-849
Seokbeom Park, Hyunjae Kim, Kyung Myun Lee

Download pdf

Mosaikbox: Improving Fully Automatic DJ Mixing Through Rule-Based Stem Modification and Precise Beat-Grid Estimation 850-857
Robert Sowula, Peter Knees

Download pdf

MidiCaps: A Large-Scale MIDI Dataset With Text Captions 858-865
Jan Melechovsky, Abhinaba Roy, Dorien Herremans

Download pdf

A New Dataset, Notation Software, and Representation for Computational Schenkerian Analysis 866-873
Stephen Ni-Hahn, Weihan Xu, Zirui Yin, Rico Zhu, Simon Mak, Yue Jiang, Cynthia Rudin

Download pdf

DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation 874-881
Zachary Novack, Julian McAuley, Taylor Berg-Kirkpatrick, Nicholas J. Bryan

Download pdf

The Concatenator: A Bayesian Approach to Real Time Concatenative Musaicing 882-889
Christopher J. Tralie, Ben Cantil

Download pdf

Deep Recombinant Transformer: Enhancing Loop Compatibility in Digital Music Production 890-896
Muhammad Taimoor Haseeb, Ahmad Hammoudeh, Gus Xia

Download pdf

I Can Listen but Cannot Read: An Evaluation of Two-Tower Multimodal Systems for Instrument Recognition 897-905
Yannis Vasilakis, Rachel Bittner, Johan Pauwels

Download pdf

Streaming Piano Transcription Based on Consistent Onset and Offset Decoding With Sustain Pedal Detection 906-913
Weixing Wei, Jiahao Zhao, Yulun Wu, Kazuyoshi Yoshii

Download pdf

Towards Universal Optical Music Recognition: A Case Study on Notation Types 914-921
Juan Carlos Martinez-Sevilla, David Rizo, Jorge Calvo-Zaragoza

Download pdf

Controlling Surprisal in Music Generation via Information Content Curve Matching 922-929
Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer

Download pdf

Toward a More Complete OMR Solution 930-937
Guang Yang, Muru Zhang, Lin Qiu, Yanming Wan, Noah A. Smith

Download pdf

Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning 938-945
Ilaria Manco, Justin Salamon, Oriol Nieto

Download pdf

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models 946-953
Seungheon Doh, Keunwoo Choi, Daeyong Kwon, Taesoo Kim, Juhan Nam

Download pdf

STONE: Self-Supervised Tonality Estimator 954-961
Yuexuan Kong, Vincent Lostanlen, Gabriel Meseguer-Brocal, Stella Wong, Mathieu Lagrange, Romain Hennequin

Download pdf

Beat This! Accurate Beat Tracking Without DBN Postprocessing 962-969
Francesco Foscarin, Jan Schlüter, Gerhard Widmer

Download pdf

Scoring Time Intervals Using Non-Hierarchical Transformer for Automatic Piano Transcription 973-980
Yujia Yan, Zhiyao Duan

Download pdf

PerTok: Expressive Encoding and Modeling of Symbolic Musical Ideas and Variations 981-988
Julian Lenz, Anirudh Mani

Download pdf

Looking for Tactus in All the Wrong Places: Statistical Inference of Metric Alignment in Rap Flow 989-995
Nathaniel Condit-Schultz

Download pdf

Exploring GPT’s Ability as a Judge in Music Understanding 996-1003
Kun Fang, Ziyu Wang, Gus Xia, Ichiro Fujinaga

Download pdf

Towards Assessing Data Replication in Music Generation With Music Similarity Metrics on Raw Audio 1004-1011
Roser Batlle-Roca, Wei-Hsiang Liao, Xavier Serra, Yuki Mitsufuji, Emilia Gómez