ISMIR 2024
Full Proceedings
Proceedings of the 25th International Society for Music Information Retrieval Conference
San Francisco, California, USA and Online, November 10-14, 2024 (ISBN: 978-1-7327299-4-0)
San Francisco, California, USA and Online, November 10-14, 2024 (ISBN: 978-1-7327299-4-0)
Papers
Formal Modeling of Structural Repetition Using Tree Compression 53-60
Zeng Ren, Yannis Rammos, Martin A. Rohrmeier
Zeng Ren, Yannis Rammos, Martin A. Rohrmeier
Saraga Audiovisual: A Large Multimodal Open Data Collection for the Analysis of Carnatic Music 61-69
Adithi Shankar, Genís Plaja-Roglans, Thomas Nuttall, Martín Rocamora, Xavier Serra
Adithi Shankar, Genís Plaja-Roglans, Thomas Nuttall, Martín Rocamora, Xavier Serra
X-Cover: Better Music Version Identification System by Integrating Pretrained ASR Model 70-77
Xingjian Du, Mingyu Liu, Pei Zou, Xia Liang, Zijie Wang, Huidong Liang, Bilei Zhu
Xingjian Du, Mingyu Liu, Pei Zou, Xia Liang, Zijie Wang, Huidong Liang, Bilei Zhu
FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs 86-94
Hitoshi Suda, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, Jun Ogata
Hitoshi Suda, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, Jun Ogata
Classical Guitar Duet Separation Using GuitarDuets – A Dataset of Real and Synthesized Guitar Recordings 95-102
Marios Glytsos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos
Marios Glytsos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos
Can LLMs “Reason” in Music? an Evaluation of LLMs’ Capability of Music Understanding and Generation 103-110
Ziya Zhou, Yuhang Wu, Zhiyue Wu, Xinyue Zhang, Ruibin Yuan, Yinghao Ma, Lu Wang, Emmanouil Benetos, Wei Xue, Yike Guo
Ziya Zhou, Yuhang Wu, Zhiyue Wu, Xinyue Zhang, Ruibin Yuan, Yinghao Ma, Lu Wang, Emmanouil Benetos, Wei Xue, Yike Guo
Music2Latent: Consistency Autoencoders for Latent Audio Compression 111-119
Marco Pasini, Stefan Lattner, George Fazekas
Marco Pasini, Stefan Lattner, George Fazekas
Robust and Accurate Audio Synchronization Using Raw Features From Transcription Models 120-127
Johannes Zeitler, Ben Maman, Meinard Müller
Johannes Zeitler, Ben Maman, Meinard Müller
Harnessing the Power of Distributions: Probabilistic Representation Learning on Hypersphere for Multimodal Music Information Retrieval 128-136
Takayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto
Takayuki Nakatsuka, Masahiro Hamasaki, Masataka Goto
Towards Automated Personal Value Estimation in Song Lyrics 137-145
Andrew M. Demetriou, Jaehun Kim, Sandy Manolios, Cynthia Liem
Andrew M. Demetriou, Jaehun Kim, Sandy Manolios, Cynthia Liem
Audio Conditioning for Music Generation via Discrete Bottleneck Features 146-153
Simon Rouard, Yossi Adi, Jade Copet, Axel Roebel, Alexandre Defossez
Simon Rouard, Yossi Adi, Jade Copet, Axel Roebel, Alexandre Defossez
Variation Transformer: New Datasets, Models, and Comparative Evaluation for Symbolic Music Variation Generation 154-163
Chenyu Gao, Federico Reuben, Tom Collins
Chenyu Gao, Federico Reuben, Tom Collins
Automatic Detection of Moral Values in Music Lyrics 164-172
Vjosa Preniqi, Iacopo Ghinassi, Julia Ive, Kyriaki Kalimeri, Charalampos Saitis
Vjosa Preniqi, Iacopo Ghinassi, Julia Ive, Kyriaki Kalimeri, Charalampos Saitis
Semi-Supervised Piano Transcription Using Pseudo-Labeling Techniques 173-181
Sebastian Strahl, Meinard Müller
Sebastian Strahl, Meinard Müller
Learning Multifaceted Self-Similarity Over Time and Frequency for Music Structure Analysis 189-197
Tsung-Ping Chen, Kazuyoshi Yoshii
Tsung-Ping Chen, Kazuyoshi Yoshii
A Contrastive Self-Supervised Learning Scheme for Beat Tracking Amenable to Few-Shot Learning 198-206
Antonin Gagneré, Slim Essid, Geoffroy Peeters
Antonin Gagneré, Slim Essid, Geoffroy Peeters
Using Pairwise Link Prediction and Graph Attention Networks for Music Structure Analysis 207-214
Morgan Buisson, Brian McFee, Slim Essid
Morgan Buisson, Brian McFee, Slim Essid
Six Dragons Fly Again: Reviving 15th-Century Korean Court Music With Transformers and Novel Encoding 217-224
Danbinaerin Han, Mark R. H. Gotham, DongMin Kim, Hannah Park, Sihun Lee, Dasaem Jeong
Danbinaerin Han, Mark R. H. Gotham, DongMin Kim, Hannah Park, Sihun Lee, Dasaem Jeong
Lessons Learned From a Project to Encode Mensural Music on a Large Scale With Optical Music Recognition 225-231
David Rizo, Jorge Calvo-Zaragoza, Patricia García-Iasci, Teresa Delgado-Sánchez
David Rizo, Jorge Calvo-Zaragoza, Patricia García-Iasci, Teresa Delgado-Sánchez
The Changing Sound of Music: An Exploratory Corpus Study of Vocal Trends Over Time 232-239
Elena Georgieva, Pablo Ripollés, Brian McFee
Elena Georgieva, Pablo Ripollés, Brian McFee
Music Proofreading With RefinPaint: Where and How to Modify Compositions Given Context 240-247
Pedro Ramoneda, Martín Rocamora, Taketo Akama
Pedro Ramoneda, Martín Rocamora, Taketo Akama
Notewise Evaluation for Music Source Separation: A Case Study for Separated Piano Tracks 248-255
Yigitcan Özer, Hans-Ulrich Berendes, Vlora Arifi-Müller, Fabian-Robert Stöter, Meinard Müller
Yigitcan Özer, Hans-Ulrich Berendes, Vlora Arifi-Müller, Fabian-Robert Stöter, Meinard Müller
Automatic Estimation of Singing Voice Musical Dynamics 256-263
Jyoti Narang, Nazif Can Tamer, Viviana De La Vega, Xavier Serra
Jyoti Narang, Nazif Can Tamer, Viviana De La Vega, Xavier Serra
Joint Audio and Symbolic Conditioning for Temporally Controlled Text-to-Music Generation 264-271
Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi
Or Tal, Alon Ziv, Itai Gat, Felix Kreuk, Yossi Adi
Diff-a-Riff: Musical Accompaniment Co-Creation via Latent Diffusion Models 272-280
Javier Nistal, Marco Pasini, Cyran Aouameur, Maarten Grachten, Stefan Lattner
Javier Nistal, Marco Pasini, Cyran Aouameur, Maarten Grachten, Stefan Lattner
Exploring Internet Radio Across the Globe With the MIRAGE Online Dashboard 281-287
Ngan V.T. Nguyen, Elizabeth Acosta, Tommy Dang, David Sears
Ngan V.T. Nguyen, Elizabeth Acosta, Tommy Dang, David Sears
MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling 288-294
Andrew C. Edwards, Xavier Riley, Pedro Pereira Sarmento, Simon Dixon
Andrew C. Edwards, Xavier Riley, Pedro Pereira Sarmento, Simon Dixon
Transcription-Based Lyrics Embeddings: Simple Extraction of Effective Lyrics Embeddings From Audio 295-303
Jaehun Kim, Florian Henkel, Camilo Landau, Samuel E. Sandberg, Andreas F. Ehmann
Jaehun Kim, Florian Henkel, Camilo Landau, Samuel E. Sandberg, Andreas F. Ehmann
MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation 311-318
Yun-Han Lan, Wen-Yi Hsiao, Hao-Chung Cheng, Yi-Hsuan Yang
Yun-Han Lan, Wen-Yi Hsiao, Hao-Chung Cheng, Yi-Hsuan Yang
End-to-End Piano Performance-MIDI to Score Conversion With Transformers 319-326
Tim Beyer, Angela Dai
Tim Beyer, Angela Dai
From Real to Cloned Singer Identification 327-334
Dorian Desblancs, Gabriel Meseguer-Brocal, Romain Hennequin, Manuel Moussallam
Dorian Desblancs, Gabriel Meseguer-Brocal, Romain Hennequin, Manuel Moussallam
Emotion-Driven Piano Music Generation via Two-Stage Disentanglement and Functional Representation 335-342
Jingyue Huang, Ke Chen, Yi-Hsuan Yang
Jingyue Huang, Ke Chen, Yi-Hsuan Yang
Efficient Adapter Tuning for Joint Singing Voice Beat and Downbeat Tracking With Self-Supervised Learning Features 343-351
Jiajun Deng, Yaolong Ju, Jing Yang, Simon Lui, Xunying Liu
Jiajun Deng, Yaolong Ju, Jing Yang, Simon Lui, Xunying Liu
Which Audio Features Can Predict the Dynamic Musical Emotions of Both Composers and Listeners? 352-359
Eun Ji Oh, Hyunjae Kim, Kyung Myun Lee
Eun Ji Oh, Hyunjae Kim, Kyung Myun Lee
Exploring Musical Roots: Applying Audio Embeddings to Empower Influence Attribution for a Generative Music Model 360-368
Julia Barnett, Hugo Flores García, Bryan Pardo
Julia Barnett, Hugo Flores García, Bryan Pardo
Green MIR? Investigating Computational Cost of Recent Music-Ai Research in ISMIR 371-380
Andre Holzapfel, Anna-Kaisa Kaila, Petra Jääskeläinen
Andre Holzapfel, Anna-Kaisa Kaila, Petra Jääskeläinen
Field Study on Children’s Home Piano Practice: Developing a Comprehensive System for Enhanced Student-Teacher Engagement 381-388
Seikoh Fukuda, Yuko Fukuda, Masamichi Hosoda, Ami Motomura, Eri Sasao, Masaki Matsubara, Masahiro Niitsuma
Seikoh Fukuda, Yuko Fukuda, Masamichi Hosoda, Ami Motomura, Eri Sasao, Masaki Matsubara, Masahiro Niitsuma
HAISP: A Dataset of Human-AI Songwriting Processes From the AI Song Contest 397-404
Lidia J. Morris, Rebecca Leger, Michele Newman, John Ashley Burgoyne, Ryan Groves, Natasha Mangal, Jin Ha Lee
Lidia J. Morris, Rebecca Leger, Michele Newman, John Ashley Burgoyne, Ryan Groves, Natasha Mangal, Jin Ha Lee
Cue Point Estimation Using Object Detection 405-412 2
Giulia Argüello, Luca A. Lanzendörfer, Roger Wattenhofer
Giulia Argüello, Luca A. Lanzendörfer, Roger Wattenhofer
SpecMaskGIT: Masked Generative Modeling of Audio Spectrogram for Efficient Audio Synthesis and Beyond 420-428
Marco Comunità, Zhi Zhong, Akira Takahashi, Shiqi Yang, Mengjie Zhao, Koichi Saito, Yukara Ikemiya, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji
Marco Comunità, Zhi Zhong, Akira Takahashi, Shiqi Yang, Mengjie Zhao, Koichi Saito, Yukara Ikemiya, Takashi Shibuya, Shusuke Takahashi, Yuki Mitsufuji
Long-Form Music Generation With Latent Diffusion 429-437
Zach Evans, Julian D. Parker, CJ Carr, Zachary Zukowski, Josiah Taylor, Jordi Pons
Zach Evans, Julian D. Parker, CJ Carr, Zachary Zukowski, Josiah Taylor, Jordi Pons
Towards Zero-Shot Amplifier Modeling: One-to-Many Amplifier Modeling via Tone Embedding Control 446-453
Yu-Hua Chen, Yen-Tung Yeh, Yuan-Chiao Cheng , Jui-Te Wu, Yu-Hsiang Ho, Jyh-Shing Roger Jang, Yi-Hsuan Yang
Yu-Hua Chen, Yen-Tung Yeh, Yuan-Chiao Cheng , Jui-Te Wu, Yu-Hsiang Ho, Jyh-Shing Roger Jang, Yi-Hsuan Yang
Mel-RoFormer for Vocal Separation and Vocal Melody Transcription 454-461
Ju-Chiang Wang, Wei-Tsung Lu, Jitong Chen
Ju-Chiang Wang, Wei-Tsung Lu, Jitong Chen
Unsupervised Synthetic-to-Real Adaptation for Optical Music Recognition 462-469
Noelia N. Luna-Barahona, Adrián Roselló, María Alfaro-Contreras, David Rizo, Jorge Calvo-Zaragoza
Noelia N. Luna-Barahona, Adrián Roselló, María Alfaro-Contreras, David Rizo, Jorge Calvo-Zaragoza
MMT-BERT: Chord-Aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT 470-477
Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama
Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama
Discogs-VI: A Musical Version Identification Dataset Based on Public Editorial Metadata 478-485
Recep Oguz Araz, Xavier Serra, Dmitry Bogdanov
Recep Oguz Araz, Xavier Serra, Dmitry Bogdanov
Who’s Afraid of the `Artyfyshall Byrd’? Historical Notions and Current Challenges of Musical Artificiality 486-492
Nicholas Cornia, Bruno Forment
Nicholas Cornia, Bruno Forment
End-to-End Automatic Singing Skill Evaluation Using Cross-Attention and Data Augmentation for Solo Singing and Singing With Accompaniment 493-500
Yaolong Ju, Chun Yat Wu, Betty Cortiñas Lorenzo, Jing Yang, Jiajun Deng, Fan Fan, Simon Lui
Yaolong Ju, Chun Yat Wu, Betty Cortiñas Lorenzo, Jing Yang, Jiajun Deng, Fan Fan, Simon Lui
Cluster and Separate: A GNN Approach to Voice and Staff Prediction for Score Engraving 503-510
Francesco Foscarin, Emmanouil Karystinaios, Eita Nakamura, Gerhard Widmer
Francesco Foscarin, Emmanouil Karystinaios, Eita Nakamura, Gerhard Widmer
From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano 511-519
Huan Zhang, Jinhua Liang, Simon Dixon
Huan Zhang, Jinhua Liang, Simon Dixon
Towards Explainable and Interpretable Musical Difficulty Estimation: A Parameter-Efficient Approach 520-528
Pedro Ramoneda, Vsevolod E. Eremenko, Alexandre D’Hooge, Emilia Parada-Cabaleiro, Xavier Serra
Pedro Ramoneda, Vsevolod E. Eremenko, Alexandre D’Hooge, Emilia Parada-Cabaleiro, Xavier Serra
Purposeful Play: Evaluation and Co-Design of Casual Music Creation Applications With Children 529-539
Michele Newman, Lidia J. Morris, Jun Kato, Masataka Goto, Jason Yip, Jin Ha Lee
Michele Newman, Lidia J. Morris, Jun Kato, Masataka Goto, Jason Yip, Jin Ha Lee
El Bongosero: A Crowd-Sourced Symbolic Dataset of Improvised Hand Percussion Rhythms Paired With Drum Patterns 540-546
Nicholas Evans, Behzad Haki, Daniel Gómez-Marín, Sergi Jordà
Nicholas Evans, Behzad Haki, Daniel Gómez-Marín, Sergi Jordà
Utilizing Listener-Provided Tags for Music Emotion Recognition: A Data-Driven Approach 547-554
Joanne Affolter, Martin A. Rohrmeier
Joanne Affolter, Martin A. Rohrmeier
PiCoGen2: Piano Cover Generation With Transfer Learning Approach and Weakly Aligned Data 555-562
Chih-Pin Tan, Hsin Ai, Yi-Hsin Chang, Shuen-Huei Guan, Yi-Hsuan Yang
Chih-Pin Tan, Hsin Ai, Yi-Hsin Chang, Shuen-Huei Guan, Yi-Hsuan Yang
Diff-MST: Differentiable Mixing Style Transfer 563-570
Soumya Sai Vanka, Christian J. Steinmetz, Jean-Baptiste Rolland, Joshua D. Reiss, George Fazekas
Soumya Sai Vanka, Christian J. Steinmetz, Jean-Baptiste Rolland, Joshua D. Reiss, George Fazekas
Semi-Supervised Contrastive Learning of Musical Representations 571-579
Julien PM Guinot, Elio Quinton, George Fazekas
Julien PM Guinot, Elio Quinton, George Fazekas
Improved Symbolic Drum Style Classification With Grammar-Based Hierarchical Representations 580-587
Léo Géré, Nicolas Audebert, Philippe Rigaux
Léo Géré, Nicolas Audebert, Philippe Rigaux
Nested Music Transformer: Sequentially Decoding Compound Tokens in Symbolic Music and Audio Generation 588-595
Jiwoo Ryu, Hao-Wen Dong, Jongmin Jung, Dasaem Jeong
Jiwoo Ryu, Hao-Wen Dong, Jongmin Jung, Dasaem Jeong
Continual Learning for Music Classification 596-602
Pedro González-Barrachina, María Alfaro-Contreras, Jorge Calvo-Zaragoza
Pedro González-Barrachina, María Alfaro-Contreras, Jorge Calvo-Zaragoza
TheGlueNote: Learned Representations for Robust and Flexible Note Alignment 603-610
Silvan Peter, Gerhard Widmer
Silvan Peter, Gerhard Widmer
GAPS: A Large and Diverse Classical Guitar Dataset and Benchmark Transcription Model 611-617
Xavier Riley, Zixun Guo, Andrew C. Edwards, Simon Dixon
Xavier Riley, Zixun Guo, Andrew C. Edwards, Simon Dixon
A Kalman Filter Model for Synchronization in Musical Ensembles 618-624
Hugo T. Carvalho, Min Susan Li, Massimiliano Di Luca, Alan M. Wing
Hugo T. Carvalho, Min Susan Li, Massimiliano Di Luca, Alan M. Wing
Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation 625-633
Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Michael Anslow, Geoffroy Peeters
Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Michael Anslow, Geoffroy Peeters
Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music With Lightweight Finetuning 634-641
Fang Duo Tsai, Shih-Lun Wu, Haven Kim, Bo-Yu Chen, Hao-Chung Cheng, Yi-Hsuan Yang
Fang Duo Tsai, Shih-Lun Wu, Haven Kim, Bo-Yu Chen, Hao-Chung Cheng, Yi-Hsuan Yang
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing 642-650
Shangda Wu, Yashan Wang, Xiaobing Li, Feng Yu, Maosong Sun
Shangda Wu, Yashan Wang, Xiaobing Li, Feng Yu, Maosong Sun
GraphMuse: A Library for Symbolic Music Graph Processing 651-658
Emmanouil Karystinaios, Gerhard Widmer
Emmanouil Karystinaios, Gerhard Widmer
ST-ITO: Controlling Audio Effects for Style Transfer With Inference-Time Optimization 661-668
Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, Joshua D. Reiss
Christian J. Steinmetz, Shubhr Singh, Marco Comunità, Ilias Ibnyahya, Shanxin Yuan, Emmanouil Benetos, Joshua D. Reiss
ComposerX: Multi-Agent Symbolic Music Composition With LLMs 669-679
Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang , Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo
Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang , Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo
Do Music Generation Models Encode Music Theory? 680-687
Megan Wei, Michael Freeman, Chris Donahue, Chen Sun
Megan Wei, Michael Freeman, Chris Donahue, Chen Sun
PolySinger: Singing-Voice to Singing-Voice Translation From English to Japanese 688-696
Silas Antonisen, Iván López-Espejo
Silas Antonisen, Iván López-Espejo
Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music 705-712
Venkatakrishnan Vaidyanathapuram Krishnan, Noel Alben, Anish A. Nair, Nathaniel Condit-Schultz
Venkatakrishnan Vaidyanathapuram Krishnan, Noel Alben, Anish A. Nair, Nathaniel Condit-Schultz
Between the AI and Me: Analysing Listeners’ Perspectives on AI- and Human-Composed Progressive Metal Music 713-720
Pedro Pereira Sarmento, Jackson J. Loth, Mathieu Barthet
Pedro Pereira Sarmento, Jackson J. Loth, Mathieu Barthet
Combining Audio Control and Style Transfer Using Latent Diffusion 721-728
Nils Demerlé, Philippe Esling, Guillaume Doras, David Genova
Nils Demerlé, Philippe Esling, Guillaume Doras, David Genova
Computational Analysis of Yaredawi YeZema Silt in Ethiopian Orthodox Tewahedo Church Chants 729-736
Mequanent Argaw Muluneh, Yan-Tsung Peng, Li Su
Mequanent Argaw Muluneh, Yan-Tsung Peng, Li Su
Lyrics Transcription for Humans: A Readability-Aware Benchmark 737-744
Ondřej Cífka, Hendrik Schreiber, Luke Miner, Fabian-Robert Stöter
Ondřej Cífka, Hendrik Schreiber, Luke Miner, Fabian-Robert Stöter
A Critical Survey of Research in Music Genre Recognition 745-782
Owen Green, Bob L. T. Sturm, Georgina Born, Melanie Wald-Fuhrmann
Owen Green, Bob L. T. Sturm, Georgina Born, Melanie Wald-Fuhrmann
Content-Based Controls for Music Large Language Modeling 783-790
Liwei Lin, Gus Xia, Junyan Jiang, Yixiao Zhang
Liwei Lin, Gus Xia, Junyan Jiang, Yixiao Zhang
Exploring the Inner Mechanisms of Large Generative Music Models 791-798
Marcel A. Vélez Vásquez, Charlotte Pouw, John Ashley Burgoyne, Willem Zuidema
Marcel A. Vélez Vásquez, Charlotte Pouw, John Ashley Burgoyne, Willem Zuidema
Quantitative Analysis of Melodic Similarity in Music Copyright Infringement Cases 799-806
Saebyul Park, Halla Kim, Jiye Jung, Juyong Park, Jeounghoon Kim, Juhan Nam
Saebyul Park, Halla Kim, Jiye Jung, Juyong Park, Jeounghoon Kim, Juhan Nam
Robust Lossy Audio Compression Identification 807-813
Hendrik Vincent Koops, Gianluca Micchi, Elio Quinton
Hendrik Vincent Koops, Gianluca Micchi, Elio Quinton
MuChoMusic: Evaluating Music Understanding in Multimodal Audio-Language Models 825-833
Benno Weck, Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas, Dmitry Bogdanov
Benno Weck, Ilaria Manco, Emmanouil Benetos, Elio Quinton, George Fazekas, Dmitry Bogdanov
Human Pose Estimation for Expressive Movement Descriptors in Vocal Musical Performances 834-841
Sujoy Roychowdhury, Preeti Rao, Sharat Chandran
Sujoy Roychowdhury, Preeti Rao, Sharat Chandran
Enhancing Predictive Models of Music Familiarity With EEG: Insights From Fans and Non-Fans of K-Pop Group NCT127 842-849
Seokbeom Park, Hyunjae Kim, Kyung Myun Lee
Seokbeom Park, Hyunjae Kim, Kyung Myun Lee
MidiCaps: A Large-Scale MIDI Dataset With Text Captions 858-865
Jan Melechovsky, Abhinaba Roy, Dorien Herremans
Jan Melechovsky, Abhinaba Roy, Dorien Herremans
A New Dataset, Notation Software, and Representation for Computational Schenkerian Analysis 866-873
Stephen Ni-Hahn, Weihan Xu, Zirui Yin, Rico Zhu, Simon Mak, Yue Jiang, Cynthia Rudin
Stephen Ni-Hahn, Weihan Xu, Zirui Yin, Rico Zhu, Simon Mak, Yue Jiang, Cynthia Rudin
DITTO-2: Distilled Diffusion Inference-Time T-Optimization for Music Generation 874-881
Zachary Novack, Julian McAuley, Taylor Berg-Kirkpatrick, Nicholas J. Bryan
Zachary Novack, Julian McAuley, Taylor Berg-Kirkpatrick, Nicholas J. Bryan
The Concatenator: A Bayesian Approach to Real Time Concatenative Musaicing 882-889
Christopher J. Tralie, Ben Cantil
Christopher J. Tralie, Ben Cantil
Deep Recombinant Transformer: Enhancing Loop Compatibility in Digital Music Production 890-896
Muhammad Taimoor Haseeb, Ahmad Hammoudeh, Gus Xia
Muhammad Taimoor Haseeb, Ahmad Hammoudeh, Gus Xia
I Can Listen but Cannot Read: An Evaluation of Two-Tower Multimodal Systems for Instrument Recognition 897-905
Yannis Vasilakis, Rachel Bittner, Johan Pauwels
Yannis Vasilakis, Rachel Bittner, Johan Pauwels
Streaming Piano Transcription Based on Consistent Onset and Offset Decoding With Sustain Pedal Detection 906-913
Weixing Wei, Jiahao Zhao, Yulun Wu, Kazuyoshi Yoshii
Weixing Wei, Jiahao Zhao, Yulun Wu, Kazuyoshi Yoshii
Towards Universal Optical Music Recognition: A Case Study on Notation Types 914-921
Juan Carlos Martinez-Sevilla, David Rizo, Jorge Calvo-Zaragoza
Juan Carlos Martinez-Sevilla, David Rizo, Jorge Calvo-Zaragoza
Controlling Surprisal in Music Generation via Information Content Curve Matching 922-929
Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer
Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer
Toward a More Complete OMR Solution 930-937
Guang Yang, Muru Zhang, Lin Qiu, Yanming Wan, Noah A. Smith
Guang Yang, Muru Zhang, Lin Qiu, Yanming Wan, Noah A. Smith
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning 938-945
Ilaria Manco, Justin Salamon, Oriol Nieto
Ilaria Manco, Justin Salamon, Oriol Nieto
Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models 946-953
Seungheon Doh, Keunwoo Choi, Daeyong Kwon, Taesoo Kim, Juhan Nam
Seungheon Doh, Keunwoo Choi, Daeyong Kwon, Taesoo Kim, Juhan Nam
STONE: Self-Supervised Tonality Estimator 954-961
Yuexuan Kong, Vincent Lostanlen, Gabriel Meseguer-Brocal, Stella Wong, Mathieu Lagrange, Romain Hennequin
Yuexuan Kong, Vincent Lostanlen, Gabriel Meseguer-Brocal, Stella Wong, Mathieu Lagrange, Romain Hennequin
Beat This! Accurate Beat Tracking Without DBN Postprocessing 962-969
Francesco Foscarin, Jan Schlüter, Gerhard Widmer
Francesco Foscarin, Jan Schlüter, Gerhard Widmer
PerTok: Expressive Encoding and Modeling of Symbolic Musical Ideas and Variations 981-988
Julian Lenz, Anirudh Mani
Julian Lenz, Anirudh Mani
Exploring GPT’s Ability as a Judge in Music Understanding 996-1003
Kun Fang, Ziyu Wang, Gus Xia, Ichiro Fujinaga
Kun Fang, Ziyu Wang, Gus Xia, Ichiro Fujinaga
Towards Assessing Data Replication in Music Generation With Music Similarity Metrics on Raw Audio 1004-1011
Roser Batlle-Roca, Wei-Hsiang Liao, Xavier Serra, Yuki Mitsufuji, Emilia Gómez
Roser Batlle-Roca, Wei-Hsiang Liao, Xavier Serra, Yuki Mitsufuji, Emilia Gómez
Generating Sample-Based Musical Instruments Using Neural Audio Codec Language Models 1012-1019
Shahan Nercessian, Johannes Imort, Ninon Devis, Frederik Blang
Shahan Nercessian, Johannes Imort, Ninon Devis, Frederik Blang
Hierarchical Generative Modeling of Melodic Vocal Contours in Hindustani Classical Music 1020-1028
Nithya Nadig Shikarpur, Krishna Maneesha Dendukuri, Yusong Wu, Antoine Caillon, Cheng-Zhi Anna Huang
Nithya Nadig Shikarpur, Krishna Maneesha Dendukuri, Yusong Wu, Antoine Caillon, Cheng-Zhi Anna Huang
SymPAC: Scalable Symbolic Music Generation With Prompts and Constraints 1029-1036
Haonan Chen, Jordan B. L. Smith, Janne Spijkervet, Ju-Chiang Wang, Pei Zou, Bochen Li, Qiuqiang Kong, Xingjian Du
Haonan Chen, Jordan B. L. Smith, Janne Spijkervet, Ju-Chiang Wang, Pei Zou, Bochen Li, Qiuqiang Kong, Xingjian Du
Lyrically Speaking: Exploring the Link Between Lyrical Emotions, Themes and Depression Risk 1046-1050
Pavani B. Chowdary, Bhavyajeet Singh, Rajat Agarwal, Vinoo Alluri
Pavani B. Chowdary, Bhavyajeet Singh, Rajat Agarwal, Vinoo Alluri
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four Stems 1051-1059
Karn N. Watcharasupat, Alexander Lerch
Karn N. Watcharasupat, Alexander Lerch
In-Depth Performance Analysis of the ADTOF-Based Algorithm for Automatic Drum Transcription 1060-1067
Mickael Zehren, Marco Alunno, Paolo Bientinesi
Mickael Zehren, Marco Alunno, Paolo Bientinesi
Towards Musically Informed Evaluation of Piano Transcription Models 1068-1075
Patricia Hu, Lukáš Samuel Marták, Carlos Eduardo Cancino-Chacón, Gerhard Widmer
Patricia Hu, Lukáš Samuel Marták, Carlos Eduardo Cancino-Chacón, Gerhard Widmer
Using Item Response Theory to Aggregate Music Annotation Results of Multiple Annotators 1076-1084
Tomoyasu Nakano, Masataka Goto
Tomoyasu Nakano, Masataka Goto
Just Label the Repeats for In-the-Wild Audio-to-Score Alignment 1085-1092
Irmak Bukey, Michael Feffer, Chris Donahue
Irmak Bukey, Michael Feffer, Chris Donahue
Investigating Time-Line-Based Music Traditions With Field Recordings: A Case Study of Candomblé Bell Patterns 1093-1100
Lucas S. Maia, Richa Namballa, Martín Rocamora, Magdalena Fuentes, Carlos Guedes
Lucas S. Maia, Richa Namballa, Martín Rocamora, Magdalena Fuentes, Carlos Guedes
