ISMIR 2023
Full Proceedings
Milan, Italy, November 5-9, 2023 (ISBN: 978-1-7327299-3-3)
Papers
Shreyas Nadkarni, Sujoy Roychowdhury, Preeti Rao, Martin Clayton
Fabio Morreale, Megha Sharma, I-Chieh Wei
Bob L. T. Sturm, Arthur Flexer
Gowriprasad R, Srikrishnan Sridharan, R Aravind, Hema A. Murthy
Changhong Wang, Gaël Richard, Brian McFee
Michèle Duguay, Kate Mancey, Johanna Devaney
Michele Newman, Lidia Morris, Jin Ha Lee
Nathan Fradet, Nicolas Gutowski, Fabien Chhel, Jean-Pierre Briot
Francisco J. Castellanos, Antonio Javier Gallego, Ichiro Fujinaga
Behzad Haki, Błażej Kotowski, Cheuk Lun Isaac Lee, Sergi Jordà
Andrea Martelloni, Andrew P. McPherson, Mathieu Barthet
Mirco Pezzoli, Raffaele Malvermi, Fabio Antonacci, Augusto Sarti
Shangda Wu, Dingyao Yu, Xu Tan, Maosong Sun
Luca Marinelli, György Fazekas, Charalampos Saitis
A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability 174-181
Li-Yang Tseng, Tzu-Ling Lin, Hong-Han Shuai, Jen-Wei Huang, Wen-Whei Chang
Efficient Notation Assembly in Optical Music Recognition 182-189
Carlos Peñarrubia, Carlos Garrido-Munoz, Jose J. Valero-Mas, Jorge Calvo-Zaragoza
Decoding Drums, Instrumentals, Vocals, and Mixed Sources in Music Using Human Brain Activity With fMRI 197-206
Vincent K. M. Cheung, Lana Okuma, Kazuhisa Shibata, Kosetsu Tsukuda, Masataka Goto, Shinichi Furuya
Dual Attention-Based Multi-Scale Feature Fusion Approach for Dynamic Music Emotion Recognition 207-214
Liyue Zhang, Xinyu Yang, Yichi Zhang, Jing Luo
High-Resolution Violin Transcription Using Weak Labels 223-230
Nazif Can Tamer, Yigitcan Özer, Meinard Müller, Xavier Serra
Polyffusion: A Diffusion Model for Polyphonic Score Generation With Internal and External Controls 231-238
Lejun Min, Junyan Jiang, Gus Xia, Jingwei Zhao
The Coordinated Corpus of Popular Musics (CoCoPops): A Meta-Corpus of Melodic and Harmonic Transcriptions 239-246
Claire Arthur, Nathaniel Condit-Schultz
Towards Computational Music Analysis for Music Therapy 247-256
Anja Volk, Tinka Veldhuis, Katrien Foubert, Jos De Backer
Timbre Transfer Using Image-to-Image Denoising Diffusion Implicit Models 257-263
Luca Comanducci, Fabio Antonacci, Augusto Sarti
Correlation of EEG Responses Reflects Structural Similarity of Choruses in Popular Music 264-271
Neha Rajagopalan, Blair Kaneshiro
Chromatic Chords in Theory and Practice 272-278
Mark R. H. Gotham
BPS-Motif: A Dataset for Repeated Pattern Discovery of Polyphonic Symbolic Music 281-288
Yo-Wei Hsiao, Tzu-Yun Hung, Tsung-Ping Chen, Li Su
Weakly Supervised Multi-Pitch Estimation Using Cross-Version Alignment 289-296
Michael Krause, Sebastian Strahl, Meinard Müller
The Batik-Plays-Mozart Corpus: Linking Performance to Score to Musicological Annotations 297-303
Patricia Hu, Gerhard Widmer
Mono-to-Stereo Through Parametric Stereo Generation 304-310
Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle
From West to East: Who Can Understand the Music of the Others Better? 311-318
Charilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos
On the Performance of Optical Music Recognition in the Absence of Specific Training Data 319-326
Juan C. Martinez-Sevilla, Adrián Roselló, David Rizo, Jorge Calvo-Zaragoza
LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT 343-351
Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger B. Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo
Sounds Out of Pläce? Score-Independent Detection of Conspicuous Mistakes in Piano Performances 352-358
Alia Morsi, Kana Tatsumi, Akira Maezawa, Takuya Fujishima, Xavier Serra
VampNet: Music Generation via Masked Acoustic Token Modeling 359-366
Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo
Contrastive Learning for Cross-Modal Artist Retrieval 375-382
Andres Ferraro, Jaehun Kim, Sergio Oramas, Andreas Ehmann, Fabien Gouyon
Repetition-Structure Inference With Formal Prototypes 383-390
Christoph Finkensiep, Matthieu Haeberle, Friedrich Eisenbrand, Markus Neuwirth, Martin Rohrmeier
Algorithmic Harmonization of Tonal Melodies Using Weighted Pitch Context Vectors 391-397
Peter van Kranenburg, Eoin J. Kearns
Text-to-Lyrics Generation With Image-Based Semantics and Reduced Risk of Plagiarism 398-406
Kento Watanabe, Masataka Goto
LP-MusicCaps: LLM-Based Pseudo Music Captioning 409-416
SeungHeon Doh, Keunwoo Choi, Jongpil Lee, Juhan Nam
A Repetition-Based Triplet Mining Approach for Music Segmentation 417-424
Morgan Buisson, Brian McFee, Slim Essid, Helene C. Crayencour
Predicting Music Hierarchies With a Graph-Based Neural Decoder 425-432
Francesco Foscarin, Daniel Harasim, Gerhard Widmer
Stabilizing Training With Soft Dynamic Time Warping: A Case Study for Pitch Class Estimation With Weakly Aligned Targets 433-439
Johannes Zeitler, Simon Deniffel, Michael Krause, Meinard Müller
Finding Tori: Self-Supervised Learning for Analyzing Korean Folk Song 440-447
Danbinaerin Han, Rafael Caro Repetto, Dasaem Jeong
Singer Identity Representation Learning Using Self-Supervised Techniques 448-456
Bernardo Torres, Stefan Lattner, Gaël Richard
On the Effectiveness of Speech Self-Supervised Learning for Music 457-465
Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Chenghua Lin, Xingran Chen, Anton Ragni, Hanzhi Yin, Emmanouil Benetos, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger B. Dannenberg, Yike Guo, Jie Fu
Transformer-Based Beat Tracking With Low-Resolution Encoder and High-Resolution Decoder 466-473
Tian Cheng, Masataka Goto
Adding Descriptors to Melodies Improves Pattern Matching: A Study on Slovenian Folk Songs 474-481
Vanessa Nina Borsan, Mathieu Giraud, Richard Groult, Thierry Lecroq
How Control and Transparency for Users Could Improve Artist Fairness in Music Recommender Systems 482-491
Karlijn Dinnissen, Christine Bauer
Towards a New Interface for Music Listening: A User Experience Study on YouTube 492-499
Ahyeon Choi, Eunsik Shin, Haesun Joung, Joongseek Lee, Kyogu Lee
FiloBass: A Dataset and Corpus Based Study of Jazz Basslines 500-507
Xavier Riley, Simon Dixon
Comparing Texture in Piano Scores 508-515
Louis Couturier, Louis Bigo, Florence Levé
Introducing DiMCAT for Processing and Analyzing Notated Music on a Very Large Scale 516-523
Johannes Hentschel, Andrew McLeod, Yannis Rammos, Martin Rohrmeier
Sequence-to-Sequence Network Training Methods for Automatic Guitar Transcription With Tokenized Outputs 524-531
Sehun Kim, Kazuya Takeda, Tomoki Toda
PESTO: Pitch Estimation With Self-Supervised Transposition-Equivariant Objective 535-544
Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters
The Games We Play: Exploring the Impact of ISMIR on Musicology 545-552
Vanessa Nina Borsan, Mathieu Giraud, Richard Groult
Carnatic Singing Voice Separation Using Cold Diffusion on Training Data With Bleeding 553-560
Genís Plaja-Roglans, Marius Miron, Adithi Shankar, Xavier Serra
Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights From a User Survey 561-570
Kosetsu Tsukuda, Tomoyasu Nakano, Masahiro Hamasaki, Masataka Goto
Towards Building a Phylogeny of Gregorian Chant Melodies 571-578
Jan Hajič jr., Gustavo A. Ballen, Klára Hedvika Mühlová, Hana Vlhová-Wörner
Audio Embeddings as Teachers for Music Classification 579-587
Yiwei Ding, Alexander Lerch
AScorePerformer: Expressive Piano Performance Rendering With Fine-Grained Control 588-596
Ilya Borovik, Vladimir Viro
Roman Numeral Analysis With Graph Neural Networks: Onset-Wise Predictions From Note-Wise Features 597-604
Emmanouil Karystinaios, Gerhard Widmer
Semi-Automated Music Catalog Curation Using Audio and Metadata 605-611
Brian Regan, Desislava Hristova, Mariano Beguerisse-Díaz
Crowd’s Performance on Temporal Activity Detection of Musical Instruments in Polyphonic Music 612-618
Ioannis Petros Samiotis, Christoph Lofi, Alessandro Bozzon
MoisesDB: A Dataset for Source Separation Beyond 4-Stems 619-626
Igor Pereira, Felipe Araújo, Filip Korzeniowski, Richard Vogl
Music as Flow: A Formal Representation of Hierarchical Processes in Music 627-633
Zeng Ren, Wulfram Gerstner, Martin Rohrmeier
Inversynth II: Sound Matching via Self-Supervised Synthesizer-Proxy and Inference-Time Finetuning 642-648
Oren Barkan, Shlomi Shvartzman, Noy Uzrad, Moshe Laufer, Almog Elharar, Noam Koenigstein
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task 649-656
Amantur Amatov, Dmitry Lamanov, Maksim Titov, Ivan Vovk, Ilya Makarov, Mikhail Kudinov
Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction 657-663
Keren Shao, Ke Chen, Taylor Berg-Kirkpatrick, Shlomo Dubnov
Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables 667-675
Chin-Yun Yu, György Fazekas
Harmonic Analysis With Neural Semi-CRF 676-683
Qiaoyu Yang, Frank Cwitkowitz, Zhiyao Duan
A Dataset and Baseline for Automated Assessment of Timbre Quality in Trumpet Sound 684-691
Alberto Acquilino, Ninad Puranik, Ichiro Fujinaga, Gary Scavone
Visual Overviews for Sheet Music Structure 692-699
Frank Heyen, Quynh Quang Ngo, Michael Sedlmair
Passage Summarization With Recurrent Models for Audio – Sheet Music Retrieval 700-707
Luís Carvalho, Gerhard Widmer
Predicting Performance Difficulty From Piano Sheet Music Images 708-715
Pedro Ramoneda, Jose J. Valero-Mas, Dasaem Jeong, Xavier Serra
Self-Refining of Pseudo Labels for Music Source Separation With Noisy Labeled Data 716-724
Junghyun Koo, Yunkee Chae, Chang-Bin Jeon, Kyogu Lee
Quantifying the Ease of Playing Song Chords on the Guitar 725-732
Marcel A. Vélez Vásquez, Mariëlle Baelemans, Jonathan Driedger, Willem Zuidema, John Ashley Burgoyne
FlexDTW: Dynamic Time Warping With Flexible Boundary Conditions 733-740
Irmak Bükey, Jason Zhang, TJ Tsai
Modeling Bends in Popular Music Guitar Tablatures 741-748
Alexandre D’Hooge, Louis Bigo, Ken Déguernel
Modeling Harmonic Similarity for Jazz Using Co-occurrence Vectors and the Membrane Area 757-764
Carey Bunks, Tillman Weyde, Simon Dixon, Bruno Di Giorgi
SingStyle111: A Multilingual Singing Dataset With Style Transfer 765-773
Shuqi Dai, Yuxuan Wu, Siqi Chen, Roy Huang, Roger B. Dannenberg
A Computational Evaluation Framework for Singable Lyric Translation 774-781
Haven Kim, Kento Watanabe, Masataka Goto, Juhan Nam
Chorus-Playlist: Exploring the Impact of Listening to Only Choruses in a Playlist 782-792
Kosetsu Tsukuda, Masahiro Hamasaki, Masataka Goto
Supporting Musicological Investigations With Information Retrieval Tools: An Iterative Approach to Data Collection 795-801
David Lewis, Elisabete Shibata, Andrew Hankinson, Johannes Kepper, Kevin R. Page, Lisa Rosendahl, Mark Saccomano, Christine Siegert
Optimizing Feature Extraction for Symbolic Music 802-809
Federico Simonetta, Ana Llorens, Martín Serrano, Eduardo García-Portugués, Álvaro Torrente
Exploring Sampling Techniques for Generating Melodies With a Transformer Language Model 810-816
Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer
Measuring the Eurovision Song Contest: A Living Dataset for Real-World MIR 817-823
John Ashley Burgoyne, Janne Spijkervet, David John Baker
Efficient Supervised Training of Audio Transformers for Music Representation Learning 824-831
Pablo Alonso-Jiménez, Xavier Serra, Dmitry Bogdanov
A Cross-Version Approach to Audio Representation Learning for Orchestral Music 832-839
Michael Krause, Christof Weiß, Meinard Müller
Music Source Separation With MLP Mixing of Time, Frequency, and Channel 840-847
Tomoyasu Nakano, Masataka Goto
Symbolic Music Representations for Classification Tasks: A Systematic Evaluation 848-858
Huan Zhang, Emmanouil Karystinaios, Simon Dixon, Gerhard Widmer, Carlos Eduardo Cancino-Chacón
The Music Meta Ontology: A Flexible Semantic Model for the Interoperability of Music Metadata 859-867
Jacopo de Berardinis, Valentina Anita Carriero, Albert Meroño-Peñuela, Andrea Poltronieri, Valentina Presutti
Polar Manhattan Displacement: Measuring Tonal Distances Between Chords Based on Intervallic Content 868-874
Jeff Miller, Johan Pauwels, Mark Sandler
