ISMIR 2018
Full Proceedings
Papers
Improved Chord Recognition by Combining Duration and Harmonic Language Models 10-17
Filip Korzeniowski, Gerhard Widmer
Filip Korzeniowski, Gerhard Widmer
Using Musical Relationships Between Chord Labels in Automatic Chord Extraction Tasks 18-25
Tristan Carsault, Jerome Nika, Philippe Esling
Tristan Carsault, Jerome Nika, Philippe Esling
A Predictive Model for Music based on Learned Interval Representations 26-33
Stefan Lattner, Maarten Grachten, Gerhard Widmer
Stefan Lattner, Maarten Grachten, Gerhard Widmer
An End-to-end Framework for Audio-to-Score Music Transcription on Monophonic Excerpts 34-41
Miguel A. Román, Antonio Pertusa, Jorge Calvo-Zaragoza
Miguel A. Román, Antonio Pertusa, Jorge Calvo-Zaragoza
Onsets and Frames: Dual-Objective Piano Transcription 50-57
Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse Engel, Sageev Oore, Douglas Eck
Curtis Hawthorne, Erich Elsen, Jialin Song, Adam Roberts, Ian Simon, Colin Raffel, Jesse Engel, Sageev Oore, Douglas Eck
Player Vs Transcriber: A Game Approach To Data Manipulation For Automatic Drum Transcription 58-65
Carl Southall, Ryan Stables, Jason Hockman
Carl Southall, Ryan Stables, Jason Hockman
A Flexible Approach to Automated Harmonic Analysis: Multiple Annotations of Chorales by Bach and Prætorius 66-73
Nathaniel Condit-Schultz, Yaolong Ju, Ichiro Fujinaga
Nathaniel Condit-Schultz, Yaolong Ju, Ichiro Fujinaga
Evaluating a Collection of Sound-Tracing Data of Melodic Phrases 74-81
Tejaswinee Kelkar, Udit Roy, Alexander Refsum Jensenius
Tejaswinee Kelkar, Udit Roy, Alexander Refsum Jensenius
Main Melody Estimation with Source-Filter NMF and CRNN 82-89
Dogac Basaran, Slim Essid, Geoffroy Peeters
Dogac Basaran, Slim Essid, Geoffroy Peeters
A Single-step Approach to Musical Tempo Estimation using a Convolutional Neural Network 98-105
Hendrik Schreiber, Meinard Müller
Hendrik Schreiber, Meinard Müller
Analysis of Common Design Choices in Deep Learning Systems for Downbeat Tracking 106-112
Magdalena Fuentes, Brian McFee, Hélène C. Crayencour, Slim Essid, Juan Pablo Bello
Magdalena Fuentes, Brian McFee, Hélène C. Crayencour, Slim Essid, Juan Pablo Bello
A Timbre-based Approach to Estimate Key Velocity from Polyphonic Piano Recordings 120-127
Dasaem Jeong, Taegyun Kwon, Juhan Nam
Dasaem Jeong, Taegyun Kwon, Juhan Nam
Interactive Arrangement of Chords and Melodies Based on a Tree-Structured Generative Model 145-151
Hiroaki Tsushima, Eita Nakamura, Katsutoshi Itoyama, Kazuyoshi Yoshii
Hiroaki Tsushima, Eita Nakamura, Katsutoshi Itoyama, Kazuyoshi Yoshii
A Generalized Parsing Framework for Generative Models of Harmonic Syntax 152-159
Daniel Harasim, Martin Rohrmeier, Timothy J. O’Donnell
Daniel Harasim, Martin Rohrmeier, Timothy J. O’Donnell
An Energy-based Generative Sequence Model for Testing Sensory Theories of Western Harmony 160-167
Peter M. C. Harrison, Marcus T. Pearce
Peter M. C. Harrison, Marcus T. Pearce
Automatic, Personalized, and Flexible Playlist Generation using Reinforcement Learning 168-174
Shun-Yao Shih, Heng-Yu Chi
Shun-Yao Shih, Heng-Yu Chi
Bridging Audio Analysis, Perception and Synthesis with Perceptually-regularized Variational Timbre Spaces 175-181
Philippe Esling, Axel Chemla–Romeu-Santos, Adrien Bitton
Philippe Esling, Axel Chemla–Romeu-Santos, Adrien Bitton
Conditioning Deep Generative Raw Audio Models for Structured Automatic Music 182-189
Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis
Rachel Manzelli, Vijay Thakkar, Ali Siahkamari, Brian Kulis
Convolutional Generative Adversarial Networks with Binary Neurons for Polyphonic Music Generation 190-196
Hao-Wen Dong, Yi-Hsuan Yang
Hao-Wen Dong, Yi-Hsuan Yang
Part-invariant Model for Music Generation and Harmonization 204-210
Yujia Yan, Ethan Lustig, Joseph VanderStel, Zhiyao Duan
Yujia Yan, Ethan Lustig, Joseph VanderStel, Zhiyao Duan
Skeleton Plays Piano: Online Generation of Pianist Body Movements from MIDI Performance 218-224
Bochen Li, Akira Maezawa, Zhiyao Duan
Bochen Li, Akira Maezawa, Zhiyao Duan
Towards Full-Pipeline Handwritten OMR with Musical Symbol Detection by U-Nets 225-232
Jan Hajič jr., Matthias Dorfer, Gerhard Widmer, Pavel Pecina
Jan Hajič jr., Matthias Dorfer, Gerhard Widmer, Pavel Pecina
Searching Page-Images of Early Music Scanned with OMR: A Scalable Solution Using Minimal Absent Words 233-239
Tim Crawford, Golnaz Badkobeh, David Lewis
Tim Crawford, Golnaz Badkobeh, David Lewis
Optical Music Recognition in Mensural Notation with Region-based Convolutional Neural Networks 240-247
Alexander Pacha, Jorge Calvo-Zaragoza
Alexander Pacha, Jorge Calvo-Zaragoza
Camera-PrIMuS: Neural End-to-End Optical Music Recognition on Realistic Monophonic Scores 248-255
Jorge Calvo-Zaragoza, David Rizo
Jorge Calvo-Zaragoza, David Rizo
Document Analysis of Music Score Images with Selectional Auto-Encoders 256-263
Francisco Castellanos, Jorge Calvo-Zaragoza, Gabriel Vigliensoni, Ichiro Fujinaga
Francisco Castellanos, Jorge Calvo-Zaragoza, Gabriel Vigliensoni, Ichiro Fujinaga
Genre-Agnostic Key Classification With Convolutional Neural Networks 264-270
Filip Korzeniowski, Gerhard Widmer
Filip Korzeniowski, Gerhard Widmer
Deep Watershed Detector for Music Object Recognition 271-278
Lukas Tuggener, Ismail Elezi, Jürgen Schmidhuber, Thilo Stadelmann
Lukas Tuggener, Ismail Elezi, Jürgen Schmidhuber, Thilo Stadelmann
Music Source Separation Using Stacked Hourglass Networks 289-296
Sungheon Park, Taehoon Kim, Kyogu Lee, Nojun Kwak
Sungheon Park, Taehoon Kim, Kyogu Lee, Nojun Kwak
The Northwestern University Source Separation Library 297-305
Ethan Manilow, Prem Seetharaman, Bryan Pardo
Ethan Manilow, Prem Seetharaman, Bryan Pardo
Improving Bass Saliency Estimation using Transfer Learning and Label Propagation 306-312
Jakob Abeßer, Stefan Balke, Meinard Müller
Jakob Abeßer, Stefan Balke, Meinard Müller
Improving Peak-picking Using Multiple Time-step Loss Functions 313-320
Carl Southall, Ryan Stables, Jason Hockman
Carl Southall, Ryan Stables, Jason Hockman
Zero-Mean Convolutions for Level-Invariant Singing Voice Detection 321-326
Jan Schlüter, Bernhard Lehner
Jan Schlüter, Bernhard Lehner
Music Generation and Transformation with Moment Matching-Scattering Inverse Networks 327-333
Mathieu Andreux, Stéphane Mallat
Mathieu Andreux, Stéphane Mallat
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation 334-340
Daniel Stoller, Sebastian Ewert, Simon Dixon
Daniel Stoller, Sebastian Ewert, Simon Dixon
SE and SNL diagrams: Flexible data structures for MIR 341-347
Melissa R. McGuirl, Katherine M. Kinnaird, Claire Savard, Erin H. Bugbee
Melissa R. McGuirl, Katherine M. Kinnaird, Claire Savard, Erin H. Bugbee
JSYMBOLIC 2.2: Extracting Features from Symbolic Music for use in Musicological and MIR Research 348-354
Cory McKay, Julie Cumming, Ichiro Fujinaga
Cory McKay, Julie Cumming, Ichiro Fujinaga
Relevance of Musical Features for Cadence Detection 355-361
Louis Bigo, Laurent Feisthauer, Mathieu Giraud, Florence Levé
Louis Bigo, Laurent Feisthauer, Mathieu Giraud, Florence Levé
On the Relationships between Music-induced Emotion and Physiological Signals 362-369
Xiao Hu, Fanjie Li, Jeremy T. D. Ng
Xiao Hu, Fanjie Li, Jeremy T. D. Ng
Music Mood Detection Based on Audio and Lyrics with Deep Neural Net 370-375
Rémi Delbouys, Romain Hennequin, Francesco Piccoli, Jimena Royo-Letelier, Manuel Moussallam
Rémi Delbouys, Romain Hennequin, Francesco Piccoli, Jimena Royo-Letelier, Manuel Moussallam
Identifying Emotions in Opera Singing: Implications of Adverse Acoustic Conditions 376-382
Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Simone Hantke, Giovanni Costantini, Klaus Scherer, Bjoern Schuller
Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Simone Hantke, Giovanni Costantini, Klaus Scherer, Bjoern Schuller
Musical Texture and Expressivity Features for Music Emotion Recognition 383-391
Renato Panda, Ricardo Malheiro, Rui Pedro Paiva
Renato Panda, Ricardo Malheiro, Rui Pedro Paiva
Shared Generative Representation of Auditory Concepts and EEG to Reconstruct Perceived and Imagined Music 392-399
André Ofner, Sebastian Stober
André Ofner, Sebastian Stober
Exploring Musical Relations Using Association Rule Networks 400-406
Renan de Padua, Verônica Oliveira de Carvalho, Solange Rezende, Diego Furtado Silva
Renan de Padua, Verônica Oliveira de Carvalho, Solange Rezende, Diego Furtado Silva
A Crowdsourced Experiment for Tempo Estimation of Electronic Dance Music 409-415
Hendrik Schreiber, Meinard Müller
Hendrik Schreiber, Meinard Müller
Computational Corpus Analysis: A Case Study on Jazz Solos 416-423
Christof Weiss, Stefan Balke, Jakob Abeßer, Meinard Müller
Christof Weiss, Stefan Balke, Jakob Abeßer, Meinard Müller
Controlled Vocabularies for Music Metadata 424-430
Pasquale Lisena, Konstantin Todorov, Cécile Cecconi, Françoise Leresche, Isabelle Canno, Frédéric Puyrenier, Martine Voisin, Thierry Le Meur, Raphaël Troncy
Pasquale Lisena, Konstantin Todorov, Cécile Cecconi, Françoise Leresche, Isabelle Canno, Frédéric Puyrenier, Martine Voisin, Thierry Le Meur, Raphaël Troncy
DALI: A Large Dataset of Synchronized Audio, Lyrics and notes, Automatically Created using Teacher-student Machine Learning Paradigm. 431-437
Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters
Gabriel Meseguer-Brocal, Alice Cohen-Hadria, Geoffroy Peeters
OpenMIC-2018: An Open Data-set for Multiple Instrument Recognition 438-444
Eric Humphrey, Simon Durand, Brian McFee
Eric Humphrey, Simon Durand, Brian McFee
From Labeled to Unlabeled Data – On the Data Challenge in Automatic Drum Transcription 445-452
Chih-Wei Wu, Alexander Lerch
Chih-Wei Wu, Alexander Lerch
GuitarSet: A Dataset for Guitar Transcription 453-460
Qingyang Xi, Rachel Bittner, Johan Pauwels, Xuzhou Ye, Juan Pablo Bello
Qingyang Xi, Rachel Bittner, Johan Pauwels, Xuzhou Ye, Juan Pablo Bello
Musical-Linguistic Annotations of Il Lauro Secco 461-467
Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Bjoern Schuller
Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Bjoern Schuller
The NES Music Database: A multi-instrumental dataset with expressive performance attributes 475-482
Chris Donahue, Huanru Henry Mao, Julian McAuley
Chris Donahue, Huanru Henry Mao, Julian McAuley
Audio-Aligned Jazz Harmony Dataset for Automatic Chord Transcription and Corpus-based Research 483-490
Vsevolod Eremenko, Emir Demirel, Baris Bozkurt, Xavier Serra
Vsevolod Eremenko, Emir Demirel, Baris Bozkurt, Xavier Serra
Methodologies for Creating Symbolic Corpora of Western Music Before 1600 491-498
Julie Cumming, Cory McKay, Jonathan Stuchbery, Ichiro Fujinaga
Julie Cumming, Cory McKay, Jonathan Stuchbery, Ichiro Fujinaga
Precision of Sung Notes in Carnatic Music 499-505
Venkata Viraraghavan, Rangarajan Aravind, Hema Murthy
Venkata Viraraghavan, Rangarajan Aravind, Hema Murthy
Revisiting Singing Voice Detection: A quantitative review and the future outlook 506-513
Kyungyun Lee, Keunwoo Choi, Juhan Nam
Kyungyun Lee, Keunwoo Choi, Juhan Nam
Vocals in Music Matter: the Relevance of Vocals in the Minds of Listeners 514-520
Andrew Demetriou, Andreas Jansson, Aparna Kumar, Rachel Bittner
Andrew Demetriou, Andreas Jansson, Aparna Kumar, Rachel Bittner
Empirically Weighting the Importance of Decision Factors for Singing Preference 529-536
Michael Barone, Karim Ibrahim, Chitralekha Gupta, Ye Wang
Michael Barone, Karim Ibrahim, Chitralekha Gupta, Ye Wang
Analysis by Classification: A Comparative Study of Annotated and Algorithmically Extracted Patterns in Symbolic Music Data 539-546
Iris Yuping Ren, Anja Volk, Wouter Swierstra, Remco Veltkamp
Iris Yuping Ren, Anja Volk, Wouter Swierstra, Remco Veltkamp
Generalized Skipgrams for Pattern Discovery in Polyphonic Streams 547-553
Christoph Finkensiep, Markus Neuwirth, Martin Rohrmeier
Christoph Finkensiep, Markus Neuwirth, Martin Rohrmeier
Comparison of Audio Features for Recognition of Western and Ethnic Instruments in Polyphonic Mixtures 554-560
Igor Vatolkin, Günter Rudolph
Igor Vatolkin, Günter Rudolph
Instrudive: A Music Visualization System Based on Automatically Recognized Instrumentation 561-568
Takumi Takahashi, Satoru Fukayama, Masataka Goto
Takumi Takahashi, Satoru Fukayama, Masataka Goto
Instrument Activity Detection in Polyphonic Music using Deep Neural Networks 569-576
Siddharth Gururani, Cameron Summers, Alexander Lerch
Siddharth Gururani, Cameron Summers, Alexander Lerch
Jazz Solo Instrument Classification with Convolutional Neural Networks, Source Separation, and Transfer Learning 577-584
Juan S. Gómez, Jakob Abeßer, Estefanía Cano
Juan S. Gómez, Jakob Abeßer, Estefanía Cano
Aligned Sub-Hierarchies: A Structure-based Approach to the Cover Song Task 585-591
Katherine M. Kinnaird
Katherine M. Kinnaird
Semi-supervised Lyrics and Solo-singing Alignment 600-607
Chitralekha Gupta, Rong Tong, Haizhou Li, Ye Wang
Chitralekha Gupta, Rong Tong, Haizhou Li, Ye Wang
Concert Stitch: Organization and Synchronization of Crowd Sourced Recordings 608-614
Vinod Subramanian, Alexander Lerch
Vinod Subramanian, Alexander Lerch
A Data-driven Approach to Mid-level Perceptual Musical Feature Modeling 615-621
Anna Aljanaki, Mohammad Soleymani
Anna Aljanaki, Mohammad Soleymani
Disambiguating Music Artists at Scale with Audio Metric Learning 622-629
Jimena Royo-Letelier, Romain Hennequin, Viet-Anh Tran, Manuel Moussallam
Jimena Royo-Letelier, Romain Hennequin, Viet-Anh Tran, Manuel Moussallam
Driftin’ Down the Scale: Dynamic Time Warping in the Presence of Pitch Drift and Transpositions 630-636
Simon Waloschek, Aristotelis Hadjakos
Simon Waloschek, Aristotelis Hadjakos
End-to-end Learning for Music Audio Tagging at Scale 637-644
Jordi Pons, Oriol Nieto, Matthew Prockup, Erik M. Schmidt, Andreas F. Ehmann, Xavier Serra
Jordi Pons, Oriol Nieto, Matthew Prockup, Erik M. Schmidt, Andreas F. Ehmann, Xavier Serra
Audio Based Disambiguation of Music Genre Tags 645-652
Romain Hennequin, Jimena Royo-Letelier, Manuel Moussallam
Romain Hennequin, Jimena Royo-Letelier, Manuel Moussallam
Learning Interval Representations from Polyphonic Music Sequences 661-668
Stefan Lattner, Maarten Grachten, Gerhard Widmer
Stefan Lattner, Maarten Grachten, Gerhard Widmer
Influences on the Social Practices Surrounding Commercial Music Services: A Model for Rich Interactions 671-677
Louis Spinelli, Josephine Lau, Liz Pritchard, Jin Ha Lee
Louis Spinelli, Josephine Lau, Liz Pritchard, Jin Ha Lee
Investigating Cross-Country Relationship between Users’ Social Ties and Music Mainstreaminess 678-686
Christine Bauer, Markus Schedl
Christine Bauer, Markus Schedl
Listener Anonymizer: Camouflaging Play Logs to Preserve User’s Demographic Anonymity 687-694
Kosetsu Tsukuda, Satoru Fukayama, Masataka Goto
Kosetsu Tsukuda, Satoru Fukayama, Masataka Goto
On the Impact of Music on Decision Making in Cooperative Tasks 695-701
Elad Liebman, Corey N. White, Peter Stone
Elad Liebman, Corey N. White, Peter Stone
VenueRank: Identifying Venues that Contribute to Artist Popularity 702-708
Emmanouil Krasanakis, Emmanouil Schinas, Symeon Papadopoulos, Yiannis Kompatsiaris, Pericles Mitkas
Emmanouil Krasanakis, Emmanouil Schinas, Symeon Papadopoulos, Yiannis Kompatsiaris, Pericles Mitkas
Representation Learning of Music Using Artist Labels 717-724
Jiyoung Park, Jongpil Lee, Jangyeon Park, Jung-Woo Ha, Juhan Nam
Jiyoung Park, Jongpil Lee, Jangyeon Park, Jung-Woo Ha, Juhan Nam
StructureNet: Inducing Structure in Generated Melodies 725-731
Gabriele Medeot, Srikanth Cherla, Katerina Kosta, Matt McVicar, Samer Abdallah, Marco Selvi, Ed Newton-Rex, Kevin Webster
Gabriele Medeot, Srikanth Cherla, Katerina Kosta, Matt McVicar, Samer Abdallah, Marco Selvi, Ed Newton-Rex, Kevin Webster
Summarizing and Comparing Music Data and Its Application on Cover Song Identification 732-739
Diego Furtado Silva, Felipe Falcão, Nazareno Andrade
Diego Furtado Silva, Felipe Falcão, Nazareno Andrade
MIDI-VAE: Modeling Dynamics and Instrumentation of Music with Applications to Style Transfer 747-754
Gino Brunner, Andres Konrad, Yuyi Wang, Roger Wattenhofer
Gino Brunner, Andres Konrad, Yuyi Wang, Roger Wattenhofer
Understanding a Deep Machine Listening Model Through Feature Inversion 755-762
Saumitra Mishra, Bob L. Sturm, Simon Dixon
Saumitra Mishra, Bob L. Sturm, Simon Dixon
Visualization of Audio Data Using Stacked Graphs 771-776
Mathieu Lagrange, Mathias Rossignol, Grégoire Lafay
Mathieu Lagrange, Mathias Rossignol, Grégoire Lafay
Two Web Applications for Exploring Melodic Patterns in Jazz Solos 777-783
Klaus Frieler, Frank Höger, Martin Pfleiderer, Simon Dixon
Klaus Frieler, Frank Höger, Martin Pfleiderer, Simon Dixon
Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game 784-791
Matthias Dorfer, Florian Henkel, Gerhard Widmer
Matthias Dorfer, Florian Henkel, Gerhard Widmer
Matrix Co-Factorization for Cold-Start Recommendation 792-798
Olivier Gouvert, Thomas Oberlin, Cédric Févotte
Olivier Gouvert, Thomas Oberlin, Cédric Févotte
