Home 9 Past Conferences 9 ISMIR 2025

ISMIR 2025

Full Proceedings

Papers

GlobalMood: A Cross-Cultural Benchmark for Music Emotion Recognition 11-19
Harin Lee, Elif Celen, Peter Harrison, Manuel Anglada-Tort, Pol van Rijn, Minsu Park, Marc Schönwiesner, Nori Jacoby
Expanding the HAISP Dataset: AI’s Impact on Songwriting Across Two AI Song Contests 28-35
Lidia Morris, Michele Newman, Xinya Tang, Renee Singh, Marcel Vélez Vásquez, Rebecca Leger, Jin Ha Lee
On the De-Duplication of the Lakh MIDI Dataset 44-51
Eunjin Choi, Hyerin Kim, Jiwoo Ryu, Juhan Nam, Dasaem Jeong
A Systematic Evaluation of Real-Time Audio Score Following for Piano Performance 91-99
Jiyun Park, Carlos Eduardo Cancino-Chacón, Suhit Chiruthapudi, Juhan Nam
AI-Generated Song Detection via Lyrics Transcripts 107-116
Markus Frohmann, Elena Epure, Gabriel Meseguer Brocal, Markus Schedl, Romain Hennequin
ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors 134-141
Junghyun Koo, Marco Martinez-Ramirez, WeiHsiang Liao, Giorgio Fabbro, Michele Mancusi, Yuki Mitsufuji
Aligning Text-to-Music Evaluation With Human Preferences 174-181
Yichen Huang, Zachary Novack, Koichi Saito, Jiatong Shi, Shinji Watanabe, Yuki Mitsufuji, John Thickstun, Chris Donahue
Investigating Music Track Liking in the Halo of Album Covers 182-189
Oleg Lesota, Anna Hausberger, Ivanna Pshenychna, Oleksandr Shvydanenko, Olha Yehorova, Markus Schedl
Phylo-Analysis of Folk Traditions: A Methodology for the Hierarchical Musical Similarity Analysis 190-197
Hilda Romero-Velo, Gilberto Bernardes, Susana Ladra, José R. Paramá, Fernando Silva
PeakNetFP: Peak-Based Neural Audio Fingerprinting Robust to Extreme Time Stretching 206-214
Guillem Cortès-Sebastià, Benjamin Martin, Emilio Molina, Xavier Serra, Romain Hennequin
Generating Symbolic Music From Natural Language Prompts Using an LLM-Enhanced Dataset 215-222
Weihan Xu, Julian McAuley, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Hao-Wen Dong
A Survey on Vision-to-Music Generation: Methods, Datasets, Evaluation, and Challenges 223-234
Zhaokai Wang, Chenxi Bao, Le Zhuo, Jingrui Han, Yang Yue, Yihong Tang, Victor Shea-Jay Huang, Yue Liao
Emergent Musical Properties of a Transformer Under Contrastive Self-Supervised Learning 235-246
Yuexuan KONG, Gabriel Mesegues-Brocal, Vincent Lostanlen, Mathieu Lagrange, Romain Hennequin
Are You Really Listening? Boosting Perceptual Awareness in Music-QA Benchmarks 247-261
Yongyi Zang, Sean O’Brien, Taylor Berg-Kirkpatrick, Julian McAuley, Zachary Novack
What Song Now? Personalized Rhythm Guitar Learning in Western Popular Music 296-302
Zakaria Hassein-Bey, Yohann Abbou, Alexandre d’Hooge, Mathieu Giraud, Gilles Guillemain, Aurélien Jeanneau
Towards Human-in-the-Loop Onset Detection: A Transfer Learning Approach for Maracatu 320-327
António Pinto (INESC TEC, University of Porto -. Faculty of Engineering)
Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning 328-336
Yixiao Zhang, Yukara Ikemiya, Woosung Choi, Naoki Murata, Marco Martínez-Ramírez, Liwei Lin, Gus Xia, Wei-Hsiang Liao, Yuki Mitsufuji, Simon Dixon
Automatic Melody Reduction via Shortest Path Finding 346-353
Ziyu Wang, Yuxuan Wu, Roger Dannenberg, Gus Xia
Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification 399-406
Recep Oguz Araz, Guillem Cortès-Sebastià, Emilio Molina, Joan Serra, Xavier Serra, Yuhki Mitsufuji, Dmitry Bogdanov
CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following 416-425
Yinghao MA, Siyou Li, Juntao Yu, Emmanouil Benetos, Akira Maezawa
Scaling Self-Supervised Representation Learning for Symbolic Piano Performance 451-459
Louis Bradshaw, Alexander Spangher, Honglu Fan, Stella Biderman, Simon Colton
The Rhythm In Anything: Audio-Prompted Drums Generation With Masked Language Modeling 460-468
Patrick O’Reilly, Julia Barnett, Hugo Flores Garcia, Annie Chu, Nathan Pruyne, Prem Seetharaman, Bryan Pardo
Enabling Empirical Analysis of Piano Performance Rehearsal With the Rach3 MIDI Dataset 484-491
Alia Morsi, Suhit Chiruthapudi, Silvan Peter, Ivan Pilkov, Laura Bishop, Akira Maezawa, Xavier Serra, Carlos Eduardo Cancino-Chacón
Keyboard Temperament Estimation From Symbolic Data: A Case Study on Bach’s Well-Tempered Clavier 503-510
Peter Van Kranenburg (Utrecht University, Meertens Institute), Gerben Bisschop
Refining Music Sample Identification With a Self-Supervised Graph Neural Network 511-517
Aditya Bhattacharjee, Ivan Meresman Higgs, Mark Sandler, Emmanouil Benetos
Video-Guided Text-to-Music Generation Using Public Domain Movie Collections 518-527
Haven Kim, Zachary Novack, Weihan Xu, Julian McAuley, Hao-Wen Dong
PianoVAM: A Multimodal Piano Performance Dataset 528-535
Yonghyun Kim, Junhyung Park, Joonhyung Bae, Kirak Kim, Taegyun Kwon, Alexander Lerch, Juhan Nam
LoopGen: Training-Free Loopable Music Generation 536-546
Davide Marincione, Giorgio Strano, Donato Crisostomi, Roberto Ribuoli, Emanuele Rodolà
Enhancing Music Recommender Systems With Multimedia Content: A Context-Aware Approach 547-554
Oleg Lesota, Veronica Clavijo, Attia Rizwani, Markus Schedl, Bruce Ferwerda
CultureMERT: Continual Pre-Training for Cross-Cultural Music Representation Learning 555-564
Angelos-Nikolaos Kanatas, Charilaos Papaioannou, Alexandros Potamianos
Versatile Music-for-Music Modeling via Function Alignment 573-581
Junyan Jiang, Daniel Chin, Xuanjie Liu, Liwei Lin, Gus Xia
Understanding Performance Limitations in Automatic Drum Transcription 582-588
Philipp Weyers, Christian Uhle, Meinard Müller, Matthias Lang
Sheet Music Benchmark: Standardized Optical Music Recognition Evaluation 604-611
Juan C. Martinez-Sevilla, Joan Cerveto-Serrano, Noelia Luna-Barahona, Greg Chapman, Craig Sapp, David Rizo, Jorge Calvo-Zaragoza
Fx-Encoder++: Extracting Instrument-Wise Audio Effect Representations From Mixtures 612-622
Yen-Tung Yeh, Junghyun Koo, Marco Martínez-Ramírez, Wei-Hsiang Liao, Yi-Hsuan Yang, Yuki Mitsufuji
MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling 623-630
Jingjing Tang, Xin Wang, Zhe Zhang, Junichi Yamagish, Geraint Wiggins, George Fazekas
IdolSongsJp Corpus: A Multi-Singer Song Corpus in the Style of Japanese Idol Groups 647-654
Hitoshi Suda, Junya Koguchi, Shunsuke Yoshida, Tomohiko Nakamura, Satoru Fukayama, Jun Ogata
GOAT: A Large Dataset of Paired Guitar Audio Recordings and Tablatures 655-662
Jackson Loth, Pedro Sarmento, Saurjya Sarkar, Zixun Guo, Mathieu Barthet, Mark Sandler
STAGE: Stemmed Accompaniment Generation Through Prefix-Based Conditioning 663-670
Giorgio Strano, Chiara Ballanti, Donato Crisostomi, Michele Mancusi, Luca Cosmo, Emanuele Rodolà
Optical Music Recognition of Jazz Lead Sheets 696-702
Juan Carlos Martinez-Sevilla, Francesco Foscarin, Patricia Garcia-Iasci, David Rizo, Jorge Calvo-Zaragoza, Gerhard Widmer
Human Vs. Machine: Comparing Selection Strategies in Active Learning for Optical Music Recognition 703-709
Juan Pedro Martinez-Esteso, Alejandro Galan-Cuenca, Carlos Pérez-Sancho, Francisco J. Castellanos, Antonio Javier Gallego
MusGO: A Community-Driven Framework for Assessing Openness in Music-Generative AI 727-738
Roser Batlle-Roca, Laura Ibáñez-Martínez, Xavier Serra, Emilia Gómez, Martín Rocamora
A Fourier Explanation of AI-Music Artifacts 739-746
Darius Afchar, Gabriel Meseguer Brocal, Kamil Akesbi, Romain Hennequin
The Jam_bot, a Real-Time System for Collaborative Free Improvisation With Music Language Models 755-762
Lancelot Blanchard, Perry Naseck, Stephen Brade, Kimaya Lecamwasam, Jordan Rudess, Cheng-Zhi Anna Huang, Joseph Paradiso
Fretboardflow: A Dual-Model Approach to Optimize Chord Voicings on the Guitar Fretboard 763-770
Marcel Vélez Vásquez, Mariëlle Baelemans, Jonathan Driedger, John Ashley Burgoyne
Adding Temporal Musical Controls on Top of Pretrained Generative Models 779-786
Sarah Nabi, Nils Demerlé, Geoffroy Peeters, Frederic Bevilacqua, Philippe Esling
Identification and Clustering of Unseen Ragas in Indian Art Music 797-804
Parampreet Singh, Adwik Gupta, Aakarsh Mishra, Vipul Arora
MAIA: An Inpainting-Based Approach for Music Adversarial Attacks 805-812
Yuxuan Liu, Peihong Zhang, Rui Sang, Zhixin Li, Shengchen Li
Joint Object Detection and Sound Source Separation 813-820
Sunyoo Kim, Yunjeong Choi, Doyeon Lee, Seoyoung Lee, Eunyi Lyou, Seungju Kim, Junhyug Noh, Joonseok Lee
User-Guided Generative Source Separation 821-829
Yutong Wen, Minje Kim, Paris Smaragdis
Looking Beyond Averaged Metrics in Music Source Separation 839-846
Saurjya Sarkar, Victoria Moomijan, Basil Woods, Emmanouil Benetos, Mark Sandler