Multi-Modal Instrument Performances (MMIP): A Musical Database

EUROGRAPHICS 2025 (London)
1University of Cyprus, 2CYENS Centre of Excellence
DRUMS System Overview

Performance Recording: The top section shows the drum recording, the middle features the digital piano, and the bottom displays the guitar recording. On the left side, there is a top-down view of the drums and digital piano, and the optical MoCap data for the guitar. Four snapshots from the video recording, taken from different camera angles, are also presented for each instrument. On the right, a 3D motion reconstruction is displayed, using the default actor model from Rokoko Studio software to accurately represent the performer's movements.

Abstract

Musical instrument performances are multimodal creative art forms that integrate audiovisual elements, resulting from musicians' interactions with instruments through body movements, finger actions, and facial expressions. Digitizing such performances for archiving, streaming, analysis, or synthesis requires capturing every element that shapes the overall experience, which is crucial for preserving the performance's essence. In this work, following current trends in large-scale dataset development for deep learning analysis and generative models, we introduce the Multi-Modal Instrument Performances (MMIP) database (\href{https://mmip.cs.ucy.ac.cy}{https://mmip.cs.ucy.ac.cy}). This is the first dataset to incorporate synchronized high-quality 3D motion capture data for the body, fingers, facial expressions, and instruments, along with audio, multi-angle videos, and MIDI data. The database currently includes 3.5 hours of performances featuring three instruments: guitar, piano, and drums. Additionally, we discuss the challenges of acquiring these multi-modal data, detailing our approach to data collection, signal synchronization, annotation, and metadata management. Our data formats align with industry standards for ease of use, and we have developed an open-access online repository that offers a user-friendly environment for data exploration, supporting data organization, search capabilities, and custom visualization tools. Notable features include a MIDI-to-instrument animation project for visualizing the instruments and a script for playing back FBX files with synchronized audio in a web environment.

Video Presentation

BibTeX

@article{Kyriakou:2025:MMIP,
 author    	= {Kyriakou, Theodoros and Aristidou, Andreas and Charalambous, Panayiotis},
 title     	= {{M}ulti-{M}odal {I}nstrument {P}erformance: A musical database},
 doi 	  	= {10.1111/cgf.70025},
 journal   	= {Comput. Graph. Forum},
 volume    	= {44},
 number    	= {2},
 pages     	= {e70025},
 year 	  	= {2025},
 publisher 	= {The Eurographics Association and John Wiley & Sons Ltd.},
 ISSN 	  	= {},
}