FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models

Luca Comanducci¹, Paolo Bestagini¹, and Stefano Tubaro¹

¹ Dipartimento di Elettronica, Informazione e Bioingegneria - Politecnico di Milano

Abstract

Text-To-Music (TTM) models have recently revolutionized the automatic music generation research field. Specifically, by reaching superior performances to all previous state-of-the-art models and by lowering the technical proficiency needed to use them. Due to these reasons, they have readily started to be adopted for commercial uses and music production practices. This widespread diffusion of TTMs poses several concerns regarding copyright violation and rightful attribution, posing the need of serious consideration of them by the audio forensics community. In this paper, we tackle the problem of detection and attribution of TTM-generated data. We propose a dataset, FakeMusicCaps that contains several versions of the music-caption pairs dataset MusicCaps re-generated via several state-of-the-art TTM techniques. We evaluate the proposed dataset by performing initial experiments regarding the detection and attribution of TTM-generated audio.

Link to additional material

The full FakeMusicCaps dataset can be downloaded at companion website.

Additional information

For more details: "FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models"

If you use code or comments from this work, please cite our paper:

@misc{comanducci2024fakemusiccapsdatasetdetectionattribution,
      title={FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models}, 
      author={Luca Comanducci and Paolo Bestagini and Stefano Tubaro},
      year={2024},
      eprint={2409.10684},
      archivePrefix={arXiv},
      primaryClass={eess.AS},
      url={https://arxiv.org/abs/2409.10684}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
data_generation		data_generation
paper_experiments		paper_experiments
LICENSE		LICENSE
README.md		README.md
params.py		params.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models

Abstract

Link to additional material

Additional information

About

Releases

Packages

Languages

License

polimi-ispl/FakeMusicCaps

Folders and files

Latest commit

History

Repository files navigation

FakeMusicCaps: a Dataset for Detection and Attribution of Synthetic Music Generated via Text-to-Music Models

Abstract

Link to additional material

Additional information

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages