Galan-Cuenca, A.; Valero-Mas, J. J.; Martinez-Sevilla, J. C.; Hidalgo-Centeno, A.; Pertusa, A.; Calvo-Zaragoza, J.

MUSCAT: a Multimodal mUSic Collection for Automatic Transcription of real recordings and image scores Conference

Proceedings of the 32nd ACM International Conference on Multimedia, Association for Computing Machinery, 2024, ISBN: 979-8-4007-0686-8.

Abstract | Links | BibTeX | Tags:

@conference{nokey,

title = {MUSCAT: a Multimodal mUSic Collection for Automatic Transcription of real recordings and image scores},

author = {A. Galan-Cuenca and J. J. Valero-Mas and J. C. Martinez-Sevilla and A. Hidalgo-Centeno and A. Pertusa and J. Calvo-Zaragoza},

doi = {https://doi.org/10.1145/3664647.3681572},

isbn = {979-8-4007-0686-8},

year  = {2024},

date = {2024-10-28},

booktitle = {Proceedings of the 32nd ACM International Conference on Multimedia},

pages = {583-591},

publisher = {Association for Computing Machinery},

abstract = {Multimodal audio-image music transcription has been recently posed as a means of retrieving a digital score representation by leveraging the individual estimations from Automatic Music Transcription (AMT)---acoustic recordings---and Optical Music Recognition (OMR)---image scores---systems. Nevertheless, while proven to outperform single-modality recognition rates, this approach has been exclusively validated under controlled scenarios---monotimbral and monophonic synthetic data---mainly due to a lack of collections with symbolic score-level annotations for both recordings and graphical sheets. To promote research on this topic, this work presents the Multimodal mUSic Collection for Automatic Transcription (MUSCAT) assortment of acoustic recordings, image sheets, and their score-level annotations in several notation formats. This dataset comprises almost 80 hours of real recordings with varied instrumentation and polyphony degrees---ranging from piano to orchestral music---, 1251 scanned sheets, and 880 symbolic scores from 37 composers, which may also be used in other tasks involving metadata such as instrument identification or composer recognition. A fragmented subset of this collection solely focused on acoustic data for score-level AMT---the MUSic Collection for aUtomatic Transcription - fragmented Subset (MUSCUTS) assortment---is also presented together with a baseline experimentation, concluding the need to foster research on this field with real recordings. Finally, a web-based service is also provided to increase the size of the collections collaboratively.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close

Penarrubia, C.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Contrastive Self-Supervised Learning for Optical Music Recognition Conference

International Workshop on Document Analysis Systems, 2024, ISBN: 978-3-031-70442-0.

Abstract | Links | BibTeX | Tags:

Ríos-Vila, A.; Calvo-Zaragoza, J.; Paquet, T.

Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription Conference

Document Analysis and Recognition - ICDAR 2024, vol. 1, Springer Nature Switzerland, 2024, ISBN: 978-3-031-70552-6.

BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Rios-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

A Transformer Approach for Polyphonic Audio-to-Score Transcription Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), Seul (Korea), 2024.

Links | BibTeX | Tags: MultiScore

Penarrubia, C.; Garrido-Munoz, C.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Efficient notation assembly in optical music recognition Conference

Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

BibTeX | Tags:

Martínez-Sevilla, J. C.; Ríos-Vila, A.; Castellanos, F. J.; Calvo-Zaragoza, J.

A Holistic Approach for Aligned Music and Lyrics Transcription Conference

Document Analysis and Recognition - ICDAR 2023, vol. 1, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-41676-7.

Abstract | Links | BibTeX | Tags: REPERTORIUM

Martínez-Sevilla, J. C.; Alfaro-Contreras, M.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Insights into end-to-end audio-to-score transcription with real recordings: A case study with saxophone works Proceedings Article

In: INTERSPEECH Conference, pp. 2793-2797, Dublin, Ireland, 2023.

Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Multimodal Strategies for Image and Audio Music Transcription: A Comparative Study Proceedings Article

In: Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges. ICPR 2022. Lecture Notes in Computer Science, pp. 64-77, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-37731-0.

Links | BibTeX | Tags: MultiScore

Garrido-Munoz, C.; Alfaro-Contreras, M.; Calvo-Zaragoza, J.

Evaluating Domain Generalization in Kitchen Utensils Classification Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 108-118, 2023.

Links | BibTeX | Tags: MultiScore

González-Barrachina, P.; Alfaro-Contreras, M.; Nieto-Hidalgo, M.; Calvo-Zaragoza, J.

Lifelong Learning for Document Image Binarization: An Experimental Study Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 146-157, 2023.

Links | BibTeX | Tags: MultiScore

Penarrubia, C.; Valero-Mas, J. J.; Gallego, A. J.; Calvo-Zaragoza, J.

Addressing Class Imbalance in Multilabel Prototype Generation for k-Nearest Neighbor Classification Conference

Iberian Conference on Pattern Recognition and Image Analysis, Alicante, Spain, 2023, ISBN: 978-3-031-36616-1.

Abstract | Links | BibTeX | Tags: DOREMI

Alfaro-Contreras, M.; Iñesta, J. M.; Calvo-Zaragoza, J.

Optical Music Recognition for Homophonic Scores with Neural Networks and Synthetic Music Generation Journal Article

In: International Journal of Multimedia Information Retrieval, vol. 12, pp. 12-24, 2023.

Links | BibTeX | Tags: MultiScore

Ríos-Vila, A.; Rizo, D.; Iñesta, J. M.; Calvo-Zaragoza, J.

End-to-end optical music recognition for pianoform sheet music Journal Article

In: International Journal on Document Analysis and Recognition (IJDAR), iss. ICDAR 2023, 2023, ISSN: 1433-2825.

Abstract | Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Ríos-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Few-Shot Symbol Classification via Self-Supervised Learning and Nearest Neighbor Journal Article

In: Pattern Recognition Letters, vol. 167, pp. 1-8, 2023.

Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Late multimodal fusion for image and audio music transcription Journal Article

In: Expert Systems With Applications, vol. 216, pp. 119491-119500, 2023.

Links | BibTeX | Tags: MultiScore

Sánchez-Ferrer, A.; Valero-Mas, J. J.; Gallego, A. J.; Calvo-Zaragoza, J.

An Experimental Study on Marine Debris Location and Recognition using Object Detection Journal Article

In: Pattern Recognition Letters, 2023, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: TADMar

Ríos-Vila, A.; Iñesta, J. M.; Calvo-Zaragoza, J.

End-to-End Full-Page Optical Music Recognition for Mensural Notation Proceedings Article

In: Proceedings of the 23rd International Society for Music Information Retrieval Conference, pp. 226-232, 2022, ISBN: 978-1-7327299-2-6.

Abstract | Links | BibTeX | Tags: Leonardo2021, MultiScore

Rizo, D.; Delgado, T.; Calvo-Zaragoza, J.; Madueño, A.; García-Iasci, P.

Speeding-up the encoding of mensural collections from Spanish libraries Journal Article

In: IAML 2022 Prague, 2022.

BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Ríos-Vila, A.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Decoupling music notation to improve end-to-end Optical Music Recognition Journal Article

In: Pattern Recognition Letters, vol. 158, pp. 157-163, 2022, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Insights into transfer learning between image and audio music transcription Proceedings Article

In: Sound and Music Computing Conference, pp. 295-301, Zenodo, Saint-Étienne, France, 2022.

Abstract | Links | BibTeX | Tags: MultiScore

Arroyo, V.; Valero-Mas, J. J.; Calvo-Zaragoza, J.; Pertusa, A.

Neural audio-to-score music transcription for unconstrained polyphony using compact output representations Proceedings Article

In: Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Singapur, Singapur, 2022.

BibTeX | Tags: MultiScore

Garrido-Munoz, C.; Ríos-Vila, A.; Calvo-Zaragoza, J.

Retrieval of Music-Notation Primitives via Image-to-Sequence Approaches Proceedings Article

In: Iberian Pattern Recognition and Image Analysis, IbPRIA 2022., pp. 482-492, Aveiro, Portugal, 2022, ISBN: 978-3-031-04880-7.

BibTeX | Tags: Leonardo2021

Mas-Candela, E.; Ríos-Vila, A.; Calvo-Zaragoza, J.

A First Approach to Image Transformation Sequence Retrieval Proceedings Article

In: Iberian Pattern Recognition and Image Analysis, IbPRIA 2022., pp. 321-332, Aveiro, Portugal, 2022.

BibTeX | Tags: MultiScore

Sánchez-Ferrer, A.; Gallego, A. J.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

The CleanSea Set: A Benchmark Corpus for Underwater Debris Detection and Recognition Proceedings Article

In: 10th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA), pp. 616–628, Aveiro, Portugal, 2022, ISBN: 978-3-031-04881-4.

Abstract | BibTeX | Tags:

Ríos-Vila, A.; Iñesta, J. M.; Calvo-Zaragoza, J.

On the Use of Transformers for End-to-End Optical Music Recognition Proceedings Article

In: Iberian Pattern Recognition and Image Analysis, IbPRIA 2022., pp. 470-481, Aveiro, Portugal, 2022, ISBN: 978-3-031-04880-7.

BibTeX | Tags: MultiScore

Fuente, C.; Valero-Mas, J. J.; Castellanos, F. J.; Calvo-Zaragoza, J.

Multimodal Image and Audio Music Transcription Journal Article

In: International Journal of Multimedia Information Retrieval, vol. 11, pp. 77-84, 2022.

BibTeX | Tags: MultiScore

Rosello, A.; Ayllon, E.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Test Sample Selection for Handwriting Recognition Through Language Modeling Proceedings Article

In: Pattern Recognition and Image Analysis - 10th Iberian Conference, IbPRIA 2022, Aveiro, Portugal, May 4-6, 2022, Proceedings, 2022.

BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Ríos-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Few-Shot Music Symbol Classification via Self-Supervised Learning and Nearest Neighbor Proceedings Article

In: Pattern Recognition. ICPR International Workshops and Challenges, 2022.

BibTeX | Tags: MultiScore

Sáez-Pérez, J.; Gallego, A. J.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Domain Adaptation in Robotics: A Study Case on Kitchen Utensil Recognition Proceedings Article

In: 10th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA), 2022.

BibTeX | Tags: ROMA

Garrido-Munoz, C.; Ríos-Vila, A.; Calvo-Zaragoza, J.

A holistic approach for image-to-graph: application to optical music recognition Journal Article

In: International Journal on Document Analysis and Recognition, 2022.

BibTeX | Tags: Leonardo2021

Ríos-Vila, A.; Iñesta, J. M.; Calvo-Zaragoza, J.

End-to-End Full-Page Optical Music Recognition for Mensural Notation Proceedings Article

In: Proceedings of the 23rd International Society for Music Information Retrieval Conference, ISMIR, Bangalore, India, 2022.

Abstract | BibTeX | Tags: Leonardo2021, MultiScore

Castellanos, F. J.; Garrido-Munoz, C.; Ríos-Vila, A.; Calvo-Zaragoza, J.

Region-based Layout Analysis of Music Score Images Journal Article

In: Expert Systems with Applications, pp. 118211, 2022, ISSN: 0957-4174.

BibTeX | Tags: MultiScore

de la Fuente, C.; Castellanos, F. J.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Multimodal Recognition of Frustration during Game-Play with Deep Neural Networks Journal Article

In: Multimedia Tools and Applications, 2022.

BibTeX | Tags: ACIF/2019/042, APOSTD/2020/256

Castellanos, F. J.; Gallego, A. J.; Calvo-Zaragoza, J.; Fujinaga, I.

Domain Adaptation for Staff-Region Retrieval of Music Score Images Journal Article

In: International Journal on Document Analysis and Recognition, vol. 25, iss. Special Issue: ICFHR 2022, pp. 281-292, 2022, ISSN: 1433-2825.

BibTeX | Tags: MultiScore

Castellanos, F. J.; Gallego, A. J.; Calvo-Zaragoza, J.

An Unsupervised Domain Adaptation framework for Layout Analysis of Music Score Images Proceedings Article

In: Proceedings of the 14th Machine Learning and Music Workshop, pp. 6, 2021.

BibTeX | Tags: GRE19-04, ROMA

Ríos-Vila, A.; Calvo-Zaragoza, J.; Iñesta, J. M.

CTC-based end-to-end approach for full page Optical Music Recognition Proceedings Article

In: Proceedings of the 14th Machine Learning and Music Workshop, pp. 11, 2021.

BibTeX | Tags: MultiScore

Calvo-Zaragoza, J.; Pertusa, A.; Gallego, A. J.; Iñesta, J. M.; Micó, L.; Oncina, J.; Perez-Sancho, C.; de León, P. J. Ponce; Rizo, D.

MultiScore Project: Multimodal Transcription of Music Scores Proceedings Article

In: Proceedings of the 14th Machine Learning and Music Workshop, pp. 3, 2021.

Links | BibTeX | Tags: MultiScore

Castellanos, F. J.; Gallego, A. J.; Calvo-Zaragoza, J.

Unsupervised Neural Domain Adaptation for Document Image Binarization Journal Article

In: Pattern Recognition, vol. 119, pp. 108099, 2021.

BibTeX | Tags: GRE19-04, HispaMus

Alfaro-Contreras, M.; Rizo, D.; Iñesta, J. M.; Calvo-Zaragoza, J.

OMR-assisted transcription: a case study with early prints Proceedings Article

In: Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR, pp. 35-41, 2021, ISBN: 978-1-7327299-0-2.

BibTeX | Tags: MultiScore

Calvo-Zaragoza, J.; Rizo, D.; Iñesta, J. M.

Reconocimiento Óptico de Partituras (OMR) aplicado al Fonde de Música Tradicional IMF-CSIC Book Chapter

In: Gambero-Ustárroz, M.; Ros-Fábregas, E. (Ed.): Musicología en Web. Patrimonio musical y Humanidades Digitales, Chapter 4, pp. 87-109, Edition Reichenberger, 2021, ISBN: 978-3-967280-14-2.

BibTeX | Tags: HispaMus

Garrido-Munoz, C.; Sánchez-Hernández, A.; Castellanos, F. J.; Calvo-Zaragoza, J.

Domain Adaptation for Document Image Binarization via Domain Classification Proceedings Article

In: Tallón-Ballesteros, A. J. (Ed.): Frontiers in Artificial Intelligence and Applications, pp. 569-582, IOS Press, 2021, ISBN: 978-1-64368-224-2.

BibTeX | Tags: GRE19-04, GV/2020/030

Gallego, A. J.; Calvo-Zaragoza, J.; Fisher, R. B.

Incremental Unsupervised Domain-Adversarial Training of Neural Networks Journal Article

In: IEEE Transactions on Neural Networks and Learning Systems, vol. 32, no. 11, pp. 4864-4878, 2021, ISSN: 2162-2388.

Abstract | Links | BibTeX | Tags: GRE19-04, HispaMus

@article{k455,

title = {Incremental Unsupervised Domain-Adversarial Training of Neural Networks},

author = {A. J. Gallego and J. Calvo-Zaragoza and R. B. Fisher},

url = {https://grfia.dlsi.ua.es/repositori/grfia/pubs/455/2001.04129.pdf},

issn = {2162-2388},

year  = {2021},

date = {2021-01-01},

urldate = {2021-01-01},

journal = {IEEE Transactions on Neural Networks and Learning Systems},

volume = {32},

number = {11},

pages = {4864-4878},

abstract = {In the context of supervised statistical learning, it is typically assumed that the training set comes from the same distribution that draws the test samples. When this is not the case, the behavior of the learned model is unpredictable and becomes dependent upon the degree of similarity between the distribution of the training set and the distribution of the test set. One of the research topics that investigates this scenario is referred to as Domain Adaptation (DA). Deep neural networks brought dramatic advances in pattern recognition and that is why there have been many attempts to provide good domain adaptation algorithms for these models. Here we take a different avenue and approach the problem from an incremental point of view, where the model is adapted to the new domain iteratively. We make use of an existing unsupervised domain-adaptation algorithm to identify the target samples on which there is greater confidence about their true label. The output of the model is analyzed in different ways to determine the candidate samples. The selected samples are then added to the source training set by self-labeling, and the process is repeated until all target samples are labeled. This approach implements a form of adversarial training in which, by moving the self-labeled samples from the target to the source set, the DA algorithm is forced to look for new features after each iteration. Our results report a clear improvement with respect to the non-incremental case in several datasets, also outperforming other state-of-the-art domain adaptation algorithms.},

keywords = {GRE19-04, HispaMus},

pubstate = {published},

tppubtype = {article}

}

Close

Ríos-Vila, A.; Rizo, D.; Calvo-Zaragoza, J.

Complete Optical Music Recognition via Agnostic Transcription and Machine Translation Proceedings Article

In: 16th International Conference on Document Analysis and Recognition, pp. 661-675, 2021.

BibTeX | Tags: GV/2020/030

Castellanos, F. J.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Prototype Generation in the String Space via Approximate Median for Data Reduction in Nearest Neighbor classification Journal Article

In: Soft Computing, vol. 25, 2021, ISSN: 15403-15415.

BibTeX | Tags: GRE19-04, HispaMus

Castellanos, F. J.; Gallego, A. J.; Calvo-Zaragoza, J.

Unsupervised Domain Adaptation for Document Analysis of Music Score Images Proceedings Article

In: Proc. of the 22nd International Society for Music Information Retrieval Conference, 2021.

BibTeX | Tags: GRE19-04

Román, M. A.

An End-to-End Framework for Audio-to-Score Music Transcription PhD Thesis

2021.

BibTeX | Tags: HispaMus

Mas-Candela, E.; Alfaro-Contreras, M.; Calvo-Zaragoza, J.

Sequential Next-Symbol Prediction for Optical Music Recognition Proceedings Article

In: 16th International Conference on Document Analysis and Recognition, pp. 708-722, 2021, ISBN: 978-3-030-86334-0.

Links | BibTeX | Tags: GV/2020/030

López-Gutiérrez, J. C.; Valero-Mas, J. J.; Castellanos, F. J.; Calvo-Zaragoza, J.

Data Augmentation for End-to-End Optical Music Recognition Proceedings Article

In: Proceedings of the 14th IAPR International Workshop on Graphics Recognition (GREC), pp. 59-73, Springer, 2021.

BibTeX | Tags: GV/2020/030

Fuente, C; Valero-Mas, J. J.; Castellanos, F. J.; Calvo-Zaragoza, J.

Multimodal Audio and Image Music Transcription Proceedings Article

In: Proc. of the 3rd International Workshop on Reading Music Systems, pp. 18-22, 2021.

BibTeX | Tags: ACIF/2019/042, APOSTD/2020/256, MultiScore

Ríos-Vila, A.; Calvo-Zaragoza, J.; Rizo, D.

Evaluating Simultaneous Recognition and Encoding for Optical Music Recognition Proceedings Article

In: DLfM 2020: 7th International Conference on Digital Libraries for Musicology, pp. 10-17, Association for Computing Machinery, 2020, ISBN: 978-1-4503-8760-6.

BibTeX | Tags: HispaMus

PUBLICATIONS

2024

2023

2022

2021

2020