Oliva-Bulpitt, Samuel B.; Martinez-Esteso, Juan P.; Galan-Cuenca, Alejandro; Castellanos, Francisco J.; Gallego, Antonio Javier

Enhancing Music Score Analysis with Monte Carlo Dropout: A Probabilistic Approach to Staff-Region Detection Journal Article

In: International Journal on Document Analysis and Recognition, iss. Special Issue: ICDAR 2025, 2025, ISSN: 1433-2825.

BibTeX | Tags: CIACIF/2023/090, SmallOMR

F. J. Castellanos J. P. Martinez-Esteso, J. Calvo-Zaragoza

Maritime search and rescue missions with aerial images: A survey Journal Article

In: Computer Science Review, vol. 57, pp. 100736, 2025, ISSN: 1574-0137.

Links | BibTeX | Tags: TADMar

Kim, D.; Han, D.; Jeong, D.; Valero-Mas, J. J.

On the automatic recognition of Jeongganbo music notation: dataset and approach Journal Article

In: Journal on Computing and Cultural Heritage, 2025, ISSN: 1556-4673.

Abstract | BibTeX | Tags:

F. J. Castellanos J. P. Martinez-Esteso, A. Rosello

On the use of synthetic data for body detection in maritime search and rescue operations Journal Article

In: Engineering Applications of Artificial Intelligence, vol. 139, pp. 109586, 2024, ISSN: 0952-1976.

Links | BibTeX | Tags: TADMar

Galan-Cuenca, A.; Valero-Mas, J. J.; Martinez-Sevilla, J. C.; Hidalgo-Centeno, A.; Pertusa, A.; Calvo-Zaragoza, J.

MUSCAT: a Multimodal mUSic Collection for Automatic Transcription of real recordings and image scores Conference

Proceedings of the 32nd ACM International Conference on Multimedia, Association for Computing Machinery, 2024, ISBN: 979-8-4007-0686-8.

Abstract | Links | BibTeX | Tags:

@conference{nokey,

title = {MUSCAT: a Multimodal mUSic Collection for Automatic Transcription of real recordings and image scores},

author = {A. Galan-Cuenca and J. J. Valero-Mas and J. C. Martinez-Sevilla and A. Hidalgo-Centeno and A. Pertusa and J. Calvo-Zaragoza},

doi = {https://doi.org/10.1145/3664647.3681572},

isbn = {979-8-4007-0686-8},

year  = {2024},

date = {2024-10-28},

booktitle = {Proceedings of the 32nd ACM International Conference on Multimedia},

pages = {583-591},

publisher = {Association for Computing Machinery},

abstract = {Multimodal audio-image music transcription has been recently posed as a means of retrieving a digital score representation by leveraging the individual estimations from Automatic Music Transcription (AMT)---acoustic recordings---and Optical Music Recognition (OMR)---image scores---systems. Nevertheless, while proven to outperform single-modality recognition rates, this approach has been exclusively validated under controlled scenarios---monotimbral and monophonic synthetic data---mainly due to a lack of collections with symbolic score-level annotations for both recordings and graphical sheets. To promote research on this topic, this work presents the Multimodal mUSic Collection for Automatic Transcription (MUSCAT) assortment of acoustic recordings, image sheets, and their score-level annotations in several notation formats. This dataset comprises almost 80 hours of real recordings with varied instrumentation and polyphony degrees---ranging from piano to orchestral music---, 1251 scanned sheets, and 880 symbolic scores from 37 composers, which may also be used in other tasks involving metadata such as instrument identification or composer recognition. A fragmented subset of this collection solely focused on acoustic data for score-level AMT---the MUSic Collection for aUtomatic Transcription - fragmented Subset (MUSCUTS) assortment---is also presented together with a baseline experimentation, concluding the need to foster research on this field with real recordings. Finally, a web-based service is also provided to increase the size of the collections collaboratively.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close

J. J Valero-Mas A. Galan-Cuenca, J. C. Martinez-Sevilla

MUSCAT: A Multimodal mUSic Collection for Automatic Transcription of Real Recordings and Image Scores Conference

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia, Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 979-8-400-70686-8.

Abstract | Links | BibTeX | Tags: MultiScore

@conference{nokey,

title = {MUSCAT: A Multimodal mUSic Collection for Automatic Transcription of Real Recordings and Image Scores},

author = {A. Galan-Cuenca, J. J Valero-Mas, J. C. Martinez-Sevilla, A. Hidalgo-Centeno, A. Pertusa, J. Calvo-Zaragoza},

url = {https://doi.org/10.1145/3664647.3681572},

doi = {10.1145/3664647.3681572},

isbn = {979-8-400-70686-8},

year  = {2024},

date = {2024-10-28},

urldate = {2024-10-28},

booktitle = {MM '24: Proceedings of the 32nd ACM International Conference on Multimedia},

pages = {583-591},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

abstract = {Multimodal audio-image music transcription has been recently posed as a means of retrieving a digital score representation by leveraging the individual estimations from Automatic Music Transcription (AMT)---acoustic recordings---and Optical Music Recognition (OMR)---image scores---systems. Nevertheless, while proven to outperform single-modality recognition rates, this approach has been exclusively validated under controlled scenarios---monotimbral and monophonic synthetic data---mainly due to a lack of collections with symbolic score-level annotations for both recordings and graphical sheets. To promote research on this topic, this work presents the Multimodal mUSic Collection for Automatic Transcription (MUSCAT) assortment of acoustic recordings, image sheets, and their score-level annotations in several notation formats. This dataset comprises almost 80 hours of real recordings with varied instrumentation and polyphony degrees---ranging from piano to orchestral music---, 1251 scanned sheets, and 880 symbolic scores from 37 composers, which may also be used in other tasks involving metadata such as instrument identification or composer recognition. A fragmented subset of this collection solely focused on acoustic data for score-level AMT---the MUSic Collection for aUtomatic Transcription - fragmented Subset (MUSCUTS) assortment---is also presented together with a baseline experimentation, concluding the need to foster research on this field with real recordings. Finally, a web-based service is also provided to increase the size of the collections collaboratively.},

keywords = {MultiScore},

pubstate = {published},

tppubtype = {conference}

}

Close

Penarrubia, C.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Contrastive Self-Supervised Learning for Optical Music Recognition Conference

International Workshop on Document Analysis Systems, 2024, ISBN: 978-3-031-70442-0.

Abstract | Links | BibTeX | Tags:

J. P. Martinez-Esteso F. J. Castellanos, A. Galán-Cuenca

A Region-Based Approach for Layout Analysis of Music Score Images in Scarce Data Scenarios Proceedings Article

In: Lecture Notes in Computer Science - International Conference on Document Analysis and Recognition, pp. 58-75, Springer, Athenes, Greece, 2024, ISBN: 978-3-031-70545-8.

Links | BibTeX | Tags: DOREMI

F. J. Castellanos E. Ayllon, J. Calvo-Zaragoza

Analysis of the Calibration of Handwriting Text Recognition Models Proceedings Article

In: Lecture Notes in Computer Science - International Conference on Document Analysis and Recognition, pp. 139-155, Springer, Athenes, Greece, 2024, ISBN: 978-3-031-70535-9.

Links | BibTeX | Tags: PolifonIA

Ríos-Vila, A.; Calvo-Zaragoza, J.; Paquet, T.

Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription Conference

Document Analysis and Recognition - ICDAR 2024, vol. 1, Springer Nature Switzerland, 2024, ISBN: 978-3-031-70552-6.

BibTeX | Tags: MultiScore

Maciá, M.; Rizo, D.

The Impact of UX/UI on Piano-Assisted Learning in Extended Reality Conference

Computer Supported Music Education. Angers, France., 2024.

BibTeX | Tags:

Alfaro-Contreras, M.; Rios-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

A Transformer Approach for Polyphonic Audio-to-Score Transcription Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), Seul (Korea), 2024.

Links | BibTeX | Tags: MultiScore

Valero-Mas, J. J.; Gallego, A. J.; Rico-Juan, J. R.

An overview of ensemble and feature learning in few-shot image classification using siamese networks Journal Article

In: Multimedia Tools and Applications, vol. 83, pp. 19929–19952, 2024, ISSN: 1380-7501.

Links | BibTeX | Tags:

Thomae, Martha E.; Rizo, David; Fuentes-Martínez, Eliseo; Raurich, Cristina Alís; Luca, Elsa De; Calvo-Zaragoza, Jorge

A Preliminary Proposal for a Systematic GABC Encoding of Gregorian Chant Proceedings Article

In: ACM International Conference Proceeding Series, pp. 45-53, Association for Computing Machinery, 2024, ISBN: 9798400717208.

Abstract | Links | BibTeX | Tags: Aquitanian neumes, GABC, Gregorian chant, MEI, music encoding, Plainchant, REPERTORIUM, square notation

@inproceedings{Thomae2024,

title = {A Preliminary Proposal for a Systematic GABC Encoding of Gregorian Chant},

author = {Martha E. Thomae and David Rizo and Eliseo Fuentes-Martínez and Cristina Alís Raurich and Elsa De Luca and Jorge Calvo-Zaragoza},

doi = {10.1145/3660570.3660581},

isbn = {9798400717208},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

booktitle = {ACM International Conference Proceeding Series},

pages = {45-53},

publisher = {Association for Computing Machinery},

abstract = {In the last years, several approaches have addressed the encoding of the different music scripts used for plainchant. One of these approaches is the GABC format. While being a comprehensive symbolic representation of square notation, the lack of a formal specification for GABC usually leads to ambiguities, which must be avoided in the specification of any encoding format. Sometimes, the simple trial-and-error approach of entering the GABC code into an engraving system - such as Illuminare, Scrib.io, or GABC Transcription Tool - can solve this ambiguity. However, these engraving systems have shown some inconsistency among themselves when rendering GABC, sometimes displaying different music for the same code snippet. This paper presents a systematic approach to encoding Gregorian chant originally written in Aquitanian neumes and square notation to eliminate ambiguities inherent in the GABC specification. By formalizing the grammar of GABC, we address the challenges of inaccurate renderings in current music notation software. Our methodology includes developing a "Systematic GABC"(S-GABC) following a critical and scientific mentality to ensure the endurance of the notation. This paper demonstrates our system's effectiveness in standardizing Gregorian chant encoding, offering significant contributions to digital musicology and enhancing the accuracy of musical heritage digitization.},

keywords = {Aquitanian neumes, GABC, Gregorian chant, MEI, music encoding, Plainchant, REPERTORIUM, square notation},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Roselló, Adrián; Fuentes-Martínez, Eliseo; Alfaro-Contreras, María; Rizo, David; Calvo-Zaragoza, Jorge

Source-Free Domain Adaptation for Optical Music Recognition Book Chapter

In: pp. 3-19, 2024, ISSN: 16113349.

Links | BibTeX | Tags:

Rizo, David; Calvo-Zaragoza, Jorge; Garc'ıa-Iasci, Patricia; Delgado-Sánchez, Teresa

Lessons Learned From a Project to Encode Mensural Music on a Large Scale With Optical Music Recognition Proceedings Article

In: Kaneshiro, Blair; Mysore, Gautham J; Nieto, Oriol; Donahue, Chris; Huang, Cheng-Zhi Anna; Lee, Jin Ha; McFee, Brian; McCallum, Matthew C (Ed.): Proceedings of the 25th International Society for Music Information Retrieval Conference, ISMIR 2024, San Francisco, California, USA and Online, November 10-14, 2024, pp. 225-231, 2024.

Links | BibTeX | Tags:

Luna-Barahona, Noelia N; Rosello, Adrian; Alfaro-Contreras, Mar'ıa; Rizo, David; Calvo-Zaragoza, Jorge

Unsupervised Synthetic-to-Real Adaptation for Optical Music Recognition Proceedings Article

In: Kaneshiro, Blair; Mysore, Gautham J; Nieto, Oriol; Donahue, Chris; Huang, Cheng-Zhi Anna; Lee, Jin Ha; McFee, Brian; McCallum, Matthew C (Ed.): Proceedings of the 25th International Society for Music Information Retrieval Conference, ISMIR 2024, San Francisco, California, USA and Online, November 10-14, 2024, pp. 462-469, 2024.

Links | BibTeX | Tags:

Martinez-Sevilla, Juan Carlos; Rizo, David; Calvo-Zaragoza, Jorge

Towards Universal Optical Music Recognition: A Case Study on Notation Types Proceedings Article

In: Kaneshiro, Blair; Mysore, Gautham J; Nieto, Oriol; Donahue, Chris; Huang, Cheng-Zhi Anna; Lee, Jin Ha; McFee, Brian; McCallum, Matthew C (Ed.): Proceedings of the 25th International Society for Music Information Retrieval Conference, ISMIR 2024, San Francisco, California, USA and Online, November 10-14, 2024, pp. 914-921, 2024.

Links | BibTeX | Tags:

Ríos-Vila, Antonio; Calvo-Zaragoza, Jorge; Rizo, David; Paquet, Thierry

Sheet Music Transformer ++: End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music Journal Article

In: CoRR, vol. abs/2405.12105, 2024.

Links | BibTeX | Tags:

Fuentes-Martínez, Eliseo; Ríos-Vila, Antonio; Martinez-Sevilla, Juan Carlos; Rizo, David; Calvo-Zaragoza, Jorge

Aligned Music Notation and Lyrics Transcription Journal Article

In: CoRR, vol. abs/2412.04217, 2024.

Links | BibTeX | Tags:

García-Iasci, Patricia; Martínez-Sevilla, Juan Carlos; Rizo, David; Calvo-Zaragoza, Jorge

Towards a standardization of lead sheet encoding: an experience in OMR Proceedings Article

In: Music Encoding Conference 2024, unknown, 2024.

Links | BibTeX | Tags:

Rizo, David; López-Rocamora, Pablo; Pardo-Cayuela, Antonio

A workflow for Attribution Issues using Language Models Proceedings Article

In: International Association of Music Libraries, 2024.

BibTeX | Tags:

García-Iasci, Patricia; Rizo, David

EA-Digifolk: Digitizing and encoding Iris Traditional Music at ITMA Proceedings Article

In: International Association of Music Libraries, 2024.

BibTeX | Tags: PolifonIA

Rizo, David; Delgado-Sánchez, Teresa; Calvo-Zaragoza, Jorge; García-Iasci, Patricia; Madueño-Madueño, Antonio

Insights into AI to encode a whole mensural collection with limited resources Proceedings Article

In: International Association of Music Libraries, 2024.

BibTeX | Tags:

María Alfaro-Contreras Pedro González-Barrachina,; Calvo-Zaragoza, Jorge

Continual Learning for Music Classification Proceedings Article

In: International Society for Music Information Retrieval Conference, ISMIR, pp. 596-602, 2024.

Abstract | Links | BibTeX | Tags:

Martinez-Esteso, Juan Pedro; Galan-Cuenca, Alejandro; Pérez-Sancho, Carlos; Castellanos, Francisco J.; Gallego, Antonio Javier

Human vs. Machine: Comparing Selection Strategies in Active Learning for Optical Music Recognition Proceedings Article

In: Proceedings of the 26th International Society for Music Information Retrieval Conference, 2023.

BibTeX | Tags: CIACIF/2023/090, LEMUR, SmallOMR

A. J. Gallego F. J. Castellanos, I. Fujinaga

A Few-Shot Neural Approach for Layout Analysis of Music Score Images Proceedings Article

In: Proceedings of the 24th International Society for Music Information Retrieval Conference, pp. 106-113, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

Links | BibTeX | Tags: DOREMI

A Rios-Vila Juan C Martinez-Sevilla, FJ Castellanos

Towards Music Notation and Lyrics Alignment: Gregorian Chants as Case Study Working paper

2023.

Abstract | Links | BibTeX | Tags: REPERTORIUM

A. J. Gallego F. J. Castellanos, I. Fujinaga

A Preliminary Study of Few-shot Learning for Layout Analysis of Music Scores Working paper

2023.

Abstract | Links | BibTeX | Tags: DOREMI

Ramoneda, P.; Jeong, D.; Valero-Mas, J. J.; Serra, X.

Predicting performance difficulty from piano sheet music images Conference

Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

Abstract | Links | BibTeX | Tags:

Penarrubia, C.; Garrido-Munoz, C.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Efficient notation assembly in optical music recognition Conference

Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

BibTeX | Tags:

Martínez-Sevilla, J. C.; Ríos-Vila, A.; Castellanos, F. J.; Calvo-Zaragoza, J.

A Holistic Approach for Aligned Music and Lyrics Transcription Conference

Document Analysis and Recognition - ICDAR 2023, vol. 1, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-41676-7.

Abstract | Links | BibTeX | Tags: REPERTORIUM

Martínez-Sevilla, J. C.; Alfaro-Contreras, M.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Insights into end-to-end audio-to-score transcription with real recordings: A case study with saxophone works Proceedings Article

In: INTERSPEECH Conference, pp. 2793-2797, Dublin, Ireland, 2023.

Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Multimodal Strategies for Image and Audio Music Transcription: A Comparative Study Proceedings Article

In: Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges. ICPR 2022. Lecture Notes in Computer Science, pp. 64-77, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-37731-0.

Links | BibTeX | Tags: MultiScore

Garrido-Munoz, C.; Alfaro-Contreras, M.; Calvo-Zaragoza, J.

Evaluating Domain Generalization in Kitchen Utensils Classification Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 108-118, 2023.

Links | BibTeX | Tags: MultiScore

González-Barrachina, P.; Alfaro-Contreras, M.; Nieto-Hidalgo, M.; Calvo-Zaragoza, J.

Lifelong Learning for Document Image Binarization: An Experimental Study Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 146-157, 2023.

Links | BibTeX | Tags: MultiScore

Penarrubia, C.; Valero-Mas, J. J.; Gallego, A. J.; Calvo-Zaragoza, J.

Addressing Class Imbalance in Multilabel Prototype Generation for k-Nearest Neighbor Classification Conference

Iberian Conference on Pattern Recognition and Image Analysis, Alicante, Spain, 2023, ISBN: 978-3-031-36616-1.

Abstract | Links | BibTeX | Tags: DOREMI

F. J. Castellanos E. Ayllon, J. Calvo-Zaragoza

A Weakly-Supervised Approach for Layout Analysis in Music Score Images Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 170-181, Springer Nature Switzerland, 2023, ISBN: 978-3-031-36615-4.

Links | BibTeX | Tags: MultiScore

F. J. Castellanos A. Rosello, J. P. Martinez-Esteso

Test-Time Augmentation for Document Image Binarization Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 158-169, Springer Nature Switzerland, 2023, ISBN: 978-3-031-36615-4.

Links | BibTeX | Tags: DOREMI

Alfaro-Contreras, M.; Iñesta, J. M.; Calvo-Zaragoza, J.

Optical Music Recognition for Homophonic Scores with Neural Networks and Synthetic Music Generation Journal Article

In: International Journal of Multimedia Information Retrieval, vol. 12, pp. 12-24, 2023.

Links | BibTeX | Tags: MultiScore

Ríos-Vila, A.; Rizo, D.; Iñesta, J. M.; Calvo-Zaragoza, J.

End-to-end optical music recognition for pianoform sheet music Journal Article

In: International Journal on Document Analysis and Recognition (IJDAR), iss. ICDAR 2023, 2023, ISSN: 1433-2825.

Abstract | Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Ríos-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Few-Shot Symbol Classification via Self-Supervised Learning and Nearest Neighbor Journal Article

In: Pattern Recognition Letters, vol. 167, pp. 1-8, 2023.

Links | BibTeX | Tags: MultiScore

Rico-Juan, J. R.; Sánchez-Cartagena, V. M.; Valero-Mas, J. J.; Gallego, A. J.

Identifying student profiles within online judge systems using explainable artificial intelligence Journal Article

In: IEEE Transactions on Learning Technologies, vol. 16, no. 6, pp. 955-969, 2023, ISSN: 1939-1382.

Links | BibTeX | Tags:

Valero-Mas, J. J.; Gallego, A. J.; Alonso-Jiménez, P.; Serra, X.

Multilabel Prototype Generation for Data Reduction in k-Nearest Neighbour classification Journal Article

In: Pattern Recognition, vol. 135, pp. 109190, 2023, ISSN: 0031-3203.

Abstract | Links | BibTeX | Tags: DOREMI, MultiScore

Sánchez-Ferrer, A.; Valero-Mas, J. J.; Gallego, A. J.; Calvo-Zaragoza, J.

An Experimental Study on Marine Debris Location and Recognition using Object Detection Journal Article

In: Pattern Recognition Letters, 2023, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: TADMar

Alfaro-Contreras, M.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Late multimodal fusion for image and audio music transcription Journal Article

In: Expert Systems With Applications, vol. 216, pp. 119491-119500, 2023.

Links | BibTeX | Tags: MultiScore

Rizo, David; Calvo-Zaragoza, Jorge; Martínez-Sevilla, Juan C; Roselló, Adrián; Fuentes-Martínez, Eliseo

Design of a music recognition, encoding, and transcription online tool Proceedings Article

In: Proceedings of the 16th International Symposium on Computer Music Multidisciplinary Research, pp. 18-29, Zenodo, 2023.

Links | BibTeX | Tags:

Martínez-Sevilla, Juan Carlos; Roselló, Adrián; Rizo, David; Calvo-Zaragoza, Jorge

On the Performance of Optical Music Recognition in the Absence of Specific Training Data Proceedings Article

In: Sarti, Augusto; Antonacci, Fabio; Sandler, Mark; Bestagini, Paolo; Dixon, Simon; Liang, Beici; Richard, Gaël; Pauwels, Johan (Ed.): Proceedings of the 24th International Society for Music Information Retrieval Conference, ISMIR 2023, Milan, Italy, November 5-9, 2023, pp. 319-326, 2023.

Links | BibTeX | Tags:

Rizo, David; Calvo-Zaragoza, Jorge; Martínez-Sevilla, Juan Carlos; Madueño, Antonio; García-Iasci, Patricia; Delgado-Sánchez, Teresa

Encoding in human centered machine learning workflows: case study on mensural ligature recognition Proceedings Article

In: Joint MEC TEI conference 2023, 2023.

BibTeX | Tags:

Rizo, David; Delgado-Sánchez, Teresa; Calvo-Zaragoza, Jorge

Self-organization of sheet music through graphical patterns Proceedings Article

In: International Association of Music Libraries Congress, 2023.

BibTeX | Tags:

PUBLICATIONS

2025

2024

2023