Oliva-Bulpitt, Samuel B.; Martinez-Esteso, Juan P.; Galan-Cuenca, Alejandro; Castellanos, Francisco J.; Gallego, Antonio Javier

Enhancing Music Score Analysis with Monte Carlo Dropout: A Probabilistic Approach to Staff-Region Detection Journal Article

In: International Journal on Document Analysis and Recognition, iss. Special Issue: ICDAR 2025, 2025, ISSN: 1433-2825.

BibTeX | Tags: CIACIF/2023/090, SmallOMR

F. J. Castellanos J. P. Martinez-Esteso, J. Calvo-Zaragoza

Maritime search and rescue missions with aerial images: A survey Journal Article

In: Computer Science Review, vol. 57, pp. 100736, 2025, ISSN: 1574-0137.

Links | BibTeX | Tags: TADMar

Kim, D.; Han, D.; Jeong, D.; Valero-Mas, J. J.

On the automatic recognition of Jeongganbo music notation: dataset and approach Journal Article

In: Journal on Computing and Cultural Heritage, 2025, ISSN: 1556-4673.

Abstract | BibTeX | Tags:

F. J. Castellanos J. P. Martinez-Esteso, A. Rosello

On the use of synthetic data for body detection in maritime search and rescue operations Journal Article

In: Engineering Applications of Artificial Intelligence, vol. 139, pp. 109586, 2024, ISSN: 0952-1976.

Links | BibTeX | Tags: TADMar

J. J Valero-Mas A. Galan-Cuenca, J. C. Martinez-Sevilla

MUSCAT: A Multimodal mUSic Collection for Automatic Transcription of Real Recordings and Image Scores Conference

MM '24: Proceedings of the 32nd ACM International Conference on Multimedia, Association for Computing Machinery, New York, NY, USA, 2024, ISBN: 979-8-400-70686-8.

Abstract | Links | BibTeX | Tags: MultiScore

@conference{nokey,

title = {MUSCAT: A Multimodal mUSic Collection for Automatic Transcription of Real Recordings and Image Scores},

author = {A. Galan-Cuenca, J. J Valero-Mas, J. C. Martinez-Sevilla, A. Hidalgo-Centeno, A. Pertusa, J. Calvo-Zaragoza},

url = {https://doi.org/10.1145/3664647.3681572},

doi = {10.1145/3664647.3681572},

isbn = {979-8-400-70686-8},

year  = {2024},

date = {2024-10-28},

urldate = {2024-10-28},

booktitle = {MM '24: Proceedings of the 32nd ACM International Conference on Multimedia},

pages = {583-591},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

abstract = {Multimodal audio-image music transcription has been recently posed as a means of retrieving a digital score representation by leveraging the individual estimations from Automatic Music Transcription (AMT)---acoustic recordings---and Optical Music Recognition (OMR)---image scores---systems. Nevertheless, while proven to outperform single-modality recognition rates, this approach has been exclusively validated under controlled scenarios---monotimbral and monophonic synthetic data---mainly due to a lack of collections with symbolic score-level annotations for both recordings and graphical sheets. To promote research on this topic, this work presents the Multimodal mUSic Collection for Automatic Transcription (MUSCAT) assortment of acoustic recordings, image sheets, and their score-level annotations in several notation formats. This dataset comprises almost 80 hours of real recordings with varied instrumentation and polyphony degrees---ranging from piano to orchestral music---, 1251 scanned sheets, and 880 symbolic scores from 37 composers, which may also be used in other tasks involving metadata such as instrument identification or composer recognition. A fragmented subset of this collection solely focused on acoustic data for score-level AMT---the MUSic Collection for aUtomatic Transcription - fragmented Subset (MUSCUTS) assortment---is also presented together with a baseline experimentation, concluding the need to foster research on this field with real recordings. Finally, a web-based service is also provided to increase the size of the collections collaboratively.},

keywords = {MultiScore},

pubstate = {published},

tppubtype = {conference}

}

Close

Galan-Cuenca, A.; Valero-Mas, J. J.; Martinez-Sevilla, J. C.; Hidalgo-Centeno, A.; Pertusa, A.; Calvo-Zaragoza, J.

MUSCAT: a Multimodal mUSic Collection for Automatic Transcription of real recordings and image scores Conference

Proceedings of the 32nd ACM International Conference on Multimedia, Association for Computing Machinery, 2024, ISBN: 979-8-4007-0686-8.

Abstract | Links | BibTeX | Tags:

@conference{nokey,

title = {MUSCAT: a Multimodal mUSic Collection for Automatic Transcription of real recordings and image scores},

author = {A. Galan-Cuenca and J. J. Valero-Mas and J. C. Martinez-Sevilla and A. Hidalgo-Centeno and A. Pertusa and J. Calvo-Zaragoza},

doi = {https://doi.org/10.1145/3664647.3681572},

isbn = {979-8-4007-0686-8},

year  = {2024},

date = {2024-10-28},

booktitle = {Proceedings of the 32nd ACM International Conference on Multimedia},

pages = {583-591},

publisher = {Association for Computing Machinery},

abstract = {Multimodal audio-image music transcription has been recently posed as a means of retrieving a digital score representation by leveraging the individual estimations from Automatic Music Transcription (AMT)---acoustic recordings---and Optical Music Recognition (OMR)---image scores---systems. Nevertheless, while proven to outperform single-modality recognition rates, this approach has been exclusively validated under controlled scenarios---monotimbral and monophonic synthetic data---mainly due to a lack of collections with symbolic score-level annotations for both recordings and graphical sheets. To promote research on this topic, this work presents the Multimodal mUSic Collection for Automatic Transcription (MUSCAT) assortment of acoustic recordings, image sheets, and their score-level annotations in several notation formats. This dataset comprises almost 80 hours of real recordings with varied instrumentation and polyphony degrees---ranging from piano to orchestral music---, 1251 scanned sheets, and 880 symbolic scores from 37 composers, which may also be used in other tasks involving metadata such as instrument identification or composer recognition. A fragmented subset of this collection solely focused on acoustic data for score-level AMT---the MUSic Collection for aUtomatic Transcription - fragmented Subset (MUSCUTS) assortment---is also presented together with a baseline experimentation, concluding the need to foster research on this field with real recordings. Finally, a web-based service is also provided to increase the size of the collections collaboratively.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close

J. P. Martinez-Esteso F. J. Castellanos, A. Galán-Cuenca

A Region-Based Approach for Layout Analysis of Music Score Images in Scarce Data Scenarios Proceedings Article

In: Lecture Notes in Computer Science - International Conference on Document Analysis and Recognition, pp. 58-75, Springer, Athenes, Greece, 2024, ISBN: 978-3-031-70545-8.

Links | BibTeX | Tags: DOREMI

Penarrubia, C.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Contrastive Self-Supervised Learning for Optical Music Recognition Conference

International Workshop on Document Analysis Systems, 2024, ISBN: 978-3-031-70442-0.

Abstract | Links | BibTeX | Tags:

F. J. Castellanos E. Ayllon, J. Calvo-Zaragoza

Analysis of the Calibration of Handwriting Text Recognition Models Proceedings Article

In: Lecture Notes in Computer Science - International Conference on Document Analysis and Recognition, pp. 139-155, Springer, Athenes, Greece, 2024, ISBN: 978-3-031-70535-9.

Links | BibTeX | Tags: PolifonIA

Ríos-Vila, A.; Calvo-Zaragoza, J.; Paquet, T.

Sheet Music Transformer: End-To-End Optical Music Recognition Beyond Monophonic Transcription Conference

Document Analysis and Recognition - ICDAR 2024, vol. 1, Springer Nature Switzerland, 2024, ISBN: 978-3-031-70552-6.

BibTeX | Tags: MultiScore

Maciá, M.; Rizo, D.

The Impact of UX/UI on Piano-Assisted Learning in Extended Reality Conference

Computer Supported Music Education. Angers, France., 2024.

BibTeX | Tags:

Alfaro-Contreras, M.; Rios-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

A Transformer Approach for Polyphonic Audio-to-Score Transcription Proceedings Article

In: Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2024), Seul (Korea), 2024.

Links | BibTeX | Tags: MultiScore

Valero-Mas, J. J.; Gallego, A. J.; Rico-Juan, J. R.

An overview of ensemble and feature learning in few-shot image classification using siamese networks Journal Article

In: Multimedia Tools and Applications, vol. 83, pp. 19929–19952, 2024, ISSN: 1380-7501.

Links | BibTeX | Tags:

María Alfaro-Contreras Pedro González-Barrachina,; Calvo-Zaragoza, Jorge

Continual Learning for Music Classification Proceedings Article

In: International Society for Music Information Retrieval Conference, ISMIR, pp. 596-602, 2024.

Abstract | Links | BibTeX | Tags:

Rizo, David; Delgado-Sánchez, Teresa; Calvo-Zaragoza, Jorge; García-Iasci, Patricia; Madueño-Madueño, Antonio

Insights into AI to encode a whole mensural collection with limited resources Proceedings Article

In: International Association of Music Libraries, 2024.

BibTeX | Tags:

García-Iasci, Patricia; Rizo, David

EA-Digifolk: Digitizing and encoding Iris Traditional Music at ITMA Proceedings Article

In: International Association of Music Libraries, 2024.

BibTeX | Tags: PolifonIA

Rizo, David; López-Rocamora, Pablo; Pardo-Cayuela, Antonio

A workflow for Attribution Issues using Language Models Proceedings Article

In: International Association of Music Libraries, 2024.

BibTeX | Tags:

García-Iasci, Patricia; Martínez-Sevilla, Juan Carlos; Rizo, David; Calvo-Zaragoza, Jorge

Towards a standardization of lead sheet encoding: an experience in OMR Proceedings Article

In: Music Encoding Conference 2024, unknown, 2024.

Links | BibTeX | Tags:

Fuentes-Martínez, Eliseo; Ríos-Vila, Antonio; Martinez-Sevilla, Juan Carlos; Rizo, David; Calvo-Zaragoza, Jorge

Aligned Music Notation and Lyrics Transcription Journal Article

In: CoRR, vol. abs/2412.04217, 2024.

Links | BibTeX | Tags:

Ríos-Vila, Antonio; Calvo-Zaragoza, Jorge; Rizo, David; Paquet, Thierry

Sheet Music Transformer ++: End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music Journal Article

In: CoRR, vol. abs/2405.12105, 2024.

Links | BibTeX | Tags:

Martinez-Sevilla, Juan Carlos; Rizo, David; Calvo-Zaragoza, Jorge

Towards Universal Optical Music Recognition: A Case Study on Notation Types Proceedings Article

In: Kaneshiro, Blair; Mysore, Gautham J; Nieto, Oriol; Donahue, Chris; Huang, Cheng-Zhi Anna; Lee, Jin Ha; McFee, Brian; McCallum, Matthew C (Ed.): Proceedings of the 25th International Society for Music Information Retrieval Conference, ISMIR 2024, San Francisco, California, USA and Online, November 10-14, 2024, pp. 914-921, 2024.

Links | BibTeX | Tags:

Luna-Barahona, Noelia N; Rosello, Adrian; Alfaro-Contreras, Mar'ıa; Rizo, David; Calvo-Zaragoza, Jorge

Unsupervised Synthetic-to-Real Adaptation for Optical Music Recognition Proceedings Article

In: Kaneshiro, Blair; Mysore, Gautham J; Nieto, Oriol; Donahue, Chris; Huang, Cheng-Zhi Anna; Lee, Jin Ha; McFee, Brian; McCallum, Matthew C (Ed.): Proceedings of the 25th International Society for Music Information Retrieval Conference, ISMIR 2024, San Francisco, California, USA and Online, November 10-14, 2024, pp. 462-469, 2024.

Links | BibTeX | Tags:

Rizo, David; Calvo-Zaragoza, Jorge; Garc'ıa-Iasci, Patricia; Delgado-Sánchez, Teresa

Lessons Learned From a Project to Encode Mensural Music on a Large Scale With Optical Music Recognition Proceedings Article

In: Kaneshiro, Blair; Mysore, Gautham J; Nieto, Oriol; Donahue, Chris; Huang, Cheng-Zhi Anna; Lee, Jin Ha; McFee, Brian; McCallum, Matthew C (Ed.): Proceedings of the 25th International Society for Music Information Retrieval Conference, ISMIR 2024, San Francisco, California, USA and Online, November 10-14, 2024, pp. 225-231, 2024.

Links | BibTeX | Tags:

Roselló, Adrián; Fuentes-Martínez, Eliseo; Alfaro-Contreras, María; Rizo, David; Calvo-Zaragoza, Jorge

Source-Free Domain Adaptation for Optical Music Recognition Book Chapter

In: pp. 3-19, 2024, ISSN: 16113349.

Links | BibTeX | Tags:

Thomae, Martha E.; Rizo, David; Fuentes-Martínez, Eliseo; Raurich, Cristina Alís; Luca, Elsa De; Calvo-Zaragoza, Jorge

A Preliminary Proposal for a Systematic GABC Encoding of Gregorian Chant Proceedings Article

In: ACM International Conference Proceeding Series, pp. 45-53, Association for Computing Machinery, 2024, ISBN: 9798400717208.

Abstract | Links | BibTeX | Tags: Aquitanian neumes, GABC, Gregorian chant, MEI, music encoding, Plainchant, REPERTORIUM, square notation

@inproceedings{Thomae2024,

title = {A Preliminary Proposal for a Systematic GABC Encoding of Gregorian Chant},

author = {Martha E. Thomae and David Rizo and Eliseo Fuentes-Martínez and Cristina Alís Raurich and Elsa De Luca and Jorge Calvo-Zaragoza},

doi = {10.1145/3660570.3660581},

isbn = {9798400717208},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

booktitle = {ACM International Conference Proceeding Series},

pages = {45-53},

publisher = {Association for Computing Machinery},

abstract = {In the last years, several approaches have addressed the encoding of the different music scripts used for plainchant. One of these approaches is the GABC format. While being a comprehensive symbolic representation of square notation, the lack of a formal specification for GABC usually leads to ambiguities, which must be avoided in the specification of any encoding format. Sometimes, the simple trial-and-error approach of entering the GABC code into an engraving system - such as Illuminare, Scrib.io, or GABC Transcription Tool - can solve this ambiguity. However, these engraving systems have shown some inconsistency among themselves when rendering GABC, sometimes displaying different music for the same code snippet. This paper presents a systematic approach to encoding Gregorian chant originally written in Aquitanian neumes and square notation to eliminate ambiguities inherent in the GABC specification. By formalizing the grammar of GABC, we address the challenges of inaccurate renderings in current music notation software. Our methodology includes developing a "Systematic GABC"(S-GABC) following a critical and scientific mentality to ensure the endurance of the notation. This paper demonstrates our system's effectiveness in standardizing Gregorian chant encoding, offering significant contributions to digital musicology and enhancing the accuracy of musical heritage digitization.},

keywords = {Aquitanian neumes, GABC, Gregorian chant, MEI, music encoding, Plainchant, REPERTORIUM, square notation},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

Martinez-Esteso, Juan Pedro; Galan-Cuenca, Alejandro; Pérez-Sancho, Carlos; Castellanos, Francisco J.; Gallego, Antonio Javier

Human vs. Machine: Comparing Selection Strategies in Active Learning for Optical Music Recognition Proceedings Article

In: Proceedings of the 26th International Society for Music Information Retrieval Conference, 2023.

BibTeX | Tags: CIACIF/2023/090, LEMUR, SmallOMR

A. J. Gallego F. J. Castellanos, I. Fujinaga

A Few-Shot Neural Approach for Layout Analysis of Music Score Images Proceedings Article

In: Proceedings of the 24th International Society for Music Information Retrieval Conference, pp. 106-113, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

Links | BibTeX | Tags: DOREMI

A. J. Gallego F. J. Castellanos, I. Fujinaga

A Preliminary Study of Few-shot Learning for Layout Analysis of Music Scores Working paper

2023.

Abstract | Links | BibTeX | Tags: DOREMI

A Rios-Vila Juan C Martinez-Sevilla, FJ Castellanos

Towards Music Notation and Lyrics Alignment: Gregorian Chants as Case Study Working paper

2023.

Abstract | Links | BibTeX | Tags: REPERTORIUM

Ramoneda, P.; Jeong, D.; Valero-Mas, J. J.; Serra, X.

Predicting performance difficulty from piano sheet music images Conference

Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

Abstract | Links | BibTeX | Tags:

Penarrubia, C.; Garrido-Munoz, C.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Efficient notation assembly in optical music recognition Conference

Proceedings of the 24th International Society for Music Information Retrieval Conference, Milan, Italy, 2023, ISBN: 978-1-7327299-3-3.

BibTeX | Tags:

Martínez-Sevilla, J. C.; Ríos-Vila, A.; Castellanos, F. J.; Calvo-Zaragoza, J.

A Holistic Approach for Aligned Music and Lyrics Transcription Conference

Document Analysis and Recognition - ICDAR 2023, vol. 1, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-41676-7.

Abstract | Links | BibTeX | Tags: REPERTORIUM

Martínez-Sevilla, J. C.; Alfaro-Contreras, M.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Insights into end-to-end audio-to-score transcription with real recordings: A case study with saxophone works Proceedings Article

In: INTERSPEECH Conference, pp. 2793-2797, Dublin, Ireland, 2023.

Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Valero-Mas, J. J.; Iñesta, J. M.; Calvo-Zaragoza, J.

Multimodal Strategies for Image and Audio Music Transcription: A Comparative Study Proceedings Article

In: Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges. ICPR 2022. Lecture Notes in Computer Science, pp. 64-77, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-37731-0.

Links | BibTeX | Tags: MultiScore

F. J. Castellanos A. Rosello, J. P. Martinez-Esteso

Test-Time Augmentation for Document Image Binarization Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 158-169, Springer Nature Switzerland, 2023, ISBN: 978-3-031-36615-4.

Links | BibTeX | Tags: DOREMI

F. J. Castellanos E. Ayllon, J. Calvo-Zaragoza

A Weakly-Supervised Approach for Layout Analysis in Music Score Images Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 170-181, Springer Nature Switzerland, 2023, ISBN: 978-3-031-36615-4.

Links | BibTeX | Tags: MultiScore

Penarrubia, C.; Valero-Mas, J. J.; Gallego, A. J.; Calvo-Zaragoza, J.

Addressing Class Imbalance in Multilabel Prototype Generation for k-Nearest Neighbor Classification Conference

Iberian Conference on Pattern Recognition and Image Analysis, Alicante, Spain, 2023, ISBN: 978-3-031-36616-1.

Abstract | Links | BibTeX | Tags: DOREMI

González-Barrachina, P.; Alfaro-Contreras, M.; Nieto-Hidalgo, M.; Calvo-Zaragoza, J.

Lifelong Learning for Document Image Binarization: An Experimental Study Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 146-157, 2023.

Links | BibTeX | Tags: MultiScore

Garrido-Munoz, C.; Alfaro-Contreras, M.; Calvo-Zaragoza, J.

Evaluating Domain Generalization in Kitchen Utensils Classification Proceedings Article

In: Iberian Conference on Pattern Recognition and Image Analysis, pp. 108-118, 2023.

Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Iñesta, J. M.; Calvo-Zaragoza, J.

Optical Music Recognition for Homophonic Scores with Neural Networks and Synthetic Music Generation Journal Article

In: International Journal of Multimedia Information Retrieval, vol. 12, pp. 12-24, 2023.

Links | BibTeX | Tags: MultiScore

Ríos-Vila, A.; Rizo, D.; Iñesta, J. M.; Calvo-Zaragoza, J.

End-to-end optical music recognition for pianoform sheet music Journal Article

In: International Journal on Document Analysis and Recognition (IJDAR), iss. ICDAR 2023, 2023, ISSN: 1433-2825.

Abstract | Links | BibTeX | Tags: MultiScore

Alfaro-Contreras, M.; Ríos-Vila, A.; Valero-Mas, J. J.; Calvo-Zaragoza, J.

Few-Shot Symbol Classification via Self-Supervised Learning and Nearest Neighbor Journal Article

In: Pattern Recognition Letters, vol. 167, pp. 1-8, 2023.

Links | BibTeX | Tags: MultiScore

Rico-Juan, J. R.; Sánchez-Cartagena, V. M.; Valero-Mas, J. J.; Gallego, A. J.

Identifying student profiles within online judge systems using explainable artificial intelligence Journal Article

In: IEEE Transactions on Learning Technologies, vol. 16, no. 6, pp. 955-969, 2023, ISSN: 1939-1382.

Links | BibTeX | Tags:

Rizo, David; Delgado-Sánchez, Teresa; Calvo-Zaragoza, Jorge

Self-organization of sheet music through graphical patterns Proceedings Article

In: International Association of Music Libraries Congress, 2023.

BibTeX | Tags:

Rizo, David; Calvo-Zaragoza, Jorge; Martínez-Sevilla, Juan Carlos; Madueño, Antonio; García-Iasci, Patricia; Delgado-Sánchez, Teresa

Encoding in human centered machine learning workflows: case study on mensural ligature recognition Proceedings Article

In: Joint MEC TEI conference 2023, 2023.

BibTeX | Tags:

Martínez-Sevilla, Juan Carlos; Roselló, Adrián; Rizo, David; Calvo-Zaragoza, Jorge

On the Performance of Optical Music Recognition in the Absence of Specific Training Data Proceedings Article

In: Sarti, Augusto; Antonacci, Fabio; Sandler, Mark; Bestagini, Paolo; Dixon, Simon; Liang, Beici; Richard, Gaël; Pauwels, Johan (Ed.): Proceedings of the 24th International Society for Music Information Retrieval Conference, ISMIR 2023, Milan, Italy, November 5-9, 2023, pp. 319-326, 2023.

Links | BibTeX | Tags: