Higuera, C. De La; Oncina, J.

The most probable string: an algorithmic study Journal Article

In: Journal of Logic and Computation, vol. 24, no. 2, pp. 311-330, 2014, ISSN: 0955-792X.

Abstract | Links | BibTeX | Tags: Prometeo 2012, TIASA

Abreu, J.; Rico-Juan, J. R.

A New Iterative Algorithm for Computing a Quality Approximated Median of Strings based on Edit Operations Journal Article

In: Pattern Recognition Letters, vol. 36, pp. 74–80, 2014.

Abstract | Links | BibTeX | Tags: TIASA

Pérez-Sancho, C.; Bernabeu, J. F.

A Multimodal Genre Recognition Prototype Proceedings Article

In: Actas del III Workshop de Reconocimiento de Formas y Análisis de Imágenes, pp. 13-16, Madrid, Spain, 2013, ISBN: 978-84-695-8332-6.

Abstract | Links | BibTeX | Tags: DRIMS, TIASA

Calvo-Zaragoza, J.; Oncina, J.

Human-Computer Interaction for Optical Music Recognition tasks Proceedings Article

In: Actas del III Workshop de Reconocimiento de Formas y Análisis de Imágenes, pp. 9-12, Madrid, Spain, 2013, ISBN: 978-84-695-8332-6.

Links | BibTeX | Tags: Prometeo 2012, TIASA

Serrano, A.; Micó, L.; Oncina, J.

Which fast nearest neighbour search algorithm to use? Journal Article

In: Lecture Notes in Computer Science, vol. 7887, pp. 567-574, 2013.

Links | BibTeX | Tags: Prometeo 2012, TIASA

Sanches, J. M.; Micó, L.; Cardoso, J. S.

Pattern Recognition and Image Analysis 6th Iberian Conference, IbPRIA 2013 Book

Springer, 2013.

BibTeX | Tags: Prometeo 2012, TIASA

Calvo-Zaragoza, J.; Oncina, J.; Iñesta, J. M.

Recognition of Online Handwritten Music Symbols Proceedings Article

In: Proceedings of the 6th International Workshop on Machine Learning and Music, Prague, Czech Republic, 2013.

Abstract | Links | BibTeX | Tags: Prometeo 2012, TIASA

Rico-Juan, J. R.; Iñesta, J. M.

New rank methods for reducing the size of the training set using the nearest neighbor rule Journal Article

In: Pattern Recognition Letters, vol. 33, no. 5, pp. 654–660, 2012, ISSN: 0167-8655.

Abstract | Links | BibTeX | Tags: DRIMS, MIPRCV, TIASA

Rico-Juan, J. R.; Iñesta, J. M.

Confidence voting method ensemble applied to off-line signature verification Journal Article

In: Pattern Analysis and Applications, vol. 15, no. 2, pp. 113–120, 2012, ISSN: 1433-7541.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Serrano, A.; Micó, L.; Oncina, J.

Restructuring Versus non Restructuring Insertions in MDF Indexes Proceedings Article

In: Carmona, J. Salvador Sa&#769 Pedro Latorre; nchez,; Fred, Ana (Ed.): ICPRAM 2012: 1st International Conference on Pattern Recognition Applications and Methods, pp. 474–480, INSTICC SciTePress, Vilamoura, Portugal, 2012, ISBN: 978-989-8425-98-0.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Micó, L.; Oncina, J.

A log square average case algorithm to make insertions in fast similarity search Journal Article

In: Pattern Recognition Letters, vol. 33, no. 9, pp. 1060–1065, 2012.

Links | BibTeX | Tags: MIPRCV, TIASA

Gallego-Sánchez, A. J.; Calera-Rubio, J.; López, D.

Structural Graph Extraction from Images Proceedings Article

In: Omatu, S.; Santana, Juan F. De Paz; González, S. Rodríguez; Molina, J. M.; a. M. Bernardos, (Ed.): Distributed Computing and Artificial Intelligence, pp. 717-724, Springer Berlin / Heidelberg, 2012, ISBN: 978-3-642-28764-0.

BibTeX | Tags: MIPRCV, TIASA

Abreu, J.

Detección de regularidades en contornos 2D, cálculo aproximado de medianas y su aplicación en tareas de clasificación PhD Thesis

2012.

Abstract | BibTeX | Tags: ISIC 2010, TIASA

@phdthesis{k293,

title = {Detección de regularidades en contornos 2D, cálculo aproximado de medianas y su aplicación en tareas de clasificación},

author = {J. Abreu},

editor = {J. R. Rico},

year = {2012},

date = {2012-01-01},

urldate = {2012-01-01},

organization = {Universidad de Alicante},

abstract = {In this work, we address two main problems: identifying the regularities and the construction of average contours from contours encoded by Freeman chain codes. Solutions for those problems are proposed which relies in the information gathered from the Levenshtein edit distance computation. We describe a new method for quantifying the regularity of contours and comparing them, when encoded by Freeman chain codes, in terms of a similarity criterion. The criterion used allows subsequences to be found from the minimal cost edit sequence that speciﬁ and es an alignment of contour segments which are similar. Two external parameters adjust the similarity criterion. The information about each similar part is encoded by strings that represent an average contour region. An explanation of how to construct a prototype based on the identiﬁ and ed regularities is also reviewed. The reliability of the prototypes is eva- luated by replacing contour groups, samples, by new prototypes used as the training set in a classiﬁ and cation task. This way, the size of the data set can be reduced without sensibly affecting its representational power for classiﬁ and cation purposes. Experimental results show that this scheme achieves a reduction in the size of the training data set of about 80% while the classiﬁ and cation error only increases by 0.45% in one of the three data sets studied. Also this thesis presents a new fast algorithm for computing an approximation to the mean between two strings of characters representing a 2D shape and its application to a new Wilson-based editing proce dure. The approximate mean is built by including some symbols from the two original strings. Besides, a greedy approach to this algorithm is studied which allows to reduce the time required to computed an approximate mean. The new dataset editing scheme relaxes the criterion for deleting instances proposed by the Wilson editing procedure. In practice, not all instances misclassiﬁ and ed by their near neighbors are pruned. Instead, an artiﬁ and cial instance is added to the dataset in the hope of successfully classifying the instance in the future. The new artiﬁ and cial instance is the approximated mean of the misclassiﬁ and ed sample and its same-class nearest neighbor. Experiments carried over three widely known databases of contours show the proposed algorithms performs very well in computing the mean of two strings, outperforming methods proposed by other authors. Particularly the low computational time required by the heuristic approach make it very suitable when dealing with long length strings. Results also shows the propo- sed preprocessing scheme can reduce the classiﬁ and cation error in about 83% of trials. There is empirical evidence that using the greedy approximation to compute the approximated mean does not affect the editing procedure performance. Finally, a new algorithm with which to compute an approximation to the mean of a set of strings is presented. The approximated mean is computed through the successive improvements of a partial solution. In each iteration, the edit distance from the partial solution to all the strings in the set are computed, thus accounting for the frequency of each of the edit operations in every position of the approximated mean. A goodness index for edit operations is later computed by multiplying their frequency by the cost. Each operation is tested, starting from that with the highest index, in order to verify whether applying it to the partial solution leads to an improvement. If successful, a new iteration begins from the new approximated mean. The algorithm ﬁ and nishes after all the operations have been examined without a better solution being found. Comparative experiments involving Freeman chain codes encoding 2D shapes show that the quality of the approximated mean string is similar to other approaches but achieves a much faster convergence.},

keywords = {ISIC 2010, TIASA},

pubstate = {published},

tppubtype = {phdthesis}

}

Close

In this work, we address two main problems: identifying the regularities and the construction of average contours from contours encoded by Freeman chain codes. Solutions for those problems are proposed which relies in the information gathered from the Levenshtein edit distance computation. We describe a new method for quantifying the regularity of contours and comparing them, when encoded by Freeman chain codes, in terms of a similarity criterion. The criterion used allows subsequences to be found from the minimal cost edit sequence that speci&#64257 and es an alignment of contour segments which are similar. Two external parameters adjust the similarity criterion. The information about each similar part is encoded by strings that represent an average contour region. An explanation of how to construct a prototype based on the identi&#64257 and ed regularities is also reviewed. The reliability of the prototypes is eva- luated by replacing contour groups, samples, by new prototypes used as the training set in a classi&#64257 and cation task. This way, the size of the data set can be reduced without sensibly affecting its representational power for classi&#64257 and cation purposes. Experimental results show that this scheme achieves a reduction in the size of the training data set of about 80% while the classi&#64257 and cation error only increases by 0.45% in one of the three data sets studied. Also this thesis presents a new fast algorithm for computing an approximation to the mean between two strings of characters representing a 2D shape and its application to a new Wilson-based editing proce dure. The approximate mean is built by including some symbols from the two original strings. Besides, a greedy approach to this algorithm is studied which allows to reduce the time required to computed an approximate mean. The new dataset editing scheme relaxes the criterion for deleting instances proposed by the Wilson editing procedure. In practice, not all instances misclassi&#64257 and ed by their near neighbors are pruned. Instead, an arti&#64257 and cial instance is added to the dataset in the hope of successfully classifying the instance in the future. The new arti&#64257 and cial instance is the approximated mean of the misclassi&#64257 and ed sample and its same-class nearest neighbor. Experiments carried over three widely known databases of contours show the proposed algorithms performs very well in computing the mean of two strings, outperforming methods proposed by other authors. Particularly the low computational time required by the heuristic approach make it very suitable when dealing with long length strings. Results also shows the propo- sed preprocessing scheme can reduce the classi&#64257 and cation error in about 83% of trials. There is empirical evidence that using the greedy approximation to compute the approximated mean does not affect the editing procedure performance. Finally, a new algorithm with which to compute an approximation to the mean of a set of strings is presented. The approximated mean is computed through the successive improvements of a partial solution. In each iteration, the edit distance from the partial solution to all the strings in the set are computed, thus accounting for the frequency of each of the edit operations in every position of the approximated mean. A goodness index for edit operations is later computed by multiplying their frequency by the cost. Each operation is tested, starting from that with the highest index, in order to verify whether applying it to the partial solution leads to an improvement. If successful, a new iteration begins from the new approximated mean. The algorithm &#64257 and nishes after all the operations have been examined without a better solution being found. Comparative experiments involving Freeman chain codes encoding 2D shapes show that the quality of the approximated mean string is similar to other approaches but achieves a much faster convergence.

Close

López, D.; Calera-Rubio, J.; Gallego-Sánchez, A. J.

Inference of k-Testable Directed Acyclic Graph Languages Proceedings Article

In: Journal of Machine Learning Research: Workshop and Conference Proceedings, Vol. 21: ICGI 2012, pp. 149-163, 2012.

Abstract | Links | BibTeX | Tags: PASCAL2, Prometeo 2012, TIASA

Higuera, C. De La; Oncina, J.

Finding the most probable string and the consensus string: an algorithmic study Proceedings Article

In: In: 12th International Conference on Parsing Technologies (IWPT 2011), pp. 26-36, Dublin, 2011.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Socorro, R.; Micó, L.; Oncina, J.

A fast pivot-based indexing algorithm for metric spaces Journal Article

In: Pattern Recognition Letters, vol. 32, no. 11, pp. 1511-1516, 2011.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Oncina, J.; Vidal, E.

Interactive Structured Output Prediction: Application to Chromosome Classification Journal Article

In: Pattern Recognition and Image Analysis (LNCS), vol. 6669, pp. 256-264, 2011.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Bernabeu, J. F.; Calera-Rubio, J.; Iñesta, J. M.; Rizo, D.

Melodic Identification Using Probabilistic Tree Automata Journal Article

In: Journal of New Music Research, vol. 40, no. 2, pp. 93-103, 2011, ISSN: 0929-8215.

Abstract | BibTeX | Tags: DRIMS, MIPRCV, TIASA

Socorro, R.; Micó, L.; Oncina, J.

Efficient search supporting several similarity queries by reordering pivots Proceedings Article

In: Signal Processing, Pattern Recognition, and Applications (SPPRA 2011), pp. 114-120, ACTA Press, Innsbruck, Austria, 2011, ISBN: 978-0-88986-865-6.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Abreu, J.; Rico-Juan, J. R.

Characterization of contour regularities based on the Levenshtein edit distance Journal Article

In: Pattern Recognition Letters, vol. 32, pp. 1421-1427, 2011.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Bernabeu, J. F.; Calera-Rubio, J.; Iñesta, J. M.

Classifying melodies using tree grammars Journal Article

In: Lecture Notes in Computer Science, vol. 6669, pp. 572–579, 2011, ISSN: 0302-9743.

Abstract | Links | BibTeX | Tags: DRIMS, MIPRCV, TIASA

Serrano, A.; Micó, L.; Oncina, J.

Impact of the Initialization in Tree-Based Fast Similarity Search Techniques Proceedings Article

In: Pelillo, M.; Hancock, E. R. (Ed.): SIMBAD'11 Proceedings of the First international conference on Similarity-based pattern recognition, pp. 163-176, Springer, Venecia, Italia, 2011, ISBN: 978-3-642-24470-4.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA

Calera-Rubio, J.; Bernabeu, J. F.

Tree language automata for melody recognition Proceedings Article

In: Pérez, Juan Carlos (Ed.): Actas del II Workshop de Reconocimiento de Formas y Análisis de Imágenes (AERFAI), pp. 17-22, AERFAI IBERGARCETA PUBLICACIONES, S.L., Valencia, Spain, 2010, ISBN: 978-84-92812-66-0.

Abstract | Links | BibTeX | Tags: DRIMS, MIPRCV, TIASA

Rico-Juan, J. R.; Abreu, J.

A new editing scheme based on a fast two-string median computation applied to OCR Proceedings Article

In: Hancok, E. R.; Wilson, R. C.; Ilkay, T. W.; Escolano, F. (Ed.): Structural, Syntactic, and Statistical Pattern Recognition, pp. 748–756, Springer, Cesme, Izmir, Turkey, 2010, ISBN: 978-3-642-14979-5.

Abstract | BibTeX | Tags: MIPRCV, TIASA

Gómez-Ballester, E.; Micó, L.; Thollard, F.; Oncina, J.; Moreno-Seco, F.

Combining Elimination Rules in Tree-Based Nearest Neighbor Search Algorithms Proceedings Article

In: Hancok, E. R.; Wilson, R. C.; Ilkay, T. W.; Escolano, F. (Ed.): Structural, Syntactic, and Statistical Pattern Recognition, pp. 80–89, Springer, Cesme, Turkey, 2010, ISBN: 978-3-642-14979-5.

Abstract | Links | BibTeX | Tags: MIPRCV, TIASA