MEDIDAS DE SIMILITUD BASADAS EN CARACTERÍSTICAS PARA LA EVALUACIÓN DE RELACIONES TAXONÓMICAS (SIMILARITY MEASURES BASED ON FEATURES FOR THE EVALUATION OF TAXONOMIC RELATIONSHIPS)
Resumen
En una ontología, la similitud semántica entre un par de conceptos es una forma de saber qué tan similares son en base a su significado, mediante el conocimiento de la distancia entre los conceptos o en base a las características de los conceptos. En esta investigación, se propone un algoritmo para la evaluación de relaciones taxonómicas en una ontología de dominio de Inteligencia Artificial (IA) a través de la medida de exactitud. Las medidas de similitud implementadas en este artículo se basan en conocimiento, y dentro de este grupo de medidas existen las medidas basadas en estructura: Path, Wu-Palmer y Li, y las medidas basadas en características: cmatch, RE y Sánchez. La exactitud de las relaciones taxonómicas de tipo “is-a” en las medidas implementadas es de un 92%. Con los resultados experimentales comparados con las respuestas de validación de un experto de dominio, el sistema coincide en un 90% de exactitud.
In an ontology, semantic similarity between a pair of concepts is a way to find out what so similar they are, this is based on their meaning by computing the distance between concepts or it is based on the characteristics of the concepts. In this paper, an algorithm is proposed for the evaluation of taxonomic relationships in a domain ontology of Artificial Intelligence (AI) through the accuracy measure. The measures of similarity implemented in this research are based on knowledge, and within this group of measures, there are measures based on structure: Path, Wu-Palmer and Li, and measures based on characteristics: cmatch, RE and Sánchez. The accuracy for the "is-a" taxonomic relationships for the measures implemented is 92%. With the experimental results compared to the validation responses of a domain expert, the system matches the 90% of accuracy.
Texto completo:
588-605 PDFReferencias
Berners-Lee, T., Helder, J. & Lassila, O. The Semantic Web. Scientific American, no. 5, vol. 284, pp. 34-43, 2001.
Bird, S., Klein, E. & Loper, E. Natural Language Processing with Python. 1st edn. O’Reilly Media, Inc. 2009.
Collins, A. M. & Loftus, E. F. A spreading-activation theory of semantic processing. Psychological Review, vol. 86, no. 6, pp. 407-428, 1975.
Gruber, T. R. Toward principles for the design of ontologies used for knowledge sharing. International Journal of Human-Computer Studies, vol. 43, issues 5-6, pp. 907-928, 1995.
Harispe, S., Ranwez, S., Janaqi, S. & Montmain, J. Semantic Similarity from Natural Language and Ontology Analysis. Morgan & Claypool Publishers. 2015.
J. Jiang, Jay & W. Conrath, David. Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. Proceedings of the International Conference on Research in Computational Linguistics. 1997.
Li, Y., Bandar, Z. A. & McLean, D. An Approach for Measuring Semantic Similarity Between Words Using Multiple Information Sources. IEEE Trans. on Knowl. and Data Eng., pp. 871-882, 2003.
Lin, D. An Information-Theoretic Definition of Similarity. In Proceedings of the Fifteenth International Conference on Machine Learning (ICML '98), Jude W. Shavlik (Ed.). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, pp. 296-304. 1998.
Liskov, B., & Wing, J.M. A Behavioral Notion of Subtyping. ACM Trans. Program. Lang. Syst., 16, pp. 1811-1841.1994.
Meadche, A. & Staab, S. Measuring Similarity Between Ontologies. Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management. Ontologies and the Semantic Web. Springer Berlin Heidelberg, pp. 251-263, 2002.
Petrakis, E. G. M., Varelas, G., Hliaoutakis, A. & Raftopoulou, P. X-Similarity: Computing Semantic Similarity bewtween Concepts from Different Ontologies. Journal of Digiral Information Management, vol. 4, pp. 233-237, 2006.
Rada, R., Miki, H., Bicknell, E. & Blettner, M. Development and application of a metric on semantic nets. In IEEE Transactions on Systems, Man, and Cybernetics, vol. 19, no. 1, pp. 17-30, 1989.
Rodriguez, M. A. & Egenhofer, M. J. Determining semantic similarity among entity classes from different ontologies. IEEE Transactions on Knowledge and Data Engineering, vol. 15, no. 2, pp. 442-456, 2003.
Sánchez, D., Batet, M., Isern, D. & Valls, A. Ontology-based semantic similarity: A new feature-based approach. Experts Systems with Applications, vol. 39, no. 9, pp. 7718-7728, 2012.
Slimani, T. Description and Evaluation of Semantic Similarity Measures Approaches. International Journal of Computer Applications. Vol 80. pp. 25-33. 2013.
Tovar, M., Pinto, D., Montes, A., Serna, J. G. A metric for the evaluation of restricted domain ontologies. Computación y Sistemas, vol. 22, no. 1, pp. 147-162, 2018.
Tversky, A. Features of Similarity. Psychological Rev, vol. 84, pp. 327-352, 1977.
Wu, Z. & Palmer, M. Verbs semantics and lexical selection. In Proceedings of the 32nd annual meeting on Association for Computational Linguistics (ACL ‘94). Association for Computational Linguistics, Stroudsburg, PA, USA, pp. 133-138, 1994.
Zhu, G. & Iglesias, C. A. Computing Semantic Similarity of Concepts in Knowledge Graphs, in IEEE Transactions on Knowledge and Data Engineering, vol. 29., no. 1, pp. 72-85, 2017.
Zouaq, A., Gasevic, D. & Hatala, M. Linguistic patterns for information extraction in ontocmaps. In Proceedings of the 3rd International Conference on Ontology Patterns, Blomqvist, E., Gangemi, A., Hammar, K. & Suárez-Figueroa, M. C. (Eds.), vol. 929, pp. 61-72, 2012.
URL de la licencia: https://creativecommons.org/licenses/by/3.0/deed.es
Pistas Educativas está bajo la Licencia Creative Commons Atribución 3.0 No portada.
TECNOLÓGICO NACIONAL DE MÉXICO / INSTITUTO TECNOLÓGICO DE CELAYA
Antonio García Cubas Pte #600 esq. Av. Tecnológico, Celaya, Gto. México
Tel. 461 61 17575 Ext 5450 y 5146
pistaseducativas@itcelaya.edu.mx