Publications Repository - Gdańsk University of Technology

Page settings

polski
Publications Repository
Gdańsk University of Technology

Treść strony

Wikipedia and WordNet integration based on words co-occurrences

The article presents a method for automatic integration of two lexical resources: semantic dictionary WordNet and electronic encyclopaedia Wikipedia. Our goal is to add automatically an semantic tags - a WordNet synset identifier to the title of the Wikipedia article. We've analyze several different ap-proaches to these problem and implement our own solution, based on word occurrences in synsets descriptions and the article body. Application of our algorithm as a result gives Wikipedia articles automatically annotated with WordNet synsets, what gives semantic readability of the knowledge stored in encyclopaedia. The procedure results has been evaluated trough comparison with hand crafted golden standard. At the end of the article we introduce some possible modifications to improve our procedure and reach higher precision of disambiguation Wikipedia articles.

Authors

Additional information

Category
Publikacja monograficzna
Type
rozdział, artykuł w książce - dziele zbiorowym /podręczniku w języku o zasięgu międzynarodowym
Language
angielski
Publication year
2009

Source: MOSTWiedzy.pl - publication "Wikipedia and WordNet integration based on words co-occurrences" link open in new tab

Portal MOST Wiedzy link open in new tab