WordNet
WordNet это большая база данных слов английского языка. Nouns, verbs, adjectives and adverbs группируются into ряды of cognitive synonyms (synsets), each expressing a distinct concept. Synsets связаны между собой концептуально-семантическими и лексическими отношениями. В результате получается сеть взаимосвязанных между собой слов и концептов, навигация по которой осуществляется посредством окна браузера. Структура WordNet's делает его полезным инструментом в исследованиях по компьютерной лингвистике и обработке естественного языка.
WordNet отдаленно resembles a thesaurus, в том плане, что it groups words together based on their meanings. However, there are some important distinctions. First, WordNet interlinks not just сами слова—strings of letters—but specific senses of words. As a result, words that are found in close proximity to one another in the network are semantically однозначны. Second, WordNet принимает во внимание semantic relations among words, whereas the groupings of words in a thesaurus does not follow any explicit pattern other than meaning similarity.
Structure
The main relation among words in WordNet is synonymy, as between the words “shut” and “close” or “car” and “automobile”. Synonyms--words that denote the same concept and are interchangeable in many contexts--are grouped into unordered synsets. Each of WordNet’s 117 000 synsets is linked to other synsets посредством a small number of “conceptual relations.” Additionally, a synset contains a brief definition (“gloss”) and, in most cases, one or more short sentences illustrating the use of the synset members. Word forms with several distinct meanings are represented in as many distinct synsets. Thus, each “form-meaning” pair in WordNet is unique.
Relations
The most frequently encoded relation among synsets is the super-subordinate relation (also called hyperonymy, hyponymy or ISA relation). It links more general synsets like