Sinica BOW:
Integrating Bilingual WordNet and SUMO Ontology Chu-Ren Huang
[email protected] Academia Sinica & Peking University NLPKE2003 Special Session:
Upper Ontology and Natural Language Processing Beijing, October 28th-30th, 2003
The Academia Sinica
Bilingual Ontological Wordnet http://BOW.sinica.edu.tw/
WordNet SUMO (Suggested Upper Merged Ontology) http://ontology.teknowledge.com English-Chinese translation equivalence database
The Construction of BOW Towards a linguistic infrastructure for knowledge representation and knowledge engineering The
Criteria
Lexicon-driven Multilingual
•Bilingual interface or •Chinese default/English mouse-over Domain
Inter-operable
What is in BOW
English-Chinese Translation Equivalents Database -includes all WordNet entries -manually checked, up to 3 Chinese translation for each English entry
SUMO as upper ontology -C-E bilingual ontology nodes -lexical-conceptual link
Domain Tag (under constr.)
BOW’ GOALS To give each linguistic form a rigorous conceptual location, To clarify the relation between conceptual classification and linguistic instantiation, and To facilitate genuine cross-lingual access of knowledge.
Functions of BOW Sense-based English-Chinese translation equivalency From English Word-Sense to Ontology and Inference From Chinese Word to Ontology and Inference From Word-Sense to Domain (to be completed)
Sinica BOW English-Chinese Lookup Search by lexical lemma Top Level POS + Frequency + Available Resources
Resources includes wordnets, lexica, and corpora. If the chosen lemma appear in any of the resources, user can click to hyperlink to outside sources, or expanded the relevant data onsite
Wordnet Level List of all senses of the chosen lemma
From Sense to Knowledge
Sinica BOW: combining Wordnet and Ontology
詞義(Sense) 4: 魚兒 領域(Domain)一般(General)
詞類(POS)名詞(Noun) 解釋(Explanation) 翻譯(Translation) 同義詞集(Synset) (整體)部件詞(Part meronym) 上位詞(Hypernym) 下位詞(Hyponym) (成員)群體詞(Member holonym) SUMO:魚類(fish)
From Hyponyms to Class Members
下位詞(Hyponym)
food_fish • 下位詞(Hyponym) snapper sole panfish shad sardine trout tuna sea_bass groundfish herring salmon whitefish
game_fish rough_fish chondrichthian bony_fish mouthbreeder
Linguistic ontology 上位詞
#1
反義詞
(整體)部件詞 同義詞1
檢索詞
同義詞n
(部件)整體詞 下位詞 參見
….等詞義關係
Linguistic ontology 上位詞
#2
反義詞
(整體)部件詞 同義詞1 同義詞n 上位詞 反義詞 反義詞 (部件)整體詞 (整體)部件詞 (整體)部件詞 同義詞1 同義詞1 同義詞n 檢索詞 下位詞 檢索詞 (部件)整體詞 參見 (部件)整體詞 檢索詞 上位詞
下位詞
下位詞
參見
參見
以其他詞義關係的詞彙為核心
Domain ontology 領域
領域
領域
領域 領域
….等領域關係
Linguistic ontology +Domain ontology 上位詞 領域
反義詞
(整體)部件詞 檢索詞 領域
同義詞1 領域 (部件)整體詞
下位詞 領域 參見
領域
同義詞n
A Linguistic KnowledgeNet: Mutli-index based on linguistic knowledge
Query by Domain
Chinese/English Search
WordNet E-C corres.
SUMO Lookup
Query by Prefix or Suffix List of Chinese prefixes and suffixes
CKIP Lexicon of Prefix/Suffix
Integrating KnowledgeBases
A network of upper ontology (e.g. SUMO) and domain ontologies will provide the basis for synergy of knowledge and provide robust basis for versatile knowledge engineering
領域A
Approach: lexicons from corpora and lexica • sub-lexicon
SUMO WordNet
領域B
領域C
• domain lexicon
Domsin Ontology Conceptual Links
領域D
領域E