Introduction    |    Members    |    Research    |    Resources    |    Service    |    Information
  Research Overview
  Research Area
- Natural Language Processing
- Ontology
- Semantic Web
  Major Projects
- COREONTO
+ Project Introduction
+ Project Background
+ Project Detail
+ Project Result
- BORA
+ Project Introduction
+ Project Background
+ Project Detail
+ Project Result
- KORTERM
+ Project Introduction
+ Project Background
+ Project Detail
+ Project Result
 
HOME > Research >     
 
Language Resource

  • What is the Language Resource?

  • The term ‘language resource’ means all language-related data obtained by processing language from all human linguistic activities. Language resource includes spoken language primitive corpus, written language primitive corpus, analysis corpus, electronic dictionary, WordNet and ontology. Also, BORA is one of the language resource.

  • What is the Language Resource for?

  • Language Resources are relevant to all language-based systems such as mechanical translation/ interpretation, Korean language education system and guide system.  For example, a sentence analysis device is an element program necessary for most of natural language processing systems.  In order to develop this system, it is necessary to first extract the co-occurrence information from the primitive corpus and then to extract the morpheme analysis statistical model and co-occurrence information from the morpheme analysis corpus.  As we can see, many language resources are required to develop software.  Such language resources will be used in all information support systems on the internet such as electronic transactions, electronic library and electronic news.

  • Which Language Resource do we have?

  • BORA has: multilingual primitive corpus of Korean, English, Chinese and Japanese; analysis corpus; and special electronic dictionaries based on parts of speech, proper nouns, compound words and conjugations.  In particular, the recently developed multilingual WordNet, based on basic Korean vocabulary lists and concepts, is an advanced language resources responding to international demands. Such resources need, however, continuous improvement and management to increase their value.