|
HOME > Introduction > BORA |
|
|
|
|
|
|
|
1. What is the Bank of Resource for language and Annotation (BORA)?
The Bank of Resource for language and Annotation (BORA) is supported as a special research materials bank to allow researchers to efficiently obtain research material that is recognized by Korean Science and Engineering Foundation as requiring state-level sponsorship for its management and distribution. Designated as a special research materials bank, the BORA was established to extensively and systematically develop, manage, and distribute language resources essential to the development of language and knowledge infra-software that will serve as the basis for the information era of the future.
A language resource bank literally provides language resources. This still-unfamiliar resource includes primitive and analysis corpora of spoken and written languages, all kinds of electronic dictionaries, wordnets , ontologies , and other language-related resources, processed and developed in diverse forms from any pronouncements resulting from human use of languages.
2. Purpose of BORA
In today¡¯s world of knowledge and information, language engineering technology that enables the efficient search and processing of information through the mechanical processing of languages and the speedy distribution of information is emerging as a nation¡¯s industrial competitive factor. A language resource is considered an understructure that is essential to the development of language engineering technology, and can be used as an important resource for language-based systems like machine translation and interpretation, information search, Korean language education system, and guide system. For example, in order to develop a syntax analyzer, an essential component of most natural language processing systems, the co-occurrence information extracted from a primitive corpus and morphological analyzer probability model is learned, and the co-occurrence information extracted from a morphological analytic corpus must be used. Likewise, a number of [different] language resources are required in developing a single piece of software. Recognizing the importance of language resources, the Bank of Resources for Language and Annotation was established to develop, manage and distribute the resources systematically and extensively.
1) Uses of Language Resources
Languages resources are used as an essential resource for all the systems based on languages, such as mechanical translation and interpretation, information search, Korean language education system, and guide system. For example, to develop a syntax analyzer, an essential component for most natural language processing systems, co-occurrence information is extracted from a primitive corpus and a morphological analyzer probability model is learned, and additional co-occurrence information is extracted from a morphological analytic corpus and used. Likewise, to develop a single piece of software, a number of language resources are required.
2) Types of Resources Available
For Korean, English, Chinese, Japanese, and other languages, the Bank currently holds primitive and analytic corpora, electronic dictionaries divided according to parts of speech, specialized dictionaries for proper nouns, compound words, case frames, and various natural language processing software based on these resources. Most notably, the recently created basic Korean lexicon list and the concept-based multi-lingual word-net are advanced language resources that meet global needs. However, these resources are not complete achievements, and require constant improvement and maintenance to increase their value.
3. History of the BORA
Date |
Major Developments |
2002 |
Signed Contract regarding Supply of Language Resource through the ELDA |
2002 |
Signed Contract regarding the Project on Supplying Language Resource through Korea Science and Engineering Foundation |
2002 |
Appointed as the International ISO/TC37/SC4 Language Resource Application Management Organization |
2003 |
Established Bank of Resources for Language and Annotation. Designated by the State as a special research material bank sponsored by Korea Science and Engineering Foundation. |
2003 |
East Asia Forum on Terminology Held |
2003 |
Conference on Standardizing North and South Korea¡¯s Language Information Industry Held (in Beijing) |
2004 |
Elected as the host of IJCNLP (International Joint Conference of Natural Language Processing) |
2005 |
IJCNLP Held (Jeju) |
2005.11. |
International ISO/TC37 SC4 Work Group Conference Held (Jeju) |
2006.01. |
GWC (Global Wordnet Conference) Held (Jeju) |
2007.05. |
Elected as the Directing Center for Frontier Semantic Web Research Centers |
2007.1. |
ISWC (International Semantic Web Conference) Held (Busan) – Elected as the Program Chair |
4. Project Introduction
For more detailed information on the Bank of Resource for language and Annotation, go to [Research] --> [Major Projects] --> [BORA] | | |
|
|
|
|
|