Introduction    |    Members    |    Research    |    Resources    |    Service    |    Information
  Software
- Natural Language Processing
- CoreOnto Workbench
- CoreOnto OpenAPI
- Digital Stickers
  Language Resources
- Corpus
- Images
- Dictionaries
- Voices
- Concept Categories
- Evaluation Datasets
- CoreNet
- CoreOnto
  Template
- SWRC
- COREONTO
- KORTERM
- BORA
- CILAB
 
HOME > Resources > Language Resources    
 

Language Resources

 



Language Resources Catalogue

     

 
Corpus

Corpus (Qualified Corpus)

Classification Mark

Title

Info

Language

Count

Year of creation

Corpus 1

Primitive corpus

Korean

70,000,000 phrases

1997

Corpus 2

Aligned morpheme analysis corpus

Korean

1,000,000 phrases

2000

Corpus 3

Automatic morpheme analysis corpus

Korean

40,000,000 phrases

1997

Corpus 4

Aligned sentence analysis corpus

Korean

3,000 sentences

1998

Corpus 5

Manual sentence analysis corpus

Korean

30,000 sentences

2000

Corpus 6

Chinese morpheme analysis corpus

Korean

10,000 sentences

2001

Corpus 7

Chinese-English-Korean multilingual corpus

Chinese, English, Korean

60,000 sentences

2000

Corpus 8

Chinese-English multilingual corpus

Chinese, English

60,000 sentences

2005

Corpus 9

Chinese-Korean multilingual corpus

Chinese, Korean

60,000 sentences

2005

Corpus 10

English-Korean multilingual corpus

English, Korean

60,000 sentences

2005

Corpus 11

Newspaper corpus (Hankyoreh)

Korean

620 files

2005

Corpus 12

Newspaper corpus (Donga-Korean, English, Japanese, Chinese)

Korean

1791 files

2005

Processed Resources

Classification Mark

Title

Info

Language

Count

Year of creation

PR1

Basic noun definition

Korean

29,038 words (57,391 meaning)

2004

PR2

Basic words definition corpus

Korean

29,042 words (57,400 meaning)

2003

PR3

Co-occurrence information data

Korean, English

35,731,121 (Korean)
12,504,329 (English)

2002

PR4

Word formation unit alignment

Korean

23,914

2003

PR5

Professional corpus- medicine

Korean

219,967 sentences

2000

PR6

Professional corpus- architectural engineering

Korean

3,681 sentences

2000

PR7

Professional corpus- economics

Korean

27,690 sentences

2000

PR8

Professional corpus- engineering

Korean

13,627 sentences

2000

PR9

Professional corpus- metal engineering

Korean

7,468 sentences

2000

PR10

Professional corpus- mechanical engineering

Korean

50,739 sentences

2000

PR11

Professional corpus- physics

Korean

106,547 sentences

2000

PR12

Professional corpus- biology

Korean

83,519 sentences

2000

PR13

Professional corpus- electronic engineering

Korean

12,887 sentences

2000

PR14

Professional corpus- computer science

Korean

8,679 sentences

2000

PR15

Professional corpus- chemistry

Korean

66,652 sentences

2000

PR16

Professional corpus- chemical engineering

Korean

19,546 sentences

2000

PR17

Professional corpus- environmental engineering

Korean

21,260 sentences

2000

PR18

Professional corpus- frequency of adverb

Korean

2,556 words

2003

PR19

Professional corpus- frequency of noun

Korean

4313,557 words

2003

PR20

Professional corpus- frequency of adjective

Korean

1,680 words

2003

PR21

Professional corpus- frequency of verb

Korean

20,024 words

2003

PR22

Professional corpus- conjugation of verb

Korean

135,329

2003

PR23

Professional corpus- co-occurrence information

Korean

16,464,054

2003

PR24

Single-syllable nouns

Korean

1,025 words

2003

 

Images

Image Database

Classification Mark

Title

Info

Language

Count

Year of creation

ICKL

Off-line Korean Handwriting Database

Korean

1,200 Korean words

1997

 

Voices

Speech Database

Classification Mark

Title

Info

Language

Count

Year of creation

PBW1

Korean voice database 1
(70 speakers & 2 news readers)

Korean

450 phrases,
2,000 phrases

1997

PBW2

Korean voice database 2
(36 localities, men/women)

Korean

36 words

2000

PBW3

Korean voice database 3
(2 news readers & 70 speakers)

Korean

1 paragraph

1997

PBW4

Korean voice database 4
(70 speakers four times & 2 news readers twice)

Korean

32 cardinal numbers
9 definitives

1998

PBW5

Korean voice database 5
(70 speakers four times & 2 news readers twice)

Korean

4 simple numbers
35 cardinal numbers

2000

PBS1

Korean voice database 6
(20 speakers once)

Korean

539 sentences
everyday sentences 50 sets

2001

PRW

Korean voice database 7

Korean

32 cardinal numbers
4 simple numbers
1,620 kinds

2000

 

Dictionaries

Electronic Dictionaries

Classification Mark

Title

Info

Language

Count

Year of creation

ED1

Basic words dictionary

Korean

54,796 words

2002

ED2

Proper noun dictionary

Korean

37,000 words

2000

ED3

Compound noun dictionary

Korean

26,000 words

2000

CBTable

Concept structure dictionary

Korean

2,954 concepts (53,628 meaning)

2002

ED4

Verb conjugation dictionary

Korean

4,560 words

2000

ED5

Compound verb logical structure dictionary

Korean

4,297

2000

ED6

Adjective phrase information dictionary

Korean

5,930

1996

ED7

Korean-Chinese dictionary

Korean, Chinese

164,000 words

1998

ED8

Chinese-Korean dictionary

Korean, Chinese

200,000 words

1998

ED9

Korean-Chinese translated words

Korean, Chinese

51,804 words

2001

ED10

Chinese-Korean corpus sentence pattern

Korean, English, Chinese

60,000

2000

 

CoreNet

CoreNet

Classification Mark

Title

Info

Language

Count

Year of creation

CBL1

CoreNet Korean

Korean, Chinese, Japanese

23,938 words connected to
2,937 CoreNet

2002

CBL2

Multilingual Concept Structure (Korean, Chinese, Japanese)

Korean, Chinese, Japanese

2,937 Concepts

2002

CBL3

CoreNet Korean Noun

Korean

21,401 words (51,607 meanings)

2001

CBL4

CoreNet Korean Verb

Korean

1,758 words (5,290 meanings)

2002

CBL5

CoreNet Korean Adjective

Korean

813 words (2,801 meanings)

2002

CBL6

CoreNet Korean Verb Conjugation

Korean, Japanese

406 words (957 meanings)

2000

CBL7

CoreNet Korean Adjective Conjugation

Korean

759 words (1,109 meanings)

2002

CBL8

CoreNet Chinese

Chinese, Korean

2,937 words (21,015 meanings)

2003

CBL9

CoreNet Chinese Noun

Chinese, Korean

20,647 words

2003

CBL10

CoreNet Chinese Verb Conjugation

Chinese, Korean

288 words

2003

CBL11

CoreNet Chinese Adjective Conjugation

Chinese, Korean

80 words

2003

 

Concept Categories

Classified Concept Table

Classification Mark

Title

Info

Language

Count

Year of creation

CCT

Japanese, Korean, Chinese, English concept classification

English, Korean, Chinese, English

62,281 concepts

2000

 

Evaluation Datasets

Test Suite

Classification Mark

Title

Info

Language

Count

Year of creation

TS1

Q&A evaluation set

Korean

15,036

2001

TS2

Mechanical translation evaluation set

Korean

3,000

2001

TS3

English-Korean phonetic notation evaluation set

English, Korean

7,186 pairs

2000

TS4

HKIB-20000/HKIB-40075 Korean Text Categorization Test Collections

Korean

60,075 documents

2000

 

CoreOnto Datasets

CoreOnto

Classification Mark

Title

Info

Language

Count

Year of creation

CO1

SUMO_CORENET
- SUMO와 CORENET 통합 버전

 

English

Named Class: 3,622
Object Properties : 281
DataType Properties : 7

2008

CO2

INSEPC
- KAIST-INSEPC 한글 포함 버전

 

English

Named Class: 9,547

2008

CO3

SMCI
- SUMO, MILO, Computing Services, INSPEC통합버전

 

English

Named Class: 12,702
Object P.: 305
DataType P.: 7

2008

CO4

KAIST COROENTO
- COROENTO UPPER

 

English

Named Class: 7,769
Object P.: 97
DataType P.: 127

2008

CO5

KAIST COROENTO
- COROENTO ROBOT

 

English

Named Class: 2,333
Object P.: 282
DataType P.: 7

2008

CO6

KAIST COROENTO
- COROENTO COMMUNICATION

 

English

Named Class: 3,511
Object P.: 282
DataType P.: 7

2008

CO7

KAIST COROENTO
- COROENTO INTELLIGENT-ROBOT

 

English

Named Class: 3,753
Object P.: 295
DataType P.: 7

2008

CO8

KAIST COROENTO
- COROENTO TC-BC

 

English

Named Class: 12,535
Object P.: 50

2008

CO9

KAIST COROENTO
- COROENTO ROBOT

 

English

Named Class: 4,773
Object P.: 1
DataType P.: 1

2008

CO10

KAIST COROENTO
- COROENTO ROBOT-DMB

 

English

Named Class: 20,787
Object P.: 351
DataType P.: 19

2008

CO11

KAIST COROENTO
- COROENTO DMB

 

English

Named Class: 7,368
Object P.: 1

2008

CO12

KAIST COROENTO
- COROENTO HOME NETWORK

 

English

Named Class: 4,410
Object P.: 290
DataType P.: 7

2008

CO12

KAIST COROENTO
- COROENTO IT PEOPLE

 

English

Named Class: 33
Object P.: 28
DataType P.: 9

2008

CO13

KAIST COROENTO
- COROENTO INSTANCES-1

 

English

Instance

2008

CO14

KAIST COROENTO
- COROENTO INSTANCES-2

 

English

Instance

2008

CO15

KAIST COROENTO
- COROENTO ROBOT STANDARDS

 

English

Named Class: 10,655
Object P.: 31
DataType P.: 86

2008