Introduction    |    Members    |    Research    |    Resources    |    Service    |    Information
  Software
- Natural Language Processing
- CoreOnto Workbench
- CoreOnto OpenAPI
- Digital Stickers
  Language Resources
- Corpus
- Images
- Dictionaries
- Voices
- Concept Categories
- Evaluation Datasets
- CoreNet
- CoreOnto
  Template
- SWRC
- COREONTO
- KORTERM
- BORA
- CILAB
 
HOME > Resources > Corpus    
 
Corpus



Language Resources Catalogue

     

Corpus (Qualified Corpus)

Classification Mark

Title

Info

Language

Count

Year of creation

Corpus 1

Primitive corpus

Korean

70,000,000 phrases

1997

Corpus 2

Aligned morpheme analysis corpus

Korean

1,000,000 phrases

2000

Corpus 3

Automatic morpheme analysis corpus

Korean

40,000,000 phrases

1997

Corpus 4

Aligned sentence analysis corpus

Korean

3,000 sentences

1998

Corpus 5

Manual sentence analysis corpus

Korean

30,000 sentences

2000

Corpus 6

Chinese morpheme analysis corpus

Korean

10,000 sentences

2001

Corpus 7

Chinese-English-Korean multilingual corpus

Chinese, English, Korean

60,000 sentences

2000

Corpus 8

Chinese-English multilingual corpus

Chinese, English

60,000 sentences

2005

Corpus 9

Chinese-Korean multilingual corpus

Chinese, Korean

60,000 sentences

2005

Corpus 10

English-Korean multilingual corpus

English, Korean

60,000 sentences

2005

Corpus 11

Newspaper corpus (Hankyoreh)

Korean

620 files

2005

Corpus 12

Newspaper corpus (Donga-Korean, English, Japanese, Chinese)

Korean

1791 files

2005

Processed Resources

Classification Mark

Title

Info

Language

Count

Year of creation

PR1

Basic noun definition

Korean

29,038 words (57,391 meaning)

2004

PR2

Basic words definition corpus

Korean

29,042 words (57,400 meaning)

2003

PR3

Co-occurrence information data

Korean, English

35,731,121 (Korean)
12,504,329 (English)

2002

PR4

Word formation unit alignment

Korean

23,914

2003

PR5

Professional corpus- medicine

Korean

219,967 sentences

2000

PR6

Professional corpus- architectural engineering

Korean

3,681 sentences

2000

PR7

Professional corpus- economics

Korean

27,690 sentences

2000

PR8

Professional corpus- engineering

Korean

13,627 sentences

2000

PR9

Professional corpus- metal engineering

Korean

7,468 sentences

2000

PR10

Professional corpus- mechanical engineering

Korean

50,739 sentences

2000

PR11

Professional corpus- physics

Korean

106,547 sentences

2000

PR12

Professional corpus- biology

Korean

83,519 sentences

2000

PR13

Professional corpus- electronic engineering

Korean

12,887 sentences

2000

PR14

Professional corpus- computer science

Korean

8,679 sentences

2000

PR15

Professional corpus- chemistry

Korean

66,652 sentences

2000

PR16

Professional corpus- chemical engineering

Korean

19,546 sentences

2000

PR17

Professional corpus- environmental engineering

Korean

21,260 sentences

2000

PR18

Professional corpus- frequency of adverb

Korean

2,556 words

2003

PR19

Professional corpus- frequency of noun

Korean

4313,557 words

2003

PR20

Professional corpus- frequency of adjective

Korean

1,680 words

2003

PR21

Professional corpus- frequency of verb

Korean

20,024 words

2003

PR22

Professional corpus- conjugation of verb

Korean

135,329

2003

PR23

Professional corpus- co-occurrence information

Korean

16,464,054

2003

PR24

Single-syllable nouns

Korean

1,025 words

2003