古典中国語コーパスの応用研究（2024年度）Applied Study of Classical Chinese Corpora

古典中国語コーパスの応用研究（2024年度）
Applied Study of Classical Chinese Corpora

班長：安岡　孝一
Coordinator: YASUOKA, Koichi

共同研究拠点＞共同研究一覧（2024年度）A,B,C へ

2010年以来、われわれが構築を続けてきた古典中国語（漢文）コーパスは、Conditional Random Fields を用いた形態素解析を古典中国語に適用した上で、Universal Dependencies にもとづく依存文法解析を適用するものである。これにより、古典中国語における単語と単語の係り受け関係を、自動で解析できるようになった。

本共同研究では、古典中国語に対する形態素解析と依存文法解析を応用して、現代中国語や近代日本語・韓国語・タイ語などの周辺諸言語に対し、文法解析手法を研究・開発する。

Since 2010, we have been developing Classical Chinese Corpora. We first constructed the Corpora using MeCab-Kanbun, a morphological analyzer based on Conditional Random Fields, for Classical Chinese texts. Then we applied SuPar-Kanbun, a dependency parser based on Universal Dependencies, to the Corpora. Using the Corpora and pre-trained language models, we can now analyze Classical Chinese texts by Part-Of-Speech tagging and dependency-parsing.

In this study, we will enhance our method so that it can be applied to other languages, such as Mandarin Chinese, Thai, Japanese, and Korean.

研究期間：2023年4月～2026年3月
Research period: Apr.2023 – Mar.2026

2022年07月23日更新
Last updated on July 23, 2024

京都大学人文科学研究所

古典中国語コーパスの応用研究（2024年度）
Applied Study of Classical Chinese Corpora

共同研究拠点

共同利用拠点