Authors: Neculai Curteanu,
Cojocaru Svetlana, Eugenia Burcă
Keywords: new approach to dictionary entry parsing; the parsing method of SCD configurations; parsing the largest Romanian, German, French, and Russian dictionaries; lexicographic modeling.
Abstract
This paper extends the experience of parsing other five, sensibly different, Romanian, French, and German largest dictionaries, to
DMLRL (Dictionary of Modern Literary Russian Language) [18], using the optimal and portable parsing method of SCD (Segmentation-Cohesion-Dependency) configurations [7], [11], [15]. The purpose of the present paper is to elaborate the lexicographic modeling of
DMLRL, which necessarily precedes the sense tree parsing dictionary entries. The following
three SCD configurations are described: the
first one has to separate the lexicographic segments in a
DMLRL entry, the
second SCD-configuration concentrates on the SCD marker classes and their hypergraph hierarchy for
DMLRL primary and secondary senses, while the
third SCD configuration hands down the same modeling process to the atomic sense definitions and their examples-to-definitions. The dependency hypergraph of the third SCD configuration, interconnected to the one of the second SCD configuration, is specified completely at the atomic sense level for the first time, exceeding the SCD configuration modeling for other five dictionaries [15], [14]. Numerous examples from
DMLRL and comparison to
DLR-DAR Romanian thesaurus-dictionary support the proposed
DMLRL lexicographic modeling.
Neculai Curteanu,
Institute of Computer Science, Romanian Academy, Iaşi Branch, Str. Gh. Asachi, Nr. 3, 700483 Iaşi, România E-mail:
Svetlana Cojocaru, Eugenia Burcă,
Institute of Mathematics and Computer Science,
Academy of Sciences of Moldova,
Str. Academiei nr. 5, Chişinău, MD 2028, R. Moldova
E-mail: ,
Fulltext
–
0.90 Mb