Научно-учебная группа «Мультиязыковая база данных синонимов» — Национальный исследовательский университет «Высшая школа экономики»

Нашли опечатку?
Выделите её, нажмите Ctrl+Enter и отправьте нам уведомление. Спасибо за участие!
Сервис предназначен только для отправки сообщений об орфографических и пунктуационных ошибках.

Научно-учебная группа завершила свою работу

О проекте

Цель проекта – сравнение теоретических и компьютерных подходов к изучению синонимии и семантических полей. В проекте используется лексикографический, а также векторно-статистический подход к описанию указанных явлений. В качестве результата ожидается создание базы данных синонимов для нескольких семантических полей в ряде европейских языков, разработка статистических критериев оценки лингвоспецифичности, создание лексических тренажеров для изучения иностранных языков.

About

The main research aim of the current project is to align theoretical and computer methodologies of establishing boundaries and describing structures of semantic fields across languages, in an attempt to advance theoretical knowledge of semantics and lexical typology, as well as with the practical purpose of developing interactive lexical exercises for foreign language learners.

The project puts forward the following hypotheses: 1) semantic fields of concrete and abstract lexicon have different structures and might therefore exhibit different cross-linguistic behavior. It is expected that concrete semantic fields will demonstrate greater cross-linguistic universality than abstract fields; 2) it is further hypothesized that fields that display greater cross-linguistic universality would present fewer difficulties for a language learner than those that are more at variance across languages; 3) methods of theoretical and computer semantics will produce different results in establishing the relation of synonymy between words. It is expected that computer methods are quite efficient in capturing close synonymy (as in surprise—astonishment), yet would be unable to reconstruct the entire composition of a semantic field in the way theoretical models do, based on the analysis of differences and similarities of semantic structures of synonyms (as in surprise-astonishment-amazement-wonder-marvel-stupefaction). Instead, the prediction is that computer models would establish the semantic relations of words on the levels of antonymy (as in lie-truth) and, in particular, co-hyponymy (as in surprise-delight-admiration-annoyance-disgust); 4) the linguospecificity of words is a parameter established for each pair of languages under comparison and can be expressed as a numerical index calculated on the basis of the number of possible translations from the source language into the target language.

The project employs methods of theoretical semantics, lexical typology and vector semantic models to tackle the issues under consideration.

The expected results include compiling a database of synonyms for several semantic fields in several European languages that incorporates and compares results received by theoretical and computer methods, developing statistical criteria for assessing linguospecificity, and creating synonyms-oriented interactive exercises in several European languages to train the lexical competence of learners.

История проекта

Научно-учебная группа «Мультиязыковая база данных синонимов: теоретические и компьютерные модели» создана в 2016 году в рамках конкурса научно-исследовательскихпроектов научно-учебных групп Научного фонда НИУ ВШЭ. Инициаторами создания группы были преподаватели Школы лингвистики Факультета гуманитарных наук доценты Валентина Юрьевна Апресян, Анастасия Сергеевна Выренкова, Борис Валерьевич Орехов, Татьяна Исидоровна Резникова. В.Ю. Апресян в течение многих лет занималась теоретической семантикой и обладает богатым опытом в области лексикографирования синонимии. Т.И. Резникова – известный специалист в области лексической типологии, и она исследует проблему синонимии и семантических полей в типологическом аспекте. Б.В. Орехов развивает компьютерно-статистические методы оценки синонимии, а также занимается составлением корпусов текстов и другими лингвистическими базами данных; А.С. Выренкова имеет научный и прикладной опыт в области развития лексической компетенции у изучающих иностранные языки, в том числе опыт создания синонимических тренажеров. Членами группы являются студенты бакалавриата, занимающиеся исследованием синонимии и семантических полей в разных языках, созданием баз данных и языковых корпусов, семантическими векторами.