• A
  • A
  • A
  • ABC
  • ABC
  • ABC
  • А
  • А
  • А
  • А
  • А
Regular version of the site
Article
Building an Open Corpus and a Morphological Parser for Corpus Annotation for Standard Dargwa

Svetlana Iu. Toldova, Elena O. Sokur.

Journal of Siberian Federal University. Series: Humanities & Social Sciences. 2024. Vol. 17. No. 5. P. 905-915.

Book chapter
SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text Detoxification

Rykov E., Zaytsev K., Anisimov I. et al.

In bk.: CLEF 2024 Working Notes. CEUR Workshop Proceedings, 2024. P. 2866-2871.

Working paper
Exploring the Effectiveness of Methods for Persona Extraction
In press

Konstantin Zaitsev.

arxiv.org. Computer Science. Cornell University, 2024

About the School

The School of Linguistics was founded in December 2014. Today, the School offers undergraduate and graduate programs in theoretical and computational linguistics. Linguistics as it is taught and researched at the School does not simply involve mastering foreign languages. Rather, it is the science of language and the methods of its modeling. Research groups in the School of Linguistics study typology, socio-linguistics and areal linguistics, corpus linguistics and lexicography, ancient languages and the history of languages. The School is also developing linguistic technologies and electronic resources: corpora, training simulators, dictionaries, thesauruses, and tools for digital storage and processing of written texts.

The School is a leader in the use of the latest IT approaches to gathering and processing language data. One of the School's key projects involves providing linguistic support for, and helping to develop the Russian National Corpus.

The School holds a series of Linguistic Expeditions each year, in which students and teachers collect material for their academic research.

It also houses the Russian Language Centre, where international students have the opportunity to learn Russian in summer schools or throughout the year.

The School is also home to the Linguistic Laboratory for Corpora Research Technologies, a Laboratory for Caucasus Languages, and a research group for academic Russian, which is developing multimedia didactic materials for teaching academic writing at HSE. There is also an ongoing research seminar, in addition to conferences and talks by guest speakers.