Scientific Conferences of Ukraine, 4TH INTERNATIONAL ONLINE CONFERENCE ‘CORPORA AND DISCOURSE’

Font Size: 
UNIVERSAL DEPENDENCIES – MORPHOSYNTACTICALLY ANNOTATED CORPORA FOR THE WORLD’S LANGUAGES
Joakim Nivre

Last modified: 2026-01-24

Abstract


Universal Dependencies (UD) is a project developing cross-linguistically consistent morphosyntactic annotation for many languages, with the goal of facilitating multilingual research in natural language processing and linguistics (Nivre et al., 2016; Nivre et al., 2020; de Marneffe et al., 2021).

Since the project started in 2014, morphosyntactically annotated corpora have been developed for 165 languages through the joint efforts of over 600 researchers around the world.

As a speaker at the 4th International Online Conference ‘CORPORA AND DISCOURSE’ to be held on November 26, 2024, I will give a brief overview of the UD annotation scheme, the UD data repository, and the UD research community. I will conclude with a few words on the future challenges of UD.

References

  1. Marie-Catherine de Marneffe, Christopher Manning, Joakim Nivre, Daniel Zeman (2021). Universal Dependencies. Computational Linguistics, vol. 47, no. 2, pp. 255-308.
  2. Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Yoav Goldberg, Jan Hajič, Christopher D. Manning, Ryan McDonald, Slav Petrov, Sampo Pyysalo, Natalia Silveira, Reut Tsarfaty, Daniel Zeman (2016). Universal Dependencies v1: A Multilingual Treebank Collection. In Proceedings of the 10th International Conference on Language Resources and Evaluaiton (LREC 2016).
  3. Joakim Nivre, Marie-Catherine de Marneffe, Filip Ginter, Jan Hajič, Christopher Manning, Sampo Pyysalo, Sebastian Schuster, Francis Tyers, Daniel Zeman (2020). Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection. In Proceedings of the 12th International Conference on Language Resources and Evaluation (LREC 2020), pp. 4034-4043.

Full Text: PDF