Versions


Version 1.0 – released March 26th 2018

The first version of the corpus, released after the conclusion of the KORBA 1 project.

Version 1.1 – released September 13th 2018

The corpus was reindexed, causing some segments to be interpreted differently from previous versions. Some changes have also been made to the region tables.

Version 1.2 – released December 5th 2019

Changes to segmentation and transcription of foreign language segments.

Version 2.0 (Manually annotated subcorpus) – released March 21st 2023

The corpus contains samples of texts transliterated by the KORBA 1 and KORBA 2 projects. The samples of texts transliterated by the KORBA 1 project have been adapted to updates in the tagset.

Version 2.0 – released January 13th 2024

The first version of the new, expanded edition of the corpus, which is the result of the project KORBA 2.

Version 2.1 – released January 31st 2024

The metadata set has been updated and minor corrections have been made to the text dating.

Version 2.2 – released February 19th 2024

The duplicate text has been removed, so the number of tokens has changed; minor corrections have been made to text dating and metadata; user manual updated.