Version 1.0 – released March 26th 2018
The first version of the corpus, released after the conclusion of the KORBA 1 project.
Version 1.1 – released September 13th 2018
The corpus was reindexed, causing some segments to be interpreted differently from previous versions. Some changes have also been made to the region tables.
Version 1.2 – released December 5th 2019
Changes to segmentation and transcription of foreign language segments.
Version 2.0 (Manually annotated subcorpus) – released March 21st 2023
The corpus contains samples of texts transliterated by the KORBA 1 and KORBA 2 projects. The samples of texts transliterated by the KORBA 1 project have been adapted to updates in the tagset.
Version 2.0 – released January 13th 2024
The first version of the new, expanded edition of the corpus, which is the result of the project KORBA 2.
Version 2.1 – released January 31st 2024
The metadata set has been updated and minor corrections have been made to the text dating.
Version 2.2 – released February 19th 2024
The duplicate text has been removed, so the number of tokens has changed; minor corrections have been made to text dating and metadata; user manual updated.