Advances of the Scientific School of V.L. Arlazarov in Dataset Creation and Training Sample Synthesis for Solving Modern Computer Vision Problems

Y. S. Chernyshovaa,b,*, A. V. Sheshkusa,b,**, K. B. Bulatova,b,***, and V. V. Arlazarova,b,****

a Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Moscow, 119133 Russian Federation

b Smart Engines Service LLC, Moscow, 121205 Russian Federation

Correspondence to: * e-mail: chernyshova@smartengines.com
Correspondence to: ** e-mail: asheshkus@smartengines.com
Correspondence to: *** e-mail: kbulatov@smartengines.com
Correspondence to: **** e-mail: vva@smartengines.com

Received 20 October, 2022

Abstract—This paper considers a scientific school of synthesis of samples and creation of datasets, which is a part of the family of scientific schools associated with image processing and analysis, originating from the work of a team led by Prof. V.L. Arlazarov in the 1970s. As part of the work of the school, the researchers have obtained important fundamental and applied results as well as set new research tasks. Over the years of the school’s existence the scientific team has developed several algorithms and systems for the synthesis and augmentation of image samples. Moreover, they have created and published more than ten open annotated image datasets, including the unique MIDV dataset family that contains synthesized images of identity documents and is the first in the world to allow a full open comparison of recognition systems for such documents.

Keywords: scientific school, image synthesis, sample augmentation, open data sets

DOI: 10.1134/S1054661823040107