Public datasets for Machine Learning and Data Science
https://toloka.ai/datasetsThis dataset, commissioned by the Yandex Business Directory, contains 10,000 photos of organization information signs shot in the Russian Federation along with the INN (taxpayer ID) and OGRN (Primary State Registration Number) codes shown on these signs. Toloka was used for both capturing photos and recognizing INN and OGRN codes. Toloka
Yandex.Toloka Open Datasets
research.yandex.com › datasets › tolokaYandex.Toloka Open Datasets. Toloka is a major source of human-marked data for machine learning tasks. Toloka has thousands of performers making millions of evaluations in hundreds of tasks every single day. Research and experiments related to machine learning always require a large volume of high-quality data.
Yandex Captcha | Kaggle
https://www.kaggle.com/imneonizer/yandex-captcha16.09.2020 · Dataset. Yandex Captcha. Nitin Rai • updated a year ago (Version 1) Data Tasks Code Discussion Activity Metadata. Download (55 MB) New Notebook. more_vert. business_center. Usability. 2.5. Tags. arts and entertainment, arts and entertainment. subject > arts and entertainment.