Du lette etter:

yandex dataset

Public datasets for Machine Learning and Data Science
https://toloka.ai/datasets
This dataset, commissioned by the Yandex Business Directory, contains 10,000 photos of organization information signs shot in the Russian Federation along with the INN (taxpayer ID) and OGRN (Primary State Registration Number) codes shown on these signs. Toloka was used for both capturing photos and recognizing INN and OGRN codes. Toloka
Public datasets for Machine Learning and Data Science
https://toloka.ai › datasets
This dataset, commissioned by the Yandex Business Directory, contains 10,000 photos of organization information signs shot in the Russian Federation along ...
GitHub - yandex-research/shifts: This repository contains ...
https://github.com/yandex-research/shifts
18.07.2021 · If you have any questions about the Shifts Dataset, the paper or the benchmarks, please contact am969@yandex-team.ru . Dataset Download And Licenses License The Shifts dataset is released under a mixed license. Weather Prediction The Shifts Weather Prediction Dataset is released under CC BY NC SA 4.0 license.
GitHub - yandex-research/shifts: This repository contains ...
github.com › yandex-research › shifts
The Shifts Dataset contains curated and labelled examples of real, 'in-the-wild' distributional shift across three large-scale tasks. Specifically, it contains a tabular weather prediction task, machine translation, and Vehicle Motion Prediction. Dataset shift is ubiquitous in all of these tasks and ...
Yandex.Metrica Data | ClickHouse Documentation
https://clickhouse.com › docs › me...
Anonymized Yandex.Metrica Data Dataset consists of two tables containing anonymized data about hits (hits_v1) and visits.
Yandex.Toloka Open Datasets
https://research.yandex.com/datasets/toloka
The dataset was provided by Yandex Business Directory. How we collected the data First we launched a task in the Yandex.Toloka mobile app that asked performers to go to a specific address marked on the map, find the organization, …
Training Data on Demand
https://trainingdata.ru › ...
Having high quality training data is necessary for training a neural network. Our team is ready to take on all the responsibilities for creating a dataset for ...
Yandex publishes industry's largest AV dataset, launches ...
https://medium.com › yandex-publ...
That's why Yandex Self-Driving Group just released the largest AV dataset in the industry to date. It includes 600,000 scenes (or more than ...
Coronavirus. Dashboard and data | Yandex.Cloud - Marketplace
https://cloud.yandex.com/en/marketplace/products/yandex/coronavirus...
A dashboard with the latest statistics on the spread of the coronavirus around the world and in Russia, as well as the self-isolation index. Datasets were prepared using data from John Hopkins University, стопкоронавирус.рф site, and Yandex services. Self-isolation data is made available under license CC-BY-SA 3.0 and requires a link to the source: https://datalens.yandex/covid19.
Benchmarks for Billion-Scale Similarity Search - Yandex
research.yandex.com › datasets › biganns
To encourage future developments of scalable similarity search algorithms, Yandex releases two billion-scale datasets that can serve as representative benchmarks for researchers from the machine learning and algorithmic communities interested in efficient similarity search. Both datasets are released under the CC BY 4.0 license. The Deep1B ...
Yandex.Toloka Open Datasets
research.yandex.com › datasets › toloka
Yandex.Toloka Open Datasets. Toloka is a major source of human-marked data for machine learning tasks. Toloka has thousands of performers making millions of evaluations in hundreds of tasks every single day. Research and experiments related to machine learning always require a large volume of high-quality data.
Benchmarks for Billion-Scale Similarity Search - Yandex
https://research.yandex.com/datasets/biganns
To encourage future developments of scalable similarity search algorithms, Yandex releases two billion-scale datasets that can serve as representative benchmarks for researchers from the machine learning and algorithmic communities interested in efficient similarity search. Both datasets are released under the CC BY 4.0 license. Deep1B
yandex-research/shifts: This repository contains data readers ...
https://github.com › yandex-research
This repository contains data readers and examples for the three tracks of the Shifts Dataset and the Shifts Challenge. - GitHub - yandex-research/shifts: ...
Personalized Web Search Challenge | Kaggle
https://www.kaggle.com/c/yandex-personalized-web-search-challenge/data
Personalized Web Search Challenge | Kaggle. We use cookies on Kaggle to deliver our services, analyze web traffic, and improve your experience on the site. By using Kaggle, you agree to our use of cookies. Got it. Learn more.
Yandex Captcha | Kaggle
https://www.kaggle.com/imneonizer/yandex-captcha
16.09.2020 · Dataset. Yandex Captcha. Nitin Rai • updated a year ago (Version 1) Data Tasks Code Discussion Activity Metadata. Download (55 MB) New Notebook. more_vert. business_center. Usability. 2.5. Tags. arts and entertainment, arts and entertainment. subject > arts and entertainment.
AOL4PS: A Large-scale Data Set for Personalized Search | Data ...
direct.mit.edu › dint › article
Oct 25, 2021 · Yandex data set consists of information on anonymized user identifiers, queries, query terms, URLs, URL domains, and clicks. Although Yandex provides a large-scale data set, its anonymous identifier processing of queries and URLs prevents researchers from accessing the raw text.
Datasets for recommender systems research - Shuai Zhang
https://shuaizhang.tech/posts/2019/08/blog-post-3
01.08.2019 · Datasets for recommender systems research. 5 minute read. Published: August 01, 2019 In this post, I will present some benchmark datasets for recommender system, please note that I will only give the links of those datasets.
Yandex.Toloka Open Datasets
https://research.yandex.com › datasets › toloka
Yandex.Toloka Open Datasets. Toloka is a major source of human-marked data for machine learning tasks. Toloka has thousands of performers making millions of ...
Yandex publishes industry’s largest AV dataset, launches ...
medium.com › yandex-self-driving-car › yandex
Jul 22, 2021 · For a long time datasets for this type of research were very limited. As a company testing its AV technology in six cities, three countries and in all types of weather conditions, Yandex can offer ...
Personalized Web Search Challenge | Kaggle
https://www.kaggle.com › yandex-...
It provides a fully anonymized dataset shared by Yandex, which has anonymized user ids, queries, query terms, urls, url domains and clicks.
Datasets for recommender systems research - Shuai Zhang
shuaizhang.tech › posts › 2019
Aug 01, 2019 · Yandex: A click dataset for personalized Web search challenge from Yandex; Avito: A dataset of contextual search ad clicks from Avito; CVR datasets. YooChoose: A sequence of click and purchase events in an e-commerce website from YooChoose; AliCCP: A click dataset gathered from the recommender system in Taobao