yandex dataset

Du lette etter:

Benchmarks for Billion-Scale Similarity Search - Yandex

research.yandex.com › datasets › biganns

To encourage future developments of scalable similarity search algorithms, Yandex releases two billion-scale datasets that can serve as representative benchmarks for researchers from the machine learning and algorithmic communities interested in efficient similarity search. Both datasets are released under the CC BY 4.0 license. The Deep1B ...

Public datasets for Machine Learning and Data Science

https://toloka.ai/datasets

This dataset, commissioned by the Yandex Business Directory, contains 10,000 photos of organization information signs shot in the Russian Federation along with the INN (taxpayer ID) and OGRN (Primary State Registration Number) codes shown on these signs. Toloka was used for both capturing photos and recognizing INN and OGRN codes. Toloka

Training Data on Demand

https://trainingdata.ru › ...

Having high quality training data is necessary for training a neural network. Our team is ready to take on all the responsibilities for creating a dataset for ...

Personalized Web Search Challenge | Kaggle

https://www.kaggle.com/c/yandex-personalized-web-search-challenge/data

Personalized Web Search Challenge | Kaggle. We use cookies on Kaggle to deliver our services, analyze web traffic, and improve your experience on the site. By using Kaggle, you agree to our use of cookies. Got it. Learn more.

Yandex publishes industry's largest AV dataset, launches ...

https://medium.com › yandex-publ...

That's why Yandex Self-Driving Group just released the largest AV dataset in the industry to date. It includes 600,000 scenes (or more than ...

Yandex.Metrica Data | ClickHouse Documentation

https://clickhouse.com › docs › me...

Anonymized Yandex.Metrica Data Dataset consists of two tables containing anonymized data about hits (hits_v1) and visits.

Personalized Web Search Challenge | Kaggle

https://www.kaggle.com › yandex-...

It provides a fully anonymized dataset shared by Yandex, which has anonymized user ids, queries, query terms, urls, url domains and clicks.

Yandex Captcha | Kaggle

https://www.kaggle.com/imneonizer/yandex-captcha

16.09.2020 · Dataset. Yandex Captcha. Nitin Rai • updated a year ago (Version 1) Data Tasks Code Discussion Activity Metadata. Download (55 MB) New Notebook. more_vert. business_center. Usability. 2.5. Tags. arts and entertainment, arts and entertainment. subject > arts and entertainment.

Yandex.Toloka Open Datasets

https://research.yandex.com/datasets/toloka

The dataset was provided by Yandex Business Directory. How we collected the data First we launched a task in the Yandex.Toloka mobile app that asked performers to go to a specific address marked on the map, find the organization, …

Yandex.Toloka Open Datasets

research.yandex.com › datasets › toloka

Yandex.Toloka Open Datasets. Toloka is a major source of human-marked data for machine learning tasks. Toloka has thousands of performers making millions of evaluations in hundreds of tasks every single day. Research and experiments related to machine learning always require a large volume of high-quality data.

GitHub - yandex-research/shifts: This repository contains ...

https://github.com/yandex-research/shifts

18.07.2021 · If you have any questions about the Shifts Dataset, the paper or the benchmarks, please contact am969@yandex-team.ru . Dataset Download And Licenses License The Shifts dataset is released under a mixed license. Weather Prediction The Shifts Weather Prediction Dataset is released under CC BY NC SA 4.0 license.

Benchmarks for Billion-Scale Similarity Search - Yandex

https://research.yandex.com/datasets/biganns

Coronavirus. Dashboard and data | Yandex.Cloud - Marketplace

https://cloud.yandex.com/en/marketplace/products/yandex/coronavirus...

A dashboard with the latest statistics on the spread of the coronavirus around the world and in Russia, as well as the self-isolation index. Datasets were prepared using data from John Hopkins University, стопкоронавирус.рф site, and Yandex services. Self-isolation data is made available under license CC-BY-SA 3.0 and requires a link to the source: https://datalens.yandex/covid19.

Yandex.Toloka Open Datasets

https://research.yandex.com › datasets › toloka

Yandex.Toloka Open Datasets. Toloka is a major source of human-marked data for machine learning tasks. Toloka has thousands of performers making millions of ...

yandex-research/shifts: This repository contains data readers ...

https://github.com › yandex-research

This repository contains data readers and examples for the three tracks of the Shifts Dataset and the Shifts Challenge. - GitHub - yandex-research/shifts: ...

Datasets for recommender systems research - Shuai Zhang

https://shuaizhang.tech/posts/2019/08/blog-post-3

01.08.2019 · Datasets for recommender systems research. 5 minute read. Published: August 01, 2019 In this post, I will present some benchmark datasets for recommender system, please note that I will only give the links of those datasets.

GitHub - yandex-research/shifts: This repository contains ...

github.com › yandex-research › shifts

The Shifts Dataset contains curated and labelled examples of real, 'in-the-wild' distributional shift across three large-scale tasks. Specifically, it contains a tabular weather prediction task, machine translation, and Vehicle Motion Prediction. Dataset shift is ubiquitous in all of these tasks and ...

Datasets for recommender systems research - Shuai Zhang

shuaizhang.tech › posts › 2019

Aug 01, 2019 · Yandex: A click dataset for personalized Web search challenge from Yandex; Avito: A dataset of contextual search ad clicks from Avito; CVR datasets. YooChoose: A sequence of click and purchase events in an e-commerce website from YooChoose; AliCCP: A click dataset gathered from the recommender system in Taobao

AOL4PS: A Large-scale Data Set for Personalized Search | Data ...

direct.mit.edu › dint › article

Oct 25, 2021 · Yandex data set consists of information on anonymized user identifiers, queries, query terms, URLs, URL domains, and clicks. Although Yandex provides a large-scale data set, its anonymous identifier processing of queries and URLs prevents researchers from accessing the raw text.

Public datasets for Machine Learning and Data Science

https://toloka.ai › datasets

This dataset, commissioned by the Yandex Business Directory, contains 10,000 photos of organization information signs shot in the Russian Federation along ...

Yandex publishes industry’s largest AV dataset, launches ...

medium.com › yandex-self-driving-car › yandex

Jul 22, 2021 · For a long time datasets for this type of research were very limited. As a company testing its AV technology in six cities, three countries and in all types of weather conditions, Yandex can offer ...

srch

yandex dataset