sklift.datasets.fetch_x5
X5 RetailHero Uplift Modeling Dataset
The dataset is provided by X5 Retail Group at the RetailHero hackaton hosted in winter 2019.
The dataset contains raw retail customer purchases, raw information about products and general info about customers.
Machine learning competition website.
Data description
Data contains several parts:
train.csv: a subset of clients for training. The column treatment_flg indicates if there was a communication. The column target shows if there was a purchase afterward;
clients.csv: general info about clients;
purchases.csv: clients’ purchase history prior to communication.

Fields
treatment_flg (binary): information on performed communication
target (binary): customer purchasing
Key figures
Format: CSV
Size: 647M (compressed) 4.17GB (uncompressed)
Rows:
in ‘clients.csv’: 400,162
in ‘purchases.csv’: 45,786,568
in ‘uplift_train.csv’: 200,039
Response Ratio: .62
Treatment Ratio: .5
About X5

X5 Group is a leading Russian food retailer. The Company operates several retail formats: proximity stores under the Pyaterochka brand, supermarkets under the Perekrestok brand and hypermarkets under the Karusel brand, as well as the Perekrestok.ru online market, the 5Post parcel and Dostavka.Pyaterochka and Perekrestok. Bystro food delivery services.
Link to the company’s website: https://www.x5.ru/