Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Last update: Dec 07, 2022

Overview

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

About

This repository contains the code to replicate the synthetic experiment conducted in the paper "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model" by Haruka Kiyohara, Yuta Saito, Tatsuya Matsuhiro, Yusuke Narita, Nobuyuki Shimizu, and Yasuo Yamamoto, which has been accepted to WSDM2022.

If you find this code useful in your research then please site:

@inproceedings{kiyohara2022doubly,
  author = {Kiyohara, Haruka and Saito, Yuta and Matsuhiro, Tatsuya and Narita, Yusuke and Shimizu, Nobuyuki and Yamamoto, Yasuo},
  title = {Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model},
  booktitle = {Proceedings of the 15th International Conference on Web Search and Data Mining},
  pages = {xxx--xxx},
  year = {2022},
}

Dependencies

This repository supports Python 3.7 or newer.

numpy==1.20.0
pandas==1.2.1
scikit-learn==0.24.1
matplotlib==3.4.3
obp==0.5.2
hydra-core==1.0.6

Note that the proposed Cascade-DR estimator is implemented in Open Bandit Pipeline (obp.ope.SlateCascadeDoublyRobust).

Running the code

To conduct the synthetic experiment, run the following commands.

(i) run OPE simulations with varying data size, with the fixed slate size.

python src/main.py setting=n_rounds

(ii), (iii) run OPE simulations with varying slate size and policy similarities, with the fixed data size.

python src/main.py

Once the code is finished executing, you can find the results (squared_error.csv, relative_ee.csv, configuration.csv) in the ./logs/ directory. Lower value is better for squared error and relative estimation error (relative-ee).

Visualize the results

To visualize the results, run the following commands. Make sure that you have executed the above two experiments (by running python src/main.py and python src/main.py setting=default) before visualizing the results.

python src/visualize.py

Then, you will find the following figures (slate size (standard/cascade/independent).png, evaluation policy similarity (standard/cascade/independent).png, data size (standard/cascade/independent).png) in the ./logs/ directory. Lower value is better for the relative-MSE (y-axis).

reward structure	Standard	Cascade	Independent
varying data size (n)
varying slate size (L)
varying evaluation policy similarity (λ)

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

Related tags

Overview

Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model

About

Dependencies

Running the code

Visualize the results

Owner

Haruka Kiyohara

Code for Max-Margin Contrastive Learning - AAAI 2022

The FIRST GANs-based omics-to-omics translation framework

Model Quantization Benchmark

Python Jupyter kernel using Poetry for reproducible notebooks

The code of Zero-shot learning for low-light image enhancement based on dual iteration

Elucidating Robust Learning with Uncertainty-Aware Corruption Pattern Estimation

This repo is for segmentation of T2 hyp regions in gliomas.

Aalto-cs-msc-theses - Listing of M.Sc. Theses of the Department of Computer Science at Aalto University

Code for the ICME 2021 paper "Exploring Driving-Aware Salient Object Detection via Knowledge Transfer"

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

deep-prae

VisionKG: Vision Knowledge Graph

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

The codebase for our paper "Generative Occupancy Fields for 3D Surface-Aware Image Synthesis" (NeurIPS 2021)

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.

[NeurIPS-2021] Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

DEMix Layers for Modular Language Modeling

Official implementation of "UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer"