Winning solution of the Indoor Location & Navigation Kaggle competition

Last update: Dec 28, 2022

Overview

This repository contains the code to generate the winning solution of the Kaggle competition on indoor location and navigation organized by Microsoft Research.

Our team name: "Track me if you can".

Authors:

Are Haartveit
Dmitry Gordeev
Tom Van de Wiele

References

Steps to obtain the approximate winning submission

Clone the repository, it doesn't matter where you clone it to since the source code and data are disentangled.
Create a project folder on a disk with at least 150GB of free space. Create a "Data" subfolder in your project folder. This will be referred to as "your data folder" in what follows.
Download the raw text data from here and extract it into your data folder.
Download the cleaned raw data from here and extract it into the "reference_preprocessed" subfolder of your data folder.
Add your data folder to line 19 in src/utils.py.
Run main.py.

If all goes well, the pipeline should create a "final_submissions" subfolder in your data folder with two final submissions. Note that these are likely slightly different from our actual submissions due to inherent training stochasticity. When you make a late submit of these submissions to the leaderboard, you should obtain a private score around 1.5, which can be further reduced to about 1.3 after fixing the private test floor predictions (not part of this repository).

Main script parameters

Mode ("-m" or "--mode"). Default: 'test'. Select from ('valid', 'test').
Suppress multipricessing ("-s"). Default: no suppression of multiprocessing.
Fast (and bad) sensor models ("-f"). Default: no fast sensor models. Mostly useful for verifying that all dependencies are in place. Ignored when copying sensor models (next parameter).
Copy sensor predictions ("-c"). Default: no copying of pretrained sensor predictions. Useful if you want to speed up the pipeline since training sensor models is the slowest part.

Hardware requirements

Due to the size of the data set, you need at least 32 GB RAM to be able to run the pipeline successfully.

Known issues

If you run out of memory, try running the pipeline again. It should continue where it left it in the previous run.

Winning solution of the Indoor Location & Navigation Kaggle competition

Related tags

Overview

References

Steps to obtain the approximate winning submission

Main script parameters

Hardware requirements

Known issues

Owner

Tom Van de Wiele

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

《LightXML: Transformer with dynamic negative sampling for High-Performance Extreme Multi-label Text Classiﬁcation》(AAAI 2021) GitHub:

DROPO: Sim-to-Real Transfer with Offline Domain Randomization

Rule Based Classification Project

(CVPR2021) DANNet: A One-Stage Domain Adaptation Network for Unsupervised Nighttime Semantic Segmentation

A model to classify a piece of news as REAL or FAKE

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI 2022)

DCGAN-tensorflow - A tensorflow implementation of Deep Convolutional Generative Adversarial Networks

CTRL-C: Camera calibration TRansformer with Line-Classification

Experimenting with computer vision techniques to generate annotated image datasets from gameplay recordings automatically.

Very deep VAEs in JAX/Flax

Pytorch reimplement of the paper "A Novel Cascade Binary Tagging Framework for Relational Triple Extraction" ACL2020. The original code is written in keras.

Session-based Recommendation, CoHHN, price preferences, interest preferences, Heterogeneous Hypergraph, Co-guided Learning, SIGIR2022

Built a deep neural network (DNN) that functions as an end-to-end machine translation pipeline

Accelerating BERT Inference for Sequence Labeling via Early-Exit

Another pytorch implementation of FCN (Fully Convolutional Networks)

GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training @ KDD 2020

Public implementation of the Convolutional Motif Kernel Network (CMKN) architecture

Anchor-free Oriented Proposal Generator for Object Detection

Aesara is a Python library that allows one to define, optimize, and efficiently evaluate mathematical expressions involving multi-dimensional arrays.