RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Last update: Feb 10, 2022

Overview

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

This is the implementation of RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation.

Code

To run our code, please use the following commands:

g++ RATE.cpp -o RATE -std=c++11
./RATE [Training File] [Test File] [L, optional, default = 30] [T, optional, default = 1]

For example,

g++ RATE.cpp -o RATE -std=c++11
./RATE Dataset/train.txt Dataset/test.txt 40 1

The prediction results will be in ./result.txt (the first row is the classification result). Then you can run

python eval.py

to obtain evaluation metrics.

Dataset

We release the Europe dataset (Dataset/data.json), where each line is a json file with tweet text and metadata. Due to privacy issues, we have anonymized the whole dataset by representing each word/feature as an integer. An example is shown below.

{ 
   "label":0,
   "language":"3",
   "timezone":"5",
   "offset":"7",
   "userlang":"5",
   "latitude":"36.8901",
   "longitude":"30.6809",
   "text":"3332 2608 29"
}

Given the json file, one can run

cd Dataset/
python preprocess.py

to get training and testing data (Dataset/train.txt and Dataset/test.txt).

Result

Method	Micro-F1 (Acc)	Macro-F1	Mean Distance Error (km)	[email protected]
RATE	0.8905	0.5230	365.16	0.4315

Citation

@inproceedings{zhang2017rate,
  title={RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation},
  author={Zhang, Yu and Wei, Wei and Huang, Binxuan and Carley, Kathleen M and Zhang, Yan},
  booktitle={Proceedings of the 2017 ACM on Conference on Information and Knowledge Management},
  pages={2423--2426},
  year={2017},
  organization={ACM}
}

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation (CIKM'17)

Related tags

Overview

RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation

Code

Dataset

Result

Citation

Owner

Yu Zhang

Rest API Written In Python To Classify NSFW Images.

Data and extra materials for the food safety publications classifier

Simulation of the solar system using various nummerical methods

Cross-platform CLI tool to generate your Github profile's stats and summary.

Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut

PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations

IEGAN — Official PyTorch Implementation Independent Encoder for Deep Hierarchical Unsupervised Image-to-Image Translation

Geometric Algebra package for JAX

The Official Repository for "Generalized OOD Detection: A Survey"

Automatic Attendance marker for LMS Practice School Division, BITS Pilani

Blender Add-on that sets a Material's Base Color to one of Pantone's Colors of the Year

PyTorch implementation of Neural Combinatorial Optimization with Reinforcement Learning.

Cascading Feature Extraction for Fast Point Cloud Registration (BMVC 2021)

Code for a seq2seq architecture with Bahdanau attention designed to map stereotactic EEG data from human brains to spectrograms, using the PyTorch Lightning.

ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation

Python Auto-ML Package for Tabular Datasets

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

CVNets: A library for training computer vision networks