Cognition-aware Cognate Detection

Last update: Feb 01, 2022

Related tags

Overview

Cognition-aware Cognate Detection

The repository which contains our code for our EACL 2021 paper titled, "Cognition-aware Cognate Detection". This work was awarded a best paper honourable mention among the long papers during the conference.

We ask the following pertinent questions with this work:

“Can cognitive features be used to help the task of Cognate Detection?”

furthermore,

“Using gaze features collected on a small set of data points, can we predict the same features on a larger set of data points to alleviate the need for collecting gaze data?”

The paper description, PDF, Slides and Video are available here: http://dipteshkanojia.github.io/publication/eacl-2021-cognate/

In case you use this data, code or research from this work, you are requested to please cite as follows:

@inproceedings{kanojia-etal-2021-cognition,
    title = "Cognition-aware Cognate Detection",
    author = "Kanojia, Diptesh  and
      Sharma, Prashant  and
      Ghodekar, Sayali  and
      Bhattacharyya, Pushpak  and
      Haffari, Gholamreza  and
      Kulkarni, Malhar",
    booktitle = "Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume",
    month = apr,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.eacl-main.288",
    pages = "3281--3292",
    abstract = "Automatic detection of cognates helps downstream NLP tasks of Machine Translation, Cross-lingual Information Retrieval, Computational Phylogenetics and Cross-lingual Named Entity Recognition. Previous approaches for the task of cognate detection use orthographic, phonetic and semantic similarity based features sets. In this paper, we propose a novel method for enriching the feature sets, with cognitive features extracted from human readers{'} gaze behaviour. We collect gaze behaviour data for a small sample of cognates and show that extracted cognitive features help the task of cognate detection. However, gaze data collection and annotation is a costly task. We use the collected gaze behaviour data to predict cognitive features for a larger sample and show that predicted cognitive features, also, significantly improve the task performance. We report improvements of 10{\%} with the collected gaze features, and 12{\%} using the predicted gaze features, over the previously proposed approaches. Furthermore, we release the collected gaze behaviour data along with our code and cross-lingual models.",
}

Abstract:

Automatic detection of cognates helps downstream NLP tasks of Machine Translation, Cross-lingual Information Retrieval, Computational Phylogenetics and Cross-lingual Named Entity Recognition. Previous approaches for the task of cognate detection use orthographic, phonetic and semantic similarity based features sets. In this paper, we propose a novel method for enriching the feature sets, with cognitive features extracted from human readers’ gaze behaviour. We collect gaze behaviour data for a small sample of cognates and show that extracted cognitive features help the task of cognate detection. However, gaze data collection and annotation is a costly task. We use the collected gaze behaviour data to predict cognitive features for a larger sample and show that predicted cognitive features, also, significantly improve the task performance. We report improvements of 10% with the collected gaze features, and 12% using the predicted gaze features, over the previously proposed approaches. Furthermore, we release the collected gaze behaviour data along with our code and cross-lingual models.

Proposed Models

Results

Setup

Please refer here

NOTE

In case of any query / issue please:

Open a github issue (OR)
Send an email with "[EACL2021 CaCD]" in the subject to dipteshkanojia [AT] gmail [DOT] com ( or prashaantsharmaa [AT] gmail [DOT] com )
- Sending the email with subject will make it easier for us to resolve the issue promptly.

We will try our best to resolve it as soon as we can.

Cognition-aware Cognate Detection

Related tags

Overview

Cognition-aware Cognate Detection

Abstract:

Proposed Models

Results

Setup

NOTE

Owner

Prashant K. Sharma

Official implementation of Monocular Quasi-Dense 3D Object Tracking

TLoL (Python Module) - League of Legends Deep Learning AI (Research and Development)

Keras implementation of Real-Time Semantic Segmentation on High-Resolution Images

CL-Gym: Full-Featured PyTorch Library for Continual Learning

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Wikidated : An Evolving Knowledge Graph Dataset of Wikidata’s Revision History

Adversarial Attacks are Reversible via Natural Supervision

This repository contains code demonstrating the methods outlined in Path Signature Area-Based Causal Discovery in Coupled Time Series presented at Causal Analysis Workshop 2021.

RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is large.

Code used to generate the results appearing in "Train longer, generalize better: closing the generalization gap in large batch training of neural networks"

Technical Indicators implemented in Python only using Numpy-Pandas as Magic - Very Very Fast! Very tiny! Stock Market Financial Technical Analysis Python library . Quant Trading automation or cryptocoin exchange

PyTorch implementation of the R2Plus1D convolution based ResNet architecture described in the paper "A Closer Look at Spatiotemporal Convolutions for Action Recognition"

wgan, wgan2(improved, gp), infogan, and dcgan implementation in lasagne, keras, pytorch

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework

🔊 Audio and fastai v2

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

The repo of Feedback Networks, CVPR17

Pytorch implementation of our method for regularizing nerual radiance fields for few-shot neural volume rendering.

An end-to-end regression problem of predicting the price of properties in Bangalore.

A collection of semantic image segmentation models implemented in TensorFlow