Demo project for real time anomaly detection using kafka and python

Last update: Dec 12, 2022

Overview

kafkaml-anomaly-detection

Project for real time anomaly detection using kafka and python

It's assumed that zookeeper and kafka are running in the localhost, it follows this process:

Train an unsupervised machine learning model for anomalies detection
Save the model to be used in real time predictions
Generate fake streaming data and send it to a kafka topic
Read the topic data with several subscribers to be analyzed by the model
Predict if the data is an anomaly, if so, send the data to another kafka topic
Subscribe a slack bot to the last topic to send a message in slack channel if an anomaly arrives

This could be illustrated as:

Demo

Generate fake transactions into a kafka topic:

Predict and send anomalies to another kafka topic

Producer and anomaly detection running at the same time

Send notifications to Slack

Usage:

First train the anomaly detection model, run the file:

model/train.py

Create the required topics

kafka-topics.sh --zookeeper localhost:2181 --topic transactions --create --partitions 3 --replication-factor 1
kafka-topics.sh --zookeeper localhost:2181 --topic anomalies --create --partitions 3 --replication-factor 1

Check the topics are created

kafka-topics.sh --zookeeper localhost:2181 --list

Check file settings.py and edit the variables if needed
Start the producer, run the file

streaming/producer.py

Start the anomalies detector, run the file

streaming/anomalies_detector.py

Start sending alerts to Slack, make sure to register the env variable SLACK_API_TOKEN, then run

streaming/bot_alerts.py

Demo project for real time anomaly detection using kafka and python

Related tags

Overview

kafkaml-anomaly-detection

Demo

Usage:

Owner

Rodrigo Arenas

PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).

Code for EMNLP2020 long paper: BERT-Attack: Adversarial Attack Against BERT Using BERT

AI-Fitness-Tracker - AI Fitness Tracker With Python

EPSANet：An Efficient Pyramid Split Attention Block on Convolutional Neural Network

Source Code For Template-Based Named Entity Recognition Using BART

Auditing Black-Box Prediction Models for Data Minimization Compliance

A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling" (ICCV 2021)

The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Novel and high-performance medical image classification pipelines are heavily utilizing ensemble learning strategies

An Active Automata Learning Library Written in Python

Industrial Image Anomaly Localization Based on Gaussian Clustering of Pre-trained Feature

Python based framework for Automatic AI for Regression and Classification over numerical data.

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

Code for our paper 'Generalized Category Discovery'

A distributed deep learning framework that supports flexible parallelization strategies.

Official implementation for ICDAR 2021 paper "Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer"

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

SMIS - Semantically Multi-modal Image Synthesis(CVPR 2020)