A Powerful Serverless Analysis Toolkit That Takes Trial And Error Out of Machine Learning Projects

Last update: Jan 02, 2023

Overview

KXY: A Seemless API to 10x The Productivity of Machine Learning Engineers

Documentation

https://www.kxy.ai/reference/

Installation

From PyPi:

pip install kxy

From GitHub:

git clone https://github.com/kxytechnologies/kxy-python.git & cd ./kxy-python & pip install .

Authentication

All heavy-duty computations are run on our serverless infrastructure and require an API key. To configure the package with your API key, run

kxy configure

and follow the instructions. To get an API key you need an account; you can sign up for a free trial here. You'll then be automatically given an API key which you can find here.

KXY is free for academic use.

Docker

The Docker image kxytechnologies/kxy has been built for your convenience, and comes with anaconda, auto-sklearn, and the kxy package.

To start a Jupyter Notebook server from a sandboxed Docker environment, run

&& /opt/conda/bin/jupyter notebook --notebook-dir=/opt/notebooks --ip='*' --port=8888 --no-browser --allow-root --NotebookApp.token=''" ">

docker run -i -t -p 5555:8888 kxytechnologies/kxy:latest /bin/bash -c "kxy configure 
   
     && /opt/conda/bin/jupyter notebook --notebook-dir=/opt/notebooks --ip='*' --port=8888 --no-browser --allow-root --NotebookApp.token=''
    "

where you should replace with your API key and navigate to http://localhost:5555 in your browser. This docker environment comes with all examples available on the documentation website.

To start a Jupyter Notebook server from an existing directory of notebooks, run

&& /opt/conda/bin/jupyter notebook --notebook-dir=/opt/notebooks --ip='*' --port=8888 --no-browser --allow-root --NotebookApp.token=''" ">

docker run -i -t --mount src=</path/to/your/local/dir>,target=/opt/notebooks,type=bind -p 5555:8888 kxytechnologies/kxy:latest /bin/bash -c "kxy configure 
   
     && /opt/conda/bin/jupyter notebook --notebook-dir=/opt/notebooks --ip='*' --port=8888 --no-browser --allow-root --NotebookApp.token=''
    "

where you should replace with the path to your local notebook folder and navigate to http://localhost:5555 in your browser.

Other Programming Language

We plan to release friendly API client in more programming language.

In the meantime, you can directly issue requests to our RESTFul API using your favorite programming language.

Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on Kubernetes simple, portable, and scalable.

SDK: Overview of the Kubeflow pipelines service Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on

3.1k Jan 6, 2023

Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.

25 Dec 28, 2022

A machine learning toolkit dedicated to time-series data

tslearn The machine learning toolkit for time series analysis in Python Section Description Installation Installing the dependencies and tslearn Getti

2.3k Jan 5, 2023

A machine learning toolkit dedicated to time-series data

tslearn The machine learning toolkit for time series analysis in Python Section Description Installation Installing the dependencies and tslearn Getti

2.3k Dec 29, 2022

Kats is a toolkit to analyze time series data, a lightweight, easy-to-use, and generalizable framework to perform time series analysis.

Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.

4.1k Dec 29, 2022

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

5.7k Dec 30, 2022

A library of extension and helper modules for Python's data analysis and machine learning libraries.

Mlxtend (machine learning extensions) is a Python library of useful tools for the day-to-day data science tasks. Sebastian Raschka 2014-2021 Links Doc

4.2k Dec 29, 2022

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Master status: Development status: Package information: TPOT stands for Tree-based Pipeline Optimization Tool. Consider TPOT your Data Science Assista

8.9k Jan 9, 2023

Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.

Python Extreme Learning Machine (ELM) Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.

84 Nov 25, 2022

Comments

error in import kxy

Hi, After installing the kxy package and configuring the API key, the import kxy shows the error below:

.../python3.9/site-packages/kxy/pfs/pfs_selector.py in <module>
      6 import numpy as np
      7 
----> 8 import tensorflow as tf
      9 from tensorflow.keras.callbacks import EarlyStopping, TerminateOnNaN
     10 from tensorflow.keras.optimizers import Adam

ModuleNotFoundError: No module named 'tensorflow'

what version of tensorflow is needed for kxy to work?

opened by zeydabadi 2

generate_features Documentation?

Is there any documentation on how to use the generate_features function? It doesn't appear in the documentation and I can't find it in the github. e.g. how to use the entity column, how to format time-series data in advance for it, etc'. Thanks!

opened by ddofer 1
error kxy.data_valuation

Hi, After running chievable_performance_df = X_train_reduced.kxy.data_valuation(target_column='state', problem_type='classification', include_mutual_information=True, anonymize=True) I get the following error and the function does not return anything: `During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "/usr/lib/python3.9/asyncio/tasks.py", line 258, in __step result = coro.throw(exc) File "/home/lucy/Downloads/general/lib/python3.9/site-packages/tornado/websocket.py", line 1104, in wrapper raise WebSocketClosedError() tornado.websocket.WebSocketClosedError Task exception was never retrieved future: <Task finished name='Task-46004' coro=<WebSocketProtocol13.write_message..wrapper() done, defined at /home/lucy/Downloads/general/lib/python3.9/site-packages/tornado/websocket.py:1100> exception=WebSocketClosedError()> Traceback (most recent call last): File "/home/lucy/Downloads/general/lib/python3.9/site-packages/tornado/websocket.py", line 1102, in wrapper await fut File "/usr/lib/python3.9/asyncio/tasks.py", line 328, in __wakeup future.result() tornado.iostream.StreamClosedError: Stream is closed `

opened by zeydabadi 0

Releases(v1.4.10)

v1.4.10(Apr 25, 2022)
Change Log

v.1.4.10 Changes

Added a function to construct features derived from PFS mutual information estimation that should be expected to be linearly related to the target.

Fixed a global name conflict in kxy.learning.base_learners.

v.1.4.9 Changes

Change the activation function used by PFS from ReLU to switch/SILU.

Leaving it to the user to set the logging level.

v.1.4.8 Changes

Froze the versions of all python packages in the docker file.

v.1.4.7 Changes

Changes related to optimizing Principal Feature Selection.

Made it easy to change PFS' default learning parameters.

Changed PFS' default learning parameters (learning rate is now 0.005 and epsilon 1e-04)

Adding a seed parameter to PFS' fit for reproducibility.

To globally change the learning rate to 0.003, change Adam's epsilon to 1e-5, and the number of epochs to 25, do

from kxy.misc.tf import set_default_parameter set_default_parameter('lr', 0.003) set_default_parameter('epsilon', 1e-5) set_default_parameter('epochs', 25)

To change the number epochs for a single iteration of PFS, use the epochs argument of the fit method of your PFS object. The fit method now also has a seed parameter you may use to make the PFS implementation deterministic.

Example:

from kxy.pfs import PFS selector = PFS() selector.fit(x, y, epochs=25, seed=123)

Alternatively, you may also use the kxy.misc.tf.set_seed method to make PFS deterministic.

v.1.4.6 Changes

Minor PFS improvements.

Adding more (robust) mutual information loss functions.

Exposing the learned total mutual information between principal features and target as an attribute of PFS.

Exposing the number of epochs as a parameter of PFS' fit.

Source code(tar.gz)
Source code(zip)
v1.4.9(Apr 12, 2022)
Change Log

v.1.4.9 Changes

Change the activation function used by PFS from ReLU to switch/SILU.

Leaving it to the user to set the logging level.

v.1.4.8 Changes

Froze the versions of all python packages in the docker file.

v.1.4.7 Changes

Changes related to optimizing Principal Feature Selection.

Made it easy to change PFS' default learning parameters.

Changed PFS' default learning parameters (learning rate is now 0.005 and epsilon 1e-04)

Adding a seed parameter to PFS' fit for reproducibility.

To globally change the learning rate to 0.003, change Adam's epsilon to 1e-5, and the number of epochs to 25, do

from kxy.misc.tf import set_default_parameter set_default_parameter('lr', 0.003) set_default_parameter('epsilon', 1e-5) set_default_parameter('epochs', 25)

To change the number epochs for a single iteration of PFS, use the epochs argument of the fit method of your PFS object. The fit method now also has a seed parameter you may use to make the PFS implementation deterministic.

Example:

from kxy.pfs import PFS selector = PFS() selector.fit(x, y, epochs=25, seed=123)

Alternatively, you may also use the kxy.misc.tf.set_seed method to make PFS deterministic.

v.1.4.6 Changes

Minor PFS improvements.

Adding more (robust) mutual information loss functions.

Exposing the learned total mutual information between principal features and target as an attribute of PFS.

Exposing the number of epochs as a parameter of PFS' fit.

Source code(tar.gz)
Source code(zip)
v1.4.8(Apr 11, 2022)
Change Log

v.1.4.8 Changes

Froze the versions of all python packages in the docker file.

v.1.4.7 Changes

Changes related to optimizing Principal Feature Selection.

Made it easy to change PFS' default learning parameters.

Changed PFS' default learning parameters (learning rate is now 0.005 and epsilon 1e-04)

Adding a seed parameter to PFS' fit for reproducibility.

To globally change the learning rate to 0.003, change Adam's epsilon to 1e-5, and the number of epochs to 25, do

from kxy.misc.tf import set_default_parameter set_default_parameter('lr', 0.003) set_default_parameter('epsilon', 1e-5) set_default_parameter('epochs', 25)

To change the number epochs for a single iteration of PFS, use the epochs argument of the fit method of your PFS object. The fit method now also has a seed parameter you may use to make the PFS implementation deterministic.

Example:

from kxy.pfs import PFS selector = PFS() selector.fit(x, y, epochs=25, seed=123)

Alternatively, you may also use the kxy.misc.tf.set_seed method to make PFS deterministic.

v.1.4.6 Changes

Minor PFS improvements.

Adding more (robust) mutual information loss functions.

Exposing the learned total mutual information between principal features and target as an attribute of PFS.

Exposing the number of epochs as a parameter of PFS' fit.

Source code(tar.gz)
Source code(zip)
v1.4.7(Apr 10, 2022)
Change Log

v.1.4.7 Changes

Changes related to optimizing Principal Feature Selection.

Made it easy to change PFS' default learning parameters.

Changed PFS' default learning parameters (learning rate is now 0.005 and epsilon 1e-04)

Adding a seed parameter to PFS' fit for reproducibility.

To globally change the learning rate to 0.003, change Adam's epsilon to 1e-5, and the number of epochs to 25, do

from kxy.misc.tf import set_default_parameter set_default_parameter('lr', 0.003) set_default_parameter('epsilon', 1e-5) set_default_parameter('epochs', 25)

To change the number epochs for a single iteration of PFS, use the epochs argument of the fit method of your PFS object. The fit method now also has a seed parameter you may use to make the PFS implementation deterministic.

Example:

from kxy.pfs import PFS selector = PFS() selector.fit(x, y, epochs=25, seed=123)

Alternatively, you may also use the kxy.misc.tf.set_seed method to make PFS deterministic.

v.1.4.6 Changes

Minor PFS improvements.

Adding more (robust) mutual information loss functions.

Exposing the learned total mutual information between principal features and target as an attribute of PFS.

Exposing the number of epochs as a parameter of PFS' fit.

Source code(tar.gz)
Source code(zip)
v1.4.6(Apr 10, 2022)
Changes

Adding more (robust) mutual information loss functions.

Exposing the learned total mutual information between principal features and target as an attribute of PFS.

Exposing the number of epochs as a parameter of PFS' fit.

Source code(tar.gz)
Source code(zip)
v1.4.5(Apr 9, 2022)

Fixing some package incompatibilities.
Source code(tar.gz)
Source code(zip)
v1.4.4(Apr 8, 2022)

Adding Principal Feature Selection.
Source code(tar.gz)
Source code(zip)
v1.0.4(Jul 1, 2021)

Source code(tar.gz)
Source code(zip)
1.0.3(Mar 23, 2021)

Source code(tar.gz)
Source code(zip)
1.0.2(Mar 16, 2021)

Source code(tar.gz)
Source code(zip)
1.0.1(Mar 16, 2021)

Source code(tar.gz)
Source code(zip)
0.3.8(Jan 25, 2021)

Source code(tar.gz)
Source code(zip)
v0.3.5(Jan 21, 2021)

Source code(tar.gz)
Source code(zip)
v0.3.4(Dec 16, 2020)

Source code(tar.gz)
Source code(zip)
v0.3.2(Aug 14, 2020)

Adding regression root mean square error (RMSE) in the list of metrics whose achievable values we calculate.
Source code(tar.gz)
Source code(zip)
v0.3.1(Aug 7, 2020)

Source code(tar.gz)
Source code(zip)
v0.3.0(Aug 3, 2020)

Adding a maximum-entropy based classifier (kxy.MaxEntClassifier) and regressor (kxy.MaxEntRegressor) following the scikit-learn signature for fitting and predicting.

These models estimate the posterior mean E[u_y|x] and the posterior standard deviation sqrt(Var[u_y|x]) for any specific value of x, where the copula-uniform representations (u_y, u_x) follow the maximum-entropy distribution.

Predictions in the primal are derived from E[u_y|x].
Source code(tar.gz)
Source code(zip)
v0.2.0(Jun 25, 2020)
Regression analyses now fully support categorical variables.

Foundations for multi-output regressions are laid.

Categorical variables are now systematically encoded and treated as continuous, consistent with what's done at the learning stage.

Regression and classification are further normalized, and most the compute for classification problems now takes place on the API side, and should be considerably faster.

Source code(tar.gz)
Source code(zip)
v0.1.3(Jun 12, 2020)

Source code(tar.gz)
Source code(zip)
v0.1.2(Jun 12, 2020)

Source code(tar.gz)
Source code(zip)
v0.1.1(Jun 11, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.18(May 26, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.16(May 18, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.15(May 18, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.14(May 18, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.13(May 16, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.11(May 13, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.10(May 11, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.3(Apr 17, 2020)

Source code(tar.gz)
Source code(zip)
v0.0.2(Apr 17, 2020)

Source code(tar.gz)
Source code(zip)

Owner

KXY Technologies, Inc.

GitHub Repository https://kxy.ai

Pydantic based mock data generation

This library offers powerful mock data generation capabilities for pydantic based models. It can also be used with other libraries that use pydantic as a foundation, for example SQLModel, Beanie and

396 Dec 28, 2022

ETNA is an easy-to-use time series forecasting framework.

ETNA is an easy-to-use time series forecasting framework. It includes built in toolkits for time series preprocessing, feature generation, a variety of predictive models with unified interface - from

674 Jan 07, 2023

About Solve CTF offline disconnection problem - based on python3's small crawler

About Solve CTF offline disconnection problem - based on python3's small crawler, support keyword search and local map bed establishment, currently support Jianshu, xianzhi,anquanke,freebuf,seebug

32 Oct 25, 2022

Kaggle Tweet Sentiment Extraction Competition: 1st place solution (Dark of the Moon team)

64 Nov 30, 2022

The Emergence of Individuality

16 Jul 20, 2022

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

Little Ball of Fur is a graph sampling extension library for Python. Please look at the Documentation, relevant Paper, Promo video and External Resour

619 Dec 14, 2022

Predico Disease Prediction system based on symptoms provided by patient- using Python-Django & Machine Learning

1 Jan 06, 2022

A Lightweight Hyperparameter Optimization Tool 🚀

The mle-hyperopt package provides a simple and intuitive API for hyperparameter optimization of your Machine Learning Experiment (MLE) pipeline.

137 Dec 02, 2022

LinearRegression2 Tvads and CarSales

LinearRegression2_Tvads_and_CarSales This project infers the insight that how the TV ads for cars and car Sales are being linked with each other. It i

1 Dec 29, 2021

Python 3.6+ toolbox for submitting jobs to Slurm

Submit it! What is submitit? Submitit is a lightweight tool for submitting Python functions for computation within a Slurm cluster. It basically wraps

768 Jan 03, 2023

Projeto: Machine Learning: Linguagens de Programacao 2004-2001

Projeto: Machine Learning: Linguagens de Programacao 2004-2001 Projeto de Data Science e Machine Learning de análise de linguagens de programação de 2

0 Jun 29, 2021

A benchmark of data-centric tasks from across the machine learning lifecycle.

61 Dec 28, 2022

A demo project to elaborate how Machine Learn Models are deployed on production using Flask API

This is a salary prediction website developed with the help of machine learning, this makes prediction of salary on basis of few parameters like interview score, experience test score.

1 Feb 10, 2022

Nevergrad - A gradient-free optimization platform

Nevergrad - A gradient-free optimization platform nevergrad is a Python 3.6+ library. It can be installed with: pip install nevergrad More installati

3.4k Jan 08, 2023

Automatic extraction of relevant features from time series:

tsfresh This repository contains the TSFRESH python package. The abbreviation stands for "Time Series Feature extraction based on scalable hypothesis

7k Jan 06, 2023

A single Python file with some tools for visualizing machine learning in the terminal.

Machine Learning Visualization Tools A single Python file with some tools for visualizing machine learning in the terminal. This demo is composed of t

35 Dec 29, 2022

Napari sklearn decomposition

napari-sklearn-decomposition A simple plugin to use with napari This napari plug

1 Sep 01, 2022

Bodywork deploys machine learning projects developed in Python, to Kubernetes.

Bodywork deploys machine learning projects developed in Python, to Kubernetes. It helps you to: serve models as microservices execute batch jobs run r

409 Jan 01, 2023

Painless Machine Learning for python based on scikit-learn

PlainML Painless Machine Learning Library for python based on scikit-learn. Install pip install plainml Example from plainml import KnnModel, load_ir

1 Aug 06, 2022

CD) in machine learning projectsImplementing continuous integration & delivery (CI/CD) in machine learning projects

CML with cloud compute This repository contains a sample project using CML with Terraform (via the cml-runner function) to launch an AWS EC2 instance

19 Oct 03, 2022

A Powerful Serverless Analysis Toolkit That Takes Trial And Error Out of Machine Learning Projects

Related tags

Overview

KXY: A Seemless API to 10x The Productivity of Machine Learning Engineers

Documentation

Installation

Authentication

Docker

Other Programming Language

You might also like...

Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on Kubernetes simple, portable, and scalable.

Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.

A machine learning toolkit dedicated to time-series data

A machine learning toolkit dedicated to time-series data

Kats is a toolkit to analyze time series data, a lightweight, easy-to-use, and generalizable framework to perform time series analysis.

A mindmap summarising Machine Learning concepts, from Data Analysis to Deep Learning.

A library of extension and helper modules for Python's data analysis and machine learning libraries.

A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

Python Extreme Learning Machine (ELM) is a machine learning technique used for classification/regression tasks.

Comments

error in import kxy

generate_features Documentation?

error kxy.data_valuation

Releases(v1.4.10)

v1.4.10(Apr 25, 2022)

Change Log

v.1.4.10 Changes

v.1.4.9 Changes

v.1.4.8 Changes

v.1.4.7 Changes

v.1.4.6 Changes

v1.4.9(Apr 12, 2022)

Change Log

v.1.4.9 Changes

v.1.4.8 Changes

v.1.4.7 Changes

v.1.4.6 Changes

v1.4.8(Apr 11, 2022)

Change Log

v.1.4.8 Changes

v.1.4.7 Changes

v.1.4.6 Changes

v1.4.7(Apr 10, 2022)

Change Log

v.1.4.7 Changes

v.1.4.6 Changes

v1.4.6(Apr 10, 2022)

Changes

v1.4.5(Apr 9, 2022)

v1.4.4(Apr 8, 2022)

v1.0.4(Jul 1, 2021)

1.0.3(Mar 23, 2021)

1.0.2(Mar 16, 2021)

1.0.1(Mar 16, 2021)

0.3.8(Jan 25, 2021)

v0.3.5(Jan 21, 2021)

v0.3.4(Dec 16, 2020)

v0.3.2(Aug 14, 2020)

v0.3.1(Aug 7, 2020)

v0.3.0(Aug 3, 2020)

v0.2.0(Jun 25, 2020)

v0.1.3(Jun 12, 2020)

v0.1.2(Jun 12, 2020)

v0.1.1(Jun 11, 2020)

v0.0.18(May 26, 2020)

v0.0.16(May 18, 2020)

v0.0.15(May 18, 2020)

v0.0.14(May 18, 2020)

v0.0.13(May 16, 2020)

v0.0.11(May 13, 2020)

v0.0.10(May 11, 2020)

v0.0.3(Apr 17, 2020)

v0.0.2(Apr 17, 2020)

Owner

KXY Technologies, Inc.

Pydantic based mock data generation

ETNA is an easy-to-use time series forecasting framework.

About Solve CTF offline disconnection problem - based on python3's small crawler

Kaggle Tweet Sentiment Extraction Competition: 1st place solution (Dark of the Moon team)

The Emergence of Individuality