In this project we can see how we can generate automatic music using character RNN.

Overview

Automatic Music Genaration

Table of Contents

  1. Project Description
  2. Approach towards the problem
  3. Limitations
  4. Libraries Used
  5. Summary
  6. Applications
  7. References


1. Project Description

  • Natural Language Processing in Artificial Intelligence is the application of computational techniques to the analysis and synthesis of natural language and speech.

  • For a hypothetical scenario it was assumed that ABC music prod. pvt.ltd is a reknowned audio-video production house based out of Mumbai, India

  • As COVID-19 cases are increasing day by day it is almost impossible for the musicians to coop up with real time studio work.

  • Hence, the company wants you to make an AI based music genaration system.

  • The goal of this project is to make an AI based music genaration system.

  • The key contraint to the problem is accruracy.

  • You have been hired as a freelance data scientist for ABC music prod. pvt.ltd

  • The model should read a text file in abc format.

  • The model should genarate the corresponding music framed out of that note sequence.


2. Approach towards the problem

  • The approach to this project was to make a chat bot from scratch.

  • Initially all the necessary libraries were imported and installed.

  • Then, the data corpus was formed.

  • A data corpus is a collection of linguistic data.

  • Then while performing preprocessing we used lemitization technique in order to process the data into desirable format.

  • Then after preprocessing, we genarated various batches to fit into the model.

  • After that we build a charecter RNN model and trained the model for 100 epochs with a batch size of 16 and a sequence length of 64

  • For every 10 epochs the model would save the updated weights in the main directory

  • We recieved a validation accuracy of 92%

  • Now we genarated the music code which is nothing but the predicted vocabulary

  • After that we went here and we were successfully able to genarate music.

3. Limitation(s)

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • We have trained this model for only 100 epochs. As the number of epochs increase it is expected that the accuracy of the model will increase.

4. Libraries Used

Following are the list of libraries that were used for making this project.

  • Python was used as the general purposae programming language

  • Keras was used to perform all Deep Learning operation(s) such as model bulding, compliation and training.

  • numpy was used in order to calcucate numercal operations.

5. Summary

  • In this project an Automatic music genaration system was made from scratch.

  • Here, we recieved a validation accuracy of 92%

  • This project can be widely used for music production systems

  • The only limitation of this model is that it is being trained with very less data.

  • However, while getting trained on various intrument data this model can be further enhanced for different instruments as well.

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

6. Applications

  • This model can be used for in house music production systems.

  • This can be widely used to automate manual instruments.

  • This can be also used to make automatic VST(virtual studio toolkit) plugins

7. References

Owner
Pronay Ghosh
A data-driven professional who grows day by day. Currently working as a Data Science Researcher at The International School of AI and Data Science.
Pronay Ghosh
Code for paper 'Audio-Driven Emotional Video Portraits'.

Audio-Driven Emotional Video Portraits [CVPR2021] Xinya Ji, Zhou Hang, Kaisiyuan Wang, Wayne Wu, Chen Change Loy, Xun Cao, Feng Xu [Project] [Paper] G

197 Dec 31, 2022
Generating a structured library of .wav samples with Python.

sample-library Scripts for generating a structured sample library with Python Requires Docker about Samples are written to wave files in lib/. Differe

Ben Mangold 1 Nov 11, 2021
python wrapper for rubberband

pyrubberband A python wrapper for rubberband. For now, this just provides lightweight wrappers for pitch-shifting and time-stretching. All processing

Brian McFee 106 Nov 28, 2022
GiantMIDI-Piano is a classical piano MIDI dataset contains 10,854 MIDI files of 2,786 composers

GiantMIDI-Piano is a classical piano MIDI dataset contains 10,854 MIDI files of 2,786 composers

Bytedance Inc. 1.3k Jan 04, 2023
This is my voice assistant Patric!

voice-assistant This is my voice assistant Patric! You can add can add commands and even modify his name Indice How to use Installation guide How to u

Norbert Gabos 1 Jun 28, 2022
Xbot-Music - Bot Play Music and Video in Voice Chat Group Telegram

XBOT-MUSIC A Telegram Music+video Bot written in Python using Pyrogram and Py-Tg

Fariz 2 Jan 20, 2022
SU Music Player — The first open-source PyTgCalls based Pyrogram bot to play music in voice chats

SU Music Player — The first open-source PyTgCalls based Pyrogram bot to play music in voice chats Note Neither this, or PyTgCalls are fully

SU Projects 58 Jan 02, 2023
GNU Radio – the Free and Open Software Radio Ecosystem

GNU Radio is a free & open-source software development toolkit that provides signal processing blocks to implement software radios. It can be used wit

GNU Radio 4.1k Jan 06, 2023
F.R.I.D.A.Y. ----- Female Replacement Intelligent Digital Assistant Youth

F.R.I.D.A.Y. Female Replacement Intelligent Digital Assistant Youth--Jarvis-- the virtual assistant made by python Overview This is a virtual assistan

JIB - Just Innovative Bro 4 Feb 26, 2022
DCL - An easy to use diacritic library used for diacritic and accent manipulation.

Diacritics Library This library is used for adding, and removing diacritics from strings. Getting started Start by importing the module: import dcl DC

Kreus Amredes 6 Jun 03, 2022
Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch

Auditory Slow-Fast This repository implements the model proposed in the paper: Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen, Slow-Fa

Evangelos Kazakos 57 Dec 07, 2022
Audio pitch-shifting & re-sampling utility, based on the EMU SP-1200

Pitcher.py Free & OS emulation of the SP-12 & SP-1200 signal chain (now with GUI) Pitch shift / bitcrush / resample audio files Written and tested in

morgan 13 Oct 03, 2022
?️ Open Source Audio Matching and Mastering

Matching + Mastering = ❤️ Matchering 2.0 is a novel Containerized Web Application and Python Library for audio matching and mastering. It follows a si

Sergey Grishakov 781 Jan 05, 2023
Mina - A Telegram Music Bot 5 mandatory Assistant written in Python using Pyrogram and Py-Tgcalls

Mina - A Telegram Music Bot 5 mandatory Assistant written in Python using Pyrogram and Py-Tgcalls

3 Feb 07, 2022
Audio library for modelling loudness

Loudness Loudness is a C++ library with Python bindings for modelling perceived loudness. The library consists of processing modules which can be casc

Dominic Ward 33 Oct 02, 2022
Marsyas - Music Analysis, Retrieval and Synthesis for Audio Signals

Welcome to MARSYAS. MARSYAS is a software framework for rapid prototyping of audio applications, with flexibility and extensibility as primary concer

Marsyas Developers Group 364 Oct 31, 2022
C++ library for audio and music analysis, description and synthesis, including Python bindings

Essentia Essentia is an open-source C++ library for audio analysis and audio-based music information retrieval released under the Affero GPL license.

Music Technology Group - Universitat Pompeu Fabra 2.3k Jan 03, 2023
Bot duniya Music Player

Bot duniya Music Player Requirements 📝 FFmpeg (Latest) NodeJS nodesource.com (NodeJS 17+) Python (3.10+) PyTgCalls (Lastest) 2nd Telegram Account (ne

Aman Vishwakarma 16 Oct 21, 2022
praudio provides audio preprocessing framework for Deep Learning audio applications

praudio provides objects and a script for performing complex preprocessing operations on entire audio datasets with one command.

Valerio Velardo 105 Dec 26, 2022
Telegram Bot to play music in VoiceChat with Channel Support and autostarts Radio.

VCPlayerBot Telegram bot to stream videos in telegram voicechat for both groups and channels. Supports live streams, YouTube videos and telegram media

Abdisamad Omar Mohamed 1 Oct 15, 2021