This document explores the use of deep learning methods, specifically RNN-RBMs and convolutional deep belief networks (CDBNs), to generate music from various audio inputs such as MIDI and WAV formats. The research aims to enhance music information retrieval (MIR) by understanding auditory signals and their temporal dependencies, while highlighting the underexplored field of audio learning in contrast to advancements in image and language processing. The project culminates in the generation of MIDI files through RNN-RBM training, although the CDBN approach did not yield successful musical outcomes.
Related topics: