Skip to content

Latest commit

 

History

History
executable file
·
20 lines (13 loc) · 1016 Bytes

README.md

File metadata and controls

executable file
·
20 lines (13 loc) · 1016 Bytes

MuGeRe

MUsic GEnre REcognition

alt textalt textalt textalt textalt text

This is an attempt to perform music genre recognition directly from audio with deep convolutional neural networks. The basic process goes from generating spectrograms using "SoX" into png files. PNG files are chunked and each chunk represents a training instance for the learning algorithm. Every chunk is then fed to the CNN which extracts salient features to be passed to fully connected layers in order to perform categorical prediction.

Requirements

  • Tensorflow
  • Keras
  • Pillow
  • Pydub
  • sox
  • ffmpeg