Home > Datasets, MIR, Research > Million Song Dataset | scaling MIR research

Million Song Dataset | scaling MIR research

An impressive feature data set extracted from music audio files by LabRosa using the Echonest API:

Million Song Dataset | scaling MIR research.

However, the feature set is (obviously) fixed and you have no access to the audio content of each music piece in the dataset (and there are some understandable reasons for that – check the FAQ). Nevertheless, a lot can already be done using this data (mainly for the machine learning, data mining, Information Retrieval folks), and this effort is a great contribution for the development of more advanced music recommendation systems.

Personally, I’m still very much into audio signal processing (mainly related to sound segregation, where I’m still trying to explore the basics of machine listening), so for now this dataset is not that useful to me…

Congratulations to LabRosa and Echonest for the effort and for making this public and available to the R&D community!

Categories: Datasets, MIR, Research Tags: , , ,
  1. No comments yet.
  1. No trackbacks yet.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s

%d bloggers like this: