One more dataset for MIR and Music Recommendation, compiled by Oscar Celma, and based around Last.fm data and APIs.
And some more detailed info here.
Last.FM also recently provided a audio fingerprinting API. More about this here.
So now it’s really simple to integrate audio fingerprinting in opensource apps. Looking forward to try it out soon.