A website called BigML (for Big Machine Learning) has compiled a great list of freely available public data sources. The article begins: “We love data, big and small and we are always on the lookout for interesting datasets. Over the last two years, the BigML team has compiled a long list of sources of data that anyone can use. It’s a great list for browsing, importing into our platform, creating new models and just exploring what can be done with different sets of data. In this post, we are sharing this list with you. Why? Well, searching for great datasets can be a time consuming task. We hope this list will support you in that search and help you to find some inspiring datasets. “
The introduction continues, “Some data sources are great for complementing your own data. Others are interesting or just fun to play with. If you have your own list of favorite data sources and want to share them, feel free to let us know and we’ll update our list. We grouped the links into some categories that bit.ly calls ‘Bundles’ to help you find what you are looking for and bundled the Bundles into a single Data Sources Bundle. Here is a short discussion of the categories, with some examples.”
Image: Courtesy BigML
- Announcing the Winner of the Semantic Web.Com "Spotlight On Library Innovation"
- Siri Alternative Maluuba Offers Sports Results, TV Schedules
- Yahoo's Acquisition of Tumblr and the Semantic Web
- GraphLab Raises $6.75M to Build 'Hadoop for Graphs'