DataTaunew | comments | leaders | submitlogin
Ask DT: Do you have any datasets to share?
8 points by mrborgen86 3052 days ago | 5 comments
I'm building an open source platform for sharing machine learning datasets, as I'd like to encourage more people and businesses to share more of their data, plus make it easier for data scientists/hobbyists to discover new datasets.

Which leads to my question:

Do you have any datasets you'd be willing to share? If so, I would be very happy to feature them on the front page of Datasets.co.

Site: http://www.datasets.co/

Github repo: https://github.com/perborgen/data_hub

All other feedback is highly appreciated :)



2 points by jamesledoux 3049 days ago | link

https://github.com/jldbc/gunsandcrime

This is all publicly available, but I compiled data on crime rates, gun ownership (through proxies - pct. suicides by gun, survey data, US gun production), and some general population data from the census for an econometrics project. Most of the data is here from 1980-2013.

-----

2 points by mrborgen86 3049 days ago | link

Added: http://www.datasets.co/dataset/5683bf93ccdd7b0300ac4d28

-----

1 point by mrborgen86 3049 days ago | link

Thats a great dataset, I'll go ahead and add it. Thanks!

-----

2 points by cdipaolo 3051 days ago | link

http://github.com/cdipaolo/hub-db

PornHub images and albums dataset with comments, ratings, tags, image links, etc. including around a quarter million images

Also includes some minor preprocessing and example MapReduce jobs done with MRJob

-----

1 point by mrborgen86 3051 days ago | link

Awesome, thanks for sharing!

I just added the dataset here: http://www.datasets.co/dataset/5680faae695fa103008ea233

It's on the first page now, so hopefully it'll give some traffic to your repo.

Let me know if you have any comments to how it's presented and I'll fix it :)

-----




RSS | Announcements