Show pageOld revisionsBacklinksBack to top This page is read only. You can view the source, but not change it. Ask your administrator if you think this is wrong. ====== Open datasets on the web ====== SIPS2019 brought together people to aggregate it all, see https://docs.google.com/spreadsheets/d/1ejOJTNTL5ApCuGTUciV0REEEAqvhI2Rd2FCoj7afops/edit#gid=0 There are lots of interesting open datasets all across the net and if you're creative enough you can think of great things to do with those for your research. Check out the following posts I previously mentioned on the blog: * [[http://mgto.org/data-collection-survey-websites-social-web/|Survey Websites]] * [[http://mgto.org/social-networks-suggested-readings-twitter-dataset/|Twitter Datasets]] | [[https://blog.twitter.com/2014/building-a-complete-tweet-index|Twtiter dataset from Twitter]] * [[https://docs.google.com/spreadsheets/d/1ejOJTNTL5ApCuGTUciV0REEEAqvhI2Rd2FCoj7afops/edit#gid=0|Google doc with psychology datasets]] ===== Datasets I repeatedly use ===== * [[http://www.thearda.com/|Association for Religion Data Archives]] * [[http://www.worldvaluessurvey.org/|World Values Survey]] with [[http://www.europeanvaluesstudy.eu/evs/surveys/longitudinal-file-1981-2008/integratedvaluessurveys/|European Values Survey]] * [[http://ess.nsd.uib.no/|European Social Survey]] * [[http://www.gesis.org/en/issp/overview/|ISSP]] * [[http://www.norc.org/Research/Projects/Pages/general-social-survey.aspx|General Social Survey (GSS)]] This looks great : * [[http://kenan.ethics.duke.edu/attitudes/resources/measuring-morality/|Measuring Morality]] ===== Promising datasets ===== (atleast for my own research): * [[http://www.pewglobal.org/category/datasets/|Pew Research Center’s Global Attitudes Project]] or [[http://www.pewforum.org/datasets/|2]] * [[http://www.psych.auckland.ac.nz/uoa/home/about/our-research/research-groups/new-zealand-attitudes-and-values-study/nzavs-information-for-researchers|NZAVS]] (requires collaboration and special access) * [[http://www.gesis.org/en/eurobarometer/home/|European Commission's Eurobarometer]] * [[http://zacat.gesis.org/webview/index.jsp|ZACAT - GESIS Online Study Catalogue]] * [[http://www.bls.gov/nls/nlsy97.htm|National Longitudinal Surveys]] * [[http://www.hansard-corpus.org/|British Parliament (Hansard) 1803-2005 | 7.6 million speeches | 1.6 billion words]] * [[http://rubinet.ece.ucdavis.edu/data.html|RUBINET Data Sets Online Social Networks: Anonymized Data from Third-Party Facebook Applications]] * [[http://cssr.surveybank.aau.dk/webview/|CSSR Open Access Databank]] - [[http://www.sfi.dk/surveys-7753.aspx|Danish datasets]] ([[https://www.researchgate.net/profile/Hans_Sievertsen/publication/294896070_Cognitive_fatigue_influences_students%27_performance_on_standardized_tests/links/56c8888408ae5488f0d6efa9.pdf|used here]]) * [[http://www.electionstudies.org/studypages/download/datacenter_all_NoData.php|Data Center ANES time series]] (national elections studies) * [[http://studentlife.cs.dartmouth.edu/|StudentLife Study]] - open dataset about health behaviors in students * [[http://www.share-project.org/data-access-documentation.html|The Survey of Health, Ageing and Retirement in Europe (SHARE)]] * [[https://discover.ukdataservice.ac.uk/series/?sn=2000053|Understanding Society UK]] ([[https://www.understandingsociety.ac.uk/2016/11/24/researchers|explained here]]) * [[https://international.ipums.org/international/|IPUMS-International]] - census data aggregated from around the world * [[https://www.v-dem.net/en/|Varieties of Democracy aims to produce better Indicators of Democracy]] ===== Others ===== * [[https://opendatainception.io/|Open Data Inception map]] * [[https://docs.google.com/spreadsheets/d/1ISYoRpx6A098m582lS4XDS2P9fdfKdAfA4ILjW44byY/edit#gid=0|Large datasets for social science]] * [[https://ourworldindata.org/corruption/#correlates-determinants-and-consequences|List of corruption indices]] * [[http://www.cdc.gov/rdc/|NCHS Data]] / [[http://wwwn.cdc.gov/nchs/nhanes/search/nnyfs12.aspx|NNYFS 2012]] - not that open, requires review, but some promising variables there * [[https://osf.io/52qxl/|Project Implicit Demo Website Datasets]] * [[http://www.ciser.cornell.edu/ASPs/datasource.asp|Internet Data Sources for Social Scientists]] * [[http://www.cpanda.org/stage/studies/c00016|Survey of Public Participation in the Arts]] * [[http://www.lissdata.nl/lissdata/Access_Data/Rules_and_Conditions|LISS]] * [[http://www.datawrangling.com/some-datasets-available-on-the-web|A long long list of open datasets on the web]] * [[http://delicious.com/pskomoroch/redistributable+dataset|Another long long list]] * [[http://www.quora.com/Data/Where-can-I-get-large-datasets-open-to-the-public?redirected_qid=195775|Quora list]] * [[http://www.reddit.com/r/datasets/|Reddit datasets chitchat]] * [[http://datamob.org/datasets|DataMob]] * [[http://www.esds.ac.uk/|Economic and Social Data Service]] * [[http://books.google.com/ngrams/datasets|Google Books Ngram Viewer]] (see [[http://www.sciencemag.org/content/331/6014/176|this Science article]] for more details). * [[http://data.un.org/|UN data]] * [[http://www.economicsnetwork.ac.uk/links/data_free.htm|Economic Data freely available online]] * [[http://aws.amazon.com/datasets?_encoding=UTF8&jiveRedirect=1|Amazon Public Data Sets]] * [[http://netsg.cs.sfu.ca/youtubedata/|Dataset for "Statistics and Social Network of YouTube Videos"]] * [[http://www.infochimps.com/collections/twitter-census|Twitter Census]] * [[https://www.cdproject.net/en-US/Results/Pages/responses.aspx|Carbon Disclosure Project]] * [[http://www.edrm.net/resources/data-sets/edrm-enron-email-data-set-v2|EDRM Enron Email Data Set v2]] * [[http://www.thefacebookproject.com/resource/datasets.html|The Facebook Project]] * [[http://thedata.org/home|Dataverse Network]] * [[http://www.tagora-project.eu/data/|Tagora social-web datasets]] * [[http://musicbrainz.org/doc/MusicBrainz_Database|MusicBrainz Database]] * [[http://personality-testing.info/_rawdata/|Raw data from online personality tests]] * [[http://www.timeuse.org/information/access-data|Access Time Use Data]] * [[http://www.dhcs.ca.gov/dataandstats/Pages/CWHS.aspx|California Women's Health Survey]] * [[http://www.himalayandatabase.com/|The Himalayan Database: The Expedition Archives of Elizabeth Hawley]] (costs about 60USD on Amazon) [[https://www8.gsb.columbia.edu/cbs-directory/sites/cbs-directory/files/publications/Anicich%2C%20Swaab%2C%20%26%20Galinsky%20%282015%2C%20PNAS%29.pdf|used in this paper]] * [[https://www.gov.uk/guidance/national-pupil-database-apply-for-a-data-extract|National pupil database]] * [[https://www.neps-data.de/en-us/home.aspx|National Educational Panel Study]] datasets.txt Last modified: 2019/07/11 09:41by filination