This new guide is a comprehensive resource for all researchers who require access to reliable and accurate publicly available statistics and Big Data sets that address diverse and timely subject matter. The resources included in this guide are developed and maintained by a range of organizations, including: academic and scholarly sources, the federal government, the corporate and business sectors, open source contributions, advocacy groups, NGOs and IGOs.
Statistics Resources and Big Data 2018
233644 datasets from the Federal Government
http://catalog.data.gov/dataset
2010 Census
http://www.census.gov/2010census/
2018 Directory of Directories
http://www.2018DirectoryOfDirectories.com/
2018 New Economy Resources
http://www.2018NewEconomy.com/
Academic Torrents
http://academictorrents.com/
Adherents.com: Religion Statistics Geography, Church Statistics
http://www.adherents.com/
African Development Bank Group (AfDB) – Statistics
http://www.afdb.org/en/knowledge/statistics/
American Customer Satisfaction Index
http://www.theacsi.org/
American Demographics
http://adage.com/section/american-demographics/195
American FactFinder
http://factfinder2.census.gov/
Annals of Applied Statistics (AOAS)
http://projecteuclid.org/DPubS?service=UI&version=1.0&verb=Display&handle=euclid.aoas
arXiv.org – Open Access e-Prints
https://arxiv.org/
Asian Development Bank (ADB) – Economics and Statistics
http://adb.org/data/main
Asset Macro
https://www.assetmacro.com/
Astrostatistics and Astroinformatics Portal (ASAIP)
http://asaip.psu.edu/
AStA Advances in Statistical Analysis
http://www.springer.com/statistics/journal/10182
Australian Bureau of Statistics
http://www.abs.gov.au/
Avention – Formerly OneSource
http://www.hoovers.com/
Awesome Project
https://awesome.re/
Awesome Public Datasets
https://github.com/caesar0301/awesome-public-datasets
bigdata@csail
http://bigdata.csail.mit.edu/
Big Data
https://www.gartner.com/it-glossary/big-data
Big Data
https://www.edx.org/micromasters/big-data
Big Data, Present and Future – Infographic
https://bbvaopen4u.com/en/actualidad/infographic-big-data-present-and-future
Big Data Solutions
https://www.oracle.com/big-data/index.html
Big Data Specialization
https://www.coursera.org/specializations/big-data
Big Data Tutorial – Everything You Need To Know
http://searchstorage.techtarget.com/guides/Big-data-tutorial-Everything-you-need-to-know
Big Data University
http://www.BigDataUniversity.com/
Big Data – What It Is and Why It Matters
https://www.sas.com/en_us/insights/big-data/what-is-big-data.html
Big Data – Wikipedia
http://en.wikipedia.org/wiki/Big_data
BigMl – Thousands of Public Data Sources
http://blog.bigml.com/2013/02/28/data-data-data-thousands-of-public-data-sources/
BizStats – Free Business Statistics and Financial Ratios
http://www.bizstats.com/
Blockchain
https://www.Blockchain.com/
https://en.wikipedia.org/wiki/Blockchain
Bureau of Economic Analysis
http://bea.gov/
Bureau of Justice Statistics (BJS)
http://www.bjs.gov/
Bureau of Labor Statistics (BLS)
http://stats.bls.gov/
Bureau of Transportation Statistics (BTS) and Research and Innovative Technology Administration (RITA)
http://www.rita.dot.gov/bts/
Buy the Dataset You Need From an Open Marketplace
https://datafloq.com/
CDC: 500 Cities Project
https://www.cdc.gov/500Cities
Census Data Mapper
http://www.census.gov/geo/maps-data/maps/datamapper.html
Census Online
http://www.census-online.com/links/
Center for Applied Internet Data Analysis
http://www.caida.org/
CEPALSTAT – Latin America and the Caribbean Databases and Statistical Publications
http://estadisticas.cepal.org/cepalstat/WEB_CEPALSTAT/Portada.asp?idioma=i
CHANCE Magazine
http://chance.amstat.org/
ChartsBin – Web Based Visualization Tool
http://chartsbin.com/
ChildStats.gov
http://www.ChildStats.gov/
China Statistical Abstract 2015
http://www.purpleculture.net/china-statistical-abstract-2015-p-21943/
CIA Publications
https://www.cia.gov/library/publications/index.html
citeulike – Managing and Discovering Scholarly References
http://www.citeulike.org/
City-Data.com – Comprehensive Stats on U.S. Cities
http://www.city-data.com/
City Population
http://www.citypopulation.de/
CKAN – Open Source Data Portal Software
http://ckan.org/
ClearStory Data – Now You Can See It
http://clearstorydata.com/
Common Crawl – Open Repository of Web Crawl Data Composed Of Over 5 Billion Freely Available Web Pages
http://www.CommonCrawl.org/
Communications in Biometry and Crop Science (CBCS)
http://agrobiol.sggw.waw.pl/cbcs/
Computational Statistics
http://www.springer.com/statistics/journal/180
CORE Data Dumps
https://core.ac.uk/intro/data_dumps
Council on East Asian Library (CEAL) Statistics
http://www.lib.ku.edu/ceal/
Data & Society
https://datasociety.net/
Data Blog – Facts Are Sacred
http://www.theguardian.com/news/datablog/interactive/2013/jan/14/all-our-datasets-index
DataCite
https://www.datacite.org/
DataCircle – Buy, Sell or Exchange Data Sets Easily
https://www.datacircle.io/
DataFerrett – Data Mining Tool
http://dataferrett.census.gov/
Data.gov APIs
http://www.data.gov/developers/apis
Data: Government, State, City, Local and Public
https://www.kdnuggets.com/datasets/government-local-public.html
DataHub – The Easy Way To Get, Use and Share Data
https://datahub.io/
Data in Gapminder World
https://www.gapminder.org/data/
DataMarket – Find, Understand and Share Data
https://www.qlik.com/us/products/qlik-data-market
DataMelt – Computation and Visualization Environment
http://jwork.org/dmelt/
Data Mining Resources 2018
http://www.DataMiningResources.info/
Data Portal – The Open Data Hub of the European Union
http://open-data.europa.eu/en/data
DataRobot – Build Better Predictions Models
http://www.datarobot.com/
Dataset of schools in the USA
https://www.quora.com/Is-there-a-dataset-of-all-the-elementary-middle-and-high-schools-in-the-United-States
Datasets for Data Mining and Data Science
https://www.kdnuggets.com/datasets/index.html
Datasets from MSTE (Mathematics, Science, and Technology Education) College University of Illinois
http://mste.illinois.edu/malcz/DATA/ARCHIVE.html
DATAVERSITY – Resources for IT Professionals
http://www.dataversity.net/
data.world – Social Network for Data People
https://data.world/
dat – Share and Sync Data Instantly
http://dat-data.com/
DBpedia – Crowd-Sourced Community Effort To Extract Structured Information from Wikipedia
http://wiki.dbpedia.org/
Deep Web and Big Data Research 2018
http://www.DeepWeb.us/
DocumentCloud – Analyze, Annotate, Publish by Turning Documents Into Data
https://www.documentcloud.org/
Dryad Digital Repository
http://datadryad.org/
DSC Data Science Search Engine
http://www.datasciencecentral.com/page/search
Earth Observing System Data and Information System (EOSDIS)
https://earthdata.nasa.gov/
Ecommerce bigdataset covering SKUs Price and availability status queryable through API
https://semantics3.com/
Economagic.com – Economic Time Series
http://www.economagic.com/
Economic Census
http://www.census.gov/econ/
EconomicIndicators.gov
http://www.esa.gov/about-economic-indicators
Economic Briefing Room
http://www.census.gov/cgi-bin/briefroom/BriefRm
Education datasets from the Department of Education
https://www.data.gov/education/
Education Data Community
http://www.data.gov/education/community/education
Energy Information Administration (EIA)- Statistical Agency of the U.S. Department of Energy
http://www.eia.gov/
Enigma Public – World’s Broadest Collection of Public Data
https://public.enigma.com/
e-Science Central – Cloud Based Platform for Data Analysis
http://www.esciencecentral.co.uk/
EU Open Data Portal
https://data.europa.eu/euodp/en/home
Eurostats – European Statistics
http://epp.eurostat.ec.europa.eu/
EveryCloud – Spam Filtering and Email Archiving
http://www.everycloudtech.com/
Extract Big Value From Big Data
http://events.pentaho.com/paths-to-big-data-registration.html
FactFinder
http://factfinder2.census.gov/
Federal R&D Facilities for Entrepreneurs and Innovators
http://www.data.gov/research/
Federal Reserve Economic Data (FRED)
http://research.stlouisfed.org/fred2/
FedStats
https://fedstats.sites.usa.gov/
Finding and Using Health Statistics
http://www.nlm.nih.gov/nichsr/usestats/index.htm
FlowingData
http://flowingdata.com/
FRASER – Federal Reserve Archive – Discover Economic History
http://fraser.stlouisfed.org/
Free GIS Data
http://freegisdata.rtwilson.com/
Gapminder – FactTank
http://www.gapminder.org/
GenBank ®
https://www.ncbi.nlm.nih.gov/genbank/
Gephi – The Open Graph Viz Platform
https://gephi.org/
GitHub Data
https://cloud.google.com/bigquery/public-data/github
Global Open Data Index
http://index.okfn.org
Google BigQuery
https://cloud.google.com/products/big-query
Grafana – Beautiful Metric and Analytic Dashboards
http://grafana.org/
Graphite – Highly Scalable Real-Time Graphing System
http://graphite.readthedocs.org/
Guide To World Population by Richard Jensen [May 2007]
http://tigger.uic.edu/~rjensen/populate.htm
Hashgraph
http://www.Hashgraph.com/
http://www.Hashgraph.org/
Healthcare Data from the Federal Government
http://www.healthdata.gov/
Household electric power consumption big dataset
http://archive.ics.uci.edu/ml/datasets/Individual+household+electric+power+consumption
How Much Information? 2003
http://www.sims.berkeley.edu/research/projects/how-much-info-2003/
Human Development Reports
http://hdr.undp.org/
HyperStat Online: An Introduction to Statistics
http://davidmlane.com/hyperstat/index.html
IMF Data Sets – International Economics Data and Statistics
http://www.imf.org/external/data.htm
Index Mundi – Global Data Portal
http://www.indexmundi.com/
Indiegogo Datasets
https://webrobots.io/indiegogo-dataset/
indix – Everything About Products
http://www.indix.com/
Industry Research from the University of Tennessee
http://libguides.utk.edu/content.php?pid=85554&sid=636582
Industry Research – University of Pittsburgh
http://www.library.pitt.edu/industry-research
InfoChimps.org – Free Redistributable Rich Data Sets
http://www.infochimps.com/
Infogram – Create Engaging Infographics and Reports in Minutes
https://infogram.com/
International Business – Information on the Business Conditions, Culture, and Economy of Different Countries
http://libguides.stthomas.edu/content.php?pid=119649&sid=1030547
International Economic Statistics (IES) Database
http://research.stlouisfed.org/fred2/categories/32263
International Human Development Indicators – Public Data Explorer
http://hdr.undp.org/en/data/explorer/
International Journal of Quality, Statistics, and Reliability
http://www.hindawi.com/journals/jqre/
International Monetary Fund (IMF) – Data and Statistics
http://www.imf.org/external/data.htm
International Trade Statistics
http://www.census.gov/foreign-trade/index.html
Internet 2010 Statistics
http://royal.pingdom.com/2011/01/12/internet-2010-in-numbers
Internet Demographics 2018
http://www.InternetDemographics.info/
Internet Monitor – Analyzing Online Content Controls and Activity
https://thenetmonitor.org/
Internet World Stats – Usage and Population Statistics
http://www.internetworldstats.com/
Inter-university Consortium for Political and Social Research (ICPSR)
http://www.icpsr.umich.edu/
IOGDS: International Open Government Dataset Search
https://logd.tw.rpi.edu/node/9903
IPUMS USA : Integrated Public Use Microdata Series
https://usa.ipums.org/usa/
Journal of Open Health Data
http://openhealthdata.metajnl.com/
Journal of Statistics Education
http://www.amstat.org/publications/jse/
Kaggle – Home of Data Science and Machine Learning
https://www.kaggle.com/
Kazoup – Analyze Search Archive
http://kazoup.com/
Kickstarter Datasets
https://webrobots.io/kickstarter-datasets/
KNIME
https://www.knime.com/
Knoema Knowledge Platform
http://knoema.com/
Linking Open Data Cloud Diagram (LOD)
http://lod-cloud.net/
LinkingOpenData – W3C SWEO Community Project
http://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
List of Free Statistical Software
http://l-lists.com/en/lists/dz3a5t.html
Local Area Unemployment Statistics (LAUS)
http://www.bls.gov/lau/
ManualsLib – The Ultimate Manuals Library
http://www.manualslib.com/
MAVERICK – HP/NVIDIA Interactive Visualization and Data Analytics System
https://www.tacc.utexas.edu/resources/visualization/
Measuring America: The Decennial Censuses From 1790 to 2000
http://www.census.gov/prod/2002pubs/pol02marv-pt1.pdf
Mirador – Tool for Visual Exploration of Complex Datasets.
http://fathom.info/mirador/
MoData – Big Data Resources
http://www.mo-data.com/
Monarch Professional – Individual Information Optimization for Enterprise
http://www.datawatch.com/
Monthly Bulletin of Statistics Online (MBS)
http://unstats.un.org/unsd/mbs/app/DataSearchTable.aspx
Movie Rating Datasets
http://grouplens.org/datasets/movielens/
Mu Sigma – Decision Sciences and Analytics
http://www.mu-sigma.com/
National Agricultural Statistics Service
http://www.nass.usda.gov/
National Bureau of Economic Research (NBER)
http://www.nber.org/
National Center for Education Statistics (NCES)
http://nces.ed.gov/
National Center for Health Statistics
http://www.cdc.gov/nchs/
National Numeracy Network: Teaching Resources
http://serc.carleton.edu/nnn/teaching
National Statistics Online (UK)
http://www.statistics.gov.uk/
NationMaster – World Statistics and Country Comparisons
http://www.nationmaster.com/
Net Data Directory
https://netdatadirectory.org/
New Economics (econ) Archive at arXiv.org
https://arxiv.org/help/econ/announcement
Occupational Employment Statistics (OES)
http://www.bls.gov/oes/
OECD Data
https://data.oecd.org/
OECD Health Statistics 2015 – Country Notes
http://www.oecd.org/chile/oecd-health-statistics-2015-country-notes.htm
OECD Health Statistics 2017
http://www.oecd.org/els/health-systems/health-data.htm
OECD.StatExtracts – Complete Databases Available Via OECD’s iLibrary
http://stats.oecd.org/
OpenAIRE – Open Access Infrastructure for Research in Europe
http://www.openaire.eu/
Open Data Barometer
http://www.opendataresearch.org/project/2013/odb
Open Data Handbook – Guides, Case Studies and Resources for Government and Civil Society On the What, Why and How of Open Data
http://opendatahandbook.org/
Open Data Inception
http://opendatainception.io/
Open Data Institute
https://theodi.org/
Open Data Inventory (ODIN)
http://odin.opendatawatch.com/
Open Data Network
http://www.opendatanetwork.com/
Open Datasets
https://github.com/caesar0301/awesome-public-datasets
https://www.kaggle.com/datasets
https://www.data.gov/
https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
https://aws.amazon.com/public-datasets/
https://data.world/
http://data.worldbank.org/
https://public.enigma.com/
http://repository.upenn.edu/mead
http://catalog.data.gov/dataset
http://www.infochimps.com/
http://publicdata.eu/
Open Educational Resources (OER): Statistics
https://guides.ou.edu/OER/statistics
OpenGeoportal – Geospatial Data from Multiple Repositories
http://opengeoportal.org/
Open Graph Viz Platform – Exploratory Data Analysis
http://gephi.org/
Open Learning Initiative: Probability & Statistics
http://oli.cmu.edu/courses/free-open/statistics-course-details
OpenRefine – A Free Open Source Powerful Tool for Working With Messy Data
http://openrefine.org/
Open Source Data Explorer – Explore and Visualize Your Event Data
http://keen.github.io/explorer/
Oracle and Big Data
http://www.oracle.com/us/technologies/big-data/index.html
Orange – Open Source Data Visualization and Analysis for Novice and Experts
http://orange.biolab.si/
Pharma Big Data Datasets
http://www.academia.edu/2497409/When_pharmaceutical_companies_publish_large_datasets_an_abundance_of_riches_or_fools_gold
PublicData.eu – Europe’s Public Data
http://publicdata.eu/
Public Data Sets On Amazon Web Services (AWS)
http://aws.amazon.com/datasets
Quality and Comparative International Statistics
http://web.freepint.com/go/newsletter/151#feature
QueryTree – Visualize and Understand Your Data
http://querytreeapp.com/
Platfora – Clarity From Big Data
http://www.platfora.com/
Project Open Data – Open Data Policy – Managing Information As An Asset
http://project-open-data.github.io/
Publicly Available Big Datasets
http://hadoopilluminated.com/hadoop_illuminated/Public_Bigdata_Sets.html
PubMed
https://www.ncbi.nlm.nih.gov/pubmed/
Random.org – True Random Number Service
http://random.org/
re3data.org – Registry of Research Data Repositories
http://www.re3data.org/
ReDash – Make Your Company Data Driven
https://redash.io/
ReportLinker: Industry Reports, Company and Country Profiles
http://www.reportlinker.com/
R Programming MOOC Course on EdX Free
https://www.edx.org/course/introduction-r-programming-microsoft-dat204x-0
http://blog.revolutionanalytics.com/2015/08/free-edx-course-for-r-beginners.html
Research Repository UCD
http://researchrepository.ucd.ie/
Sample datasets for practicing with the R Development System
http://vincentarelbundock.github.io/Rdatasets/datasets.html
SameAs.org – Interlinking the Web of Data
http://sameas.org/
SCaVis – Scientific Computation and Visualization Environment
http://jwork.org/scavis/
Scientific Data Repository – Real Time Visualization and Exploration Techniques
http://www.mlvis.com/platform.php
Sense – A Collaborative Cloud Platform for Data Science and Big Data Analytics
https://senseplatform.com/
Sindice – The Semantic Web Index
http://www.sindice.com/
SISA – Simple Interactive Statistical Analysis
http://www.quantitativeskills.com/sisa/
Sizzle Analytics
https://www.sizzleanalytics.com
Smithsonian/NASA Astrophysics Data System (ADS)
http://adsabs.harvard.edu/
Socialbakers – Social Statistics, Application Statistics and Page Statistics
http://www.socialbakers.com/
Social Science Data Search
http://www.lib.berkeley.edu/wikis/datalab/index.php?n=Main.GoogleSearch
Social Statistics 2.0 – Open Database of Statistics
http://www.postyour.info/
SourceForge.net Research Data
http://sourceforge.net/
SORT (Statistics and Operations Research Transactions)
http://www.idescat.cat/sort/
Sqrrl Security – Cell-Level Security for Big Data
http://www.sqrrl.com/
StatCrunch – Data Analysis On the Web
http://www.statcrunch.com/
Statista – A Leading Statistics Portal
http://www.statista.com/
Statistical Analysis and Data Mining
http://onlinelibrary.wiley.com/journal/10.1002/%28ISSN%291932-1872
Statistical Data Mining Tutorials – Tutorial Slides by Andrew Moore
http://www.autonlab.org/tutorials/index.html
Statistical Education Through Problem Solving
http://www.stats.gla.ac.uk/steps/
Statistical Resources Online
http://jolis.worldbankimflib.org/Estats/stat245.htm
Statistical Sites on the World Wide Web
http://www.bls.gov/bls/other.htm
Statistics – Wikipedia
http://en.wikipedia.org/wiki/Statistics
Statistics.com – Research Statistics and Statistical Analysis Directory
http://www.statistics.com/
Statistics and Probability
http://stattrek.com/
Statistics Canada
http://www.statcan.gc.ca/start-debut-eng.html
Statistics Every Writer Should Know
http://nilesonline.com/stats/
Statistics Online Compute Resources (SOCR)
http://socr.stat.ucla.edu/
Statistics on the Web
http://www.claviusweb.net/statistics.shtml
Statistics Resources and Big Data 2018
http://www.StatisticsResources.info/
Statistics Sources
http://www.rba.co.uk/sources/stats.htm
Stat Wing – Turn Data Into Insight In Seconds
https://www.statwing.com/
tamr – Leverage All Data
http://www.tamr.com/
Tanagra Project – Free Data Mining Software for Academic and Research Purposes
http://eric.univ-lyon2.fr/~ricco/tanagra/en/tanagra.html
The Big Data Hub – Understanding Big Data for the Enterprise
http://www.ibmbigdatahub.com/
The Dataverse Project
https://dataverse.org/
The DBpedia Data Set (3.9)
http://wiki.dbpedia.org/Datasets
The Dryad Digital Repository
http://datadryad.org/
The Internet Glossary of Statistical Terms
http://www.animatedsoftware.com/statglos/statglos.htm
The Magazine of Early American Datasets
http://repository.upenn.edu/mead
The Manifesto for Data Practices
https://datapractices.org/manifesto/
The Open Data Institute
http://theodi.org/
The Open Knowledge Foundation – Empowering Through Open Knowledge
http://okfn.org/
The R Project for Statistical Computing
http://www.r-project.org/
The Statistics Home Page
http://www.statsoft.com/
The World Bank – Data
http://data.worldbank.org/
The World Bank Data Catalog
http://datacatalog.worldbank.org/
The World of Statistics
http://www.worldofstatistics.org/
Trifacta – Data Wrangling
https://www.trifacta.com/
Truthy – Information Diffusion Research
http://truthy.indiana.edu/
UC Irvine Machine Learning Repository
https://archive.ics.uci.edu/ml/index.php
UK National Statistics Online
http://www.statistics.gov.uk/
UNdata – Data Access System to UN Databases (34 Databases – 60 Million Records)
http://data.un.org/
UNESCO Institute for Statistics
http://www.uis.unesco.org/
United Kingdom National Accounts, The Blue Book, 2014 Edition
http://www.ons.gov.uk/ons/rel/naa1-rd/united-kingdom-national-accounts/the-blue-book–2014-edition/index.html
United Nations Statistics Division
http://unstats.un.org/unsd/
United States Census Bureau
http://www.census.gov/
United States Census Bureau Research
https://www.census.gov/research/
U.S. and World Population Clocks
http://www.census.gov/popclock/
USA.gov – Data and Statistics
http://www.usa.gov/Topics/Reference-Shelf/Data.shtm
USA Trade Online – The Official Source of Trade Statistics
https://usatrade.census.gov/
U.S. Business and Economy-Wide Statistics
http://www.census.gov/econ/economywide.html
USDA Economics, Statistics, and Market Information System
http://usda.mannlib.cornell.edu/
US Government Web Services and XML Data Sources
http://usgovxml.com/
USITC Interactive Tariff and Trade DataWeb
http://dataweb.usitc.gov/
Visualization of Large Spatiotemporal Datasets
http://www.nanocubes.net/
Visualizing.org – Making Sense of Complex Issues Through Data and Design
http://www.visualizing.org/
Vital Statistics of the United States (VSUS)
http://www.cdc.gov/nchs/products/vsus.htm
VIZE – Experiment With Data On the Fly
http://www.vize.io/
WebCASPAR – Integrated Science and Engineering Resources
https://webcaspar.nsf.gov/
Web and Blog Datasets
http://snap.stanford.edu/data/other.html
Web Interface for Statistics Education (WISE)
http://wise.cgu.edu/
WebSM – Web Survey Methodology Portal
http://www.websm.org/
Weka 3: Data Mining Software in Java
http://www.cs.waikato.ac.nz/~ml/weka/
What Is Big Data?
http://www-01.ibm.com/software/data/bigdata/
WHO: World Health Statistics
http://www.who.int/gho/publications/world_health_statistics/en/
WikiData – Free KnowledgeBase With 15,819,145 Editable Data Items
http://www.wikidata.org/wiki/Wikidata:Main_Page
WisStat – Applied Population Laboratory
http://www.getfacts.wisc.edu/
Wolfram Data Repository
https://datarepository.wolframcloud.com/
World Bank Open Data
http://data.worldbank.org/
World dataBank – World Development Indicators (WDI) and Global Development Finance (GDF)
http://databank.worldbank.org/data/
Worldometers – World Statistics Updated In Real Time
http://www.worldometers.info/
World Statistics Pocketbook
http://unstats.un.org/unsd/pocketbook/
WTO Statistics Database
http://stat.wto.org/
WWW Virtual Library: Statistics
http://www.stat.ufl.edu/vlib/statistics.html
YourEconomy.org (YE)
http://youreconomy.org/
Zanran – Search the Web For Data and Statistics
http://zanran.com/