This is an expansive listing that focuses on statistics and big data datasets available free on the internet, covering multiple disciplines, for teaching, learning and reference. These data are published and maintained by sources that include: the U.S. and foreign governments, academic, corporate, NGOs, and open source communities.
Statistics Resources and Big Data 2020
25 Excellent Machine Learning Open Datasets
https://opendatascience.com/25-excellent-machine-learning-open-datasets/
254,868 Datasets from the Federal Government
https://catalog.data.gov/dataset
2010 Census
https://www.census.gov/programs-surveys/decennial-census/decade.2010.html
2020 Directory of Directories
https://www.2020DirectoryOfDirectories.com/
2020 Guide to Finding Experts By Using the Internet
http://www.FindingExperts.info/
2020 Guide to Privacy Resources and Tools
https://www.StealthMode.info/
2020 Guide to Searching the Internet
https://www.SearchingTheInternet.info/
2020 New Economy Resources
https://www.2020NewEconomy.com/
Academic Torrents – Making 57.45TM of Research Data Available
https://academictorrents.com/
Adherents.com: Religion Statistics Geography, Church Statistics
http://www.adherents.com/
African Development Bank Group (AfDB) – Statistics
https://www.afdb.org/en/knowledge/statistics/
American Customer Satisfaction Index (ACSI)
https://www.theacsi.org
American Demographics – AdAge
https://adage.com/section/american-demographics/195
Anasen – Agile Data Analysis with Zero Training
https://www.anasen.com/
Annals of Applied Statistics (AOAS)
https://projecteuclid.org/DPubS?service=UI&version=1.0&verb=Display&handle=euclid.aoas
arXiv.org – Open Access e-Prints
https://arxiv.org/
Asian Development Bank (ADB) – Economics and Statistics
http://adb.org/data/main
Asset Macro
https://www.assetmacro.com/
Astrostatistics and Astroinformatics Portal (ASAIP)
https://asaip.psu.edu/
AStA Advances in Statistical Analysis
https://www.springer.com/statistics/journal/10182
Australian Bureau of Statistics
https://www.abs.gov.au/
Awesome Project – Awesome Lists About All Kinds of Interesting Topics
https://awesome.re/
Awesome Public Datasets
https://github.com/caesar0301/awesome-public-datasets
bigdata@csail – Research Initiatives on Timely Topics
https://bigdata.csail.mit.edu/
Big Data
https://www.gartner.com/it-glossary/big-data
Big Data
https://www.edx.org/micromasters/big-data
Big Data & Society
https://journals.sagepub.com/home/bds
Big Data, Present and Future – Infographic
https://bbvaopen4u.com/en/actualidad/infographic-big-data-present-and-future
Big Data Solutions and Machine Learning in the Cloud
https://www.oracle.com/big-data/index.html
Big Data Specialization
https://www.coursera.org/specializations/big-data
Big Data Tutorial – Everything You Need To Know
http://searchstorage.techtarget.com/guides/Big-data-tutorial-Everything-you-need-to-know
Big Data – What It Is and Why It Matters
https://www.sas.com/en_us/insights/big-data/what-is-big-data.html
Big Data – Wikipedia
http://en.wikipedia.org/wiki/Big_data
BigMl – Thousands of Public Data Sources
http://blog.bigml.com/2013/02/28/data-data-data-thousands-of-public-data-sources/
Blockchain
https://www.Blockchain.com/
https://en.wikipedia.org/wiki/Blockchain
Bureau of Economic Analysis
https://bea.gov
Bureau of Justice Statistics (BJS)
https://www.bjs.gov/
Bureau of Labor Statistics (BLS)
https://stats.bls.gov/
Bureau of Transportation Statistics (BTS) and Research and Innovative Technology Administration (RITA)
https://www.bts.gov/
Business Formation Statistics (BFS)
https://www.census.gov/programs-surveys/bfs.html
Buy the Dataset You Need From an Open Marketplace
https://datafloq.com/
CDC: 500 Cities Project
https://www.cdc.gov/500Cities
Census Data A – Z Index
https://www.census.gov/about/index.html
Census Online
https://www.census-online.com/links/
CEPALSTAT – Latin America and the Caribbean Databases and Statistical Publications
https://estadisticas.cepal.org/cepalstat/WEB_CEPALSTAT/Portada.asp?idioma=i
ChartsBin – Web Based Visualization Tool
https://chartsbin.com/
ChildStats.gov
https://www.ChildStats.gov/
China Statistical Abstract 2015
https://www.purpleculture.net/china-statistical-abstract-2015-p-21943/
CIA Publications
https://www.cia.gov/library/publications/index.html
City-Data.com – Comprehensive Stats on U.S. Cities
https://www.city-data.com/
City Population
https://www.citypopulation.de/
CKAN – Open Source Data Portal Software
https://ckan.org/
Code.org: CS Principles Unit 4 – Big Data and Privacy
https://curriculum.code.org/csp-19/unit4/
Common Crawl – Open Repository of Web Crawl Data Composed Of Over 5 Billion Freely Available Web Pages
https://www.CommonCrawl.org/
Communications in Biometry and Crop Science (CBCS)
https://agrobiol.sggw.waw.pl/cbcs/
Computational Statistics
https://www.springer.com/statistics/journal/180
Council on East Asian Library (CEAL) Statistics
https://ceal.ku.edu/table/basic
Data & Society
https://datasociety.net/
Data Blog – Facts Are Sacred
https://www.theguardian.com/news/datablog/interactive/2013/jan/14/all-our-datasets-index
Data.census.gov Resources – New Platform to Access Data From the U.S. Census
https://www.census.gov/data/adrm/what-is-data-census-gov.html
DataCite
https://www.datacite.org/
Data.gov APIs
https://www.data.gov/developers/apis
Data: Government, State, City, Local and Public
https://www.kdnuggets.com/datasets/government-local-public.html
DataHub – The Easy Way To Get, Use and Share Data
https://datahub.io/
Data in Gapminder World
https://www.gapminder.org/data/
DataMarket – Find, Understand and Share Data
https://www.qlik.com/us/products/qlik-data-market
DataMelt – Computation and Visualization Environment
https://jwork.org/dmelt/
Data Mining Resources 2020
https://www.DataMiningResources.info/
Data Portal – The Open Data Hub of the European Union
https://open-data.europa.eu/en/data
DataRobot – Build Better Predictions Models
https://www.datarobot.com/
Data Science and Cognitive Computing Free Courses
https://cognitiveclass.ai/
Dataset of schools in the USA
https://www.quora.com/Is-there-a-dataset-of-all-the-elementary-middle-and-high-schools-in-the-United-States
Datasets for Data Mining and Data Science
https://www.kdnuggets.com/datasets/index.html
Datasets from MSTE (Mathematics, Science, and Technology Education) College University of Illinois
https://mste.illinois.edu/malcz/DATA/ARCHIVE.html
Data USA – Explore, Map, Compare and Download U.S. Data
https://datausa.io/
DATAVERSITY – Resources for IT Professionals
http://www.dataversity.net/
data.world – The Cloud-Native Data Catalog
https://data.world/
dat Foundation – Supporting the Adoption and Development of The Dat Protocol
https://dat.foundation/
DBpedia – Crowd-Sourced Community Effort To Extract Structured Information from Wikipedia
https://wiki.dbpedia.org/
DECS – The All-In-One Workspace To Manage Code Snippets and Protect Sensitive Data
https://app.decs.xyz/
Deep Web and Big Data Research 2020
https://www.DeepWeb.us/
Digital Operating Systems Tools and Resources 2020
https://www.DigitalOperatingSystems.com/
DocumentCloud – Analyze, Annotate, Publish by Turning Documents Into Data
https://www.documentcloud.org/
Doing Business 2020 – Measuring Business Regulations
https://www.doingbusiness.org/en/reports/global-reports/doing-business-2020
Dryad Digital Repository
https://datadryad.org/
DSC Data Science Search Engine
https://www.datasciencecentral.com/page/search
Earth Observing System Data and Information System (EOSDIS)
https://earthdata.nasa.gov/
Economagic.com – Economic Time Series
https://www.economagic.com/
Economic Census
https://www.census.gov/econ/
Education Data Community
https://www.data.gov/education/community/education
Energy Information Administration (EIA)- Statistical Agency of the U.S. Department of Energy
https://www.eia.gov/
Enigma Public – World’s Broadest Collection of Public Data
https://public.enigma.com/
https://aws.amazon.com/marketplace/search/results?x=0&y=0&searchTerms=enigma
Enzypt – A Web3-Enabled Website to Buy and Sell Files Through Ethereum and IPFS
https://enzypt.io/
e-Science Central – Cloud Based Platform for Data Analysis
https://www.esciencecentral.co.uk/
EU Open Data Portal
https://data.europa.eu/euodp/en/home
European Data Portal
https://www.europeandataportal.eu/en/homepage
Eurostat – Your Key to European Statistics
https://ec.europa.eu/eurostat/data/database
EveryCloud – Spam Filtering and Email Archiving
https://www.everycloud.com/
Extract Big Value From Big Data
https://www.hitachivantara.com/en-us/home.html
Federal R&D Facilities for Entrepreneurs and Innovators
https://www.data.gov/research/
Federal Reserve Economic Data (FRED)
https://research.stlouisfed.org/fred2/
Finding and Using Health Statistics
https://www.nlm.nih.gov/nichsr/usestats/index.htm
FIVESProject – Firm and Industry Evolution, Entrepreneurship, and Strategy
https://five.dartmouth.edu/
FlowingData
https://flowingdata.com/
Foreign Trade
https://www.census.gov/foreign-trade/index.html
FRASER – Federal Reserve Archive – Discover Economic History
https://fraser.stlouisfed.org/
Free GIS Data
https://freegisdata.rtwilson.com/
Gapminder – FactTank
https://www.gapminder.org/
GenBank®
https://www.ncbi.nlm.nih.gov/genbank/
Gephi – The Open Graph Viz Platform
https://gephi.org/
GitHub Data
https://cloud.google.com/bigquery/public-data/github
Global Entrepreneurship Monitor (GEM)
https://www.gemconsortium.org/
Global Open Data Index
https://index.okfn.org
Google BigQuery
https://cloud.google.com/products/big-query
Grafana – Beautiful Metric and Analytic Dashboards
https://grafana.org/
Graphite – Highly Scalable Real-Time Graphing System
https://graphite.readthedocs.org/
GSS – The General Social Survey
https://gss.norc.org/
Guide To World Population by Richard Jensen [May 2007]
https://tigger.uic.edu/~rjensen/populate.htm
Hashgraph
https://www.Hashgraph.com/
Healthcare Data from the Federal Government
https://www.healthdata.gov/
Household electric power consumption big dataset
https://archive.ics.uci.edu/ml/datasets/Individual+household+electric+power+consumption
How Much Information? 2003
https://www.sims.berkeley.edu/research/projects/how-much-info-2003/
Human Development Reports 2019
https://hdr.undp.org
HyperStat Online: An Introduction to Statistics
https://davidmlane.com/hyperstat/index.html
IMF Data Sets – International Economics Data and Statistics
https://www.imf.org/external/data.htm
Index Mundi – Global Data Portal
https://www.indexmundi.com/
Indiegogo Datasets
https://webrobots.io/indiegogo-dataset/
Industry Research from the University of Tennessee
https://libguides.utk.edu/content.php?pid=85554&sid=636582
Industry Research – University of Pittsburgh
https://www.library.pitt.edu/industry-research
Infogram – Create Engaging Infographics and Reports in Minutes
https://infogram.com/
International Business – Information on the Business Conditions, Culture, and Economy of Different Countries
https://libguides.stthomas.edu/content.php?pid=119649&sid=1030547
International Economic Statistics (IES) Database
https://research.stlouisfed.org/fred2/categories/32263
International Journal of Quality, Statistics, and Reliability
https://www.hindawi.com/journals/jqre/
International Monetary Fund (IMF) – Data and Statistics
https://www.imf.org/external/data.htm
International Trade Statistics
https://www.census.gov/foreign-trade/index.html
Internet 2010 Statistics
https://royal.pingdom.com/2011/01/12/internet-2010-in-numbers
Internet Demographics 2020
https://www.InternetDemographics.info/
Internet Monitor – Analyzing Online Content Controls and Activity
https://thenetmonitor.org/
Internet World Stats – Usage and Population Statistics
https://www.internetworldstats.com/
Inter-university Consortium for Political and Social Research (ICPSR)
https://www.icpsr.umich.edu/
IOGDS: International Open Government Dataset Search
https://logd.tw.rpi.edu/node/9903
IPUMS USA : Integrated Public Use Microdata Series
https://usa.ipums.org/usa/
Journal of Open Health Data
https://openhealthdata.metajnl.com/
Kaggle – Home of Data Science and Machine Learning
https://www.kaggle.com
Kazoup – Analyze Search Archive
https://kazoup.com/
Kickstarter Datasets
https://webrobots.io/kickstarter-datasets/
KNIME – End to End Data Science
https://www.knime.com/
Knoema Knowledge Platform – Data Made Accessible
https://knoema.com/
Linking Open Data Cloud Diagram (LOD)
https://lod-cloud.net/
LinkingOpenData – W3C SWEO Community Project
https://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
List of Free Statistical Software
https://l-lists.com/en/lists/dz3a5t.html
Local Area Unemployment Statistics (LAUS)
https://www.bls.gov/lau/
ManualsLib – The Ultimate Manuals Library
https://www.manualslib.com/
Measuring America: The Decennial Censuses From 1790 to 2000
https://www.census.gov/prod/2002pubs/pol02marv-pt1.pdf
Mirador – Tool for Visual Exploration of Complex Datasets.
https://fathom.info/mirador/
MoData – Big Data Resources
https://www.mo-data.com/
Monarch Professional – Individual Information Optimization for Enterprise
https://www.datawatch.com/
https://www.altair.com/data-analytics/
Monthly Bulletin of Statistics Online (MBS)
https://unstats.un.org/unsd/mbs/app/DataSearchTable.aspx
Movie Rating Datasets – MovieLens
https://grouplens.org/datasets/movielens/
Mu Sigma – Decision Sciences and Analytics
https://www.mu-sigma.com/
National Agricultural Statistics Service
https://www.nass.usda.gov/
National Bureau of Economic Research (NBER)
https://www.nber.org/
National Center for Education Statistics (NCES)
https://nces.ed.gov/
National Center for Health Statistics
https://www.cdc.gov/nchs/
National Numeracy Network: Teaching Resources
https://serc.carleton.edu/nnn/teaching
National Science Foundation (NSF) Survey of Industrial Research and Development (SIRD)
https://www.nsf.gov/statistics/srvyindustry/sird.cfm
National Statistics Online (UK)
https://www.statistics.gov.uk/
NationMaster – World Statistics and Country Comparisons
https://www.nationmaster.com/
NCSES Table Tool
https://ncsesdata.nsf.gov/ids/
Net Data Directory
https://netdatadirectory.org/
New Economics (econ) Archive at arXiv.org
https://arxiv.org/help/econ/announcement
New Economy Resources 2020
https://www.NewEconomyResources.com/
Observatory on Social Media (OSoMe)
https://truthy.indiana.edu/
Occupational Employment Statistics (OES)
https://www.bls.gov/oes/
OECD Data
https://data.oecd.org/
OECD Health Statistics 2015 – Country Notes
https://www.oecd.org/chile/oecd-health-statistics-2015-country-notes.htm
OECD Health Statistics 2019
https://www.oecd.org/els/health-systems/health-data.htm
OECD.StatExtracts – Complete Databases Available Via OECD’s iLibrary
https://stats.oecd.org/
OpenAIRE – Open Access Infrastructure for Research in Europe
https://www.openaire.eu/
Open Data Barometer – [Note: Content Not Being Updated]
https://www.opendataresearch.org/project/2013/odb
Open Data for Resilience Index
https://index.opendri.org/
Open Data Handbook – Guides, Case Studies and Resources for Government and Civil Society On the What, Why and How of Open Data
https://opendatahandbook.org/
Open Data Inception
https://opendatainception.io/
Open Data Institute
https://theodi.org/
Open Data Inventory (ODIN)
https://odin.opendatawatch.com/
Open Data Network
https://www.opendatanetwork.com/
Open Datasets
https://github.com/caesar0301/awesome-public-datasets
https://www.kaggle.com/datasets
https://www.quora.com/Where-can-I-find-large-datasets-open-to-the-public
https://aws.amazon.com/public-datasets/
https://repository.upenn.edu/mead
https://catalog.data.gov/dataset
Open Educational Resources (OER) Sources 2020
https://www.OERSources.com/
Open Graph Viz Platform – Exploratory Data Analysis
https://gephi.org/
OpenRefine – A Free Open Source Powerful Tool for Working with Messy Data
https://openrefine.org/
Open Source Data Explorer – Explore and Visualize Your Event Data
https://keen.github.io/explorer/
Oracle and Big Data
https://www.oracle.com/big-data/
Orange – Open Source Data Visualization and Analysis for Novice and Experts
https://orange.biolab.si/
Pharma Big Data Datasets
https://www.academia.edu/2497409/When_pharmaceutical_companies_publish_large_datasets_an_abundance_of_riches_or_fools_gold
Public Data Sets On Amazon Web Services (AWS)
https://aws.amazon.com/datasets
Quality and Comparative International Statistics
https://web.freepint.com/go/newsletter/151#feature
QueryTree – Visualize and Understand Your Data
http://querytreeapp.com/
Platfora – Clarity From Big Data
https://www.platfora.com/
Project Open Data – Open Data Policy – Managing Information As An Asset
https://project-open-data.github.io/
Public Data
https://www.PublicData.com/
Publicly Available Big Datasets
https://hadoopilluminated.com/hadoop_illuminated/Public_Bigdata_Sets.html
PubMed
https://www.ncbi.nlm.nih.gov/pubmed/
qri – A New Tool for Data Science
https://qri.io/
Random.org – True Random Number Service
https://random.org/
re3data.org – Registry of Research Data Repositories
https://www.re3data.org/
ReDash – Make Your Company Data Driven
https://redash.io/
ReportLinker: Industry Reports, Company and Country Profiles
https://www.reportlinker.com/
R Programming MOOC Course on EdX Free
https://www.edx.org/course/introduction-r-programming-microsoft-dat204x-0
https://blog.revolutionanalytics.com/2015/08/free-edx-course-for-r-beginners.html
Research Repository UCD
https://researchrepository.ucd.ie/
Sample datasets for practicing with the R Development System
https://vincentarelbundock.github.io/Rdatasets/datasets.html
SameAs.org – Interlinking the Web of Data
https://sameas.org/
SCaVis – Scientific Computation and Visualization Environment
https://jwork.org/scavis/
Scientific Data Repository – Real Time Visualization and Exploration Techniques
https://www.mlvis.com/platform.php
SISA – Simple Interactive Statistical Analysis
https://www.quantitativeskills.com/sisa/
Sizzle Analytics
https://www.sizzleanalytics.com
Smithsonian/NASA Astrophysics Data System (ADS)
https://adsabs.harvard.edu/
Socialbakers – Social Statistics, Application Statistics and Page Statistics
https://www.socialbakers.com/
SourceForge.net Research Data
https://sourceforge.net/
SORT (Statistics and Operations Research Transactions)
https://www.idescat.cat/sort/
StatCrunch – Data Analysis On the Web
https://www.statcrunch.com/
Statista – Global No. 1 Business Data Platform
https://www.statista.com
Statistical Analysis and Data Mining
https://onlinelibrary.wiley.com/journal/10.1002/%28ISSN%291932-1872
Statistical Education Through Problem Solving
https://www.stats.gla.ac.uk/steps/
Statistical Sites on the World Wide Web
https://www.bls.gov/bls/other.htm
Statistics – Wikipedia
https://en.wikipedia.org/wiki/Statistics
Statistics.com – Research Statistics and Statistical Analysis Directory
https://www.statistics.com/
Statistics and Probability
https://stattrek.com/
Statistics Canada
https://www.statcan.gc.ca/start-debut-eng.html
Statistics Every Writer Should Know
http://nilesonline.com/stats/
Statistics Online Compute Resources (SOCR)
https://socr.stat.ucla.edu/
Statistics on the Web
https://www.claviusweb.net/statistics.shtml
Statistics Resources and Big Data 2019
https://www.StatisticsResources.info/
Statistics Sources
https://www.rba.co.uk/sources/stats.htm
Stat Wing – Turn Data Into Insight In Seconds
https://www.statwing.com/
Tallylab – Your Data Your Insights
https://tallylab.com/
tamr – Leverage All Data
https://www.tamr.com/
Tanagra Project – Free Data Mining Software for Academic and Research Purposes
https://eric.univ-lyon2.fr/~ricco/tanagra/en/tanagra.html
Teach Engineering: Big Data, What Are You Saying?
https://www.teachengineering.org/activities/view/und-1721-big-data-collection-manipulation-analysis
The Big Data Hub – Understanding Big Data for the Enterprise
https://www.ibmbigdatahub.com/
The Data and AI Platform for Ecommerce and Logistics
https://semantics3.com/
The Dataverse Project
https://dataverse.org/
The Dryad Digital Repository
https://datadryad.org/
The Internet Glossary of Statistical Terms
https://www.animatedsoftware.com/statglos/statglos.htm
The Magazine of Early American Datasets
https://repository.upenn.edu/mead
The Manifesto for Data Practices
https://datapractices.org/manifesto/
The National Bureau of Economic Research (NBER)- Other Data Collections
https://www.nber.org/links/data.html
The Open Data Institute
http://theodi.org/
The Open Knowledge Foundation – Empowering Through Open Knowledge
https://okfn.org/
The R Project for Statistical Computing
http://www.r-project.org/
The World Bank – Data
https://data.worldbank.org/
The World Bank Data Catalog
https://datacatalog.worldbank.org/
Trifacta – Data Wrangling
https://www.trifacta.com/
UC Irvine Machine Learning Repository
https://archive.ics.uci.edu/ml/index.php
UK National Statistics Online
https://www.statistics.gov.uk/
UN Data – Data Access System to UN Databases (34 Databases – 60 Million Records)
https://data.un.org/
UNESCO Institute for Statistics
https://www.uis.unesco.org/
United Kingdom National Accounts, The Blue Book, 2014 Edition
https://www.ons.gov.uk/ons/rel/naa1-rd/united-kingdom-national-accounts/the-blue-book–2014-edition/index.html
United Nations Statistics Division
https://unstats.un.org/unsd/
United States Census Bureau
https://www.census.gov/
United States Census Bureau Research
https://www.census.gov/research/
University of Minnesota Library: Databases A – Z
https://libguides.umn.edu/az.php?t=30801
U.S. and World Population Clocks
https://www.census.gov/popclock/
USA.gov – Data and Statistics
https://www.usa.gov/Topics/Reference-Shelf/Data.shtml
USA Trade Online – The Official Source of Trade Statistics
https://usatrade.census.gov/
U.S. Business and Economy-Wide Statistics
https://www.census.gov/econ/economywide.html
USDA Economics, Statistics, and Market Information System
https://usda.mannlib.cornell.edu/
US Government Web Services and XML Data Sources
https://usgovxml.com/
USITC Interactive Tariff and Trade DataWeb
https://dataweb.usitc.gov/
Visualization Laboratory (VISLAB) Interactive Visualization and Data Analytics Systems
https://www.tacc.utexas.edu/resources/visualization/
Visualization of Large Spatiotemporal Datasets
https://www.nanocubes.net/
Visualizing.org – Making Sense of Complex Issues Through Data and Design
https://www.visualizing.org/
Vital Statistics of the United States (VSUS)
https://www.cdc.gov/nchs/products/vsus.htm
Web and Blog Datasets
https://snap.stanford.edu/data/other.html
Web Interface for Statistics Education (WISE)
https://wise.cgu.edu/
WebSM – Web Survey Methodology Portal
https://www.websm.org/
Weka 3: Data Mining Software in Java
https://www.cs.waikato.ac.nz/~ml/weka/
What Is Big Data?
https://www-01.ibm.com/software/data/bigdata/
WHO: World Health Statistics
https://www.who.int/gho/publications/world_health_statistics/en/
WikiData – Free KnowledgeBase With 76,735,541 Editable Data Items
https://www.wikidata.org/wiki/Wikidata:Main_Page
Wolfram Data Repository
https://datarepository.wolframcloud.com/
World Bank Open Data
https://data.worldbank.org/
Worldometers – World Statistics Updated In Real Time
https://www.worldometers.info/
YourEconomy.org (YE)
https://youreconomy.org/
Zanran – Search the Web For Data and Statistics
https://zanran.com/