Master's in Data Science

  • Top Schools
    • 23 Great Schools with Master’s Programs in Data Science
    • 22 Top Schools with Master’s in Information Systems Degrees
    • 25 Top Schools with Master’s in Business Analytics Programs
  • Online Programs
    • Online Data Science Degree Programs
    • 2022 Guide to Online Bachelor’s in Computer Science Degree Programs
    • Online Masters in Business Analytics Programs
    • Online Masters in Information Systems Programs
    • Online Masters in Computer Engineering
    • Online Masters in Computer Science
    • Online Masters in Cybersecurity
    • Online Certificate Programs in Analytics
  • By State
    • Alabama
    • Arizona
    • Arkansas
    • California
    • Colorado
    • Connecticut
    • Delaware
    • Florida
    • Georgia
    • Hawaii
    • Idaho
    • Illinois
    • Indiana
    • Iowa
    • Kansas
    • Kentucky
    • Louisiana
    • Maine
    • Maryland
    • Massachusetts
    • Michigan
    • Minnesota
    • Mississippi
    • Missouri
    • Montana
    • Nebraska
    • Nevada
    • New Hampshire
    • New Jersey
    • New Mexico
    • New York
    • North Carolina
    • North Dakota
    • Ohio
    • Oklahoma
    • Oregon
    • Pennsylvania
    • Rhode Island
    • South Carolina
    • South Dakota
    • Tennessee
    • Texas
    • Utah
    • Vermont
    • Virginia
    • Washington
    • Washington, D.C.
    • West Virginia
    • Wisconsin
  • Related Degrees
    • Data Science Bachelor Degrees
    • Data Science Certificate Programs for 2022
    • Master’s in Accounting Analytics
    • Master’s in Applied Statistics
    • Master’s in Business Analytics
    • Master’s in Business Intelligence
    • Master’s in Geospatial Science & GIS
    • Master’s in Health Informatics
    • Master’s in Library Science
    • Master’s in Public Policy Data Analytics
    • MBA in Analytics/Data Science
    • PhD in Data Science Programs
    • Programs Outside the US
  • Careers
    • Business Analyst
    • Business Analyst Salary
    • Computer Engineer
    • Computer Scientist
    • Data Analyst
    • Data Analyst Salary Guide
    • Data Architect
    • Data Engineer
    • Data Mining Specialist
    • Data Scientist
    • Data Scientist Salary
    • Marketing Analyst
    • Quantitative Analyst
    • Financial Analyst
    • Information Security Analyst
    • Statistician
    • Digital Marketer
  • Online Courses
    • Your Guide for Online Data Science Courses in 2021
    • Online Data Analytics Courses
    • Machine Learning Courses
    • Blockchain Courses
    • Online Digital Marketing Courses
    • FinTech Courses
    • Financial Analysis Courses
    • Cybersecurity Courses
    • Business Analytics Courses
    • Artificial Intelligence Courses
    • UX/UI Courses
  • Bootcamps
    • Data Science Bootcamps
    • Data Analytics Bootcamps
    • Coding Bootcamps
    • Are Coding Bootcamps Worth it?
    • Cybersecurity Bootcamps
    • UX/UI Bootcamps
    • FinTech Bootcamps
    • Digital Marketing Bootcamps
  • Learning
    • What is Data Analytics?
    • What is Business Analytics?
    • What Is Cyber Security?
    • What is Computer Engineering?
    • What is Computer Science?
    • What is FinTech?
    • Best Programming Language to Learn
    • Is Computer Science a Good Major?
    • What Can You Do With a Computer Science Degree?
    • What Is a Neural Network?
    • What is an Information System?
    • Learn Data Science Online
    • Benefits of Business Intelligence Software
    • Computer Science vs. Computer Engineering
    • Cyber Security vs. Computer Science
    • Data Analyst vs Data Scientist
    • Data Analytics vs. Business Analytics
    • Data Science vs. Machine Learning
  • Resources
  • About 2U

Websites With Free and Open Source Datasets

September 23, 2020 Stevie DiSalvo

Overhead view of a set of graphs and charts on paper, and a person’s left hand working on a laptop while the right uses a pen in a notebook.
Image Credit: image via Pexels

A dataset is a collection of data usually organized in a tabular display that corresponds with several tables. In most datasets, the columns represent specific variables while the rows represent certain records within the dataset. A dataset may include additional files or documents that provide statistical information on a specific subject.
In data science and analytics, datasets are used to create statistics or infographics that illustrate certain issues or facts. You may need to use a dataset to assist with:

  • Assignments while pursuing your master’s in data science degree.
  • Generating actionable insights for your boss or a company relying on data analytics.
  • Creating a graphic that summarizes important facts or trends in a simple, visual way.
  • Analyzing the choices or patterns consumers, customers, or users are prone to making.
  • Testing the functionality of your .

If you’re new to datasets, Kaggle is a great site that will help you explore different datasets and get excited about all the possibilities in data science and analytics. The following websites also offer datasets that are free for public use. Review these sites so you can learn where to access datasets to practice data analysis, test your management system, or find statistics to assist with an upcoming project.

BuzzFeed

The BuzzFeed site is known for providing unbiased news and information on current events. The site also conducts numerous surveys and pulls data to formulate statistics that relate to these current events. You can visit the BuzzFeed archives section or use the site’s search function to pull stories that relate to the subject you need more data about.

You may also visit a reliable dataset resource site, such a GitHub, to find BuzzFeed datasets. When you locate the dataset you need, consider linking back to the article to find the data or download the dataset directly from the site.

BuzzFeed has covered numerous topics over the years so the free datasets available to you may vary. Some of the datasets you may find on the site include:

  • Fake news sites and viral posts.
  • The movement of COVID-19 cases in several major cities.
  • Contributions to presidential campaigns.
  • Analysis of Federal Communications Commission data breaches.
  • Gentrification tracking in major cities.

If you’re in the process of earning an online master’s degree in data science, datasets on the BuzzFeed site may be helpful in upcoming projects. You can also use this data to learn more about how recent events have impacted the world or to create your own infographics to help viewers understand the effects of these events.

Reddit r/Datasets

Reddit is a public, user-generated content sharing site that allows users to post information and observations. As a collection of forums that allows users to interact with one another and provide opinions on issues, Reddit wouldn’t usually be considered a top source for datasets. However, within these discussion boards lies an entire community dedicated to data. Users in this subsection of the site request, discuss and exchange datasets for free.

Reddit users post datasets that offer useful information and statistics relating to current news stories. Since all data is submitted by Reddit users, it may not be verified. It’s important to only use data retrieved by the dataset forum at your own risk.

To access these datasets, visit the data visualization aids to post on blogs, social media, or company websites.

In some cases, users are simply looking for datasets they can download to practice grouping information or to study data organization techniques. Users may also find it beneficial to download datasets from Reddit to learn how data behaves within a management system.

Socrata OpenData

Socrata OpenData is an expansive open portal that contains many datasets covering various topics and issues. With so much extensive data offered on the site, users may find it overwhelming to search for certain subjects that relate to the datasets they need. However, the Socrata OpenData homepage offers many strategies to filter the datasets available so you can identify the ones you need for your specific project.

You can sort available datasets by authority, category, view type, or tag. It’s important to know a little about the subject you’re investigating so it’s easier to choose the categories and how you want to retrieve the datasets before searching.

When the site retrieves your related data, review the date the dataset was uploaded. With such an expansive portal, some data may be older so you’ll need to find updated sets to use for the most accurate information.

If you choose to sort your data source by “Community” instead of “Official,” the datasets that appear are uploaded by site users. They may not be as accurate or reliable as those provided by authorities.

The Socrata OpenData site is known for providing free public datasets in countless categories, including:

  • Facebook marketing costs.
  • Music sales data.
  • Payroll reports for Senate employees.
  • Radiation analysis data throughout the U.S.
  • Fatalities in the workplace, sorted by state.

When you find a dataset that relates to your project, you have the option to download the set to your computer. You may also visit the link to the data source or contact the dataset owner, if available.

Quandl

If you’re interested in using your computer to study data science as it relates to finances, you may find Quandl datasets useful. This site offers free public datasets about financial and economic issues. However, some of the more extensive datasets may require payment.
When you visit the site, access these datasets by creating an account and searching by the data category. Categories you may choose from include:

  • U.S. stock prices.
  • Auto sales estimates.
  • Historical U.S. equity information.
  • Global index prices.
  • Company spending patterns.

Download datasets directly to your device to import them into a data management system or review statistics that are useful for your project.

Free public datasets are helpful if you’re trying to expand your data science skill set, work on a project, or create infographics and visualizations for your business. These websites offer expansive datasets you can download to help achieve your data analytics goals.

Last updated: September 2020

Share on Facebook Share
Share on TwitterTweet
Share on LinkedIn Share

Filed Under: Resources

SPONSORED DATA SCIENCE PROGRAMS

UC Berkeley - Master of Information and Data Science
Sponsored Program
Syracuse University - Master of Science in Applied Data Science
Sponsored Program

SPONSORED ANALYTICS PROGRAMS

American University - Master of Science in Analytics
Sponsored Program
Syracuse University - Master of Science in Business Analytics
Sponsored Program

Online Programs

  • Online Master’s in Data Science Programs
  • Online Master’s in Business Analytics
  • Master’s in Information Systems Online
  • Online Master’s in Computer Science
  • Online Master’s in Computer Engineering
  • Online Master’s in Cybersecurity
  • Graduate Certificates in Data Science Online

Career Profiles

  • Business Analyst
  • Data Analyst
  • Data Architect
  • Data Engineer
  • Data Scientist
  • Marketing Analyst
  • Information Security
  • Quantitative Analyst
  • Statistician

Bootcamps

  • Data Science Bootcamps
  • Data Analytics Bootcamps
  • Coding Bootcamps
  • Cybersecurity Bootcamps
  • UX/UI Bootcamps
  • Fintech Bootcamps
  • Digital Marketing Bootcamps

Online Courses

  • Online Data Science Courses
  • Online Data Analytics Courses
  • Online Machine Learning Courses
  • Online Blockchain Courses
  • Online Digital Marketing Courses
  • Online Financial Analysis Courses
  • Online Cybersecurity Courses
  • Online Business Analytics Courses
  • Online Artificial Intelligence Courses
  • Online UX/UI Courses

Industry Uses

  • Biotechnology
  • Energy
  • Finance
  • Gaming and Hospitality
  • Government
  • Health Care
  • Insurance
  • Internet
  • Manufacturing
  • Pharmaceuticals
  • Retail
  • Telecommunications
  • Travel and Transportation
  • Utilities
  • Food

Data Science Technologies

  • R
  • Python
  • SQL
  • Hadoop
  • Tableau

MastersInDataScience.org is owned and operated by 2U, Inc.
© 2U, Inc. 2022

About 2U | Privacy Policy | Terms of Use | Resources