Loading...

Course Description

Most data science projects that use Python will require you to access and integrate different types of data from a variety of external sources. This course will give you experience identifying and integrating data from spreadsheets, text files, websites, and databases. To prepare for downstream analyses, you first need to integrate any external data sources into your Python program. You will utilize existing packages and develop your own code to read data from a variety of sources. You will also practice using Python to prepare disorganized, unstructured, or unwieldy datasets for analysis by other stakeholders.

You are required to have completed the following courses or have equivalent experience before taking this course:

  • Constructing Expressions in Python
  • Writing Custom Python Functions, Classes, and Workflows
  • Developing Data Science Applications
  • Creating Data Arrays and Tables in Python

Faculty Author

Chris Myers

Benefits to the Learner

  • Read data from files, spreadsheets, websites, and databases
  • Create a standard for organizing data within a dataset
  • Filter, integrate, and prepare data for analysis

Target Audience

  • Data analysts and business analysts
  • Database managers
  • Technical and systems analysts
  • Programmers interested in data science
  • Marketers
  • Business managers

Applies Towards the Following Certificates

Loading...
Cornell Center for Advanced Computing
Thank you for your interest in this course. Unfortunately, the course you have selected is currently not open for enrollment. Please complete a Course Inquiry so that we may promptly notify you when enrollment opens.
Required fields are indicated by .