Prerequisites: A Bachelor’s Degree and completion of an introductory statistics course
This course introduces students to the interdisciplinary field of data science. The emergence of massive datasets from diverse areas such as telecommunications, large-scale retailing, sports, healthcare, climate science, and social media provide the primary impetus for the field. This course will emphasize practical techniques that include cleaning and transforming data, exploring and analyzing data, summarizing and visualizing data, statistical inference, creation of statistical models, and communication of results. In addition, ethical implications of the choices made at different stages in a data science project will be explored. This course also introduces students to the scripting languages R and Python which will be used throughout the course.