Data Science (DS-UA)
DS-UA 100 Discovering Data Science in the Age of AI (4 Credits)
Typically offered Fall and Spring
Data science is a relatively new discipline that is radically reshaping our world. This course is a one-semester tour of data science highlights for non-majors. Specifically, the course will start with brief introductions to programming in Python, basic probability and statistics, and causal inference. We will then see examples of how these tools are applied in various real-world contexts. There is no assumed background in either math or programming. The lectures for this course will focus on covering conceptual aspects of the tools of data science and demonstrations of how data science can be used. The associated laboratory sections will focus specifically on direct application of what is being learned in lectures using Python code, with guided demonstrations by the section leader(s). Restrictions: not open to students who are enrolled in, or have completed for credit, DS-UA 111 and/or 112; not open to students who have declared: the major and minor in Data Science; the major in Computer and Data Science; or the major in Data Science and Mathematics. This course does NOT count toward the requirements of these majors and minors.
Grading: CAS Graded
Repeatable for additional credit: No
DS-UA 111 Principles of Data Science I (4 Credits)
Typically offered Fall and Spring
Principles of Data Science I is the first course in a two-part sequence that introduces foundational concepts in data science, with a focus on statistical principles. In this course, students will develop programming skills in Python. Building on these programming skills, students will learn the process of both drawing conclusions from data and making predictions. Students will also explore related ethical, legal, and privacy issues. This course lays the groundwork for the next course of the sequence. Formerly titled Data Science for Everyone (the content of the course has not changed). Not open to second-semester juniors or to seniors. Restricted to students who intend to major or minor in Data Science or to major in either Computer and Data Science or Data Science and Mathematics. (All other students may enroll in DS-UA 100, Discovering Data Science in the Age of AI.)
Grading: CAS Graded
Repeatable for additional credit: No
Antirequisites: DS-UA 112 AND Restriction: Academic Level Senior AND Junior AND Academic Plan Liberal Arts-AA AND Academic Program UH-Abu Dhabi degree AND UI-Shanghai.
DS-UA 112 Principles of Data Science II (4 Credits)
Typically offered Fall and Spring
Principles of Data Science II builds upon the concepts introduced in Data
Science I and shifts focus toward machine learning. In this course, we will
cover the principles of machine learning in the domains of supervised
learning, unsupervised learning, and reinforcement learning. We will
illuminate these principles in terms of their mathematical foundations,
their implementation in code, and practical applications.
Specifically, we cover classical prediction and classification methods such
as random forests or support vector machines as well as neural network
approaches to these problems. Students will tie what they learned in this
class together in a capstone project that incorporates these methods.
Finally, we aim to touch on current developments in machine learning, such
as generative AI. Formerly titled Principles of Data Science (the content
of the course has not changed). Only open to students who intend to major or minor in Data Science or to major in either Computer and Data Science or Data Science and Mathematics. (All other students may enroll in DS-UA 100, Discovering Data Science in the Age of AI.)
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: DS-UA 111 AND Restriction: Academic Plan Liberal Arts-AA.
DS-UA 200 Linear Regression Models (4 Credits)
Typically offered Spring
This course provides a rigorous foundation in the theory and application of linear regression. The course covers key concepts in model selection, estimation, diagnostics, and interpretation through different lenses: descriptive, predictive, and causal. Additional topics include shrinkage methods, basic time series, and logistic regression. Students will implement these methods in Python. Counts toward the minor in Data Science but not toward the major (including any of the joint majors with DS-UA).
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: DS-UA 112 or MATH-UA 185.
DS-UA 201 Causal Inference (4 Credits)
Typically offered Fall and Spring
Causal Inference provides students with the tools for understanding causation, i.e., the relationship between cause and effect. We will start with the situation in which you are able to design and implement the data gathering process, called the experiment. We will then define causation, identify preconditions required for A to cause B, show how to design perfect experiments, and discuss how to understand threats to the validity of less-than-perfect experiments. In this course, we will cover experimental design and then turn to those careful approaches, where we will consider such approaches as quasi-experiments, regression discontinuities, differences in differences, and contemporary advanced approaches.
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: DS-UA 112 and (MATH-UA 185 or MATH-UA 334 or MA-UY 2224) and restricted to Majors/Minors.
DS-UA 202 Responsible Data Science (4 Credits)
Typically offered Spring
The first wave of data science focused on accuracy and efficiency: on what we can do with data. The second wave is about responsibility: what we should and should not do. Accordingly, this technical course tackles the issues of ethics and responsibility in data science, including legal compliance, data quality, algorithmic fairness and diversity, transparency of data and algorithms, privacy, and data protection. An important feature of this course is its holistic treatment of the data science lifecycle, beginning with data discovery and acquisition, through data cleaning, integration, querying, analysis, and result interpretation.
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: DS-UA 112 and restricted to Majors/Minors.
DS-UA 204 Practical Training (2-4 Credits)
Typically offered Summer term
Provides data science students with an opportunity to apply the knowledge gained in their course work to practical problems in industry. This course is for majors and minors only.
Grading: CAS Pass/Fail
Repeatable for additional credit: Yes
DS-UA 205 Machine Learning for Climate Change (4 Credits)
Typically offered Spring term of odd numbered years
This course will introduce you to both modern machine learning techniques and their domain-specific application toward climate change problems. Lectures will be organized around published research papers. Unless otherwise noted, in the second half of each class, there will be a lecture that provides the necessary background needed to read the assigned paper. In the first half of the next class, we will have a group discussion on the assigned paper. Paper discussions will be complemented by coding assignments wherein students will apply machine learning techniques to climate-related data. Skills gained in the course will be applied to a final course project designed by the students.
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: (DS-UA 112, MATH-UA 185, MATH-UA 235, MATH-UA 234, MATH-UA 334, MATH-UA 9235 OR MATH-UA 9185) AND Corequisite: CSCI-UA 473 OR CSCI-UA 9473.
DS-UA 300 Special Topics in Data Science (4 Credits)
Topics and prerequisites vary by semester
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: DS-UA 112.
DS-UA 301 Advanced Topics in Data Science (4 Credits)
Typically offered Fall and Spring
Advanced Topics in Data Science exposes students to two specialized topics within Data Science: Examples of topics include time series, deep learning, and other advanced machine learning topics. Students will learn the theoretical underpinnings of advanced data science techniques, as well as engage in hands-on activities to build a practical toolkit.
Grading: CAS Graded
Repeatable for additional credit: Yes
Prerequisites: DS-UA 112 AND (MATH-UA 185 OR MATH-UA 334 OR MA-UY 2224 ) AND Data Science Plans Corequisite: CSCI-UA 473.
DS-UA 302 Natural Language Processing (4 Credits)
Typically offered Fall and Spring
Natural Language Processing applies computational and linguistic knowledge to the processing of natural languages (English, Chinese, Spanish, Japanese). Applications include: machine translation, information extraction, information retrieval, and others. On the one hand, the class will include the modeling and representation of linguistic phenomena. On the other, it will cover methods for applying this knowledge using both manual rules and machine learning. Sample topics include: formal languages, hidden Markov models, part of speech tagging, vector-based methods, shallow and full parsing, semantic role labeling, information extraction and machine translation. Students will complete programming assignments (POS-tagging, Information Extraction, etc.) and group final projects. Note: CSCI-UA 469 (sponsor) and DS-UA 302 (non-sponsor) are cross-listed.
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: CSCI-UA 201 with a grade of C or higher.
DS-UA 9100 Discovering Data Science in the Age of AI (4 Credits)
Typically offered Fall and Spring
Data science is a relatively new discipline that is radically reshaping our
world. This course is a one-semester tour of data science highlights for
non-majors. Specifically, the course will start with brief introductions to
programming in Python, basic probability and statistics, and causal
inference. We will then see examples of how these tools are applied in
various real-world contexts. There is no assumed background in either math
or programming. The lectures for this course will focus on covering
conceptual aspects of the tools of data science and demonstrations of how
data science can be used. The associated laboratory sections will focus
specifically on direct application of what is being learned in lectures
using Python code, with guided demonstrations by the section leader(s).
Restrictions: not open to students who are enrolled in, or have completed
for credit, DS-UA 111 and/or 112; not open to students who have declared:
the major and minor in Data Science; the major in Computer and Data
Science; or the major in Data Science and Mathematics. This course does NOT
count toward the requirements of these majors and minors.
Grading: CAS Graded
Repeatable for additional credit: No
DS-UA 9111 Principles of Data Science I (4 Credits)
Data Science for Everyone is a foundational course that prepares students
to participate in the data-driven world that we are all experiencing. It
develops programming skills in Python so that students can write programs
to summarize and compare real-world datasets. Building on these data
analysis skills, students will learn how to draw conclusions and make
predictions about the data. Students will also explore related ethical,
legal, and privacy issues.
Grading: CAS Graded
Repeatable for additional credit: No
DS-UA 9201 Causal Inference (4 Credits)
Typically offered Fall and Spring
Causal Inference provides students with the tools for
understanding causation, i.e., the relationship between cause and effect.
We will start with the situation in which you are able to design and
implement the data gathering process, called the experiment. We will then
define causation, identify preconditions required for A to cause B, show
how to design perfect experiments, and discuss how to understand threats to
the validity of less-than-perfect experiments. In this course, we will
cover experimental design and then turn to those careful approaches, where
we will consider such approaches as quasi-experiments, regression
discontinuities, differences in differences, and contemporary advanced
approaches.
Grading: CAS Graded
Repeatable for additional credit: No
Prerequisites: DS-UA 112 and (MATH-UA 234 or MATH-UA 334 or MATH-UA 185 or MA-UY 2224).
DS-UA 9301 Advanced Topics in Data Science (4 Credits)
Typically offered Fall
Advanced Topics in Data Science exposes students to two specialized topics within Data Science: Examples of topics include time series, deep learning, and other advanced machine learning topics. Students will learn the theoretical underpinnings of advanced data science techniques, as well as engage in hands-on activities to build a practical toolkit.
Grading: CAS Graded
Repeatable for additional credit: Yes
Prerequisites: DS-UA 112 AND (MATH-UA 185 OR MATH-UA 334 OR MA-UY 2224 ) Corequisite: CSCI-UA 473.