# Academics

## X + DS: Interdisciplinary Education in Data Science

At Illinois, we are launching a new series of undergraduate degrees that combine Data Science with other disciplines. The X + Data Science (X + DS) family of degrees will prepare Illinois students to lead society's digital transformation.

Data Science is the art of extracting new knowledge and finding meaningful information in a huge sea of data. This important new field includes principles for data collection, storage, integration, analysis, inference, communication, and ethics.

**Interdisciplinary:** Digital transformation is impacting all fields, so collaborative work in an application domain is important part of a Data Science education. X + DS majors will take core coursework in an application domain of their choosing.

**Inclusive:** Our core Data Science coursework have fewer technical prerequisites and requirements than most programs in computer science, mathematics, or statistics. This makes X + DS more accessible to people from all backgrounds.

## X + DS: Degree Programs

Each X + DS degree programs include approximately 30 credit hours of core Data Science coursework, plus a meaningful research or discovery experience of at least 3 credit hours. This will prepare students with a strong background in data science, inferential thinking, computational thinking, and real-world relevance.

Four X + DS degree programs have been approved by the Illinois Higher Board of Education after having been approved by the University of Illinois Urbana-Champaign Faculty Senate in April 2021:

Applications for Fall 2023 will be open from September 1, 2022 to January 5, 2023. The iSchool will receive current student transfers in Fall 2022.

## X + DS: Core Data Science Coursework

The core Data Science coursework for X + DS is designed to be completed by students within their first 3-5 semesters to prepare for advanced work in their area of specialization:

**Calculus: Fulfilled by MATH 220, MATH 221, or MATH 234**

First course in calculus and analytic geometry; basic techniques of differentiation and integration with applications including curve sketching; antidifferentation, the Riemann integral, fundamental theorem, exponential and trigonometric functions.

**Linear Algebra for Data Science: MATH 227**

Linear algebra is the main mathematical subject underlying the basic techniques of data science. This course provides a practical computer-based introduction to linear algebra, emphasizing its uses in analyzing data, such as linear regression, principal component analysis, and network analysis. We will also explore some of the strengths and limitations of linear methods. Students will learn how to implement linear algebra methods using Python, making it possible to apply these techniques to large data sets. The course assumes an introductory knowledge of Python, such as students acquire in STAT 107.

**Data Science Discovery: STAT/CS/IS 107**

Data Science Discovery is the intersection of statistics, computation, and real-world relevance. As a project-driven course, students perform hands-on-analysis of real-world datasets to analyze and discover the impact of the data. Throughout each experience, students reflect on the social issues surrounding data analysis such as privacy and design.

**Data Science Exploration: STAT 207**

This course explores the data science pipeline from hypothesis formulation, to data collection and management,to analysis and reporting. Topics include data collection, preprocessing and checking for missing data, data summary and visualization, random sampling and probability models, estimating parameters, uncertainty quantification, hypothesis testing, multiple linear and logistic regression modeling, classification, and machine learning approaches for high dimensional data analysis. Students will learn how to implement the methods using Python programming and Git version control. The course assumes an introductory knowledge of statistical concepts and Python, such as students acquire in STAT 107.

**Modeling and Learning in Data Science: CS 307**

Introduction to the use of classical approaches in data modeling and machine learning in the context of solving data-centric problems. A broad coverage of fundamental models is presented, including linear models, unsupervised learning, supervised learning, and deep learning. A significant emphasis is placed on the application of the models in Python and the interoperability of the results.

**Algorithms and Data Structures for Data Science: CS 277**

An introduction to elementary concepts in algorithms and classical data structures with a focus on their applications in Data Science. Topics include algorithm analysis (ex: Big-O notation), elementary data structures (ex: lists, stacks, queues, trees, and graphs), basics of discrete algorithm design principles (ex: greedy, divide and conquer, dynamic programming), and discussion of discrete and continuous optimization.

**Ethics and Policy for Data Science: IS 467**

Learn about common ethical data challenges, including privacy, discrimination, and access to data. These challenges will be explored through real-world cases of corporate settings, non-profits, governments, academic research, and healthcare. The course will also cover common ethical principles, providing a framework to analyze these cases. Students will also be introduced to a range of policy responses. The course is suitable for anyone who plans to work in a professional setting that will involve handling data, or who is seeking a grounding for future study of data and information ethics.

**Data Management, Curation, and Reproducibility: IS 477**

We introduce and use the Data Science Life Cycle as an intellectual foundation for understanding Data Management, Curation & Reproducibility in the Data Science context. The Data Science Life Cycle allows us to study how data, software, workflows, computational environments, scientific findings,and other artifacts form linked foundational components of data science research. Topics include research artifact identification and management, metadata, repositories, economics of artifact preservation and sustainability, and data management plans.

Research or Discovery Experience

One of the most important skills a student will gain in a X + DS degree will be the ability to present data in meaningful ways. This experience should be developed with an adviser before the end of a student’s sophomore year and result in the creation of one or more artifacts documenting the experience. A minimum of 3 credit hours must be specifically designated to the preparation and the completion of the experience component. Two smaller experiences may be used to fulfill the full experience requirement.

Examples of possible experiences may include:

- A semester
**study-abroad**with at one or more courses focused on discovery while attending the international institution. - A multi-semester
**capstone**experience within the student’s area of specialization. - A semester
**co-op experience**outside of the Champaign-Urbana area focused within the student’s area of specialization. - A multi-semester
**undergraduate research experience**under the direction of faculty. - A summer
**REU program**focused within the student's area of specialization.

## Interdisciplinary Illinois

A leader in interdisciplinary education and research, the University of Illinois seeks to give all Illinois students the opportunity to have a meaningful exposure to Data Science. X + DS will provide a new pathway to foster education across disciplines, supporting the University's Strategic Plan.