X + DS: Interdisciplinary Education in Data Science

At Illinois, we are launching a new series of undergraduate degrees that combine Data Science with other disciplines. The X + Data Science (X + DS) family of degrees will prepare Illinois students to lead society's digital transformation.

Data Science is the art of extracting new knowledge and finding meaningful information in a huge sea of data.  This important new field includes principles for data collection, storage, integration, analysis, inference, communication, and ethics.

Interdisciplinary: Digital transformation is impacting all fields, so collaborative work in an application domain is important part of a Data Science education. X + DS majors will take core coursework in an application domain of their choosing.

Inclusive: Our core Data Science coursework have fewer technical prerequisites and requirements than most programs in computer science, mathematics, or statistics. This makes X + DS more accessible to people from all backgrounds.   

X + DS: Degree Programs

Each X + DS degree programs include approximately 30 credit hours of core Data Science coursework, plus a meaningful research or discovery experience of at least 3 credit hours. This will prepare students with a strong background in data science, inferential thinking, computational thinking, and real-world relevance. 

Four X + DS degree programs have been approved by the Illinois Higher Board of Education after having been approved by the University of Illinois Urbana-Champaign Faculty Senate in April 2021:

Applications for Fall 2023 will be open from September 1, 2022 to January 5, 2023. The iSchool will receive current student transfers in Fall 2022.

X + DS: Core Data Science Coursework

The core Data Science coursework for X + DS is designed to be completed by students within their first 3-5 semesters to prepare for advanced work in their area of specialization: 

Calculus: Fulfilled by MATH 220, MATH 221, or MATH 234
First course in calculus and analytic geometry; basic techniques of differentiation and integration with applications including curve sketching; antidifferentation, the Riemann integral, fundamental theorem, exponential and trigonometric functions. 

Linear Algebra for Data Science: MATH 227
Linear algebra is the main mathematical subject underlying the basic techniques of data science. This course provides a practical computer-based introduction to linear algebra, emphasizing its uses in analyzing data, such as linear regression, principal component analysis, and network analysis. We will also explore some of the strengths and limitations of linear methods. Students will learn how to implement linear algebra methods using Python, making it possible to apply these techniques to large data sets. The course assumes an introductory knowledge of Python, such as students acquire in STAT 107. 

Data Science Discovery: STAT/CS/IS 107
Data Science Discovery is the intersection of statistics, computation, and real-world relevance. As a project-driven course, students perform hands-on-analysis of real-world datasets to analyze and discover the impact of the data. Throughout each experience, students reflect on the social issues surrounding data analysis such as privacy and design. 

Data Science Exploration: STAT 207
This course explores the data science pipeline from hypothesis formulation, to data collection and management,to analysis and reporting. Topics include data collection, preprocessing and checking for missing data, data summary and visualization, random sampling and probability models, estimating parameters, uncertainty quantification, hypothesis testing, multiple linear and logistic regression modeling, classification, and machine learning approaches for high dimensional data analysis. Students will learn how to implement the methods using Python programming and Git version control. The course assumes an introductory knowledge of statistical concepts and Python, such as students acquire in STAT 107. 

Modeling and Learning in Data Science: CS 307
Introduction to the use of classical approaches in data modeling and machine learning in the context of solving data-centric problems. A broad coverage of fundamental models is presented, including linear models, unsupervised learning, supervised learning, and deep learning. A significant emphasis is placed on the application of the models in Python and the interoperability of the results. 

Algorithms and Data Structures for Data Science: CS 277
An introduction to elementary concepts in algorithms and classical data structures with a focus on their applications in Data Science. Topics include algorithm analysis (ex: Big-O notation), elementary data structures (ex: lists, stacks, queues, trees, and graphs), basics of discrete algorithm design principles (ex: greedy, divide and conquer, dynamic programming), and discussion of discrete and continuous optimization. 

Ethics and Policy for Data Science: IS 467
Learn about common ethical data challenges, including privacy, discrimination, and access to data. These challenges will be explored through real-world cases of corporate settings, non-profits, governments, academic research, and healthcare. The course will also cover common ethical principles, providing a framework to analyze these cases. Students will also be introduced to a range of policy responses. The course is suitable for anyone who plans to work in a professional setting that will involve handling data, or who is seeking a grounding for future study of data and information ethics. 

Data Management, Curation, and Reproducibility: IS 477
We introduce and use the Data Science Life Cycle as an intellectual foundation for understanding Data Management, Curation & Reproducibility in the Data Science context. The Data Science Life Cycle allows us to study how data, software, workflows, computational environments, scientific findings,and other artifacts form linked foundational components of data science research. Topics include research artifact identification and management, metadata, repositories, economics of artifact preservation and sustainability, and data management plans. 

One of the most important skills a student will gain in a X + DS degree will be the ability to present data in meaningful ways. This experience should be developed with an adviser before the end of a student’s sophomore year and result in the creation of one or more artifacts documenting the experience. A minimum of 3 credit hours must be specifically designated to the preparation and the completion of the experience component. Two smaller experiences may be used to fulfill the full experience requirement. 

Examples of possible experiences may include:

  • A semester study-abroad with at one or more courses focused on discovery while attending the international institution.
  • A multi-semester capstone experience within the student’s area of specialization.
  • A semester co-op experience outside of the Champaign-Urbana area focused within the student’s area of specialization.
  • A multi-semester undergraduate research experience under the direction of faculty.
  • A summer REU program focused within the student's area of specialization.

X + DS Toolkit

Illinois units seeking to establish a new X + Data Science degree can access a toolkit containing a degree framework and proposal templates.  U of I Box login is required.

Get the Toolkit

Interdisciplinary Illinois

A leader in interdisciplinary education and research, the University of Illinois seeks to give all Illinois students the opportunity to have a meaningful exposure to Data Science. X + DS will provide a new pathway to foster education across disciplines, supporting the University's Strategic Plan.