The database group at Michigan has a long and illustrious history. Some of our better known graduates include E.F. Codd, Mike Stonebraker, David J. DeWitt, and more.
Our current work tackles data management issues in many different topics, including:
- Usability of Big Data and database systems, particularly when data involved comes from multiple heterogeneous sources, and has undergone many manipulations.
- Systems and algorithms for "messy" data management, including work on information extraction (from spreadsheets, or from Web pages of different kinds), data integration (whether integrating data from Web pages or more traditional sources), machine learning workloads (such as feature engineering), and top-k ranking.
- Novel data applications, especially for social science use cases in economics and fighting human trafficking.
- Scalable databases, particularly using advanced statistical models using concepts and tools from applied statistics, optimization theory, and machine learning.
- Data systems infrastructure, including systems work that can undergird very general-purpose data management methods, such as Hadoop, optimization for MapReduce programs and hardware support for text analytics.