Welcome to University of Michigan Database Research Group

The database group at Michigan has a long and illustrious history. Some of our better known graduates include E.F. Codd, Mike Stonebraker, David J. DeWitt, and more.

Our current work tackles data management issues in many different topics, including:

  • Responsible data science, particularly to ensure that matters such as Fairness, Accountability, and Transparency (FAT) are fulfilled in the life cycle of data science.
  • Usability of Big Data and database systems, particularly when data involved comes from multiple heterogeneous sources, and has undergone many manipulations.
  • Systems and algorithms for "messy" data management, including work on information extraction (from spreadsheets, or from Web pages of different kinds), data integration (whether integrating data from Web pages or more traditional sources), machine learning workloads (such as feature engineering), and top-k ranking.
  • Novel data applications, especially in bioinformatics and the social sciences (in economics and fighting human trafficking).
  • Scalable databases, particularly using advanced statistical models using concepts and tools from applied statistics, optimization theory, and machine learning.
  • Data systems infrastructure, including systems work that can undergird very general-purpose data management methods, such as Hadoop, optimization for MapReduce programs and hardware support for text analytics.



Thanks to our kind sponsors