Welcome to University of Michigan Database Research Group
The database group at Michigan has a long and illustrious history. Some of our better known graduates include E.F. Codd, Mike Stonebraker, David J. DeWitt, and more.
Our current work tackles data management issues in many different topics, including:
- Responsible data science, particularly to ensure that matters such as Fairness, Accountability, and Transparency (FAT) are fulfilled in the life cycle of data science.
- Usability of Big Data and database systems, particularly when data involved comes from multiple heterogeneous sources, and has undergone many manipulations.
- Systems and algorithms for "messy" data management, including work on information extraction (from spreadsheets, or from Web pages of different kinds), data integration (whether integrating data from Web pages or more traditional sources), machine learning workloads (such as feature engineering), and top-k ranking.
- Novel data applications, especially in bioinformatics and the social sciences (in economics and fighting human trafficking).
- Scalable databases, particularly using advanced statistical models using concepts and tools from applied statistics, optimization theory, and machine learning.
- Data systems infrastructure, including systems work that can undergird very general-purpose data management methods, such as Hadoop, optimization for MapReduce programs and hardware support for text analytics.
News
- We presented a long paper (On Obtaining Stable Rankings) and two workshop papers at VLDB 2019.
- We presented three long papers (BlinkML, RRR and Designing Fair Ranking Schemes), two demo papers (C2Metadata and MithraRanking) and a workshop paper (Knowledge Graph Programming) at SIGMOD 2019.
- We presented three papers at ICDE 2019 about improving natural language interfaces to databases with SQL query logs, optimizing queries to video databases, and ensuring that datasets "cover" various attribute groups, such as minorities.
- We presented a paper (Open Information Extraction from Question-Answer Pairs) at NAACL 2019.
- We presented one paper at EDBT 2019 about data pattern standardization.
- We presented two long papers about explaining and repairing filter-based data transformation and lock scheduling for transactional databases, and a panel about algorithm ethics in VLDB 2018 in Rio, Brazil.
- We delivered a keynote speech in AiDM workshop, presented 2 long papers (VerdictDB & RDMA Lock Management), 2 demos (VerdictDB & GeoFlux) in SIGMOD 2018 and a paper (a schema mapping system "Beaver") in HILDA workshop.
- Prof. Michael Cafarella was recognized with a 10-Year Test-of-Time Award for his paper Webtables: exploring the power of tables on the web published in VLDB 2008. Congratulations!
- Yongjoo Park was recognized with an Honorable Mention for the 2018 SIGMOD Jim Gray Doctoral Dissertation Award. Congratulations!
- Jie Song won a Best Paper Runner-up Award at EDBT 2018 for her GeoAlign paper. Congratulations!
- Professor HV Jagadish was elected as Fellow of the American Association for the Advancement of Science. Congratulations!
- We organized the First Workshop on Approximate Computing for Affordable and Interactive Analytics (ACAIA), which was held on November 9th, 2017 at San Jose, California.
- We presented Raccoon at VLDB 2017.
- We presented three research papers, one demo paper, and one keynote at SIGMOD 2017.