The scientific data management landscape is changing. Improvements in instrumentation and simulation software are giving scientists access to data at an unprecedented scale. This data is increasingly being stored in data centers running thousands of commodity servers. This new environment creates significant data management challenges. In addition to efficient query processing, the magnitude of data and queries call for new query management techniques such as runtime query control and intra-query fault tolerance.
In this project, we are developing new data management systems and techniques for enabling scientists to store, analyze, and share large volumes of data using cloud-computing environments.