Francesca Dominici: Can Data Science save the Environment?

Date: 

Monday, March 30, 2020, 12:00pm to 1:00pm

Location: 

Pierce Hall 209

Can Data Science save the Environment?

What if I told you I had evidence of a serious threat to American national security – a terrorist attack in which a jumbo jet will be hijacked and crashed every 12 days. Thousands will continue to die unless we act now. This is the question before us today – but the threat doesn’t come from terrorists. The threat comes from climate change and air pollution.

We have developed an artificial neural network model that uses on-the-ground air-monitoring data and satellite-based measurements to estimate daily pollution levels across the continental U.S., breaking the country up into 1-square-kilometer zones. We have paired that information with health data contained in Medicare claims records from the last 12 years, and for 97% of the population ages 65 or older. We have developed statistical methods for causal inference and computational efficient algorithms for the analysis over 570 million health records.

Our research shows that short and long term exposure to air pollution is killing thousands of senior citizens each year. We are developing methods for causal inference and machine learning to inform us who is most susceptible. This data science platform is telling us that federal limits on the nation’s most widespread air pollutants are not stringent enough.

This type of data is the sign of a new era for the role of data science in public health, and also for the associated methodological challenges. For example, with enormous amounts of data, the threat of unmeasured confounding bias is amplified, and causality is even harder to assess with observational studies. These and other challenges will be discussed.

Dr. Francesca Dominici is Professor of Biostatistics at the Harvard T.H. Chan School of Public Health and Co-Director of the Data Science Initiative at Harvard University. She was recruited to the Harvard Chan School as a tenured Professor of Biostatistics in 2009. She was appointed Associate Dean of Information Technology in 2011 and Senior Associate Dean for Research in 2013.