Spring 2021 update: due to the COVID-19 pandemic, we will not be hosting a full in-person or virtual CRCS Rising Stars workshop in March 2021. We hope to return to hosting the in-person version of the Rising Stars Workshop in Spring 2022.
Instead, this year we are excited to host a virtual CRCS Rising Stars Speaker Series! Each talk in the series will feature 12 minute presentations from 4 PhD students and postdoctoral candidates who were nominated by experts in the field as having demonstrated exemplary research in topics related to AI for social good. Talks will be followed by a panel discussion with the speakers. Please find more information below, and send any questions via email to firstname.lastname@example.org.
We are excited to announce our final list of nominated CRCS Rising Stars in AI! Please find their names and bios underneath the sessions during which they will give presentations.
All events will take place 12–1:30pm ET
Public Health: Tuesday, March 30
Sonali Parbhoo (Harvard University)
Talk Title: Robust Machine Learning Methods for Targeted Healthcare
Across several fields in science and engineering, we are increasingly turning to machine learning solutions for making decisions that can affect our lives in profound ways.
Unlike many of these success stories, machine learning has had limited success in healthcare. Yet the vast volumes of medical data currently recorded are far beyond what medical experts can analyse. In this talk, I will discuss the importance of building robust tools that can communicate their decisions and limitations to human decision-makers. I will demonstrate how building small, inspectable models that humans can understand can help us manage hypotension in the ICU, and show how incorporating human input into off-policy evaluation can help us find better strategies for managing illnesses such as HIV. Throughout the talk I will highlight several interesting questions that could have a profound impact on healthcare.
Bio: Sonali is a postdoctoral research fellow at Harvard, working with Prof Finale Doshi-Velez. Her research focuses on decision-making in uncertainty, causal inference and building interpretable models to improve clinical care and deepen our understanding of human health, with applications in areas such as HIV and critical care. Her work has been published at a number of machine learning conferences (NeurIPS, AAAI, ICML, AISTATS) and medical journals (Nature Medicine, Nature Communications, AMIA, PLoS One, JAIDS). Sonali received her PhD (summa cum laude) in July 2019 from the University of Basel, Switzerland, where she built intelligent models for understanding the interplay between host and virus in the fight against HIV. She was also a recipient of the Swiss National Science Foundation (SNSF) Mobility Fellowship for her research at Harvard. Prior to this, Sonali received her B.Sc. and M.Sc. in Johannesburg, South Africa where she specialised in Molecular Biology, Computer Science and Mathematics. Apart from her research, Sonali is also passionate about encouraging more discussion about the role of ethics in developing machine learning technologies to improve society.
Paidamoyo Chapfuwa (Duke University)
Talk Title: Counterfactual Survival Analysis with Balanced Representations
Abstract: Survival analysis or time-to-event studies focus on modeling the time of a future event, such as death or failure, and investigate its relationship with covariates or predictors of interest. Specifically, we may be interested in the causal effect of a given intervention or treatment on survival time. A typical question may be: will a given therapy increase the chances of survival of an individual or population? Such causal inquiries on survival outcomes are common in the fields of epidemiology and medicine. In this talk, I will introduce our recently proposed coun- terfactual inference framework for survival analysis which adjusts for bias from two sources, namely, confounding (from covariates influencing both the treatment assignment and the outcome) and censoring (informative or non- informative). I will then present extensive results on challenging datasets, such as the Framingham Heart Study and the AIDS clinical trials group (ACTG).
Bio: Paidamoyo Chapfuwa received B.S.E. with distinction, M.S., and Ph.D. degrees in electrical and computer engineering from Duke University, Durham, NC, USA, in 2013, 2018, and 2021 (expected), respectively. Paidamoyo has been advised throughout her Ph.D. by Drs. Lawrence Carin and Ricardo Henao. Her research focuses on developing modern machine learning approaches, i.e., representation and deep learning, to characterize individualized survival (event times) from clinical data such as electronic health records and more recently, immunomics. Her work incorporates statistical techniques from causal inference, generative modeling, and Bayesian nonparametrics. Her work has culminated in publications at prestigious venues such as IEEE, ACM, ACL, and ICML. See https://paidamoyo.github.io for more information.
Irene Chen (MIT)
Talk Title: Beyond Bias Audits: Building an Ethical Machine Learning for Health Pipeline
Abstract: Machine learning has demonstrated the potential to fundamentally improve healthcare because of its ability to find latent patterns in large observational datasets and scale insights rapidly. However, the use of ML in healthcare also raises numerous ethical concerns, often analyzed through bias audits. How can we address algorithmic inequities once bias has been detected? In this talk, we consider the pipeline for ethical machine learning in health and focus on two case studies. First, cost-based metrics of discrimination in supervised learning can decompose into bias, variance, and noise terms with actionable steps for estimating and reducing each term. Second, deep generative models can address left-censorship from unequal access to care in disease phenotyping. The talk will conclude with a discussion of directions for further research along the entire model development pipeline including problem selection and data collection.
Bio: I’m a Ph.D. student in computer science at MIT, advised by David Sontag in the Clinical Machine Learning group. I work on machine learning methods to advance understanding of health and reduce inequality. Prior to MIT, I completed a joint AB/SM degree at Harvard. I also worked at Dropbox as a data scientist, machine learning engineer, and chief of staff.
Charles C Onu (MILA)
Talk Title: Robust algorithms for the analysis of infant cry sounds to detect pathologies
Abstract: My research is inspired by the goal of developing accurate and robust algorithms for the analysis of infant cry sounds to detect pathologies in the real world. I will discuss our work in learning in the small data setting, model compression and task-invariant representations of cry sounds. I will also describe our ongoing effort, in collaboration with clinicians across 3 countries to collect a large database of newborn cry sounds that are fully-annotated with clinical indications. Such a database will facilitate the development and validation of effective models for pathology detection.
Bio: I conduct my research at the intersection of artificial intelligence and healthcare at Mila and the Reasoning and Learning (RL) lab, McGill University. My supervisor is Prof. Doina Precup, co-director of RL lab and director of the DeepMind lab in Montreal. The overarching theme guiding my work is advancing machine learning to positively impact healthcare. Specific areas I work on include classical ML, deep learning, speech, physiological signal processing and tensor decomposition techniques. I hold a Vanier Canada Graduate Scholarship.
I founded and lead AI Research at Ubenwa. The Ubenwa project is aimed at developing cry-based, low-cost tools for early diagnosis of conditions that affect the central and autonomic nervous systems in newborns. Our work is funded by generous grants from Mila, Ministère de l’Économie et d’Innovation (MEI) du Québec, District 3 Innovation Centre, and MIT Solve.
Conservation: Thursday, April 8
Esther Rolf (UC Berkeley)
Talk Title: A Generalizable and Accessible Approach to Machine Learning with Global Satellite Imagery
Abstract: Combining satellite imagery with machine learning (SIML) has the potential to address global challenges by remotely estimating socioeconomic and environmental conditions in data-poor regions, yet the resource requirements of SIML limit its accessibility and use. We show that a single encoding of satellite imagery can generalize across diverse prediction tasks (e.g. forest cover, house price, road length). Our method achieves accuracy competitive with deep neural networks at orders of magnitude lower computational cost, scales globally, delivers label super-resolution predictions, and facilitates characterizations of uncertainty. Since image encodings are shared across tasks, they can be centrally computed and distributed to unlimited researchers, who need only fit a linear regression to their own ground truth data in order to achieve state-of-the-art SIML performance.
Bio: Esther Rolf is a 5th year PhD candidate in the Computer Science department at UC Berkeley, where she is advised by Mike Jordan and Ben Recht. Esther studies how data acquisition processes and downstream use cases influence the efficacy and applicability of machine learning systems, with emphasis on problems with the potential for positive social impact. Her projects span developing algorithms and infrastructure for reliable environmental monitoring using machine learning and understanding social outcomes of decisions influenced by machine learning systems.
Esther is also a member of the Berkeley AI Research (BAIR) Lab and is a fellow in the Global Policy Lab in the Goldman School of Public Policy at UC Berkeley. During her PhD she has graciously received support from an NSF GRFP grant and a Google PhD fellowship.
Elizabeth Bondi (Harvard University)
Talk title: Imagery and Strategic Reasoning: Making Decisions in Conservation with Imperfect Data
Abstract: In conservation, it is often the case that we have "imperfect" data: noisy, limited, difficult to collect or label, etc. Yet, we may be using these data to inform important decisions, for example, in deploying limited resources to protect animals from illegal poaching. It is therefore imperative to consider these "imperfect" characteristics throughout the process of collecting data, designing algorithms to interpret data and make decisions, and deploying such algorithms, not only during data collection and interpretation. We illustrate this point with conservation drones, including the noisy, real-time data they provide and the decisions we need to make to collect and respond to these data.
Bio: Elizabeth Bondi is a PhD candidate studying Computer Science at Harvard University, with an M.S. in Computer Science from the University of Southern California (USC) and a B.S. in Imaging Science from Rochester Institute of Technology (RIT). At Harvard, she is advised by Prof. Milind Tambe. Her research interests include computer vision and deep learning, remote sensing, and multi-agent systems, especially applied to conservation and sustainability.
She has received a Best Application Demo Award at AAMAS 2019 and a Best Paper Award at the Autonomous Air and Ground Sensing Systems for Agricultural Optimization and Phenotyping Conference at SPIE DCS 2016, in addition to an Honorable Mention for the NSF Graduate Research Fellowship Program in 2017, and a Barry Goldwater Scholarship in 2015.
Sasha Luccioni (MILA)
Talk title: Visualizing the Future Impacts of Climate Change with GANs
Abstract: Climate change is a major threat to humanity, and the actions required to prevent its catastrophic consequences include changes in both policy-making and individual behaviour. However, taking action requires understanding the effects of climate change, even though they may seem abstract and distant. Projecting the potential consequences of extreme climate events such as flooding in familiar places can help make the abstract impacts of climate change more concrete and encourage action. My team has developed a generative model that leverages both simulated and real data for unsupervised domain adaptation and conditional image generation. In my presentation, I will describe the details of the ClimateGAN framework, the components of our architecture and demonstrate that our model is capable of robustly generating photo-realistic climate effects robustly. I will also present "This Climate Does Not Exist", the interactive user experience that will allow people to explore the potential impacts of climate change while learning about its impacts on our planet and society.
Bio: Sasha Luccioni is a postdoctoral researcher working on Artificial Intelligence for Humanity initiatives at Mila Institute, where she leads projects at the nexus of machine learning and social issues such as climate change, education and healthcare. Sasha got her PhD in Cognitive Computing from UQAM in 2018 and later spent two years working in applied machine learning research. Since joining Mila in early 2019, she has organized and led many AI for social good initiatives, conferences and workshops. She is also highly involved in her community, volunteering for initiatives such as Women in Machine Learning, Climate Change AI and Kids Code Jeunesse.
Shiva R Iyer (New York University)
Talk title: Experiences from fine-grained air quality monitoring in Delhi using low-cost sensors
Abstract: Delhi is one of the most polluted cities in India and the world, and despite the installation of over 30 high-end air quality monitors throughout the city by various public bodies, we do not have sufficiently fine-grained information on local pollution levels in order to reason about air quality in specific localities. This public network, while quantifying air quality in an extremely detailed manner in terms of measuring a large number of components in the air, only provides a macro-level picture of air quality in the city. On the other hand, tiny localized pollution hotspots within a city, each created from localized sources (such as open waste burning and vehicle exhaust), add up over space and time to result in the poor air quality that is observed at the macro scale. We partnered with a company that produces air quality filters and monitoring equipment, produced 28 custom-designed low-cost monitors and placed them in various locations in the city, with a heavy concentration in South Delhi. We also formalize the notion of a "hotspot", define various types of hotspots and apply the model to our data. We find that the monitoring network augmented with our low-cost sensors can significantly enhance our understanding of localized levels of air pollution to which citizens are subject on a daily basis. For instance, we uncover locations with better/poor air quality that are not reported in the official government reports. And finally, we design statistical and machine learning models that learn a spatiotemporal "field", which can be used in interpolation and forecasting. Our message-passing neural network model, combined with a state-of-the-art spatiotemporal hierarchical model and a spline correction step, is able to predict PM2.5 values with a mean absolute percentage error of only 10% across all our locations.
Bio: Shiva is a sixth year PhD student in the CS Department, part of the Courant Institute of Mathematical Sciences, at New York University, advised by Prof Lakshminarayanan Subramanian. His research interests in computer science lie in the areas of both networked mobile systems and data science. On the mobile systems front, he has explored methods for improving transport-layer performance in next-generation wireless and mobile communication technologies such as millimeter wave (mmWave). On the data science front, he focuses on algorithms for spatiotemporal forecasting and predictive analytics in urban sensing applications. In the end, he wishes to bring these two disciplines together to design smart systems for urban spatiotemporal sensing applications. His larger goals beyond PhD are to have such systems deployed in the real world for measurable policy impact. He is also a member of the Open Networks and Big Data Lab, NYU Systems and NYU WIRELESS, and was formerly a member of EPoD at Harvard. He is a recipient of the GSAS Dean's Dissertation Fellowship at NYU, awarded to selected PhD students in their final year of dissertation writing, the Nokia Bell Labs Innovation Project Award for his work during his time as an intern at Nokia Bell Labs in the summer of 2019, and the Henning-Biermann Prize, awarded to students for outstanding service in the CS department at NYU.
Fairness: Tuesday, April 20
Lily Hu (Harvard University)
Lily Hu is a PhD candidate in Applied Mathematics and Philosophy at Harvard University. She works in philosophy of (social) science and political and social philosophy. Her dissertation project concerns causal theorizing about the social world, in particular reasoning about the “causal effect” of social categories such as race and sex, and the relationship between this kind of causal theorizing and normative theorizing about core ethical notions such as discrimination and fairness. She has also worked on topics in machine learning theory and algorithmic fairness.
Angela Zhou (Cornell University)
Angela Zhou is a fifth-year PhD candidate at Cornell University/Cornell Tech in Operations Research and Information Engineering. She works at the intersection of statistical machine learning and operations research in order to inform reliable data-driven decision-making in view of fundamental practical challenges that arise from realistic information environments. In particular, her research has focused on robust causal inference for decision-making, and credible performance evaluation for algorithmic fairness and disparity assessment.
Ana-Andrea Stoica (Columbia University)
Ana-Andreea Stoica is a Ph.D. candidate at Columbia University. Her work focuses on mathematical models, data analysis, and inequality in social networks. From recommendation algorithms to the way information spreads in networks, Ana is particularly interested in studying the effect of algorithms on people's sense of privacy, community, and access to information and opportunities. She strives to integrate tools from mathematical models—from graph theory to opinion dynamics—with sociology to gain a deeper understanding of the ethics and implications of technology in our everyday lives. Ana grew up in Bucharest, Romania, and moved to the US for college, where she graduated from Princeton in 2016 with a bachelor's degree in Mathematics. Since 2019, she has been co-organizing the Mechanism Design for Social Good initiative.
Paul Gölz (CMU)
Paul Gölz is a PhD student in the Computer Science Department at CMU and is advised by Ariel Procaccia (now at Harvard). Paul’s research applies tools from AI, algorithms, and game theory to help society make better decisions. A specific interest of his are emerging forms of democratic participation and how these processes can be supported by axiomatic and algorithmic analysis.
Tech + Society: Thursday, April 29
Amber M. Hamilton (University of Minnesota)
Amber M. Hamilton is a doctoral candidate in Sociology at the University of Minnesota, Twin Cities. Her research focuses on the intersection of race, racism, and technology. Her dissertation, titled "Doing Race Online: An Exploration of Race-Making on Social Media Platforms," explores the meaning-making around race and racism that occurs on digital platforms. Amber has worked as a PhD Research Intern at IBM Research, Microsoft Research, and the Berkman Klein Center for Internet and Society.
Brooklyne Gipson (University of Illinois Urbana-Champaign)
Brooklyne Gipson is an Illinois ACLS/DRIVE Distinguished Postdoctoral Fellow in the Digital Humanities at the University of Illinois, Urbana-Champaign. Dr. Gipson is an interdisciplinary communication scholar whose areas of research include digital and social media environments, Black feminist digital/technology studies, and the intersection of race, gender, social media, and power. Her work examines how social media platforms facilitate civic engagement within Black communities. Her current research takes an intersectional approach to analyzing how anti-Black discourses manifest themselves in everyday discursive exchanges within Black social media spaces.
Randi Williams (MIT)
Randi Williams is a 3rd year Ph.D. student in the Personal Robots Group at the MIT Media Lab. She received her Master of Science in Media Arts in Sciences from MIT in 2018 and her Bachelor of Science in Computer Engineering from UMBC in 2016. Her research intersects human-robot interaction and education with a particular focus on engaging students from underrepresented communities in tech. In her current project, How to Train Your Robot, she is working with educators to design AI curricula that teach AI concepts through hands-on projects related to current issues. You can learn more about her past and ongoing projects at https://www.media.mit.edu/people/randiw12/overview/.
Dora Demszky (Stanford University)
Dora is a 4th year PhD student in Linguistics at Stanford, advised by Dan Jurafsky. Her research focuses on developing natural language processing methods to support student-centered education. Her recent publications focus on analyzing the representation of historically marginalized groups in US history textbooks and on measuring teachers' uptake of student ideas in classroom discourse. She is currently leading a project studying the effectiveness of providing linguistic feedback to teachers using NLP.