Background

Talks in the Statistics and Data Science Colloquium are free and open to the public. They are intended to be accessible to a broad undergraduate audience with some background in statistics and data science. Junior and senior statistics majors are expected to attend talks in the SDS Colloquia (please reach out to Professor Nicholas Horton in case of conflicts).

More information about logistics and location can be found HERE.

Data Science Initiative (DSI)

Information about talks and and events sponsored by the Amherst College Data Science Initiative (DSI) can be found here.

Upcoming talks

Monday October 20, 2025: Troy Wixson (University of Massachusetts, Amherst)

Title: Neural classification of asymptotic (in)dependence

  • 4:15pm refreshments, 4:30pm talk
  • Amherst College Seeley Mudd 207

Abstract: The study of extremes allows us to learn about the tail of the distribution. This research is important when studying rare events that have outsized impacts like heatwaves, wildfires, and hurricanes. These problems are inherently multivariate in nature and capturing dependence in the tail is challenging. Classifying a data set as asymptotically dependent (ADep) or asymptotically independent (AInd) is a necessary early choice in the modeling of multivariate extremes. In this talk I will introduce the study of extremes and then perform a series of experiments to determine whether a finite sample has enough information for a neural network to reliably distinguish between these regimes in the bivariate case. These experiments lead to a new classification tool for practitioners which we call nnadic as it is a Neural Network for Asymptotic Dependence/Independence Classification. This tool accurately classifies over 95% of test datasets and is robust to a wide range of sample sizes. These experiments highlight that ADep and AInd models differ in whether the dependence completely decays in the limit, irrespective of the path of that decay.

Bio: Troy Wixson joined the UMass Department of Mathematics and Statistics as a Visiting Assistant Professor after completing his PhD in statistics at Colorado State University this past spring. Troy has research interests related to modeling tail dependence in multivariate extremes with environmental applications and Bayesian modeling of heterogeneous data to probabilistically identify genes associated with disease.

Monday November 17th, 2025: Bonnie Ghosh Dastidar (RAND)

Senior Statistician and Head of RAND Statistics Group

Title: Statistics: a Core Competency for Effective Public Policy

  • 4:15pm refreshments, 4:30pm talk
  • Amherst College Seeley Mudd 207

Abstract: Statistical science is the foundation for evidence-based decision making. As an interdisciplinary science, it has applications to every field imaginable, making statisticians uniquely qualified to lend their expertise in multiple policy domains. Effectively informing policy requires becoming involved early in the design phase; understanding the nature of the issue; and knowing how to communicate, educate, and explain. In this talk, I will provide multiple examples from health policy to highlight both valuable contributions made by statistical scientists and lessons learned – and how this model of collaboration is relevant across other fields and application areas. I will suggest areas of improvement based on lessons learned. And extrapolating from these successes, I will suggest areas for future contributions in which the stakes are very high and involving statistics will be essential.

Bio: Madhumita (Bonnie) Ghosh-Dastidar is a senior statistician, data scientist, and head of the Statistics Group at RAND. Her public policy areas are in public health, prevention, and HIV. Specific examples of studies conducted include evaluation of neighborhood-level interventions to improve diet, physical activity, and cardiometabolic health; randomized trials in HIV patients; and quasi-experimental policy evaluations. Her statistical expertise includes study design, sampling, measurement, missing data, evaluations, and longitudinal/multilevel modeling. Ghosh-Dastidar’s major studies include an objective assessment of sexual assault and harassment in the U.S. military and the first Singapore mental health study. She was the 2024 President of the American Statistical Association (ASA), the largest professional society of statisticians and data scientists. She is an elected fellow of the ASA. The ASA Committee on Women in Statistics recognized her work in statistics and data science by naming her as one of the “Top 20 Women in Statistics and Data Science” (March 2019). Ghosh-Dastidar received her Ph.D. and M.A. in statistics from Penn State, University Park, PA; she also holds a dual B.S. in mathematics and computer science from Albright College, Reading, PA.

Past talks

Monday September 22, 2025: Kate Shutta (Harvard University) and Katharine Correia (Amherst College)

Title: SpiderLearner: An ensemble approach to Gaussian graphical model estimation

  • 4:15pm refreshments, 4:30pm talk
  • Amherst College Seeley Mudd 207

Abstract: Gaussian graphical models (GGMs) are a popular form of network model in which nodes represent features in multivariate normal data and edges reflect conditional dependencies between these features. Currently available tools for GGM estimation require investigators to make several choices regarding algorithms, scoring criteria, and tuning parameters. An estimated GGM may be highly sensitive to these choices, and the accuracy of each method can vary based on structural characteristics of the network such as topology, degree distribution, and density. Because these characteristics are a priori unknown, it is not straightforward to establish universal guidelines for choosing a GGM estimation method. We address this problem by introducing SpiderLearner, an ensemble method that constructs a consensus network from multiple estimated GGMs.

In this talk, Dr. Shutta will present an overview of GGMs, the rationale and methodology behind SpiderLearner, and simulation results. Then, Dr. Correia will present an application of SpiderLearner to low dimensional data aimed at identifying mechanisms that contribute to adverse birth outcomes in the context of HIV infection and antiretroviral exposure.

Bio: Dr. Shutta is a postdoctoral fellow in the Department of Biostatistics at Harvard T.H. Chan School of Public Health. Her work leverages network models of multi-omic data to identify systems-level molecular characteristics associated with the development and progression of disease, including COPD, idiopathic pulmonary fibrosis, and cancer. Her translational goal is to use network features to discover targetable mechanisms of disease progression, thereby perturbing disease trajectories back towards healthy states.

Dr. Correia is a biostatistician interested in the development and dissemination of statistical applications for the medical and public health fields. Her applied research has focused on reproductive health and assisted reproductive technologies. She collaborates with clinicians and epidemiologists on research to elucidate links between modifiable risk factors and poor reproductive outcomes, enabling clinicians to better counsel patients and effectuating positive change in clinical practices.

Logistics

Seeley Mudd Hall is located at the southeast corner of the first year Quadrangle (31 Quadrangle Drive). Paid parking is available at the Amherst Town Common and Boltwood Drive (approximately 8 minute walk). PVTA Bus Service is available from the Amherst College Converse Hall stop (approximately 5 minute walk).

Last updated October 1, 2025

Copyright © 2025 Amherst College. All rights reserved.