Welcome to my website! I am a fourth-year PhD candidate in the Operations Research Center at MIT where I am advised by Professor Rahul Mazumder. My research lies in the intersection of statistical machine learning and discrete optimization, and I focus on developing efficient and explainable algorithms for data science and ML. I am also interested in developing interpretable ML methods for healthcare and public policy.

Prior to graduate school, I was a Data and Applied Scientist at Microsoft where I worked on online advertising and I am originally from the San Francisco Bay Area.

Please find my CV here.

Select Honors and Awards

  • Invited to the 2025 ISyE-MS&E-IOE Rising Stars Workshop
  • 2025 American Statistical Association Statistical Computing Section Best Student Paper Winner
  • 2024 INFORMS Data Mining Society Best Student Paper Competition 1st Place

Publications

Under Review

  • B. Liu and R. Mazumder. Randomization Can Reduce Both Bias and Variance: A Case Study in Random Forests, arxiv.org/abs/2402.12668, 2024. R&R at Journal of Machine Learning Research (JMLR).

  • B. Liu and R. Mazumder. Locally Transparent Rule Sets for Explainable Machine Learning, 2025, Submitted to Operations Research.

Published Works

  • B. Liu and R. Mazumder. MOSS: Multi-Objective Optimization for Stable Rule Sets. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025.
    • 2024 INFORMS Data Mining Society Best Student Paper Competition 1st Place.
  • B. Liu and R. Mazumder. FASTopt: An Optimization Framework for Fast Additive Segmentation. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2024.
    • 2025 American Statistical Association Statistical Computing Student Paper Competition Winner.
  • B. Liu and R. Mazumder. FIRE: An Optimization Framework for Fast Interpretable Rule Extraction. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2023.

  • B. Liu and R. Mazumder. ForestPrune: Compact Depth-Pruned Tree Ensembles. In Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS), 2023.

  • B. Liu , M. Xie, and M. Udell. ControlBurn: Feature Selection by Sparse Forests. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2021.

  • B. Liu*, Y. Zhang*, S. Henderson, D. Shmoys, P. Frazier. Modeling the risk of in-person instruction during the COVID-19 pandemic, INFORMS Journal of Applied Analytics, 2024.

  • P. Frazier, J. M. Cashore, N. Duan, S. Henderson, A. Janmohamed, B. Liu , D. Shmoys, J. Wan, Y. Zhang. Modeling for COVID-19 College Reopening Decisions: Cornell, A Case Study. Proceedings of the National Academy of Sciences.

Talks

  • MIT Sloan Health Systems Initiative Annual Workshop, October 2024
    • Interpretable Machine Learning Methods for Predicting Telemental Health Outcomes
  • INFORMS Annual Meeting, October 2024
    • An Optimization Framework for Fast Additive Segmentation in Transparent ML
  • ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2024
    • An Optimization Framework for Fast Additive Segmentation in Transparent ML
  • International Symposium on Mathematical Programming, July 2024
    • An Optimization Framework for Fast Additive Segmentation in Transparent ML
  • US Census Bureau Center for Statistical Research and Methodology, July 2024
    • Making Tree Ensembles Interpretable
  • ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2023
    • Fast Interpretable Rule Extraction
  • INFORMS Annual Meeting, October 2022
    • Depth-Pruning Tree Ensembles
  • ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2021
    • Feature Selection with Sparse Forests