Curriculum Vitae
Mark Hagemann, Ph.D. Link to heading
Data Scientist, Engineer
- mark.hagemann@gmail.com
- Richmond, VA
- LinkedIn: Mark Hagemann
- GitHub: markwh
Education Link to heading
University of Massachusetts Link to heading
- Ph.D. in Civil and Environmental Engineering 2016
- Dissertation: Predictive modeling of riverine constituent concentrations and loads using historic and imposed hydrologic conditions
- M.S. in Statistics 2017
Carleton College Link to heading
- B.A. in Geology 2010
Work Experience Link to heading
EAB Global Link to heading
- Senior Data Scientist (Nov 2019 - Apr 2024)
- Developed and maintained lead-scoring machine learning models on R / AWS infrastructure, improving AUC by 5-10 points for most models through feature engineering and hyperparameter tuning.
- Created analytics dashboards using R, SQL, Shiny, and CSS to enhance data accessibility and decision-making.
- Built and disseminated R packages to streamline data science workflows across teams.
Ohio State University Link to heading
- Postdoctoral Researcher (Jan 2018 - Oct 2019)
- Contributed to code development for validating error estimates and visualizing errors in statistical models for monitoring rivers using spaceborne radar.
- Co-developed a model to quantify measurement uncertainty for a NASA satellite mission.
University of Massachusetts Link to heading
- Postdoctoral Researcher (Sept 2016 - Dec 2016)
- Designed and published a Bayesian method to estimate streamflow using satellite data.
- Led instruction for undergraduate statistics courses and wrote open-source R packages.
University of Massachusetts Link to heading
- Postdoctoral Researcher (July 2017 - Jan 2018)
- Designed and published a Bayesian method to estimate streamflow using satellite data.
- Led instruction for undergraduate statistics courses and wrote open-source R packages.
Carleton College Link to heading
- Visiting Assistant Professor (Jan 2017 - Mar 2017)
- Provided instruction for a 10-week hydrology course and laboratory.
Skills Link to heading
- R
- Python
- Stan
- MATLAB
- Linux/Bash
- SQL
- Supervised and Unsupervised Learning
- Regression
- Classification
- Bayesian Inference
- Amazon Web Services
- Data Cleaning
- Feature Engineering
- Statistical and Probabilistic Analysis
- ggplot2
- Shiny Dashboard
- htmlwidgets
- plotly.js
- R package development
- Reproducible Research
- Git/Github
- Google Cloud Platform
- Docker
- Machine Learning
- Mathematical Statistics
- Bayesian Statistics
- Linear Models
- Regression Modeling
- Time Series Analysis
- GIS Analysis
Projects Link to heading
Real-Time Categorization Tool for Online Listening Sessions Link to heading
- Developed a tool using the LangChain framework for real-time categorization of responses during online listening sessions. Integrated AI components in Python and built a user interface using R/Shiny.
- Technologies: LangChain, Python, R, Shiny
Personal Project: LLM-based AI Chat Interface in Rmarkdown (“Larkdown”) Link to heading
- Created a bilingual project using R and Python to develop a chat interface with LLMs within Rmarkdown documents for interactive AI interactions.
- Technologies: R, Python
- Link: Larkdown
Leadership Link to heading
- Co-Chair of Programming, Graduate Researchers in Data (GRiD) — UMass Amherst, June 2015 - May 2016
- Student Co-Chair, New England Graduate Student Water Symposium — UMass Amherst, Sept. 2015
Honors and Awards Link to heading
- National Water Center Innovators Program, 2016 Consortium of Universities for the Advancement of Hydrologic Science, Inc.
- R.A. Noga and K.M. Noga Fellowship, 2015 UMass Civil and Environmental Engineering
- Phi Beta Kappa, 2010 Beta chapter of Minnesota
- Distinction in senior thesis, 2010 Carleton College Geology
- Eagle Scout, 2006
- National Merit Finalist