Bryce Grover

Curriculum Learning for Dental Disease Detection

Wed, 15 Apr 2026 04:00:00 GMT

Summary. A three-stage curriculum learning framework (quadrant localization, then tooth enumeration, then disease diagnosis) on the DENTEX 2023 panoramic X-ray dataset (2,032 hierarchically labeled images) using YOLOv8m segmentation models. Against a matched single-stage baseline, the curriculum approach achieved mAP@0.5 of 0.394 versus 0.417, a small but real regression. The empirical takeaway is that on this size of dataset, additional weakly-related supervision didn’t help fine-grained detection. Class imbalance was the dominant limitation, not the training schedule.

Note

This was my final project for DSAN 6600, Neural Networks & Advanced Deep Learning at Georgetown (Spring 2026).

The question

Curriculum learning, training models on easier sub-tasks before harder ones, has a strong intuitive appeal, especially for hierarchical labels. Dental panoramic X-rays are a near-perfect test bed. Every tooth lives in a quadrant, has a number, and may or may not have one of several conditions. Does staging the supervision in that order actually help fine-grained disease detection on a small medical dataset?

Approach

[TODO 1 to 2 paragraphs on data prep, augmentation, model config. Pull from the report. Keep it concrete around image sizes, batch size, loss, and schedule.]

# Sketch of the curriculum schedule. Full code in the repo.
stages = [
    {"task": "quadrant_localization", "epochs": 30, "data": "quadrant_labels"},
    {"task": "tooth_enumeration",     "epochs": 40, "data": "tooth_labels"},
    {"task": "disease_diagnosis",     "epochs": 60, "data": "disease_labels"},
]

Results

[TODO drop in the table comparing curriculum versus single-stage baseline across mAP@0.5, precision, recall, and per-class F1. If the predictions are saved as CSV, render the table here from a pd.read_csv() cell so it stays in sync with the source data.]

What I learned

The interesting part of this project wasn’t the architecture. It was sitting with a result that didn’t go the way I expected and figuring out why. Two things stood out.

The class distribution was doing more work than the schedule. A small handful of disease classes dominated. A curriculum that doesn’t address that imbalance just front-loads the easy stages without solving the actual problem.
“More supervision” is not a free lunch on small datasets. Each curriculum stage adds variance from its own labels. If those labels are only weakly related to the downstream task, you can pay the variance cost without earning the bias reduction.

What I’d do differently

[TODO for example focal loss or class-rebalanced sampling, pretraining on a related larger dataset, ablating which curriculum stages help versus hurt.]

Code

Repository on GitHub

Predictive Modeling of U.S. Oral Health Outcomes

Wed, 01 Apr 2026 04:00:00 GMT

Summary. With a team of three, I led the shallow-learning analysis on NHANES 2017–2018 (n=5,265 adults), benchmarking logistic regression, random forests, and XGBoost across two binary classification tasks and one regression task. Best models hit a 5-fold CV ROC-AUC of 0.849 (self-rated oral health) and 0.844 (clinician-recommended care). A two-stage regression cut DMFT mean absolute error from 6.98 to 4.67 teeth (33%) using only socioeconomic predictors.

Note

DSAN 5300, Statistical Learning, Spring 2026. I owned the data preprocessing pipeline and co-authored the manuscript.

The setup

[TODO 1 paragraph framing. Why NHANES, why these three tasks, what makes oral-health prediction interesting from a public-health standpoint. The economic angle (predicting need for care from socioeconomic features alone) is the strongest hook.]

Data and preprocessing

[TODO describe the merged NHANES tables (oral exam, demographics, SES), the imputation strategy, and the train/test splitting decisions. If you can render a sample DataFrame here it’s a great signal of the data wrangling work.]

Models

# The three model families benchmarked across all three tasks.
models = {
    "logistic":      LogisticRegression(...),
    "random_forest": RandomForestClassifier(...),
    "xgboost":       XGBClassifier(...),
}

[TODO a few sentences on hyperparameter tuning approach (grid versus random versus Bayesian) and any cross-validation specifics.]

Results

[TODO a results table, ideally rendered from saved CSV so it stays accurate. Highlight the headline numbers, ROC-AUC of 0.849 and 0.844, and the 33% MAE reduction.]

What surprised me

[TODO 1 to 2 specific surprises. Examples to consider include which predictors mattered most, where XGBoost beat or didn’t beat logistic regression, and what the residuals told you about who the model misses.]

Caveats

A model that predicts oral-health outcomes from socioeconomic predictors is also, implicitly, a model of structural inequity. The accuracy is real, and so is the responsibility to think hard about how a result like this gets used.

Code

Repository on GitHub

Residential Electricity Demand Forecasting from Weather

Wed, 10 Dec 2025 05:00:00 GMT

Summary. Solo end-to-end project. I built a pipeline that integrates DSGrid synthetic residential demand profiles with ERA5 daily weather data via the Open-Meteo API for New York City, producing a multi-year aligned dataset. I benchmarked supervised regression and classification baselines (linear, logistic, gradient boosting) alongside unsupervised methods (PCA, t-SNE, K-means, DBSCAN, hierarchical clustering), and published the full reproducible workflow.

Note

DSAN 5000, Data Science & Analytics, Fall 2025. My first end-to-end project at Georgetown.

Why this project

[TODO one paragraph on the practical motivation. Utility planning, demand response, the tension between weather-driven peaks and grid stability. Make it about a real-world question, not just “I wanted to learn the pipeline.”]

Data engineering

The unglamorous half of this project was getting two data sources with very different shapes to align cleanly. Synthetic demand profiles at one resolution, ERA5 daily weather at another, all keyed to NYC.

[TODO quick paragraph on the joining strategy, time-zone handling, and missing-data treatment.]

Models

[TODO brief tour through the supervised baselines, then the unsupervised clustering, and what each was for. The interesting story is usually the contrast between supervised performance and what the clusters revealed about the residuals.]

Findings

[TODO 2 to 3 concrete results. Lead with effect sizes, not p-values.]

What this project taught me

[TODO honest reflection. This was your first big end-to-end project. What did you do right, what would you do differently now that you know more?]

Code

Repository on GitHub

Geospatial Crime Pattern Analysis

Sun, 01 Dec 2024 05:00:00 GMT

Summary. Statistical and geospatial analysis of urban crime data, looking for correlations between residential density, commercial zoning, and crime incidence. Built a Python pipeline (Pandas, scikit-learn, GeoPandas) that handles normalization, K-means clustering, and choropleth visualization.

Note

Coursework at Chapman University.

The question

[TODO 1 paragraph framing the question. Be specific. Which city, which crime categories, what years.]

Pipeline

[TODO walk through the steps. Geocoding and spatial join, the normalization choice (per-capita or per-area), and the clustering decision (why K-means rather than DBSCAN here, or vice versa).]

Findings

[TODO 1 to 2 concrete patterns the analysis revealed, with a choropleth or scatter to back them up.]

What I’d do differently now

[TODO this is one of the earlier projects. A short reflection on what an upgraded version would look like is a great signal of growth.]

Code

Repository on GitHub