
Conference Papers
Zero-adjusted Inverse Gamma Regression Model
The analysis of data containing zero observations presents challenges in econometrics, biological sciences, and actuarial studies, where variables of interest often assume null values in significant portions of the dataset. This phenomenon is common in insurance claim frequency studies (where some clients report no claims) or ecological surveys (where species may be absent from samples). Traditional regression models like gamma regression fail to accommodate this zero-inflation characteristic.
Zero-adjusted regression models (zero-inflated or hurdle models) address this by separately modeling the probability of zero vs. positive observations. While Vitorino (2024) recently proposed a zero-adjusted inverse gamma distribution for descriptive analysis, it lacked regression capabilities. Our work extends this into a full regression framework, allowing covariate analysis for both the binary (zero vs. positive) and continuous (positive values) components. This provides researchers with a more flexible tool for zero-inflated positive continuous data with heavy-tailed characteristics.
View Full PaperZero-adjusted Inverse Gamma Regression Model
Extended version presented at the undergraduate research symposium, featuring additional simulation studies comparing our model's performance with Tweedie and zero-inflated gamma alternatives under varying zero-inflation scenarios (10%-90% zeros).
View Extended VersionThesis
Predictive Modeling of League of Legends Matches Using Supervised Learning
This study applies supervised learning models to predict match outcomes in League of Legends, one of the world's most popular esports. We evaluated K-Nearest Neighbors (KNN), Logistic Regression, Gradient Boosting, and Decision Trees using data from Oracle's Elixir (10,784 competitive matches from 2022-2023 seasons).
Methodology involved: (1) strategic feature engineering (early-game gold differentials, champion pick synergies), (2) elimination of non-predictive metadata, and (3) validation via AUC-ROC and accuracy metrics. Key findings:
- Logistic Regression and KNN achieved peak performance (AUC=0.930, accuracy=87.2%)
- First 8-minute game state metrics were 3.4× more predictive than late-game features
- Champion selection contributed 62% of model predictive power
The framework demonstrates how machine learning can quantify strategic decision-making in competitive gaming.
View Thesis