Statistics and Data Science Student
University of Illinois Urbana-Champaign ยท US Citizen ยท Expected Graduation: May 2027
Chicago, IL ยท Willing to Relocate Nationwide
I'm a Statistics major with minors in Computer Science and Data Science at the University of Illinois Urbana-Champaign, passionate about leveraging data to solve real-world problems. With a strong foundation in statistical modeling, machine learning, and data visualization, I'm actively seeking data science internship opportunities where I can apply my analytical skills and continue learning.
My academic journey has equipped me with hands-on experience in building predictive models, conducting exploratory data analysis, and deriving actionable insights from complex datasets. I'm particularly interested in applying statistical methods and machine learning techniques to domains like sports analytics, manufacturing optimization, and agricultural technology.
As a Project Lead at the Illinois Data Science Club and member of NeuroTech Club, I'm constantly expanding my knowledge and collaborating with peers on innovative projects. I'm eager to contribute to a forward-thinking team while gaining practical industry experience.
AGWISE | Remote
Sandia National Laboratories | Champaign, IL
University of Illinois Urbana-Champaign | Champaign, IL
Illinois Data Science Club (iDSC) | UIUC
Illinois Data Science Club (iDSC) | UIUC
NeuroTech Club | UIUC
Led an end-to-end data science project predicting MLB player WAR (Wins Above Replacement) using advanced machine learning techniques, winning 1st place in a competitive 8-week challenge.
Key Achievements:Built predictive models for fourth-down conversion success using NFL play data, comparing neural networks and logistic regression to evaluate model complexity trade-offs.
Key Achievements:A machine learning project focused on mental health risk assessment for students, combining statistical analysis with predictive modeling to identify at-risk individuals.
Key Achievements:An advanced sports analytics project that leverages Statcast data to classify baseball pitches in real-time, demonstrating the intersection of machine learning and sports technology.
Key Achievements:A data-driven analysis exploring how lifestyle and demographic factors influence tumor size using regression and ANOVA modeling in R.
Key Achievements:A collection of exploratory data analysis and regression modeling projects across different domains, showcasing versatility in data science techniques and problem-solving approaches.
Key Achievements:My primary language for data science and programming. I've developed strong proficiency through extensive coursework, personal projects, and structured learning programs.
Experience: I use Python for everything from exploratory data analysis to building production-ready machine learning models. Comfortable with data manipulation, statistical analysis, machine learning pipelines, and algorithm implementation.
Key Libraries: Pandas (data manipulation), NumPy (numerical computing), Scikit-learn (machine learning), PyTorch (deep learning), Statsmodels (statistical modeling), Seaborn & Matplotlib (visualization)
Learned R through statistics coursework and have developed strong skills in statistical analysis and data visualization. While I prefer Python for most data science tasks, I particularly appreciate R's visualization capabilities.
Experience: Proficient in using ggplot2 for creating publication-quality visualizations. Extensive experience with R Markdown for creating reproducible research documents, which also helped me become proficient in LaTeX. Later expanded to Quarto for working with .ipynb files, enabling seamless integration between R and Python workflows.
Key Libraries: tidyverse (data manipulation), ggplot2 (visualization), statistical modeling packages
Self-taught SQL through online platforms and practical problem-solving. Comfortable with database querying, data manipulation, and writing efficient queries for data extraction and analysis.
Experience: Proficient in writing complex SELECT queries, JOINs, subqueries, and aggregate functions. Experience with database design concepts and query optimization. Continuously practicing through HackerRank challenges to strengthen problem-solving skills.
Learned C++ through rigorous coursework focused on data structures and algorithms. Have developed multiple projects demonstrating practical application of systems programming concepts.
Experience: Completed a comprehensive Data Structures and Algorithms course taught in C++, gaining hands-on experience with memory management, pointers, templates, and STL containers. Built several projects that showcase algorithmic thinking and efficient code implementation. Comfortable with low-level programming concepts and performance optimization.
Java was my first programming language and remains a strong foundation of my technical skills. I have extensive experience from both academic work and practical application.
Experience: Developed an Android app using Android Studio, demonstrating mobile development capabilities. As an Assistant Tutor for three semesters at UIUC, I taught Java programming to students with diverse backgrounds, which deepened my understanding of core concepts like OOP, data structures, and software design patterns. This teaching experience honed my ability to explain complex technical concepts clearly.
Currently preparing to learn SAS as part of upcoming coursework next semester. Eager to expand my statistical programming toolkit with industry-standard software.
Anticipated Focus: Statistical analysis, data management, and reporting using SAS programming. Looking forward to applying SAS in real-world data analytics scenarios and adding enterprise-level statistical software to my skill set.
Business intelligence and data visualization tool that I learned through peer mentorship.
Experience: Familiar with creating interactive dashboards and visualizations. Currently building practical experience and planning future projects to showcase Tableau capabilities in presenting data insights to non-technical audiences.
Proficient with essential development tools learned through coursework and hands-on project work.
Tools: Git/GitHub (version control), Jupyter Notebook (interactive development), VS Code (primary IDE), R Studio (R development), Docker (containerization), Excel (data analysis)
Experience: Comfortable with version control workflows, collaborative coding, and using modern IDEs for efficient development. Regular use of these tools across multiple projects has made them integral to my development process.
Click on any skill card to learn more about my experience and learning journey!