Syllabus

Statistical Methods Workshop Staff

  • Instructor: James Balamuta
  • Affiliation: University of Illinois Urbana-Champaign
  • Email: balamut2 illinois edu

Biography:

I am currently a Visiting Assistant Professor in the Department of Statistics at the University of Illinois Urbana-Champaign. In the Summer of 2021, I completed my PhD in Informatics and deposited my dissertation on Bayesian Estimation of Restricted Latent Class Models under the supervision of Prof. Steven Culpepper. I write and teach in R and Python just about every single day. I enjoy finding and working on problems that have multiple ways to reach a solution. Research wise, I primarily focus on latent variable modeling and psychometrics that often necessitates integrating compiled code routines with R. On the teaching side, I’m interested in data science pedagogy and figuring out innovative ways to incorporate industry data science techniques in academic settings.

Midwest

  • Teaching Assistant: David Lundquist
  • Affiliation: University of Illinois Urbana-Champaign
  • Email: davidl11 illinois edu

  • Teaching Assistant: Man Fung “Heman” Leung <(???)>
  • Affiliation: University of Illinois Urbana-Champaign
  • Email: mfleung2 illinois edu

Mid-Atlantic

  • Teaching Assistant: Teresa Huang
  • Affiliation: Johns Hopkins University
  • Email: nhuang19 jhu edu

  • Teaching Assistant: Salma Tarmoun
  • Affiliation: Johns Hopkins University
  • Email: starmou1 jhu edu

INMAS Statistical Methods Workshop Description

The INMAS Statistical Methods Workshop is designed for graduate students who are transitioning into an industry data science role. The workshop focuses on programming with data in Python for statistical purposes. Topics will include: data wrangling (cleaning data), statistical modeling (linear and logistic regression), visualization techniques, ethics, and model deployment.

Schedule

Date Seminar Topic Time
Pre-workshop
Google Colab Anytime
—- —– —-
Fri, Nov 12th Session #1
Welcoming Statements 5:00 - 8:00 PM CDT
Data Wrangling with Pandas
Data Cleaning
—- —– —-
Break 6:25 - 6:40 PM CDT
—- —– —-
Sat, Nov 13th Session #2
Visualizations 9:00 AM - 12:00 PM CDT
—- —– —-
Break 10:30 AM - 10:45 AM CDT
—- —– —-
Lunch Break 12:00 PM - 2:00 PM CDT
—- —– —-
Sat, Nov 13th Session #3
Linear Regression 2:00 - 5:00 PM CDT
Break 3:30 - 3:45 PM CDT
—- —– —-
Sun, Nov 14th Session #4
Logistic Regression 9:00 - 12:00 PM CDT
Break 10:30 - 10:45 AM CDT