Data Wrangling with R: Week 2 Homework

Introduction to Me!

Hello! My name is Laura Neltner, and I am an undergraduate student at the University of Cincinnati. On campus, I am involved in Student Government as a Senator At-Large, in Queen City Consulting as an Analyst, and in Lindner Women in Business as former Chief Information Officer. I grew up in Northern Kentucky with three older siblings (2 sisters and a brother), and my oldest sister is getting married this year! In my free time, I enjoy outdoor activities such as hiking and playing tennis, baking, playing piano, and reading.


Academic Background


Professional Background

  • Incoming Analyst: AMEND Consulting (May 2023 - August 2023)
  • Data Science Intern: 84.51 (May 2022 - August 2022)
    • Identified > 4.2 million households to grow Kroger Credit Card audience from 1.2 million through marketing campaign
    • Improved machine learning model precision by 30% and decreased processing time by 75-85% with legacy rebuild
    • Leveraged diverse features with pySpark & pandas to build models in Databricks and train/score in DataRobot
    • Enabled rapid development of future classification model through GitHub documentation and version control
  • Supply Chain Analyst Intern: The Kroger Co. (May 2021 - August 2021)
    • Engineered data using SQL, Spark SQL, Python, Spark Python, & Pandas to detect meaningful trends and transition data from Kroger servers to the Azure cloud using Databricks
    • Created data visualizations with Tableau and Power BI to promote operational efficiency by demonstrating trends in KPIs like case fill fines (>$10 million/year in revenue) and inventory adjustments (6,100 workbook views/year)
    • Ensured data quality by implementing Microsoft Excel tools, including VLookups, to compare data from various sources

Experience with R

I have been programming with R since August 2022 (for about half of a year). I used it primarily for statistical purposes in a statistics elective, which focused on linear regression. As a result, I have used it extensively for regression but only know the basics about data manipulation.


Experience with Other Analytic Software

Compared to R, I am more familiar with variants of Python and SQL. In two previous internships, I have used Python in conjunction with pandas in JupyterLab, and I have also used pySpark and Spark SQL in the Databricks coding environment.
In my first internship, I primarily used coding for data preparation to feed queries into data visualization software like Tableau and Power BI.
In my second internship, I wrote queries to create dataframes to feed into machine learning models, which were run in DataRobot Cloud and API.


Example Equation

Here is an example equation to calculate the area of a circle of radius 3

radius = 3
area = pi * I(radius^2)
area
## [1] 28.27433