Joshua Norfolk

Logo

View My GitHub Profile

Joshua Norfolk

Professional Summary

Most recently, worked in human data as an operator for two years, at both semi-established company Surge AI and startup Handshake AI.

Prior to my time developing my data science skills independently and then through TripleTen’s program, I worked on a number of fascinating physics projects in my physics B.S. - namely, planning out scientific data collection for a NASA sounding rocket as part of a PSU engineering group and evaluating X-ray detectors’ comparative effectiveness with Python at Lawrence-Livermore National Lab, plus documenting the assembly of a key experiment part for dark matter project LUX-LZ.

Now, I am a Data Scientist with a strong foundation in Python, machine learning, and statistical analysis. Leveraging 1+ year of experience in data-driven projects and a B.S. in Physics; excelling in transforming complex ideas into actionable strategies. Known for quick learning, effective mentorship, and fostering collaborative team environments.

Alongside my physics experience, I thoroughly enjoyed teaching children, working first as a counselor and then as a science teacher for Nature’s Classroom. I developed physics-based curriculums and improved my ability to communicate physics and experimental principles to a less knowledgeable audience, while exhibiting patience.

Though I used C++, Matlab, Excel, and Python in college, I have focused on Python for data science this year. I am comfortable using pandas for exploratory data analysis and preprocessing. I have conducted statistical analyses using namely T-tests, ANOVA, and bootstrapping. I’ve learned how to visualize data with Matplotlib and Pandas, though I often prefer Seaborn. I studied how to build web applications using Streamlit and utilized this for a work project with tech company DataSpeak. Much of my focus was on machine learning: understanding the inner mathematical workings, interpreting results and metrics, preprocessing data correctly for different models, implementing a variety of classification/regression models effectively, and communicating the results. I studied mostly supervised learning, and learned some unsupervised learning. I worked on projects specifically involving time series, neural networks and deep learning with Keras, and Natural Language Processing. I worked heavily with Large Language Models while developing a chatbot for DataSpeak.

As of late spring 2024, I had accumulated several data experiences that have further bolstered my confidence. I worked on an extremely challenging externship with Dataspeak (small tech consulting), where I built a chatbot (like ChatGPT) to answer user questions specifically based on proprietary data, presenting my solution to the CEO. I joined a Data Analysis Hackathon, creating/delivering the final presentation and receiving high marks, contributing to a team win. I then worked on an externship with Besample (small worldwide survey conduction), where I found features that were most important to classifying a user as a bot through reverse-engineered machine learning model outputs, and informed the Besample team on how to use the technique for themselves. I worked with Data Annotation as a freelancer, sharpening my coding skills by conducting reviews of AI (like ChatGPT) outputs, creating science/math-based prompt sets for AI to learn from, and essentially performing a wide variety of tasks contributing to AI development.

Personally speaking: for years I have loved rock climbing and hiking. I devoted a year, sometime between my college graduation and the start of my data science program, to traveling the United States in my car by myself and climbing in the desert and mountains as much as possible. Beyond finding that the world is a beautiful place, I honed my sense of technical and emotional problem-solving in terms of rope systems, trip logistics, and teamwork with strangers. If asked when I’ve had to “think on my feet,” certainly there have been instances in professional settings where this was required of me - but nothing comes to mind more sharply than the numerous times that I took the lead to problem-solve myself and my partners out of unexpected and pressing trouble.

Skills

Tech Projects

Beach Bandits Hackathon Route Optimization (06/24)

Zyfra Gold Recovery Prediction (06/23)

Telecom Churn Prediction (11/23)

Ice Video Game Sales (04/23)

OilyGiant Region Selection (06/23)

Experience

Data Annotator/Code Reviewer | Data Annotation (04/2024 - Present)

Data Scientist/Analyst | Besample (04/2024-06/2024)

Data Scientist/AI Engineer | DataSpeak (09/2023-11/2023)

Inventory Control Specialist | ADUSA Distribution (01/2023 – 11/2023)

Scientific Data Analyst Intern | Lawrence-Livermore National Lab (06/2018-08/2018)

Scientific Researcher | Penn State University (09/2017-04/2019)

Education