📝About

I'm an aspiring data engineer with a passion for building efficient, scalable systems and tackling complex technical challenges through projects that blend creativity and technical expertise.

Recently, I worked as a Data Engineer Intern at Perdis AI LLC, where I enhanced data monitoring capabilities through cutting-edge tools like PySpark and Delta Lake.

I am also deeply involved in web development and backend infrastructure, having created and maintained multiple high-performing websites, but I hope to only improve my skills in these fields.

Beyond my technical pursuits, I enjoy competitive gaming, swimming, reading, and weightlifting. I am currently exploring graduate programs to further my expertise in data engineering and distributed systems.

🎓Education

Master of Science - Computer Science

Program Selection In Progress

Currently exploring graduate programs in Data Engineering and Distributed Systems

NCSU Logo

North Carolina State University - College of Engineering

B.S., Computer Science (Expected May 2025)

GPA: 3.7 Cumulative / 3.75 Departmental

Honors & Activities:

  • Dean's List
  • NCSU Hackathon Participant - Developed NCSUHealth

Relevant Coursework:

Data Structures & Algorithms, Operating Systems, Automated Learning and Data Analysis, Intro to Artificial Intelligence, Software Development Fundamentals, Software Engineering, C and Software Tools, Computer Security, Computer Organization and Assembly Language, Senior Design (Capstone Project), Software Testing

💼Work Experience

Data Engineer Intern @ Perdis AI LLC (June 2024 - August 2024)

Full-Stack Developer @ Beesbridge LLC (December 2022 - Present)

🚀Projects

Here I've listed my projects, some of which have links directly to their repo! Others may not have links as a result of them being private tools, but if you would like to see them, don't hesitate to contact me!

Data Observability Tool

Data Observability Tool for Profile and Drift Metrics

  • Developed an open-source tool with PySpark to enhance data monitoring by implementing profile and drift metrics that aggregate, reformat, and analyze large incoming datasets incrementally.
  • Generated detailed visualizations and dashboards of big data through profiling metrics such as quantile distributions, frequent items, and numerous statistical calculations, which detect data drifts.
SCD Type 2

Data Engineering: Slowly Changing Dimension (SCD) Type 2

  • Implemented SCD Type 2 to track historical changes in dimensional data, ensuring full visibility of past data states for reliable analysis.
  • Utilized Delta Lake, PySpark, and SQL to efficiently manage records by merging new data with existing datasets and tracking changes using dynamic updates and versioning.
NCSUHealth

NCSUHealth

  • Developed a full-stack health tracking application using React, Django, and MongoDB that processes and analyzes student nutrition and exercise data.
  • Implemented data pipeline integrations with NC State's dining database and designed scalable database schemas for efficient querying of user health metrics.
Beesbridge

Beesbridge LLC Website

  • Created and maintained a website for a boutique data engineering company using WordPress.
  • Integrated custom HTML and CSS to extend WordPress functionality, ensuring the site's standout design.
CUDA Vector

CUDA-Accelerated Vector Calculation

  • Implemented a CUDA algorithm in C for parallel processing to optimize performance in vector magnitude calculations.
TrailManager

TrailManager

  • Created a "Trail Manager" tool using Java to analyze and optimize paths between nodes representing campsites or checkpoints in a wilderness setting.
  • Applied breadth-first search (BFS) algorithms to find the most efficient routes, reducing travel time and resource consumption.
Streaming Monitor

Multi-Platform Streaming Monitor

  • Developed a Chrome extension using JavaScript, Twitch API, and YouTube API to monitor the live status of 100 streamers.
CoffeeMaker

CoffeeMaker Simulation

  • Contributed to unit testing and frontend layout improvements for a coffee recipe management system using Java, Maven, and JUnit.
AES Encryption

AES Encryption and Decryption (C, Makefile)

  • Implemented an encryption and decryption tool using the Advanced Encryption Standard (AES), ensuring secure data transmission and storage.
  • Developed the tool in C, optimizing it for performance and large files, with a Makefile for easy compilation and deployment.

🛠️Skills

Data Engineering & Analytics

  • PySpark
  • SQL
  • Python
  • Big Data
  • Data Lake/Warehouse
  • Hadoop
  • PostgreSQL
  • Docker

Software Development

  • Java
  • C++
  • C
  • C#
  • Golang
  • React
  • Swift
  • MATLAB
  • Arduino

Development Tools & Infrastructure

  • Git
  • Linux
  • Unix
  • Shell Scripting
  • Unit Testing
  • venv
  • Postman
  • Axios
  • Valgrind
  • GDB
  • Swing

🏆Certifications