๐Ÿ“About

I'm an aspiring data engineer/full-stack developer with a passion for building efficient, scalable systems and tackling complex technical challenges through projects that blend creativity and technical expertise.

Recently, I worked as a Data Engineer Intern at Perdis AI LLC, where I enhanced data monitoring capabilities through cutting-edge tools like PySpark and Delta Lake.

I am also deeply involved in web development and backend infrastructure, having created and maintained multiple high-performing websites, but I hope to only improve my skills in these fields.

Beyond my technical pursuits, I enjoy competitive gaming, swimming, reading, and weightlifting. I am currently exploring graduate programs to further my expertise in data engineering and distributed systems.

๐ŸŽ“Education

Master of Science - Computer Science

Program Selection In Progress

Currently exploring graduate programs in Data Engineering and Distributed Systems

NCSU Logo

North Carolina State University - College of Engineering

B.S., Computer Science (Expected May 2025)

GPA: 3.7 Cumulative / 3.75 Departmental

Honors & Activities:

  • Dean's List
  • NCSU Hackathon Participant - Developed NCSUHealth

Relevant Coursework:

Data Structures & Algorithms, Operating Systems, Automated Learning and Data Analysis, Intro to Artificial Intelligence, Software Development Fundamentals, Software Engineering, C and Software Tools, Computer Security, Computer Organization and Assembly Language, Senior Design (Capstone Project), Software Testing

๐Ÿ’ผWork Experience

Data Engineer Intern @ Perdis AI LLC (June 2024 - August 2024)

Full-Stack Developer @ Beesbridge LLC (December 2022 - Present)

๐Ÿš€Projects

Here I've listed my projects, some of which have links directly to their repo! Others may not have links as a result of them being private tools, but if you would like to see them, don't hesitate to contact me!

Data Observability Tool

Data Observability Tool for Profile and Drift Metrics

  • Developed an open-source tool with PySpark to enhance data monitoring by implementing profile and drift metrics that aggregate, reformat, and analyze large incoming datasets incrementally.
  • Generated detailed visualizations and dashboards of big data through profiling metrics such as quantile distributions, frequent items, and numerous statistical calculations, which detect data drifts.
SCD Type 2

Data Engineering: Slowly Changing Dimension (SCD) Type 2

  • Implemented SCD Type 2 to track historical changes in dimensional data, ensuring full visibility of past data states for reliable analysis.
  • Utilized Delta Lake, PySpark, and SQL to efficiently manage records by merging new data with existing datasets and tracking changes using dynamic updates and versioning.
NCSUHealth

NCSUHealth

  • Developed a full-stack health tracking application using React, Django, and MongoDB that processes and analyzes student nutrition and exercise data.
  • Implemented data pipeline integrations with NC State's dining database and designed scalable database schemas for efficient querying of user health metrics.
Beesbridge

Beesbridge LLC Website

  • Created and maintained a website for a boutique data engineering company using WordPress.
  • Integrated custom HTML and CSS to extend WordPress functionality, ensuring the site's standout design.
CUDA Vector

CUDA-Accelerated Vector Calculation

  • Implemented a CUDA algorithm in C for parallel processing to optimize performance in vector magnitude calculations.
TrailManager

TrailManager

  • Created a "Trail Manager" tool using Java to analyze and optimize paths between nodes representing campsites or checkpoints in a wilderness setting.
  • Applied breadth-first search (BFS) algorithms to find the most efficient routes, reducing travel time and resource consumption.
Streaming Monitor

Multi-Platform Streaming Monitor

  • Developed a Chrome extension using JavaScript, Twitch API, and YouTube API to monitor the live status of 100 streamers.
CoffeeMaker

CoffeeMaker Simulation

  • Contributed to unit testing and frontend layout improvements for a coffee recipe management system using Java, Maven, and JUnit.
AES Encryption

AES Encryption and Decryption (C, Makefile)

  • Implemented an encryption and decryption tool using the Advanced Encryption Standard (AES), ensuring secure data transmission and storage.
  • Developed the tool in C, optimizing it for performance and large files, with a Makefile for easy compilation and deployment.

๐Ÿ› ๏ธSkills

Data Engineering & Analytics

  • PySpark
  • SQL
  • Python
  • Big Data
  • Data Lake/Warehouse
  • Hadoop
  • PostgreSQL
  • Docker

Software Development

  • Java
  • C++
  • C
  • C#
  • Golang
  • React
  • Swift
  • MATLAB
  • Arduino

Development Tools & Infrastructure

  • Git
  • Linux
  • Unix
  • Shell Scripting
  • Unit Testing
  • venv
  • Postman
  • Axios
  • Valgrind
  • GDB
  • Swing

๐Ÿ†Certifications

Databricks Certified Generative AI Engineer Associate

Issued Oct 2024 ยท Expires Oct 2026

Databricks Certified Machine Learning Associate

Issued Jul 2024 ยท Expires Jul 2026

Databricks Certified Data Engineer Associate

Issued Jun 2024 ยท Expires Jun 2026