📝About
I'm an aspiring data engineer with a passion for building efficient, scalable systems and tackling complex technical challenges through projects that blend creativity and technical expertise.
Recently, I worked as a Data Engineer Intern at Perdis AI LLC, where I enhanced data monitoring capabilities through cutting-edge tools like PySpark and Delta Lake.
I am also deeply involved in web development and backend infrastructure, having created and maintained multiple high-performing websites, but I hope to only improve my skills in these fields.
Beyond my technical pursuits, I enjoy competitive gaming, swimming, reading, and weightlifting. I am currently exploring graduate programs to further my expertise in data engineering and distributed systems.
🎓Education
Master of Science - Computer Science
Program Selection In Progress
Currently exploring graduate programs in Data Engineering and Distributed Systems

North Carolina State University - College of Engineering
B.S., Computer Science (Expected May 2025)
GPA: 3.7 Cumulative / 3.75 Departmental
Honors & Activities:
- Dean's List
- NCSU Hackathon Participant - Developed NCSUHealth
Relevant Coursework:
Data Structures & Algorithms, Operating Systems, Automated Learning and Data Analysis, Intro to Artificial Intelligence, Software Development Fundamentals, Software Engineering, C and Software Tools, Computer Security, Computer Organization and Assembly Language, Senior Design (Capstone Project), Software Testing
💼Work Experience
Data Engineer Intern @ Perdis AI LLC (June 2024 - August 2024)
Full-Stack Developer @ Beesbridge LLC (December 2022 - Present)
🚀Projects
Here I've listed my projects, some of which have links directly to their repo! Others may not have links as a result of them being private tools, but if you would like to see them, don't hesitate to contact me!

Data Observability Tool for Profile and Drift Metrics
- Developed an open-source tool with PySpark to enhance data monitoring by implementing profile and drift metrics that aggregate, reformat, and analyze large incoming datasets incrementally.
- Generated detailed visualizations and dashboards of big data through profiling metrics such as quantile distributions, frequent items, and numerous statistical calculations, which detect data drifts.

Data Engineering: Slowly Changing Dimension (SCD) Type 2
- Implemented SCD Type 2 to track historical changes in dimensional data, ensuring full visibility of past data states for reliable analysis.
- Utilized Delta Lake, PySpark, and SQL to efficiently manage records by merging new data with existing datasets and tracking changes using dynamic updates and versioning.

NCSUHealth
- Developed a full-stack health tracking application using React, Django, and MongoDB that processes and analyzes student nutrition and exercise data.
- Implemented data pipeline integrations with NC State's dining database and designed scalable database schemas for efficient querying of user health metrics.


CUDA-Accelerated Vector Calculation
- Implemented a CUDA algorithm in C for parallel processing to optimize performance in vector magnitude calculations.




AES Encryption and Decryption (C, Makefile)
- Implemented an encryption and decryption tool using the Advanced Encryption Standard (AES), ensuring secure data transmission and storage.
- Developed the tool in C, optimizing it for performance and large files, with a Makefile for easy compilation and deployment.
🛠️Skills
Data Engineering & Analytics
- PySpark
- SQL
- Python
- Big Data
- Data Lake/Warehouse
- Hadoop
- PostgreSQL
- Docker
Software Development
- Java
- C++
- C
- C#
- Golang
- React
- Swift
- MATLAB
- Arduino
Development Tools & Infrastructure
- Git
- Linux
- Unix
- Shell Scripting
- Unit Testing
- venv
- Postman
- Axios
- Valgrind
- GDB
- Swing