Data Scientist, US Department of Defense
BA in Applied Math, University of Illinois Springfield
I am a full-stack data scientist with 2 years of data experience in driving business solutions through data analytics and ML modeling.
I am skilled in Python, machine learning (SKLearn, TensorFlow/Keras), Tableau, and SQL. My domain of expertise is in natural
language processing (NLP) and machine learning classification/prediction.
With a dual background in industry data science and academic AI research, I bring a multidisciplinary problem-solving
toolkit that emphasizes data communicability, user-centered design, and theory-informed application. My experience with
AI governance also enables me to lead engineering teams in meeting data governance, algorithmic
fairness, and general AI compliance standards.
I am currently doing data science work for the US Department of Department, specifically at the NSWC Crane (Navy Research Lab)—where I am developing
an NLP platform to inform the lab's $3.5M research direction, budget, and policy. My passion lies in generating world impact through data-driven insights,
particularly at the intersection of ML modeling and user safety.
Last updated: March 27, 2024.
Experience • Projects • Education • Skills
• Staffed to Naval Surface Warfare Center: Crane Division (NSWC Crane) as part of Department of Defense (DoD) x National Security Innovation Network (NSIN)'s X-Force Program
• Developing an NLP software solution to ingest, analyze, and model research data to inform federal research budget, direction, and policy
• Handpicked as top 0.8% finalist (25 of 3000) to participate in Adobe Digital Academy’s Data Science Fellowship
• Awarded $26,000 (stipend and tuition) to complete 300+ hours of instructor-led AI education from General Assembly Data Science Program
• Engineered 3 projects for Adobe Digital Academy, including NLP classification model for tagging AI-generated text
• 20% Project & Firm Initiative: developed CNN computer vision model for identifying deepfakes and edited images
• Optimized patient treatment speed by 25% through streamlining office workflow (automated forms and data insights)
• Engineered 12 data products (dashboards, databases) to inform 25 medical staff operating across 4,200 patients
• Lead cohort of 35 students in team of 3 teaching staff across 24 weeks of instruction with a 100% graduation rate
• Supported additional 210+ hours of code review and tutoring in weekly sessions and open office hours
• Created in total 8 data products ranging from SQL scripts to Tableau dashboards that helped internal partners manage their business capacity, health, performance, and risk
• Succeeded in providing insight into the loan originations pipeline across 7 departments; optimizations helped 8+ team managers manage 100k data points
• Tutored 25+ students from high school to adult learners for 65+ hours; topics range from AP Computer Science to Intro. ML
• Succeeded in helping 25+ students raise their test scores by average of 2.7 letter grades
• Selected full-stack data solutions and applied machine learning apps
Machine Learning, AI | Scikit-Learn, TensorFlow, Keras, Hadoop, Scipy, Statsmodel |
Analytics, Visualization | Tableau, Pandas, Matplotlib, Numpy, Seaborn, Leaflet.js, Plotly, GMaps API, Web APIs, EDA |
Spreadsheet | Excel, VBA, Google Sheets, Google Apps Script |
Math | Statistics, Probability Theory, Linear Algebra, Modeling, Multivariate Calculus, Discrete Mathematics |
Domains | Clustering/Segmentation, Regression/Forecasting, NLP, CNN, Image Classification, Dimensionality Reduction, Transfer Learning, Deep Learning |
Programming | Python: OOP & Functional, Data Structures and Algorithms, HTML, CSS, JavaScript, Bootstrap, React.js |
Deployment | Flask, Streamlit, AWS, GCP Cloud Run, Github Pages |
Database Management | PostgreSQL (SQL), ETL, MongoDB (NoSQL), SQLite, SQLAlchemy, Pymongo |
Production | Git, Github, Agile (Scrum), Gitlab, Unix |
Webscraping | Selenium, Beautiful Soup, Splinter, Requests, Webdriver.manager |
Other | Google Colab, Jupyter Notebook, Visual Studio Code, LaTeX, AutoHotkey (AHK), HotkeyNet (HKN), Microsoft Word, Microsoft Powerpoint, Google Slides, Google Doc |