Paul Okafor

Paul Okafor

Data Scientist & AI Engineer

Computer Vision Enthusiast
AI/ML Expert
LLM/GenAI and AI Agent Engineer

Professional Summary

Paul Okafor is a highly skilled Data Scientist and AI Engineer with a strong foundation in machine learning, deep learning, and natural language processing (NLP). He excels at developing and deploying AI-powered solutions, with experience spanning computer vision, LLM, AI Research, and the creation of intelligent AI agents. Paul has a proven ability to leverage data to solve complex problems, as demonstrated by his work on projects addressing healthcare challenges, unemployment, and the digital lending landscape in Africa.

His technical expertise encompasses a wide range of tools and technologies, including Python, Streamlit, TensorFlow, PyTorch, scikit-learn, and various cloud platforms. Paul's projects, such as his sentiment analysis of Nigerian digital lending platforms and his contributions to hackathons focused on African health and unemployment, highlight his commitment to using data science for social impact. He is proficient in building end-to-end machine learning pipelines, from data collection and preprocessing to model development, deployment, and visualization.

Passionate about continuous learning and contributing to the open-source community, Paul is also focused on developing cutting-edge AI solutions for healthcare, including cost-sensitive online learning models for anomaly detection in real-time monitoring and interpretable machine learning models for optimizing biomanufacturing processes. His research has been published at the 2024 IEEE Big Data Conference, demonstrating his commitment to advancing the field. He is adept at transforming complex data into actionable insights and is driven by the potential of AI to create positive change. His experience includes building custom LLM chatbots, license plate detection systems, and predictive models for disease classification, showcasing his versatility and adaptability across different domains.

Skills & Expertise

AI & Machine Learning

Neural Networks
Reinforcement Learning
Deep Learning
Generative AI
Causal Inference
LLM Fine-Tuning

Cloud Architecture & MLOps

AWS SageMaker
Docker/Kubernetes
CloudFormation
Vector DBs
GPU Acceleration
CI/CD
Docker
Git

Programming

Python
Java
TypeScript
TensorFlow
PyTorch
LangChain
LangGraph
FastAPI
Flask
Next.js
Hugging Face Transformers
Scikit-Learn
CUDA
Pandas
NumPy

Data Science

Feature Engineering
Dimensionality Reduction
Clustering
Statistical Modeling
Databricks
Plotly Dash

Work Experience

Graduate Research Assistant

University of Oklahoma

Norman, OK, USA

August 2023 - Present

  • Developed cost-sensitive online learning models for control chart pattern recognition, improving real-time monitoring and anomaly detection in manufacturing processes, and significantly reducing misclassification costs.
  • Applied interpretable machine learning (IML) techniques, including SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations), to optimize recombinant protein titer production in *E. coli* fermentations, contributing to advancements in biomanufacturing.
  • Co-authored and presented research on advanced filtering techniques (Kalman Filter, Particle Filter) for titer estimation at the 2024 IEEE Big Data Conference, demonstrating improved accuracy in recombinant protein production using fermentation data.
Python
PyTorch
TensorFlow
Machine Learning
Deep Learning
Cost-Sensitive Learning
Online Learning
Control Chart Pattern Recognition
Anomaly Detection
Interpretable Machine Learning (IML)
SHAP
LIME
Time Series Analysis
Data Analysis
Filtering Techniques
Kalman Filter
Particle Filter
Biomanufacturing
Statistical Modeling
R

Data Scientist

Backyard Innovations Limited

Nigeria

September 2020 - July 2023

  • Led the development and deployment of predictive analytics models for smart home energy systems, using advanced machine learning techniques (e.g., time series forecasting with LSTM networks) to predict hourly energy consumption. This resulted in a 20% improvement in system efficiency and reduced energy waste.
  • Performed comprehensive analysis of weather data (temperature, humidity, solar radiation) and energy consumption data from smart homes to identify optimization opportunities, leading to significant operational cost savings and improved energy usage patterns.
  • Developed and maintained data pipelines to ingest, process, and store large volumes of time-series data from smart home devices, ensuring data quality and availability for analysis and modeling.
  • Created interactive dashboards and reports using Power BI to visualize key performance indicators (KPIs) and communicate insights to stakeholders, facilitating data-driven decision-making.
Python
Machine Learning
Predictive Analytics
Time Series Forecasting
LSTM Networks
Data Modeling
Data Analysis
Power BI
Data Pipelines
ETL
SQL
Data Visualization

Technical Sales Engineer

Fortizo Energy Resources Limited

Nigeria

February 2020 - July 2020

  • Developed and delivered technical bids and proposals, resulting in a significant increase in the company's contract win rate and improved client satisfaction.
  • Utilized expertise in process designs, equipment lists, and heat/material balances to ensure the accuracy and efficiency of project deliverables. Built and maintained strong client relationships to drive sales targets and revenue growth.
  • Conducted market research and competitive analysis to identify new business opportunities and inform sales strategies.
Technical Sales
Proposal Writing
Process Design
Client Relationship Management
Communication
Market Research
Competitive Analysis
Presentation Skills

Education

Ph.D. in Data Science and Analytics

University of Oklahoma

January 2025 - Present

GPA: 3.5

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • OU Data Science Club

M.S. in Data Science and Analytics

University of Oklahoma

August 2023 - December 2024

GPA: 3.5

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • Graduate Research Assistant
  • National Science Foundation (NSF) Agric AI coding challenge Winner
  • TeamElectra in the Air Selangor Data & Digital Hackathon 2024 in Malaysia (Finalist)

M.S. in Petroleum Engineering and Project Development

University of Port Harcourt

November 2018 - November 2019

GPA: 4.6

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • TeamElectra in the Air Selangor Data & Digital Hackathon 2024 in Malaysia (Finalist)

B.S. in Petroleum and Gas Engineering

University of Lagos

October 2012 - December 2016

GPA: 3.54

2024 IEEE Big Data Conference Paper Publication

Activities and Societies:

  • TeamElectra in the Air Selangor Data & Digital Hackathon 2024 in Malaysia (Finalist)

Hobbies & Interests

📺 Binge-Watching TV Shows

Watching popular series and discovering new favorites

🏔️ Hiking

Exploring Oklahoma's beautiful trails and mountain ranges

Camping

Connecting with nature and embracing outdoor adventures

🏀 Basketball

Playing pick-up games and following the NBA

✈️ Traveling

Exploring new cultures and broadening perspectives

🍳 Cooking

Trying new recipes and experimenting with different cuisines