Daniel Egbo

Astrophysicist and AI Engineer specializing in machine learning, large-scale data analysis, generative AI, and scientific computing. I develop ML systems, LLM applications, and data platforms while conducting research on radio-emitting stars using MeerKAT, Gaia, and multi-wavelength astronomical surveys.

PhD Candidate at the University of Cape Town & South African Astronomical Observatory.
Cape Town, South Africa

Google Scholar | LinkedIn | GitHub | Twitter

6+

Publications

15+

Projects

4

Awards & Grants

5+

Certifications

About Me

I am an Astronomy PhD Candidate at the University of Cape Town and the South African Astronomical Observatory, researching active radio-emitting stars using multi-wavelength data from MeerKAT, Gaia, and eROSITA. My work involves cross-matching millions of astronomical sources, optical spectroscopy, and statistical analysis to understand stellar magnetic activity.

Alongside my research, I build machine learning systems, LLM applications, and data engineering pipelines. I have developed RAG-based chatbots, computer vision classifiers, fraud detection models, and fine-tuned large language models. I combine scientific rigor with practical engineering to solve real-world problems with AI.

I am actively seeking industry opportunities where I can apply my expertise in machine learning, data science, AI engineering, and scientific computing.

Featured Projects

Radio Star Discovery Pipeline

Automated detection of radio stars using ML classification on MeerKAT survey data.

Radio - X-ray stellar Alliance

Cross-matching radio and X-ray catalogs to identify stellar associations and characterize multiwavelength emission.

Medical Speech Recognition using Whisper

Fine-tuned OpenAI's Whisper model on medical dictation data for accurate clinical transcription.

Iowa Liquor Sales Analytics

Full analytics pipeline including ETL, dashboards, and sales forecasting.

Multi-agent RAG Applications

Multi-agent retrieval-augmented generation system for complex document-based QA.

Brain MRI Tumor Classification

Deep learning model for brain tumor classification from MRI scans using CNNs and transfer learning.

View All Projects Data Science Portfolio

Technical Skills

Programming & Data

Python, R, SQL, Bash, Git, Pandas, NumPy, NVIDIA RAPIDS, cuDF, cuPy

Machine Learning & AI

PyTorch, Scikit-learn, cuML, XGBoost, LightGBM, Hugging Face, OpenAI, LangChain, LangGraph, Strands Agents, LiveKit

Generative AI & LLMs

RAG Systems, Fine-tuning, Prompt Engineering, Qdrant, Pinecone, Milvus, Transformers, NVIDIA NEMO Speech

Data Engineering & Cloud

BigQuery, S3, GCS, dbt, DuckDB, MinIO, Postgres, Metabase, Airflow, Prefect, Kestra AWS, GCP

Astronomy & Scientific Computing

Astropy, TOPCAT, Specutils, Multi-wavelength Data Analysis, Numerical Analysis, Optical Spectroscopy, Survey Cross-matching

Selected Publications

View all publications & presentations

Honors and Awards

Professional Training & Certifications

NVIDIA Deep Learning Institute

  • Building Conversational AI Applications (2025)
  • Accelerating End-to-End Data Science Workflows (2024)
  • Getting Started with Deep Learning (2024)
  • Generative AI with Diffusion Models (2024)
  • Building RAG Agents with LLMs (2024)
  • Disaster Risk Monitoring Using Satellite Imagery (2024)

Data Science & ML

  • MLOps Zoomcamp - DataTalks.Club (2025)
  • Machine Learning Zoomcamp - DataTalks.Club (2023)
  • Applied Data Science II: ML & Statistical Analysis - WorldQuant University (2023)
  • Applied Data Science I: Scientific Computing & Python - WorldQuant University (2023)

Summer Programs

  • Oxford Machine Learning Summer School (2023)
  • COSPAR X-Vision School: X-ray Astronomy (2023)
  • ESCAPE Summer School: Data Science for Astronomy (2021)
  • ZTF Summer School: Time-domain Astronomy (2021)
  • GROWTH Astronomy School: Time-domain Astronomy (2020)