Emmanuel Daniel Chonza chonzadaniel

👋 Hi there, I'm Emmanuel Daniel Chonza

🚀 Data Scientist | Generative AI Practitioner | LLM/RAG Engineer

I build end-to-end AI systems that make real-world impact—ranging from LLM fine-tuning and image classification apps, to retrieval-augmented generation (RAG) pipelines and AI-assisted job search agents. My work integrates machine learning, deep learning, and natural language processing (NLP) with cutting-edge tooling like OpenAI, HuggingFace, Streamlit, CrewAI, and ChromaDB.

🧠 What I Do

⚙️ Train and fine-tune LLMs for domain-specific tasks (e.g., sentiment analysis, resumes, instructions).
🤖 Develop computer vision applications using ResNet, VGG, and EfficientNet.
💻 Train and Deploy Supervised Machine Learning Build regression and predictive Machine learning models.
🔍 Build advanced RAG pipelines using ChromaDB, FAISS, and OpenAI APIs.
🧪 Experiment with PEFT techniques (LoRA, QLoRA, IA3, DPO) on real-world datasets.
📊 Design data science workflows: MLflow tracking, feature engineering, and model evaluation.
🌐 Deploy AI apps with Streamlit, FastAPI, Slack bots, and RESTful APIs.
䷢ Design and Build M&E Systems driven by results-based management approach, craft Theory of Change, Results Frameworks, M&E Plans, Analyze data, Visualize, build Dynamic Dashboards.

🔭 Current Work

🔬 Fine-tuning & evaluating LLMs on domain-specific sentiment and intent classification.
🧱 Implementing MLOps/LLMOps pipelines for scalable experimentation.
📈 Improving fairness in ML models trained on imbalanced datasets.
🧠 Prompt engineering for grounded and hallucination-free AI output.
🎯 Deploying AI apps with powerful frontends using Streamlit + LangChain + LlamaIndex.

🧩 Featured Projects

🧠 Multimodal RAG App (📚 + 🌐)

Streamlit-powered GenAI App to retrieve and summarize research papers (PDFs) using a multi-vector retriever, ChromaDB, GPT-4o, and web-augmented generation.
PDF Ingestion → Chunking → Embedding → Retrieval → Generation → UI

🐶 Stanford Dog Breed Classifier (ResNet50 TL)

Fine-tuned ResNet50 model (transfer learning) trained on 120 Stanford Dog Breeds with >80% validation accuracy. App UI built using Streamlit that predicts breed from uploaded .jpeg/.png image.

💼 [Resume & Job Application Advisor]

Agentic Streamlit App powered by CrewAI + Open-source LLMs. Guides users in:

Resume feedback.
Tailored job openings.
Cover letter generation.
Interview Q&A.

💳 [Credit Card Fraud Detector]

Robust ML pipeline for highly imbalanced datasets, including:

Stratified train/test splitting.
Oversampling (SMOTE).
GridSearch + XGBoost.
ROC-AUC, confusion matrix.

🐦 [Racist Tweet Classifier]

NLP workflow with:

SymSpell spell correction.
Stratified cross-validation.
Oversampling.
Streamlit UI for public demo.

🚗 [Used Car Price Prediction]

Regression pipeline using XGBoost, feature engineering, and marketplace data (brand, model, mileage, engine size, etc.).

🧪 [Parameter Efficient Fine-Tuning (PEFT)]

Experiments with LoRA, QLoRA, IA3, and DPO on binary sentiment tasks using HuggingFace Transformers + bitsandbytes.

🖼️ [FoodVision & DogVision]

Custom CNN and pretrained ResNet models trained on:

🍣 Food101 (sushi, pizza, steak...).
🐕 Stanford Dog Breeds (with label mapping & confidence overlay).

📦 Coming Soon

💬 Multi-turn chatbot with memory + web search + RAG.
🧑‍💼 Job Application Assistant v2 (LangGraph-powered).
🛰️ LLM inference microservices (FastAPI + LangServe)
🧬 BGE-Large + Llama3 RAG for scientific documents

📫 Reach Me

🛠️ Tech Stack

Languages: Python, R, SQL, Markdown

Algorithms: LLMs, ML, NLP, Transformers/CNNs/ANN/RNNs/GANs, LSTMs

Frameworks & Tools: PyTorch, scikit-learn, Transformers, Streamlit, MLflow, FastAPI, LangChain, LlamaIndex, ChromaDB, OpenAI, HuggingFace, Plotly, Matplotlib, seaborn , crewai, crewai-tools, APIs

MLOps: MLflow, wandb, Docker, Conda, Git, Kaggle, AWS

Deployment: Huggingface Spaces, Streamlit Cloud, Slack, Local API, Render, AWS

IDEs and Editors: Jupyter, Google Colab, PyCharm, Visual Studio Code, Kaggle, Sublime Text, Thonny

✨ Motto

“Build. Evaluate. Iterate. Deploy. Share.”

Let’s collaborate on AI that matters. Feel free to explore my work or reach out!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly