Faruna Godwin Abuh

Applied AI Safety Engineer · Evaluation & Interpretability

Building safety-aware AI systems with focus on model behavior, dataset quality, and real-world deployment

🛡️View Safety Case Studies 📝Read Technical Writing 🔗GitHub

Selected Project Metrics

Exploratory & Demo Deployments · Non-Production Research Projects

📊

Igala-English Parallel
Sentences Collected

🔬

Safety & NLP Tools
Deployed

🎯

Vulnerability Categories
Tested

Focus on interpretability, robustness, and dataset quality.
All tools built for research exploration and learning purposes.

🛡️ LLM Red-Teaming Framework

🔬 Mechanistic Interpretability

🌍 First Igala NMT System

⚡ Custom Igala GPT from Scratch

About / Research Philosophy

I focus on applied AI safety — how models behave in real systems, how datasets shape outcomes, and how deployment context introduces risk. My work centers on evaluation, interpretability, and low-resource NLP, where safety issues often appear earlier and more visibly.

Rather than optimizing for scale or performance alone, I prioritize transparency, robustness, and understanding failure modes before deployment. I'm especially interested in how AI safety frameworks apply outside well-resourced settings and how tooling can support safer adoption globally.

Featured Projects

Building AI for Real-World Impact

🛡️

Red-Teaming LLMs for AI Safety

Automated adversarial testing to identify vulnerabilities in large language models

Systematic red-teaming framework for evaluating LLM robustness against prompt injection, jailbreaks, and unsafe outputs. Implements multiple attack strategies with automated safety scoring.

✓Tests 5+ vulnerability categories (injection, bias, toxicity)

✓Automated attack generation and evaluation

✓Comprehensive safety scoring framework

✓Production-ready testing pipeline

PythonTransformersStreamlitOpenAISafety Metrics

View Full Case Study

🌐

Igala-English Neural Translation

Production-grade machine translation for low-resource African languages

Fine-tuned mBERT model for Igala-English translation with quality metrics, uncertainty quantification, and interactive demo. Addresses critical language preservation and accessibility needs.

✓3,253 parallel sentences training corpus

✓Real-time bidirectional translation

✓Confidence scoring for each translation

✓First publicly available Igala MT system

PyTorchmBERTTransformersStreamlitHuggingFace

View Full Case Study

🔬

Mechanistic Interpretability Analysis

Understanding transformer attention patterns in low-resource NMT

Deep dive into mBERT's internal mechanisms during Igala-English translation. Visualizes attention heads, analyzes token alignments, and explores how transformers handle morphologically complex languages.

✓Layer-by-layer attention visualization

✓Token alignment analysis across 12 attention heads

✓Morphological feature tracking

✓Interactive attention pattern explorer

PyTorchTransformerLensmBERTStreamlitPlotly

View Full Case Study

⚡

Igala GPT from Scratch

Custom transformer language model built entirely from first principles

A decoder-only transformer trained on Igala corpus, implementing multi-head self-attention, positional encoding, and custom tokenization. Demonstrates deep understanding of transformer architecture.

✓Built transformer architecture from scratch (no pretrained models)

✓268KB Igala corpus training data

✓Custom BPE tokenizer implementation

✓Interactive text generation interface

PyTorchCustom TokenizerStreamlitNumPyTransformers

View Full Case Study

🌍

Igala Dataset Explorer

Making African languages AI-ready through community-driven data collection

An interactive Streamlit application for exploring and auditing low-resource language datasets. The tool provides sentence statistics, token analysis, and linguistic insights to support NLP research and language preservation efforts.

✓3,253 field-collected sentences

✓First comprehensive Igala NLP dataset

✓Interactive data exploration interface

✓Designed for expansion to other African languages

PythonPandasStreamlitHuggingFaceNLP

View Full Case Study

🔍

AI Safety & Calibration

Making transformer models transparent through calibration and mechanistic interpretability

Empirical evaluation and mechanistic interpretability analysis of transformer models, with a focus on reliability, calibration, and understanding internal decision mechanisms using GPT-2.

✓Direct Logit Attribution analysis on GPT-2

✓Selective prediction and abstention methods explored

✓Calibration improvements through temperature scaling

✓Research-grade evaluation framework

PyTorchTransformerLensGPT-2PythonScikit-learn

View Full Case Study

💬

AI Portfolio Assistant

Production-grade conversational AI with memory and intelligent response modes

A full-stack AI chatbot featuring conversation memory, automatic mode detection, real-time logging, and analytics. Built to provide personalized portfolio Q&A experience using Google Gemini API.

✓Multi-mode responses (deep-dive, quick, story, default)

✓Sub-2 second average response time

✓Session persistence across page reloads

✓Production logging with analytics dashboard

Next.jsFastAPIGoogle GeminiPythonDocker

View Full Case Study

Detailed Case Studies

Deep dives into architecture, challenges, and solutions

🛡️

Red-Teaming

View case study →

🌐

Neural Translation

View case study →

🔬

Interpretability

View case study →

⚡

Igala GPT

View case study →

Safety & Evaluation Writing

Exploring dataset quality, model behavior, and deployment considerations

🤖

Building Multi-Mode Chatbots

Lessons from deploying conversational AI tools

Read article →

🌍

Dataset Quality in Low-Resource NLP

Explores dataset bias and quality challenges

Read article →

☁️

Production AI Deployment

Lessons from deploying interpretability tools

Read article →

Project Vision

My long-term goal is to build AI systems that are not only powerful, but also trustworthy and useful in real-world settings. This means designing models that communicate uncertainty, respect local languages, and avoid overconfident or unsafe recommendations.

I'm especially interested in deploying AI where access to expertise is limited, and where responsible design can make a meaningful difference. Whether it's building datasets for underrepresented languages, making models more interpretable, or designing systems that know when to abstain—I believe AI should serve communities, not replace human judgment.

🌍

Inclusive NLP

Building for low-resource languages

🔬

AI Safety

Transparency and reliability

🤝

Real Impact

Serving underserved communities

Let's Connect

Open to applied AI safety, evaluation, and research-engineering collaboration. If you're working on model behavior analysis, low-resource NLP, or responsible AI deployment—let's talk.

GitHub

🤗

Hugging Face LinkedIn Email

💡 Currently exploring: Selective prediction methods, failure mode analysis in low-resource NLP, and production LLM observability