AI Engineer · Pakistan · 2026

Building intelligence by design.

I architect AI systems that think, learn, and adapt — from fine-tuned LLMs to real-time computer vision pipelines.

PyTorch✦ LangChain✦ LLMs✦ RAG Systems✦ MLOps✦ Generative AI✦ HuggingFace✦ Computer Vision✦ AWS SageMaker✦ Transformers✦ PyTorch✦ LangChain✦ LLMs✦ RAG Systems✦ MLOps✦ Generative AI✦ HuggingFace✦ Computer Vision✦ AWS SageMaker✦ Transformers✦

50⁺ AI Projects shipped

Since 2019

About me

I make machines that understand the world.

5 years building ML systems at Google DeepMind, Stripe and beyond. My obsession: making AI feel less like a tool and more like a collaborator.

BS AI CUSIT

2× NeurIPS Author

2021 & 2022

Disciplines

01Large Language Models
02Computer Vision
03Generative AI
04MLOps & Infrastructure
05Reinforcement Learning

99.2^% Model accuracy (best)

Fraud detection

Currently

Senior AI Engineer @ OpenAI Partners

Building RAG pipelines serving
500K+ daily users.

Selected Work

Projects that
ship and scale.

01 Enterprise RAG Chatbot →

02 Object Detection API →

03 Sentiment Engine →

04 AI Art Studio →

05 AutoML Platform →

Enterprise RAG Chatbot

A production-grade Retrieval-Augmented Generation system powering internal knowledge search for enterprise teams. GPT-4 + Pinecone vector DB, serving 500K+ queries daily with sub-200ms latency.

LangChainPineconeFastAPIAWSGPT-4

View Live ↗ GitHub ↗

Real-Time Object Detection API

Custom-trained YOLOv8 model hitting 60 FPS on edge hardware. Zero-latency REST API with batch inference support — used in smart retail and industrial safety monitoring at scale.

PyTorchYOLOv8FastAPIDocker

View Live ↗ GitHub ↗

Financial Sentiment Engine

Fine-tuned BERT model on 10M+ financial news articles, achieving 94% classification accuracy. Powers live trading signals at a hedge fund — 35% increase in signal confidence versus baseline.

BERTHuggingFaceSageMakerPython

Case Study ↗

AI Art Style Transfer Studio

Stable Diffusion fine-tuned with custom LoRA adapters trained on curated artist styles. Gradio web interface lets creators apply styles instantly — deployed on AWS with auto-scaling GPU pods.

Stable DiffusionLoRAGradioAWS

View Live ↗ GitHub ↗

AutoML Pipeline Platform

End-to-end MLOps system: automated data ingestion, feature engineering, hyperparameter tuning with Optuna, experiment tracking via MLflow, and one-click Kubernetes deployment with Terraform.

MLflowAirflowKubernetesTerraform

Case Study ↗ GitHub ↗

Technology Stack

Tools I actually use.

AI / ML

PyTorch

TensorFlow

HuggingFace

LangChain

scikit-learn

Languages

Python

SQL

JavaScript

C++

Bash

Infra & Tools

AWS SageMaker

Docker / K8s

MLflow

FastAPI

Apache Spark

Experience

Where I've
made impact.

2022 – Now

Senior AI Engineer

OpenAI Partners

Lead 6-person team building LLM products. RAG pipelines for 500K+ users, 60% latency reduction via quantization.

GPT-4RAGLangChain

2020 – 2022

ML Engineer

Google DeepMind

Productionized RL agents on TPU clusters. Processed petabytes of training data. Co-authored 2 NeurIPS papers.

RLJAXTPU

2019 – 2020

Data Scientist

Stripe

Fraud detection models cutting chargebacks by 35%. Real-time scoring system handling 1M+ transactions per day.

XGBoostSparkKafka

2018 – 2019

AI Research Intern

Stanford AI Lab

Researched novel attention mechanisms for vision-language models and benchmarked SOTA approaches.

VisionNLPPyTorch

Specialization

What I'm expert at.

Large Language Models

Fine-tuning, prompt engineering, RAG pipelines, and production deployment of LLMs including GPT-4, LLaMA, Mistral, and custom models.

GPT-4
LLaMA
RAG
LangChain

Computer Vision

Object detection, segmentation, real-time inference at the edge. Specializing in custom YOLO, DETR, and diffusion-based vision systems.

YOLOv8
OpenCV
ONNX
TensorRT

MLOps & Infrastructure

End-to-end ML pipelines, experiment tracking, automated retraining, and one-click deployment on Kubernetes & cloud platforms.

MLflow
Kubeflow
Terraform
AWS

Generative AI & Multimodal

Text-to-image, video synthesis, multimodal agents. Stable Diffusion, LoRA fine-tuning, and CLIP-based retrieval at scale.

Stable Diffusion
LoRA
CLIP
Imagen

NLP & Conversational AI

Sentiment analysis, entity extraction, summarization, and dialogue systems using BERT, T5, and custom transformer architectures.

BERT
T5
spaCy
Transformers

Real-Time AI Systems

Low-latency AI inference for production use cases — edge deployment, model quantization, TensorRT optimization, and streaming pipelines.

TensorRT
ONNX
Quant
Edge AI

Certifications

Validated expertise.

Credentials from leading industry organisations, validating hands-on skills in AI, ML, and cloud platforms.

Google Cloud2024

Verified

Professional Machine Learning Engineer

Demonstrates the ability to design, build, and productionize ML models using Google Cloud — Vertex AI, BigQuery ML, and TFX pipelines serving at scale.

Vertex AITFXBigQuery MLMLOps

1 / 5

Amazon Web Services2023

Verified

AWS Certified Machine Learning – Specialty

Validates expertise in building, training, tuning, and deploying ML models using the AWS cloud — SageMaker, data engineering, and model monitoring.

SageMakerS3KinesisCloudWatch

2 / 5

DeepLearning.AI2022

Verified

Deep Learning Specialization

5-course program covering neural networks, hyperparameter tuning, CNNs, sequence models, and structuring production ML projects — by Andrew Ng.

TensorFlowCNNLSTMTransformers

3 / 5

Hugging Face2023

Verified

Natural Language Processing with Transformers

Hands-on covering BERT, GPT, T5, and modern NLP pipelines using the Hugging Face ecosystem — from fine-tuning to deploying in production.

BERTGPTT5HuggingFace

4 / 5

PyTorch Foundation2024

Verified

PyTorch for Deep Learning & Neural Networks

End-to-end mastery of PyTorch — tensors, autograd, custom architectures, distributed training, and model export with TorchScript and ONNX.

PyTorchAutogradONNXTorchScript

5 / 5

Contact

Got a hard
problem?
Let's solve it.

Email taseermehboob@gmail.com ↗ LinkedIn linkedin.com/in/taseer-mehboob ↗ GitHub github.com/Taseer09 ↗ Twitter @Taseermehboob09 ↗

Building intelligence by design.

I make machines that understand the world.

Projects thatship and scale.

Enterprise RAG Chatbot

Real-Time Object Detection API

Financial Sentiment Engine

AI Art Style Transfer Studio

AutoML Pipeline Platform

Tools I actually use.

Where I'vemade impact.

Senior AI Engineer

ML Engineer

Data Scientist

AI Research Intern

What I'm expert at.

Large Language Models

Computer Vision

MLOps & Infrastructure

Generative AI & Multimodal

NLP & Conversational AI

Real-Time AI Systems

Validated expertise.

Professional Machine Learning Engineer

AWS Certified Machine Learning – Specialty

Deep Learning Specialization

Natural Language Processing with Transformers

PyTorch for Deep Learning & Neural Networks

Got a hardproblem?Let's solve it.

Projects that
ship and scale.

Where I've
made impact.

Got a hard
problem?
Let's solve it.