0
Initializing AI Systems
Work About Stack Contact
AI Engineer · Pakistan · 2026

Building intelligence by design.

I architect AI systems that think, learn, and adapt — from fine-tuned LLMs to real-time computer vision pipelines.

PyTorch LangChain LLMs RAG Systems MLOps Generative AI HuggingFace Computer Vision AWS SageMaker Transformers PyTorch LangChain LLMs RAG Systems MLOps Generative AI HuggingFace Computer Vision AWS SageMaker Transformers
50+ AI Projects shipped
Since 2019
About me

I make machines that understand the world.

5 years building ML systems at Google DeepMind, Stripe and beyond. My obsession: making AI feel less like a tool and more like a collaborator.

Taseer Mehboob
BS AI CUSIT
NeurIPS Author
2021 & 2022
Disciplines
  • 01Large Language Models
  • 02Computer Vision
  • 03Generative AI
  • 04MLOps & Infrastructure
  • 05Reinforcement Learning
99.2% Model accuracy (best)
Fraud detection
Currently
Senior AI Engineer @ OpenAI Partners

Building RAG pipelines serving
500K+ daily users.

Selected Work

Projects that
ship and scale.

01 Enterprise RAG Chatbot
02 Object Detection API
03 Sentiment Engine
04 AI Art Studio
05 AutoML Platform

Enterprise RAG Chatbot

A production-grade Retrieval-Augmented Generation system powering internal knowledge search for enterprise teams. GPT-4 + Pinecone vector DB, serving 500K+ queries daily with sub-200ms latency.

LangChainPineconeFastAPIAWSGPT-4

Real-Time Object Detection API

Custom-trained YOLOv8 model hitting 60 FPS on edge hardware. Zero-latency REST API with batch inference support — used in smart retail and industrial safety monitoring at scale.

PyTorchYOLOv8FastAPIDocker

Financial Sentiment Engine

Fine-tuned BERT model on 10M+ financial news articles, achieving 94% classification accuracy. Powers live trading signals at a hedge fund — 35% increase in signal confidence versus baseline.

BERTHuggingFaceSageMakerPython

AI Art Style Transfer Studio

Stable Diffusion fine-tuned with custom LoRA adapters trained on curated artist styles. Gradio web interface lets creators apply styles instantly — deployed on AWS with auto-scaling GPU pods.

Stable DiffusionLoRAGradioAWS

AutoML Pipeline Platform

End-to-end MLOps system: automated data ingestion, feature engineering, hyperparameter tuning with Optuna, experiment tracking via MLflow, and one-click Kubernetes deployment with Terraform.

MLflowAirflowKubernetesTerraform
Technology Stack

Tools I actually use.

AI / ML
PyTorch
TensorFlow
HuggingFace
LangChain
scikit-learn
Languages
Python
SQL
JavaScript
C++
Bash
Infra & Tools
AWS SageMaker
Docker / K8s
MLflow
FastAPI
Apache Spark
Experience

Where I've
made impact.

2022 – Now

Senior AI Engineer

OpenAI Partners

Lead 6-person team building LLM products. RAG pipelines for 500K+ users, 60% latency reduction via quantization.

GPT-4RAGLangChain
2020 – 2022

ML Engineer

Google DeepMind

Productionized RL agents on TPU clusters. Processed petabytes of training data. Co-authored 2 NeurIPS papers.

RLJAXTPU
2019 – 2020

Data Scientist

Stripe

Fraud detection models cutting chargebacks by 35%. Real-time scoring system handling 1M+ transactions per day.

XGBoostSparkKafka
2018 – 2019

AI Research Intern

Stanford AI Lab

Researched novel attention mechanisms for vision-language models and benchmarked SOTA approaches.

VisionNLPPyTorch
Specialization

What I'm expert at.

Large Language Models

Fine-tuning, prompt engineering, RAG pipelines, and production deployment of LLMs including GPT-4, LLaMA, Mistral, and custom models.

  • GPT-4
  • LLaMA
  • RAG
  • LangChain

Computer Vision

Object detection, segmentation, real-time inference at the edge. Specializing in custom YOLO, DETR, and diffusion-based vision systems.

  • YOLOv8
  • OpenCV
  • ONNX
  • TensorRT

MLOps & Infrastructure

End-to-end ML pipelines, experiment tracking, automated retraining, and one-click deployment on Kubernetes & cloud platforms.

  • MLflow
  • Kubeflow
  • Terraform
  • AWS

Generative AI & Multimodal

Text-to-image, video synthesis, multimodal agents. Stable Diffusion, LoRA fine-tuning, and CLIP-based retrieval at scale.

  • Stable Diffusion
  • LoRA
  • CLIP
  • Imagen

NLP & Conversational AI

Sentiment analysis, entity extraction, summarization, and dialogue systems using BERT, T5, and custom transformer architectures.

  • BERT
  • T5
  • spaCy
  • Transformers

Real-Time AI Systems

Low-latency AI inference for production use cases — edge deployment, model quantization, TensorRT optimization, and streaming pipelines.

  • TensorRT
  • ONNX
  • Quant
  • Edge AI
Certifications

Validated expertise.

Credentials from leading industry organisations, validating hands-on skills in AI, ML, and cloud platforms.

Google Cloud2024
Verified

Professional Machine Learning Engineer

Demonstrates the ability to design, build, and productionize ML models using Google Cloud — Vertex AI, BigQuery ML, and TFX pipelines serving at scale.

Vertex AITFXBigQuery MLMLOps
1 / 5
Amazon Web Services2023
Verified

AWS Certified Machine Learning – Specialty

Validates expertise in building, training, tuning, and deploying ML models using the AWS cloud — SageMaker, data engineering, and model monitoring.

SageMakerS3KinesisCloudWatch
2 / 5
DeepLearning.AI2022
Verified

Deep Learning Specialization

5-course program covering neural networks, hyperparameter tuning, CNNs, sequence models, and structuring production ML projects — by Andrew Ng.

TensorFlowCNNLSTMTransformers
3 / 5
Hugging Face2023
Verified

Natural Language Processing with Transformers

Hands-on covering BERT, GPT, T5, and modern NLP pipelines using the Hugging Face ecosystem — from fine-tuning to deploying in production.

BERTGPTT5HuggingFace
4 / 5
PyTorch Foundation2024
Verified

PyTorch for Deep Learning & Neural Networks

End-to-end mastery of PyTorch — tensors, autograd, custom architectures, distributed training, and model export with TorchScript and ONNX.

PyTorchAutogradONNXTorchScript
5 / 5
✓ Message sent — I'll respond within 24 hours.