Nauman Mustafa

AI systems engineer, former senior machine learning engineer, and indie builder.

I work on OCR, computer vision, LLM workflows, and software delivery with coding agents. This site is a small notebook for projects, field notes, and technical essays from that work.

Portrait of Nauman Mustafa

Projects

TranslateX

An iOS-first real-time voice translation app for English and Japanese conversations.

Shipped personal app

Writing

Running a YouTube Channel on Autopilot

A 2-month YouTube automation experiment: scraping news, turning stories into videos, uploading them automatically, and learning hard lessons about niche, cost, and watch time.

MLUI Mobile: Autify OCR vs. Google OCR

A benchmark of Autify OCR versus Google OCR on rendered mobile UI text, including methodology, accuracy metrics, and failure analysis.

A quest for very long context: Part 1

Experiments on extending transformer context length, including training observations, tradeoffs, and lessons from long-context tuning.

Recent Advancements in AI

A 2019 snapshot of major AI breakthroughs across text, image, audio, and video generation, with practical startup use cases.

This Icon Does Not Exist

Using GANs to generate unique icon designs, with model behavior examples and creative applications for design workflows.

Deploy Machine Learning Model

Step-by-step tutorial for packaging and deploying a machine learning app to Google Cloud Run with Docker and Cloud Shell.

Cloud Run

Overview of Google Cloud Run and how serverless containers can simplify deployment, scaling, and cost management for ML applications.

Deep Learning in Cloud

A cost-focused comparison of cloud GPU options for deep learning across AWS, Paperspace, Colab, and Google Cloud preemptible machines.