About Me

Experience

7+ years in industry
6 years ML Engineering 2 years vibe coding

Core Skills

Python PyTorch Docker FastAPI Google Cloud Prompt Engineering

Technical Solutions

OCR Computer Vision NLP Transformers Training & Fine-tuning LLMs

Industry Domains

Retail Receipt Scanning
Software Testing Visual Testing e2e Test Generation Source Code Analysis

Personal Work/Writings

Sep 2023

MLUI Mobile: Autify OCR vs. Google OCR

Comprehensive performance comparison of Autify's in-house OCR system against Google Cloud OCR and EasyOCR, with detailed methodology, results analysis, and sample evaluations. Autify OCR achieved 91% accuracy on mobile screenshot text recognition.

Work Done in Autify
Aug 2023

Token Compression: Reducing Attention Waste?

Explored using LLMs to compress multiple tokens into single tokens for more efficient transformers. Demonstrated that 2048 hidden dimensions can compress ~8 tokens losslessly using a two-stage encode-decode architecture with LoRA fine-tuning.

Apr 2023

Long Pythia

Explored token length extension when it was common to have 2048 or 4096 context length.

Nov 2021

Machine Learning Features in Autify for Mobile

Comprehensive overview of AI-powered mobile testing features including Visual Regression Testing (VRT), Visual Self-Healing algorithms, and the upcoming Visual App Explorer (VAX) for autonomous app navigation and bug discovery.

Work Done in Autify
Jul 2021

Solving Automated App Navigation: A Use-case

Detailed exploration of behavior cloning techniques for automated mobile app navigation, comparing regression vs heatmap approaches, and demonstrating how UΒ²Net successfully models uncertainty in tap location prediction.

Work Done in Autify
Jan 2021

Applying Modern Deep Learning in Autify

Comprehensive overview of deep learning applications in software testing, including visual regression detection, genetic algorithm optimization, graph neural networks for HTML analysis, and reinforcement learning for intelligent test discovery.

Work Done in Autify
Apr 2020

Getting the Most Out of Pre-trained Models

Deep dive into pre-trained NLP models like GPT-2 and T5, exploring their capabilities for text generation, question answering, summarization, and transfer learning applications. Originally published on Toptal.

Published in Toptal
Jun 2019

Recent Advancements in AI

Comprehensive overview of AI breakthroughs in 2019, covering text generation, image synthesis, audio creation, and video/animation technologies that were transforming industries.

May 2019

This Icon Does Not Exist β€” GAN for Icon Generation

An application of Generative Adversarial Networks to icon generation, exploring mode collapse and training challenges with a custom dataset.

Apr 2019

Deploy ML on Cloud Run

Complete tutorial on deploying machine learning models to Google Cloud Run using Docker and GPT-2 as an example.

Apr 2019

Cloud Run β€” Future Tech

Explored Google's revolutionary serverless container platform and its comparison with traditional serverless and container technologies.

Mar 2019

Deep Learning in Cloud

Explored the cloud computing options for deep learning.

Professional Experience