About Me
Experience
Core Skills
Technical Solutions
Industry Domains
Personal Work/Writings
Long Pythia
Explored token length extension when it was common to have 2048 or 4096 context length.
Getting the Most Out of Pre-trained Models
Deep dive into pre-trained NLP models like GPT-2 and T5, exploring their capabilities for text generation, question answering, summarization, and transfer learning applications. Originally published on Toptal.
Recent Advancements in AI
Comprehensive overview of AI breakthroughs in 2019, covering text generation, image synthesis, audio creation, and video/animation technologies that were transforming industries.
This Icon Does Not Exist — GAN for Icon Generation
An application of Generative Adversarial Networks to icon generation, exploring mode collapse and training challenges with a custom dataset.
Deploy ML on Cloud Run
Complete tutorial on deploying machine learning models to Google Cloud Run using Docker and GPT-2 as an example.
Cloud Run — Future Tech
Explored Google's revolutionary serverless container platform and its comparison with traditional serverless and container technologies.
Deep Learning in Cloud
Explored the cloud computing options for deep learning.
Professional Experience
Sr. Machine Learning Engineer @ Autify
Jan 2025 - June 2025 Bootstrapped project Lexa:
- Generate PRD from Source Code
- Vibe-coded entire desktop app using cursor in Swift UI
- Vibe-migrated the desktop app to Electron for multiple platforms
Jan 2024 - Dec 2024 Bootstrapped project Genesis:
- Designed Prompts for the following:
- Generate Test Cases from PRD
- Generate Test Scenarios from Test Cases
- Generate Playwright code from Test Scenarios using Agentic AI
- Build Backend/frontend using LLMs
July 2023 - Dec 2023 Developed Step Suggestion Chrome Extension:
- Wrote frontend code in PReact
- Wrote Backend in Python/FastAPI
- Developed the algorithm to slim down the HTML
- Designed the prompt to provide next step suggestions based on current page
Jan 2022 - June 2023 Core Deep Learning Work:
- Developed MLUI, an ensemble of ML models for parsing screenshot of mobile apps
- A custom OCR model based on encoder-decoder transformer architecture trained on synthetic data
- Retina net fine-tuned on rico + internal dataset for detecting UI elements
- A post processing model to merge the output of the above two models
- Deployment of MLUI at scale on GKE using cost-effective GPU & queue optimizations
- Explored Reinforcement Learning for automated testing
Jan 2021 - Dec 2021 Built the baseline ML Infrastructure at Autify:
- Set up Google Cloud GKE for ML Deployment
- Set up pipeline for VRT (Visual Regression Testing) algorithm deployment at scale on GKE
- Fine-tuned Adobe Semantic Web Segmentation model
- Explored Visual Self-Healing using Template Matching
May 2020 - Dec 2020 Early Exploration of Application of ML on Web Apps:
- Implemented OpenCV-based web screenshot comparison
- Fine-tuned BERT on HTML data
- Optimized the weights of an internal feature extraction algorithm using evolutionary algorithms