Kartik Chaudhary
I am currently working as a Lead AI/ML Engineer at Google, where I design AI/ML based solutions/products leveraging the advancements in the field of Computer Vision, NLP, GenAI and Machine Learning.
I am passionate about learning and inventing new ways of improving ML and DL algorithms to solve real world problems more efficiently. Do checkout my blog on Artificial Intelligence - Drops of AI
Get in Touch
LinkedIn  / 
Google Scholar  / 
Twitter  / 
Github  / 
Medium  / 
Personal Blog
Quick links
News |
Books |
Publications |
Patents |
Articles
|
|
News
[2025]: Published a Google Blog on Optimizing image generation pipelines on Google Cloud!
[2024]: Published book on Generative Adversarial Networks titled The GAN Book!
[2024]: Published book on The Definitive Guide to Google Vertex AI with Packt!
[2023]: Got Promoted to Lead AI/ML Engineer at Google, May-2023!
[2022]: Filed a patent on Improving Wind Power production forecast using Image Processing techniques!
[2022]: Published Easter2.0 paper preprint on Arxiv and code on Github!
[2021]: Joined Google as a Senior AI/ML Engineer, Dec-2021!
[2021]: Awarded honorary title of Senior Inventor for several contributions to patents in 2020, Optum!
[2021]: Runner’s Up, Global Data Science Hackathon, Optum!
[2020]: Runner’s Up, Global Data Science Hackathon, Optum!
|
Publications
Most of my research projects are related to computer vision, optical character recognition, image processing and video processing.
|
|
Optimizing image generation pipelines on Google Cloud: A practical guide
Kartik Chaudhary,
Gopala Dhar,
Akhil Sakarwal,
Ashish Tendulkar,
Abhijat Gupta,
Suraj Kanojia
Goolge AI Blog, 2025
Optimizing image generation pipelines through hardware, code, and pipeline improvements boosts performance, cuts costs, and enhances user experience without sacrificing image quality.
publication
|
|
Google Cloud and Apollo24|7: Building Clinical Decision Support System (CDSS) together.
Sharmila Devi,
Kartik Chaudhary,
Nitin Aggarwal,
Gopala Dhar,
Durga Tulluru,
Praful Turanur and Apollo Team.
Goolge AI Blog, 2022
Clinical Decision Support System (CDSS) is an important technology for the healthcare industry that analyzes data to help healthcare professionals make decisions related to patient care.
publication
|
|
Easter2.0: Improving Convolutional models for Handwritten Text Recognition.
Kartik Chaudhary,
Raghav Bali
Arxiv, 2022
Easter2.0 is a small/fast convolutional model for the task of OCR/HTR that works well even when labelled data is limited.
paper |
code
|
|
Easter: Simplifying Text Recognition using only 1D Convolutions.
Kartik Chaudhary,
Raghav Bali
CAIAC, 2021
Easter is a small/fast fully convolutional model for the task of OCR/HTR.
paper |
code
|
|
Dynamic triggering of augmented reality assistance mode functionalities
Kartik Chaudhary,
Sudeep Choudhary,
Raghav Bali,
Anurag Das,
Subhadip Maji
US11663790B2
Granted, 2023
|
|
Automated systems and methods for identifying fields and regions of interest within a document image.
V Kishore Ayyadevara,
Nilav Baran Ghosh,
Yeshwanth Reddy,
Vineet Shukla,
Kartik Chaudhary
US11227153B2
Granted, 2022
|
|
Dynamic detection of cross-document associations.
Swapna Sourav Rout,
Sharlene L. Tan,
Vamsi Bhandaru,
Vineet Shukla,
Akshaya D. N,
Kartik Chaudhary
US12050650B2
Granted, 2024
|
|
Dynamic detection of cross-document associations-II
Swapna Sourav Rout,
Sharlene L. Tan,
Vamsi Bhandaru,
Vineet Shukla,
Akshaya D. N,
Kartik Chaudhary
US11508171B2
Granted, 2022
|
|
Machine learning models for multi-risk-level disease spread forecasting.
Kartik Chaudhary,
Vineet Shukla,
Pooja Mahesh Rajdev,
V Kishore Ayyadevara,
Shivam Mishra,
Neelesh Bhushan,
Sahil Jolly
US20210358640A1
Application, 2021
|
|
Predictive document conversion.
Kartik Chaudhary,
Raghav Bali,
V Kishore Ayyadevara,
Yerraguntla Yeshwanth Reddy
US20210326631A1
Granted, 2024
|
|
Wind power production prediction using machine learning based image processing.
Kartik Chaudhary,
Supriya Sharma
WO2024097438A1
Published, 2024
|
|
Training classification machine learning models with imbalanced training sets.
Kartik Chaudhary,
Ankit Varshney,
Rajat Gupta,
Snigdha Sree Borra,
Yogesh K. Dagar
US20230134348A1
Published, 2023
|
Articles
|
|
Understanding Audio data, Fourier Transform, FFT and Spectrogram features for a Speech Recognition System.
Kartik Chaudhary
Medium, 2020
|
|
Explaining Reinforcement Learning to your next-door-neighbor.
Kartik Chaudhary
Drops of AI, 2020
|
|
Optimizers explained for training Neural Networks.
Kartik Chaudhary
Drops of AI, 2020
|
|
1D-CNN based Fully Convolutional Model for Handwriting Recognition.
Kartik Chaudhary
Drops of AI, 2020
|
|
Boosting your Sequence Generation Performance with ‘Beam Search + Language model’ decoding.
Kartik Chaudhary
Drops of AI, 2020
|
|
Deep Learning with PyTorch: Introduction.
Kartik Chaudhary
Drops of AI, 2020
|
|
Deep Learning with PyTorch: First Neural Network.
Kartik Chaudhary
Drops of AI, 2020
|
|
OpenCV: Introduction and Simple Tricks in Python.
Kartik Chaudhary
Drops of AI, 2020
|
|
Convolutional Denoising Autoencoders for image noise reduction.
Kartik Chaudhary
Drops of AI, 2020
|
|
Sound Wave Basics — Every Data Scientist must know before starting analysis on Audio Data.
Kartik Chaudhary
Medium, 2019
|
|
Variational AutoEncoders and Image Generation with Keras.
Kartik Chaudhary
Drops of AI, 2019
|
|