Kshama Nitin Shah

AI Engineer & Researcher

Machine Learning & Computer Vision

AI Algorithm Developer at Stoneridge Inc. and a recent Machine Learning graduate from the University of Michigan. Passionate about building intelligent systems at the intersection of computer vision and multimodal learning.

View CV
Kshama Nitin Shah

About Me

My research interests broadly lie in image and video understanding and multimodal learning. Specifically, I'm interested in developing 'self-supervised' computer vision models that learn from multimodal sensation like natural language and cross-modal image data.

Featured Projects

SSL Object Detection

Self-Supervised Object Detection with Multimodal Image Captioning

A novel self-supervised pipeline using natural language supervision to localize objects, achieving an mAP of 21.57% when finetuned with an FCOS detector.

Fine-grained Food Classification

Language Supervised Pre-Training for Fine-grained Food Classification

Leveraged a vision and language pre-training model, trained on RedCaps sub-reddits, for fine-grained food classification, achieving 20% top-5 accuracy in zero-shot transfer.

MC-VQA

MC-VQA using Customized Prompts

Developed a novel zero-shot VQA pipeline by conjoining CLIP and T-5, achieving 49.5% accuracy, which is competitive with state-of-the-art zero-shot models.

Depth Estimation

A Monocular Local Mapper for Urban Scenes

Developed a model to perform object detection, semantic segmentation, and depth estimation simultaneously using a U-Net, YOLO-v1, and MobileNet-v3 feature extractor.

Teaching Experience

University of Michigan Logo

Graduate Student Instructor

EECS 442/504 Computer Vision (Fall 2022)

Advisor: Prof. Andrew Owens

University of Michigan Logo

Graduate Student Instructor

EECS 442 Computer Vision (Winter 2023)

Advisor: Prof. David Fouhey

✨ AI Assistant

Hello! I'm Kshama's AI assistant. Ask me anything about her projects or experience.