Bhavya Goyal

pronunciation

Applied Scientist
Amazon Robotics

bhavya at cs.wisc.edu

I am an Applied Scientist at Amazon Robotics. My research interests are broadly in the areas of Computer Vision, Robotics and 3D Sensing.

I received my PhD from UW-Madison in CS. I was part of the Wision Lab where I was advised by Prof. Mohit Gupta. I worked on perception models under adverse conditions like low light, motion, and LiDAR noise. Before that, I completed my undergraduate in CS from IIT Delhi where I was advised by Prof. Vinay Riberio.

Earlier, I worked as a Research Engineer at Visual Understanding Lab of Samsung Research, Seoul for 3 years. I also spent a summer at Cruise AI with the Perception team and Prof. Yong Jae Lee.

Publications

Robust 3D Object Detection using Probabilistic Point Clouds
[Arxiv] [Project] [Code]

ICCV 2025
Bhavya Goyal, F. Barragan, W. Lin, A. Velten, Y. Li, M. Gupta
Robust Scene Inference under Noise-Blur Dual Corruptions
[Arxiv] [Project] [Code]

ICCP 2022
Bhavya Goyal, J.F. Lalonde, Y. Li, M. Gupta
Photon-Starved Scene Inference using Single Photon Cameras
[Arxiv] [Project] [Code] [Video]

ICCV 2021
Bhavya Goyal, M. Gupta
Attention-based Ensemble for Deep Metric Learning
[Arxiv] [Poster] [Slides]

ECCV 2018
W. Kim, Bhavya Goyal, K. Chawla, J. Lee, K. Kwon

Experience

Samsung Research, Seoul
(Sep'16 - Jul'19)

Research Engineer (Visual Understanding Lab)
Object Recognition and Retrieval, Smart Refrigerators [Link]
- Image recognition algorithms for detecting grocery items inside the refrigerator, models used for product recommendation engine in Samsung Smart Refrigerators
- Designed techniques using global and attentive deep local descriptors for feature matching, paired with geometric verification of selected keypoints, to recognize products that are partially occluded by other items.
Product Search, Bixby Vision [Link]
- Developed large scale retrieval models for products in online shopping mall images.
- Designed attention mechanism in Deep Neural Networks to ignore background noise in query images, achieves SOTA results on all image major retrieval benchmarks, published in ECCV 2018

Cruise, San Francisco
(May'22 - Aug'22)

Research Intern (Perception)
Self-Supervised Learning
- Self-supervised pretraining for 3D object recognition models using camera RGB images and LiDAR point clouds.
- Joint pre-text tasks for 2D images and point clouds based on masked auto-encoders using vision transformers.

Projects

AI Meets Beauty Challenge [ Link ] [ Code ]
ACM Multimedia Conference 2018
- Winner with SOTA results for half million product image recognition
- Developed CNN based retrieval model using attention module to ignore background clutter in product images, approx nearest neighbor search for product images in large scale retrieval DB
Tiger ReID in Wild [ Code ] [ Slides ] [ Paper ]
Prof. Yin Li
- Proposed architecture using object detection and re-id which encourages diversity among feature embeddings to get more discriminative features which boosts the performance on most retrieval benchmarks.
Stack-Exchange Tag Prediction [ Code ] [ Slides ] [ Paper ]
Prof. Mausam
- State-of-the-art results for predicting tags/labels for questions on different Stack Exchange portals
- Classification of meta features from text and code snippets in questions using SVM
- KMeans to cluster word embeddings from Google's pre-trained Word2Vec model and ensembled with model trained with term affinity of tags and words

Bhavya Goyal

Publications

Robust 3D Object Detection using Probabilistic Point Clouds [Arxiv] [Project] [Code]

Robust Scene Inference under Noise-Blur Dual Corruptions [Arxiv] [Project] [Code]

Photon-Starved Scene Inference using Single Photon Cameras [Arxiv] [Project] [Code] [Video]

Attention-based Ensemble for Deep Metric Learning [Arxiv] [Poster] [Slides]

Experience

Samsung Research, Seoul

(Sep'16 - Jul'19)

Object Recognition and Retrieval, Smart Refrigerators [Link]

Product Search, Bixby Vision [Link]

Cruise, San Francisco

(May'22 - Aug'22)

Self-Supervised Learning

Projects

AI Meets Beauty Challenge [ Link ] [ Code ]

Tiger ReID in Wild [ Code ] [ Slides ] [ Paper ] Prof. Yin Li

Stack-Exchange Tag Prediction [ Code ] [ Slides ] [ Paper ] Prof. Mausam

Robust 3D Object Detection using Probabilistic Point Clouds
[Arxiv] [Project] [Code]

Robust Scene Inference under Noise-Blur Dual Corruptions
[Arxiv] [Project] [Code]

Photon-Starved Scene Inference using Single Photon Cameras
[Arxiv] [Project] [Code] [Video]

Attention-based Ensemble for Deep Metric Learning
[Arxiv] [Poster] [Slides]

Tiger ReID in Wild [ Code ] [ Slides ] [ Paper ]
Prof. Yin Li

Stack-Exchange Tag Prediction [ Code ] [ Slides ] [ Paper ]
Prof. Mausam