profile photo

Hardik Shah

I am currently working at Flexion Robotics - attempting to make humanoids more capable and autonomous. I did my MSc in Computer Science at ETH Zurich and my undergrad at BITS Pilani, Goa.

I have had the pleasure of working with some amazing researchers and engineers at :

 ~  Email  |  CV  |  Resume  |  Google Scholar  |  Github  |  LinkedIn  ~ 


P.S. I play tennis, and find freedom in running and cycling. I also enjoy reading and occasionally dabble in chess. Rafa remains the GOAT for me even if Djokovic goes on to win 30 grand slams.


Visiting Student Researcher | NASA JPL
Apr '25 - Nov '25
MSc. Thesis - With Patrick Spieler and Prof. Dr. Marco Hutter.
Under Review. [Paper][Code][Website]

Developed WildOS, combining complementary strengths of visual and geometric perception for long-range navigation and object search in large-scale off-road and vegetated environments.
Research supported by the Zeno Karl Schindler Foundation Master Thesis Grant.

Computer Vision Student Researcher | Scandit
Jul '24 - Mar '25
ML in the Barcode Tracking Team - With Menelaos Kanakis and Matthias Bloch.

Replacing traditional keypoint detectors with learned detection and matching methods in the tracking pipeline of Scandit's MatrixScan product, which leverages SLAM for AR-based inventory management on resource-constrained devices. Optimizing training paradigms, and exploring lightweight architectures for efficient on-device inference along with custom evaluation benchmarks.

Graduate Student Researcher | Robotics and Perception Group, UZH
Feb '24 - Mar '25
Research - Under Prof. Dr. Davide Scaramuzza.
Accepted to CVPRW '25. [Paper][Code][Poster]

Developed ForesightNav, an imagination-driven exploration strategy for efficient long-horizon and open-vocabulary navigation using CLIP-grounded semantic and geometric predictions of unseen regions.

Graduate Student Researcher | Computer Vision and Geometry Group, ETH
Feb '24 - Mar '25
Research - Under Prof. Dr. Marc Pollefeys

Developed POLD2, a deep learning-based pipeline that jointly detects and describes both point and line features in images, optimizing feature extraction for 3D vision tasks like SLAM and pose estimation. By sharing computations between points and lines, POLD2 achieves a significant 9.5x speedup in inference time compared to traditional methods, while maintaining comparable accuracy.

Student Researcher | Google Research, India
Aug '22 - Jun '23
Undergraduate Thesis - Accepted to CVPRW '24! [Paper]

Developed a versatile neural network compression toolbox that optimizes for the model's FLOPs via a novel latency surrogate across a family of compression methods, including pruning and low-rank factorization. Additionally, optimized on-device latency of large vision models used for OCR tasks in Google Lens, and QR-code scanning in GooglePay.

Summer Research Intern | Karlsruhe University of Applied Sciences, Germany
May '22 - Aug '22
Research funded by the DAAD WISE Scholarship [Code][Website]

Designed an end to end pipeline for multi-view stereo dense 3D reconstruction from a handheld stereo-camera(Intel RealSense) that outputs stable dense pointclouds. Integration of classical visual SLAM algorithms with U-Net adapted deep learning architectures for dense depth prediction.

Researcher | APPCAIR & Intel Labs
May '22 - Aug '22

Demonstrated use of ensemble learning for the task of activity recognition/video classifi- cation on the Something-Something-v2 dataset. Weak learners were typically used for feature extraction. Explored various methods for combination of features, ultimately used for downstream classification

Machine Learning Research Intern | CEERI Pilani
May '21 - Sept '21

Worked under under the supervision of Prof. Sandeep Joshi and Prof. Madan Lakshmanan. on classification of a person as fatigued or non-fatigued based on PPG signals of a human subject.


This template is a modification to Jon Barron's website. Last Updated: June 12 2025.