Felix Yanwei Wang
I am on the job market for research scientist positions starting in spring 2025.
I am a final-year EECS PhD student at MIT CSAIL, working with Julie Shah on robot learning, specifically, inference-time policy alignment through human interactions. I am also a fellow at MIT's Work of the Future group focusing on generative AI.
Before MIT, I did my MS in robotics at Northwestern University and researched with Todd Murphey and Mitra Hartmann on active sensing. I completed my undergraduate degree in physics at Middlebury College with Richard Wolfson.
Outside research, I enjoy theatre and backpacking. I thru-hiked PCT in 2019.
Email
/
Google Scholar
/
LinkedIn
/
CV
/
X
|
|
News
- [Dec 2024]
Invited talk at MIT EI Seminar - "Inference-Time Policy Customization through Interactive Task Specification"
- [Oct 2024]
Invited talk at University of New Hampshire - "Inference-Time Policy Alignment through Human Interactions"
- [Jul 2024]
Organized GenAI-HRI workshop at RSS 2024.
- [Mar 2024]
Invited talk at Brown University Robotics Seminar - "Interactive Task and Motion Imitation"
- [Mar 2024]
Invited talk at University of Utah - "Interactive Task and Motion Imitation"
- [Jan 2024]
Check out our Generative AI newsletter from MIT's work of the future group.
- [Oct 2023]
Temporal logic imitation was awarded the Best Student Paper at IROS 2023 Workshop.
- [Jan 2023]
PBS News coverage of our human robot interaction exhibition at MIT museum.
Research
Imagine driving with Google Maps, where multiple routes unfold before you. As you take turns and change plans, it adapts instantly recalculating to match your shifting preferences. My research goal is to bring this level of interactivity to multimodal embodied AI, empowering humans to steer pre-trained foundation models at inference-time.
|
|
Inference-Time Policy Steering through Human Interactions
Yanwei Wang,
Lirui Wang,
Yilun Du,
Balakumar Sundaralingam,
Xuning Yang,
Yu-Wei Chao,
Claudia Perez-D'Arpino,
Dieter Fox,
Julie Shah
arxiv /
code /
project page
ICRA 2025 (in submission)
We propose Inference-Time Policy Steering (ITPS), a framework that leverages human interactions to zero-shot adapt
pre-trained generative policies for downstream tasks without any additional data collection or fine-tuning.
|
|
Versatile Demonstration Interface: Toward More
Flexible Robot Demonstration Collection
Michael Hagenow,
Dimosthenis Kontogiorgos,
Yanwei Wang,
Julie Shah
arxiv
ICRA 2025 (in submission)
We present the Versatile Demonstration Interface (VDI), a collaborative robot tool designed to enable seamless
transitions between data collection modes—teleoperation, kinesthetic teaching, and natural demonstrations—without the
need for additional environmental instrumentation.
|
|
Grounding Language Plans in Demonstrations through Counter-factual Perturbations
Yanwei Wang,
Tsun-Hsuan Wang,
Jiayuan Mao,
Michael Hagenow,
Julie Shah
arxiv /
code /
project page /
MIT News
ICLR 2024 (Spotlight, acceptance rate: 5%)
This work learns grounding classifiers for LLM planning. Our end-to-end explanation-based network is
trained to differentiate successful demonstrations from failing counterfactuals and as a by-product learns classifiers that ground continuous states
into discrete manipulation mode families without dense labeling.
|
|
Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from Demonstrations
Yanwei Wang,
Nadia Figueroa, Shen Li, Ankit Shah, Julie Shah
arxiv
/
code
/
project page
/
PBS News
CoRL 2022 (Oral, acceptance rate: 6.5%)
IROS 2023 Workshop ( Best Student Paper, Learning Meets Model-based Methods for Manipulation and Grasping Workshop)
We present a continuous motion imitation method that can provably satisfy any discrete plan specified by a Linear Temporal Logic (LTL) formula. Consequently, the imitator is robust to both task- and motion-level disturbances and guaranteed to achieve task success.
|
|
Improving Small Language Models on PubMedQA via Generative Data Augmentation
Zhen Guo,
Yanwei Wang,
Peiqi Wang,
Shangdi Yu
arxiv
KDD 2023 (Foundations and Applications in Large-scale
AI Models Pre-training, Fine-tuning, and Prompt-based Learning Workshop)
We prompt large language models to augment a domain-specific dataset to train specialized small language models that outperform the general-purpose LLM.
|
|
Visual Pre-training for Navigation: What Can We Learn from Noise?
Yanwei Wang,
Ching-Yun Ko,
Pulkit Agrawal
arxiv
/
code
/
project page
IROS 2023
NeurIPS 2022 Workshop (Synthetic Data for Empowering ML Research / Self-Supervised Learning)
By learning how to pan, tilt and zoom its camera to focus on random crops of a noise image, an embodied agent can pick up navigation skills in realistically simulated environments.
|
|
MIT Museum Interactive Robot Exhibition: Teach a Robot Motions
Nadia Figueroa,
Yanwei Wang,
Julie Shah
We installed an interactive exhibition at MIT Museum that allows non-robot-experts to teach a robot an inspection task using demonstrations. The robustness and compliance of the learned motion policy enables visitors (including kids) to physically perturb the system safely 24/7 without losing a success gaurantee.
|
|
Active Learning for Object Search
Yanwei Wang,
Todd Murphey
This research project studies the application of two information gain methods in a search problem: 1. Infotaxis - Search guided by entropy minimization 2. Ergodic exploration - Search guided by proportional coverage.
|
|
Active Sensing with Tactile Sensors
Yanwei Wang,
Mitra Hartmann
This research project studies how active sensing, i.e., choosing what data to collect, can improve data efficiency for decision-making under uncertainty. Inspired by the active whisking behavior of rats, we use simulated rat whisker sensory signals as a model for spatial-temporal data to learn policies that first collect observations and then classify object shapes.
|
|
Multi-agent Distributed Sensing and Control
Yanwei Wang,
Michael Rubinstein
This research project studies multi-agent distributed algorithms concerning coordination, segregation, and locomotion, with a hardware implementation of robust localization with cheap sensors on a low-cost underactuated system.
|
|
Analyzing Energy Efficiency of Bio-Inspired Wind Generator
Yanwei Wang,
Richard Wolfson
pdf
For my undergraduate physics thesis at Middlebury College, I did a computational fluid dynamics (CFD) simulation of Festo's Dual-Wing generator using COMSOL CFD package.
|
Projects
Miscellaneous class projects.
|
|