Publications

2024

Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation

Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation

Ian Chuang*, Andrew Lee*, Dechen Gao, Iman Soltani (* equal contribution)

Workshop on Whole-body Control and Bimanual Manipulation @ CoRL 2024
International Conference on Robotics and Automation (ICRA) 2025

We introduce AV-ALOHA, a new bimanual teleoperation robot system that extends the ALOHA 2 robot system with Active Vision. This system provides an immersive teleoperation experience, with bimanual first-person control, enabling the operator to dynamically explore and search the scene and simultaneously interact with the environment. We conduct imitation learning experiments and our results show significant improvements over fixed cameras in tasks with limited visibility.

[project page] [arXiv] [code] [code (VR)] [video] [poster (workshop)] [BibTeX]

@article{chuang2024active,
  title={Active vision might be all you need: Exploring active vision in bimanual robotic manipulation},
  author={Chuang, Ian and Lee, Andrew and Gao, Dechen and Soltani, Iman},
  journal={arXiv preprint arXiv:2409.17435},
  year={2024}
}

Active Vision Might Be All You Need: Exploring Active Vision in Bimanual Robotic Manipulation

Ian Chuang*, Andrew Lee*, Dechen Gao, Iman Soltani (* equal contribution)

Workshop on Whole-body Control and Bimanual Manipulation @ CoRL 2024
International Conference on Robotics and Automation (ICRA) 2025

[project page] [arXiv] [code] [code (VR)] [video] [poster (workshop)]

InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation

InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation

Andrew Lee, Ian Chuang, Ling-Yuan Chen, Iman Soltani

Conference on Robot Learning (CoRL) 2024

InterACT is an imitation learning model that captures and extracts inter-dependencies between dual-arm joint positions and visual inputs. By doing so, InterACT guides the two arms to perform bimanual tasks with precision—independently yet in seamless coordination.

[project page] [arXiv] [code] [poster] [BibTeX]

@article{lee2024interact,
  title={InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation},
  author={Lee, Andrew and Chuang, Ian and Chen, Ling-Yuan and Soltani, Iman},
  journal={arXiv preprint arXiv:2409.07914},
  year={2024}
}

InterACT: Inter-dependency Aware Action Chunking with Hierarchical Attention Transformers for Bimanual Manipulation

Andrew Lee, Ian Chuang, Ling-Yuan Chen, Iman Soltani

Conference on Robot Learning (CoRL) 2024

[project page] [arXiv] [code] [poster]