Baoru Huang

Assistant Professor

University of Liverpool

Biography

I am currently an Assistant Professor in the Department of Computer Science at the University of Liverpool. Prior to this, I worked as a Research Fellow in the Department of Computer Science at University College London, funded by the EU Horizon 2020 program EndoMapper. I completed my PhD at the Hamlyn Centre, Imperial College London. I also had the privilege of working as a Research Scientist at Reality Labs, Meta (Facebook), where I gained invaluable experience and contributed to pioneering projects in Eye Tracking for AR/VR applications.

I earned my BEng degree in Mechanical Engineering with First Class Honours from the University of Birmingham, UK, in 2018, followed by an MRes degree in Medical Robotics and Image-Guided Intervention with Distinction from Imperial College London, UK, in 2019.

Interests

Computer vision - 3DV, Scene understanding
Robotics - Robot manipulation, Robot vision, Medical Robotics
Surgical vision - Foundation model, Image-Guided Intervention

Education

PhD in AI, Computer Vision & Medical Robotics, 2023
Imperial College London
MRes in Medical Robotics and Image-guided Intervention (with Distinction), 2019
Imperial College London
BEng in Mechanical Engineering (with Honours Class I), 2018
University of Birmingham

Experience

Assistant Professor in the Department of Computer Science

University of Liverpool

Sep 2024 – Present Liverpool, UK

Research Fellow at WEISS

University College London

May 2023 – Sep 2024 London, UK

Research Scientist at Reality Labs

Meta (Facebook)

Oct 2023 – Feb 2024 Seattle, US

Research Assistant at Hamlyn Center

Imperial College London

Sep 2019 – Mar 2023 London, UK

Research Assistant at Autonomous Re-manufacturing Lab

University of Birmingham

Jul 2017 – Sep 2018 Birmingham, UK

Accomplishments

Best Paper Award @ International Conference on Robot Intelligence Technology and Applications

International Conference on Robot Intelligence Technology and Applications Dec 2023

DAAD AInet Fellowship

DAAD Deutscher Akademischer Austauschdienst German Academic Exchange Service Apr 2023

Best Poster Award @ European Molecular Imaging Meeting

European Society for Molecular Imaging Apr 2022

Featured Publications

Tudor Jianu, Baoru Huang, Hoan Nguyen, Binod Bhattarai, Tuong Do, Erman Tjiputra, Quang Tran, Pierre Berthet-Rayne, Ngan Le, Sebastiano Fichera, Anh Nguyen

October, 2024 In ACCV 2024

Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction

We introduce Guide3D, a high-resolution bi-planar X-ray dataset for 3D reconstruction, addressing the limitations of monoplanar fluoroscopic data in endovascular surgical tool navigation and providing a benchmark for advancing segmentation and reconstruction techniques in real-world applications.

Toan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen

July, 2024 In ECCV 2024

Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance

We present a novel approach for language-driven 6-DoF grasp detection in cluttered point clouds, introducing Grasp-Anything-6D, a large-scale dataset, and a diffusion model with negative prompt guidance to enable robots to grasp objects based on natural language commands, surpassing baselines in both benchmarks and real-world applications.

Toan Nguyen, Minh Nhat Vu, Baoru Huang, Tuan Van Vo, Vy Truong, Ngan Le, Thieu Vo, Bac Le, Anh Nguyen

June, 2024 In ICRA 2024

Language-Conditioned Affordance-Pose Detection in 3D Point Clouds

We propose a method for language-conditioned affordance detection and 6-DoF pose estimation in 3D point clouds, enabling robots to handle diverse affordances beyond predefined sets. Our approach features an open-vocabulary affordance detection branch and a language-guided diffusion model for pose generation. A new dataset supports the task, and experiments show significant performance improvements over baselines. The method demonstrates strong potential in real-world robotic applications.

Tuan Van Vo, Minh Nhat Vu, Baoru Huang, Toan Nguyen, Ngan Le, Thieu Vo, Anh Nguyen

June, 2024 In ICRA 2024

Open-vocabulary affordance detection using knowledge distillation and text-point correlation

We introduce an open-vocabulary affordance detection method for 3D point clouds, addressing the challenges of complex object shapes and diverse affordances. Using knowledge distillation and a novel text-point correlation approach, our method enhances feature extraction and semantic understanding. It outperforms baselines with a 7.96% mIOU improvement and supports real-time inference, ideal for robotic manipulation tasks.

An Dinh Vuong, Minh Nhat Vu, Baoru Huang, Nghia Nguyen, Hieu Le, Thieu Vo, Anh Nguyen

June, 2024 In CVPR 2024

Language-driven Grasp Detection

Robot grasp detection is a complex challenge with significant industrial relevance. To address this, we present Grasp-Anything++, a new language-driven grasp detection dataset containing 1M samples, over 3M objects, and 10M grasping instructions. Leveraging foundation models, we frame grasp detection as a conditional generation task and propose a novel diffusion model-based method with a contrastive training objective to improve language-guided grasp pose detection. Our approach surpasses state-of-the-art methods, supports real-world robotic grasping, and enables zero-shot grasp detection. The dataset serves as a challenging benchmark, promoting advancements in language-driven robotic grasping research.

Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Bartłomiej W Papież

January, 2024 In MedIA

Residual Aligner-based Network (RAN): Motion-separable structure for coarse-to-fine discontinuous deformable registration

We propose a novel Residual Aligner-based Network (RAN) for deformable image registration, addressing challenges in capturing separate and sliding motions of organs. By introducing a Motion Separable backbone and a Residual Aligner module, RAN achieves state-of-the-art accuracy in unsupervised registration of abdominal and lung CT scans, with reduced model size and computational cost.

Baoru Huang, Yicheng Hu, Anh Nguyen, Stamatia Giannarou, Daniel S Elson

September, 2023 In MICCAI 2023 Oral

Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery

We propose a simple regression network to enhance intraoperative gamma activity visualization in endoscopic radio-guided cancer detection and resection. By leveraging high-dimensional image features and probe position data, our method effectively detects sensing areas, outperforming prior geometric approaches.

Recent Publications

Quickly discover relevant content by filtering publications.

Tudor Jianu, Baoru Huang, Hoan Nguyen, Binod Bhattarai, Tuong Do, Erman Tjiputra, Quang Tran, Pierre Berthet-Rayne, Ngan Le, Sebastiano Fichera, Anh Nguyen (2024). Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction. In ACCV 2024.

PDF Cite Code Dataset

An Dinh Vuong, Toan Tien Nguyen, Minh Nhat Vu, Baoru Huang, Dzung Nguyen, Huynh Thi Thanh Binh, Thieu Vo, Anh Nguyen (2024). HabiCrowd: A High Performance Simulator for Crowd-Aware Visual Navigation. In IROS 2024.

PDF Cite Code Dataset

Tuan Van Vo, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen (2024). Language-driven Grasp Detection with Mask-guided Attention. In IROS 2024.

PDF Cite Code Dataset

Nghia Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Ngan Le, Thieu Vo, Anh Nguyen (2024). Lightweight language-driven grasp detection using conditional consistency model. In IROS 2024.

PDF Cite Code Dataset

Lei Wang, Fuchen Xie, Xin Zhang, Li Jiang, Baoru Huang (2024). Spatial-temporal graph feature learning driven by time–frequency similarity assessment for robust fault diagnosis of rotating machinery. In Advanced Engineering Informatics.

PDF Cite

Tudor Jianu, Baoru Huang, Minh Nhat Vu, Mohamed EMK Abdelaziz, Sebastiano Fichera, Chun-Yi Lee, Pierre Berthet-Rayne, Ferdinando Rodriguez y Baena, Anh Nguyen (2024). Cathsim: an open-source simulator for endovascular intervention. In TMRB.

PDF Cite Code Dataset

Toan Nguyen, Minh Nhat Vu, Baoru Huang, An Vuong, Quan Vuong, Ngan Le, Thieu Vo, Anh Nguyen (2024). Language-Driven 6-DoF Grasp Detection Using Negative Prompt Guidance. In ECCV 2024.

PDF Cite Code Dataset

An Dinh Vuong, Minh Nhat Vu, Hieu Le, Baoru Huang, Binh Huynh, Thieu Vo, Andreas Kugi, Anh Nguyen (2024). Grasp-anything: Large-scale grasp dataset from foundation models. In ICRA 2024.

PDF Cite Code Dataset

Toan Nguyen, Minh Nhat Vu, Baoru Huang, Tuan Van Vo, Vy Truong, Ngan Le, Thieu Vo, Bac Le, Anh Nguyen (2024). Language-Conditioned Affordance-Pose Detection in 3D Point Clouds. In ICRA 2024.

PDF Cite Code Dataset

Tuan Van Vo, Minh Nhat Vu, Baoru Huang, Toan Nguyen, Ngan Le, Thieu Vo, Anh Nguyen (2024). Open-vocabulary affordance detection using knowledge distillation and text-point correlation. In ICRA 2024.

PDF Cite Code Dataset

An Dinh Vuong, Minh Nhat Vu, Baoru Huang, Nghia Nguyen, Hieu Le, Thieu Vo, Anh Nguyen (2024). Language-driven Grasp Detection. In CVPR 2024.

PDF Cite Code Dataset

Jian-Qing Zheng, Ziyang Wang, Baoru Huang, Ngee Han Lim, Bartłomiej W Papież (2024). Residual Aligner-based Network (RAN): Motion-separable structure for coarse-to-fine discontinuous deformable registration. In MedIA.

PDF Cite Code Dataset

Baoru Huang, Yicheng Hu, Anh Nguyen, Stamatia Giannarou, Daniel S Elson (2023). Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery. In MICCAI 2023 Oral.

PDF Cite Code Dataset

Baoru Huang, Jian-Qing Zheng, Anh Nguyen, Chi Xu, Ioannis Gkouzionis, Kunal Vyas, David Tuch, Stamatia Giannarou, Daniel S Elson (2022). Self-Supervised Depth Estimation in Laparoscopic Image using 3D Geometric Consistency. In MICCAI 2022.

PDF Cite Code Dataset

Daqian Wang, Ji Qi, Baoru Huang, Elizabeth Noble, Danail Stoyanov, Jun Gao, Daniel S Elson (2022). Polarization-based smoke removal method for surgical images. In Biomedical Optics Express.

PDF Cite

Baoru Huang, Jian-Qing Zheng, Anh Nguyen, David Tuch, Kunal Vyas, Stamatia Giannarou, Daniel S Elson (2021). Self-supervised generative adversarial network for depth estimation in laparoscopic images. In MICCAI 2021.

PDF Cite

Baoru Huang, Ya-Yen Tsai, João Cartucho, Kunal Vyas, David Tuch, Stamatia Giannarou, Daniel S Elson (2020). Tracking and visualization of the sensing area for a tethered laparoscopic gamma probe. In IJCARS.

PDF Cite

Dandan Zhang, Bo Xiao, Baoru Huang, Lin Zhang, Jindong Liu, Guang-Zhong Yang (2019). A self-adaptive motion scaling framework for surgical robot remote control. In RA-L.

PDF Cite

Baoru Huang

Assistant Professor

University of Liverpool

Biography

Experience

Accomplish­ments

Projects

Featured Publications

Recent Publications

Popular Topics

Accomplishments