Thanh Nguyen Canh

I am a PhD student in the School of Information Science at Japan Advanced Institute of Science and Technology. I work at the Robotics Laboratory and am fortunate to be advised by Prof. Chong Nak Young on 3D Active Semantic SLAM.

I obtained my Master degree from the School of Information Science at JAIST, Japan and B.S. degree in Roboitics Engineering from University of Engineering and Technology, Vietnam National University, Vietnam . I worked on real-time semantic-aware simultaneous localization and mapping (SLAM) for Unmanned Aerial Vehicle (UAV). During my Master, I am grateful to collaborate with Prof. Xiem HoangVan, Prof. Armagan Elibol, Prof. Manh Duong Phung and Prof. Van-Truong Nguyen.

Email  /  Google Scholar  /  Github  /  LinkeIn  /  ORCID

profile photo
Recent News
Research Interests

I am interested in robotics, machine learning, vision, and control. My work focuses on robots' simultaneous localization and mapping, e.g. probablistic mapping, navigation and exploration, semantic slam, active slam and life-long slam; and of their own dynamics model, e.g. robot dynamics learning, model-based reinforcement learning, and learning from demonstration. I am also interested in modeling uncertainty in map representations and robots' dynamics for safe and active planning and control.

Awards & Professional Activities
Scholarships & Research grants
  • JST Spring Scholarship, 2025-2027
  • KDDI Foundation Scholarship, 2024-2025
  • VinIF Scholarship for Master Programmer, Vingroup Innovation Foundation, 2022-2024
  • Teaching

    Robotics, Mobile Robotics, Algorithms for Intelligent Robots, Discrete Signal Processing

    Programming robot with ROS, Robotic Control, Human Machine Interface

    Mechanical Drawing, Electronics Engineering Practice, PLC and Its Application

    Publications
    Projects for active semantic slam

    3D Semantic-aware SLAM

    S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera
    Thanh Nguyen Canh, Van-Truong Nguyen, Xiem HoangVan, Armagan Elibol, Nak Young Chong
    Conference version, accepted to SII , 2024.
    website / video / arxiv / code

    This paper presents a novel approach to address challenges in semantic information extraction and utilization within UAV operations. Our system integrates state-of-the-art visual SLAM to estimate a comprehensive 6-DoF pose and advanced object segmentation methods at the back end. To improve the computational and storage efficiency of the framework, we adopt a streamlined voxel-based 3D map representation - OctoMap to build a working system. Furthermore, the fusion algorithm is incorporated to obtain the semantic information of each frame from the front-end SLAM task, and the corresponding point. By leveraging semantic information, our framework enhances the UAV's ability to perceive and navigate through indoor spaces, addressing challenges in pose estimation accuracy and uncertainty reduction. Through Gazebo simulations, we validate the efficacy of our proposed system and successfully embed our approach into a Jetson Xavier AGX unit for real-world applications.

    Object-Oriented Semantic Mapping for Reliable UAVs Navigation
    Thanh Nguyen Canh, Armagan Elibol, Nak Young Chong, Xiem HoangVan
    Conference version, accepted to ICCAS , 2023.
    website / video / arxiv / code

    To autonomously navigate in real-world environments, special in search and rescue operations, Unmanned Aerial Vehicles (UAVs) necessitate comprehensive maps to ensure safety. However, the prevalent metric map often lacks semantic information crucial for holistic scene comprehension. In this paper, we proposed a system to construct a probabilistic metric map enriched with object information extracted from the environment from RGB-D images. Our approach combines a state-of-the-art YOLOv8-based object detection framework at the front end and a 2D SLAM method - CartoGrapher at the back end. To effectively track and position semantic object classes extracted from the front-end interface, we employ the innovative BoT-SORT methodology. A novel association method is introduced to extract the position of objects and then project it with the metric map. Unlike previous research, our approach takes into reliable navigating in the environment with various hollow bottom objects. The output of our system is a probabilistic map, which significantly enhances the map's representation by incorporating object-specific attributes, encompassing class distinctions, accurate positioning, and object heights. A number of experiments have been conducted to evaluate our proposed approach. The results show that the robot can effectively produce augmented semantic maps containing several objects (notably chairs and desks). Furthermore, our system is evaluated within an embedded computer - Jetson Xavier AGX unit to demonstrate the use case in real-world applications.

    Multisensor Data Fusion for Reliable Obstacle Avoidance
    Thanh Nguyen Canh, Truong Son Nguyen, Cong Hoang Quach, Xiem HoangVan, Manh Duong Phung
    Conference version, accepted to ICCAS , 2022.
    website / arxiv

    In this work, we propose a new approach that combines data from multiple sensors for reliable obstacle avoidance. The sensors include two depth cameras and a LiDAR arranged so that they can capture the whole 3D area in front of the robot and a 2D slide around it. To fuse the data from these sensors, we first use an external camera as a reference to combine data from two depth cameras. A projection technique is then introduced to convert the 3D point cloud data of the cameras to its 2D correspondence. An obstacle avoidance algorithm is then developed based on the dynamic window approach. A number of experiments have been conducted to evaluate our proposed approach. The results show that the robot can effectively avoid static and dynamic obstacles of different shapes and sizes in different environments.

    Projects for robot learning



    Design of Deep Reinforcement Learning Approach for Traffic Signal Control at Three-way Crossroads
    Thanh Nguyen Canh, Anh Pham Tuan, Xiem HoangVan
    Journal version, submitted to Public Transprt , 2025.
    website / video / Research Square / code

    Traffic light control (TSC) is an important and challenging real-world problem with the aim of reducing travel time as well as saving energy. Recent researches have numerous attempts to apply intelligent methods for TSC at four-way crossroads to solve the traffic light scheduling problem. However, there is the limitation of researches on efficient TSC at three-way crossroads. Therefore, this paper introduces a novel TSC solution for three-way crossroad environment (TW-TSC). The proposed TSC method is designed based on a deep reinforcement learning approach, namely Soft Actor-Critic (TWSAC). Firstly, we create a simulation environment for three-way crossroads which consists of numerous transportation and two parallel lanes using Unity framework. Secondly, to achieve practical movements of transportation in three-way crossroads, we carefully design agents which have a high impact to the transportation movement, notably the time to wait for traffic light, the velocity of transportation, and the number of transportation passing successfully. Finally, to achieve TW-TSC efficiency, we propose a novel reward function together with a design of TWSAC algorithm. Experimental results show that the proposed TWSAC in TW-TSC achieves higher performance than both fixed-time TSC methods and relevant RL algorithms.

    Optimal design and fabrication of frame structure for dual-arm service robots: An effective approach for human–robot interaction
    Thanh Nguyen Canh, Son Tran Duc, Huong Nguyen The, Trang Huyen Dao, Xiem HoangVan
    Journal version, accepted to Engineering Science and Technology, an International Journal , 2024.
    website / video /

    Rapid advancement in robotics technology has paved the way for developing mobile service robots capable of human interaction and assistance. In this paper, we propose a comprehensive approach to design, fabricate, and optimize the overall structure of a dual-arm service robot. The conceptual design phase focuses on both critical components, the mobile platform and the manipulation system, essential for seamless navigation and effective task execution. In the proposed system, the distribution of the robot payload in terms of region, maximum stress, and displacement is examined, comprehensively analyzed, and compared with the relevant works. In addition, to enhance the system’s efficiency while minimizing its weight, we introduce a lightweight design approach in which Finite Element Analysis is utilized to optimize the frame structure. Subsequently, we fabricate a physical prototype based on the derived model. Finally, we provide a kinematic model for our dual-arm service robot and demonstrate its efficacy in both control and human–robot interaction (HRI) tasks. Experimental results indicate that the proposed dual arm design can achieve a significant weight reduction of 25% from the original design while still performing actions smoothly for HRI tasks.

    Projects for reliable localization and safety path planing
    Toward Integrating Semantic-aware Path Planning and Reliable Localization for UAV Operations
    Thanh Nguyen Canh, Huy-Hoang Ngo, Xiem HoangVan, Nak Young Chong
    Conference version, accepted to ICCAS , 2024.
    website / video / arxiv /

    Localization is one of the most crucial tasks for Unmanned Aerial Vehicle systems (UAVs) directly impacting overall performance, which can be achieved with various sensors and applied to numerous tasks related to search and rescue operations, object tracking, construction, etc. However, due to the negative effects of challenging environments, UAVs may lose signals for localization. In this paper, we present an effective path-planning system leveraging semantic segmentation information to navigate around texture-less and problematic areas like lakes, oceans, and high-rise buildings using a monocular camera. We introduce a real-time semantic segmentation architecture and a novel keyframe decision pipeline to optimize image inputs based on pixel distribution, reducing processing time. A hierarchical planner based on the Dynamic Window Approach (DWA) algorithm, integrated with a cost map, is designed to facilitate efficient path planning. The system is implemented in a photo-realistic simulation environment using Unity, aligning with segmentation model parameters. Comprehensive qualitative and quantitative evaluations validate the effectiveness of our approach, showing significant improvements in the reliability and efficiency of UAV localization in challenging environments.



    Enhancing Social Robot Navigation with Integrated Motion Prediction and Trajectory Planning in Dynamic Human Environments
    Thanh Nguyen Canh, Xiem HoangVan Nak Young Chong
    Conference version, accepted to ICCAS , 2024.
    website / video / arxiv / code

    Navigating safely in dynamic human environments is crucial for mobile service robots, and social navigation is a key aspect of this process. In this paper, we proposed an integrative approach that combines motion prediction and trajectory planning to enable safe and socially-aware robot navigation. The main idea of the proposed method is to leverage the advantages of Socially Acceptable trajectory prediction and Timed Elastic Band (TEB) by incorporating human interactive information including position, orientation, and motion into the objective function of the TEB algorithms. In addition, we designed social constraints to ensure the safety of robot navigation. The proposed system is evaluated through physical simulation using both quantitative and qualitative metrics, demonstrating its superior performance in avoiding human and dynamic obstacles, thereby ensuring safe navigation.

    Projects for 3D robot calibration


    M-Calib: A Monocular 3D Object Localization using 2D Estimates for Industrial Robot Vision System
    Thanh Nguyen Canh, Du Ngoc Trinh, Xiem HoangVan
    Journal version, accepted to Journal of Automation, Mobile Robotics and Intelligent Systems (JAMRIS) , 2024.
    website / video / Research Square / code

    3D Object Localization has been emerging recently as one of the challenges of Machine Vision or Robot Vision tasks. In this paper, we proposed a novel method designed for the localization of isometric flat 3D objects, leveraging a blend of deep learning techniques primarily rooted in object detection, postimage processing algorithms, and pose estimation. Our approach involves the strategic application of 3D calibration methods tailored for low-cost industrial robotics systems, requiring only a single 2D image input. Initially, object detection is performed using the You Only Look Once (YOLO) model, followed by segmentation of the object into two distinct parts— the top face and the remainder— using the Mask R-CNN model. Subsequently, the center of the top face serves as the initialization position and a unique combination of postprocessing techniques and a novel calibration algorithm is employed to refine the object’s position. Experimental results demonstrate a notable reduction in localization error by 87.65% when compared to existing methodologies.


    Template borrowed from Jon Barron.