Experience

Work Experience

VinMotion, VinGroup

Role: AI/Robotics Engineer (PermanentFull-time)
Period: June 2025 – Present
Location: Hanoi, Vietnam
Stack: ROS 2, Python, PyTorch, LangGraph, Tool Calling, VLA, VLM, Gazebo
Bullets:
- Developed embodied task-planning modules using multi-agent workflows and Vision-Language Models for autonomous task execution, and decision-making in humanoid robots
- Built and deployed model-based and model-free 6DoF object pose estimation modules to improve robotic grasping accuracy
- Developed an instruction-following data generation and benchmarking pipeline for VLN training, automating dataset creation and evaluation workflows
Projects:
- Task Planning for Embodied AI → /projects/task-planning-embodied/
- 6-DoF Object Pose Estimation → /projects/pose-estimation/
- Instruction Following Datagen & Benchmark → /projects/instruction-following-datagen-benchmark/
Key Skills: ROS 2, LangGraph, VLM, VLA, PyTorch, Tool Calling, Humanoid Robotics, Gazebo

AIS, VMO Holdings

Role: AI Engineer (Contract full-time remote)
Period: May 2025 – Sep 2025
Location: Hanoi, Vietnam
Stack: Python, FastAPI, ONNX, LangChain, LangGraph, LLM, NLP, ASR, TTS
Bullets:
- Built a Computer Vision Acceleration Pipeline cutting development-phase time cost by 50–70%
- Warehouse Package Tracking with RF-Detr deployed in warehouse via surveillance cameras, automating storage processes
- Food-Ordering Agentic Workflow with LangGraph deployed in kiosks to automate ordering process from customer interaction (STT/NLP) to order fulfillment and customer feedback collection (TTS/NLP)
Projects: [hidden — company privacy policy]

Human Robotics Laboratory, NAIST

Role: Research Assistant (Contract full-time)
Period: Nov 2023 – Sep 2025
Location: Ikoma, Japan
Stack: Flutter, Python, Unity, Unreal Engine, ROS 2
Bullets:
- Developed an Android VR See-Through System for medical assessment robots supporting clinical observation
- Designed an Unreal-Based VR Driving Simulator with integrated AI driving assistant for driver training and cognition evaluation (Hitachi Corp collaboration)
Projects:
- Personalized Voice-Based eHMI → /projects/personalized-voice-based-eHMI/
- Android VR See-Through System → /projects/android-vr-see-through/
- Unreal-Based VR Driving Simulator → /projects/unreal-vr-driving-simulator/
Key Skills: Flutter, Unreal Engine, Unity, ROS 2, Generative AI, HRI, VR/AR

Viettel Big Data Analytics Center, Viettel Group

Role: AI Engineer (Permanent Full-time)
Period: Nov 2021 – Sep 2023
Location: Hanoi, Vietnam
Stack: Python, PyTorch, Triton, Docker, Kafka, ASR, VAD, NLP, LLM, Speaker Verification
Bullets:
- Built and deployed large-scale speaker verification for Viettel Call Center — serving millions of customers daily with >99% accuracy, significantly reducing manual workload (>50%), preventing fraud, and improving customer experience
- Developed a voice quality assessment system for call centers to monitor and improve audio quality for speech datasets, leading to a reduction of manual review time by 80% and improving ASR performance through better data quality
- Participated in VoxCeleb Speaker Recognition Challenge (VoxSRC) 2022, Vietnamese Language and Speech Processing (VLSP 2022)
- Data preparation and implementation of LLM into the My Viettel chatbot platform
Projects:
- Speaker Verification System → /projects/speaker-verification/
- Voice Quality Assessment → /projects/voice-quality-assessment/
- LLM Chatbot — fine-tuned PaLM & LLaMA on company data (2022) [no page]
Internal Contributions:
- Tech Talk: Speaker Verification for Call Center — internal knowledge-sharing session
- Mentor — Viettel Digital Talent Program 2022
Achievements:
- High-rank placement in VLSP (Vietnamese Language & Speech Processing) competition
- Participated in VoxCeleb Speaker Recognition Challenge (VoxSRC) 2022 (Top 20)
Key Skills: Speaker Verification, PyTorch, Triton, ASR, VAD, LLM Fine-tuning, Docker, Kafka

Viettel Digital Talents Program 2021, Viettel Group

Role: AI Intern (Part-time internship)
Period: Apr 2021 – Oct 2021
Location: Hanoi, Vietnam
Bullets:
- Completed intensive AI & Data Science training under Viettel’s national talent initiative
- Contributed to Vietnamese speaker verification system later deployed in production
Projects:
- Speaker Verification System → /projects/speaker-verification/
Achievement: Selected as Top 10 candidate out of the entire Viettel Digital Talent Program 2021 cohort
Key Skills: Python, Speaker Verification, Deep Learning, Data Science

Education

MEng — Human Robotics Interaction

School: Nara Institute of Science and Technology (NAIST), Japan
Period: Oct 2023 – Sep 2025
Research: Personalized human-robot interface for APMVs using generative models tailored to passenger personality.
Funded by MEXT Scholarship, Japan (2023).

BSc — Automation & Electrical Engineering

School: Hanoi University of Science and Technology (HUST), Vietnam
Period: Sept 2018 – Sept 2022
GPA: 3.52 / 4.0 — Graduated with Distinction, 2nd place in Thesis defense committee.
Thesis: Speaker verification system for telecommunication service.

Competitions

VoxCeleb Speaker Recognition Challenge (VoxSRC) 2022 — Top 20

Task: Speaker Verification — Track 1 (closed, no external data) and Track 2 (open, with external dataset)
Approach:
- Model fusion: RawNet (raw audio input) + ECAPA-TDNN (spectrogram input)
- Multi-loss optimization: AAM-Softmax (classification) + Proxy Anchor (metric learning)
- n-fold data augmentation pipeline

Vietnamese Language and Speech Processing (VLSP) 2022 — Top 5

Task: O-COCOSDA and VLSP 2022 - MSV Shared task: Multilingual Speaker Verification (Track 1: seen languages, Track 2: unseen languages, Track 3: cross-lingual)
Approach:
- Model fusion: RawNet (raw audio input) + ECAPA-TDNN (spectrogram input)
- Pretrained self-supervised models: Wav2Vec 2.0 and WavLM as feature extractors
- Multi-loss optimization for cross-lingual speaker representation

Data-Centric AI Competition, FPT 2021

Task: Improve human detection with YOLOv5 using a data-driven approach only (no model architecture changes)
Approach:
- Data preprocessing: noisy label removal, annotation standardization, image quality enhancement
- Ensemble with diverse augmentation strategies and hyperparameter sets
- Knowledge distillation to reduce manual data evaluation overhead

Dang Phuong Nam

Experience

Work Experience

VinMotion, VinGroup

AIS, VMO Holdings

Human Robotics Laboratory, NAIST

Viettel Big Data Analytics Center, Viettel Group

Viettel Digital Talents Program 2021, Viettel Group

Education

MEng — Human Robotics Interaction

BSc — Automation & Electrical Engineering

Competitions

VoxCeleb Speaker Recognition Challenge (VoxSRC) 2022 — Top 20

Vietnamese Language and Speech Processing (VLSP) 2022 — Top 5

Data-Centric AI Competition, FPT 2021