Tao Tu

(In search of a PhD position starting in 2024)



Tao Tu is currently a research assistant in the Department of Electrical Engineering at National Tsing Hua University, fortunate to collaborate with Prof. Min Sun. He is also a visiting student in UCMerced’s VLLab, privileged to work with Prof. Ming-Hsuan Yang. His research aims to empower machines with the intelligence to comprehend and engage with their surroundings. Currently, he focuses on 3D computer vision and multimodal learning.

Before joining NTHU, he was an algorithm engineer at MediaTek. Before that, in 2020, he gained valuable experience as an applied scientist intern with Amazon. He received an M.S. degree in Computer Science & Information Engineering from National Taiwan University and a B.S. degree in Electrical Engineering and Computer Science from National Tsing Hua University. He was a member of the Speech Processing Lab, being advised by Prof. Lin-shan Lee and Prof. Hung-yi Lee in machine learning and speech processing.


Dec 5, 2023 Check our new project DreaMo!
Jul 14, 2023 ImGeoNet is accepted by ICCV’23. See you at Paris!
Apr 6, 2023 Join UCMerced’s VLLab as a Visiting Researcher (work with Prof. Ming-Hsuan Yang).
Oct 11, 2022 Join NTHU VSLab as a Research Assistant (work with Prof. Min Sun).
Feb 28, 2021 My work during Amazon internship READ-UP is accepted by CVPR’21.
Dec 7, 2020 Join MediaTek as an Algorithm Engineer.
Nov 1, 2020 Graduated from NTU. Grateful to Prof. Lin-shan Lee and Prof. Hung-yi Lee for their guidance.
Jul 25, 2020 Multi-speaker SeqRQ-AE is accepted by Interspeech’20.
Jul 13, 2020 Join Amazon as an Applied Scientist Intern.
Jan 24, 2020 SeqRQ-AE is accepted by ICASSP’20.
Jun 17, 2019 Cross-lingual Transfer Learning TTS is accepted by Interspeech’19.
Sep 1, 2018 Join NTU Speech Lab as a Graduate Student.


Apr, 2023 - Current Visiting Student @ UCMerced VLLab
Oct, 2022 - Current Research Assistent @ NTHU VSLab
Dec, 2020 - Oct, 2022 Algorithm Engineer @ MediaTek
Jul, 2020 - Oct, 2020 Applied Scientist Intern @ Amazon
Sep, 2018 - Nov, 2020 Graduate Student @ NTU CSIE
Sep, 2014 - Jun, 2018 Undergraduate Student @ NTHU EECS

Selected Publications

⭐ The preview images/videos are zoomable.


  1. dreamo.png
    DreaMo: Articulated 3D Reconstruction From A Single Casual Video
    Tao Tu, Ming-Feng Li, Chieh Hubert Lin, Yen-Chi Cheng, Min Sun, and Ming-Hsuan Yang
    In Submission, 2023
  2. imgeonet.gif
    ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
    Tao Tu, Shun-Po Chuang, Yu-Lun Liu, Cheng Sun, Ke Zhang, Donna Roy, Cheng-Hao Kuo, and Min Sun
    In ICCV, 2023


  1. readup.png
    Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation
    Tao Tu, Qing Ping, Govindarajan Thattai, Gokhan Tur, and Prem Natarajan
    In CVPR, 2021


  1. semi-tts.png
    Semi-Supervised Learning for Multi-Speaker Text-to-Speech Synthesis Using Discrete Speech Representation
    Tao Tu, Yuan-Jui Chen, Alexander H Liu, and Hung-yi Lee
    Interspeech, 2020
  2. seqrq-ae.png
    Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
    Tao Tu (co-first), Alexander H Liu (co-first), Hung-yi Lee, and Lin-shan Lee
    In ICASSP, 2020


  1. low-resrc-tts.png
    End-to-End Text-to-Speech for Low-Resource Languages by Cross-Lingual Transfer Learning
    Tao Tu (co-first), Yuan-Jui Chen (co-first), Cheng-chieh Yeh, and Hung-yi Lee
    Interspeech, 2019