Publications

2023

DreaMo: Articulated 3D Reconstruction From A Single Casual Video

Tao Tu, Ming-Feng Li, Chieh Hubert Lin, Yen-Chi Cheng, Min Sun, and Ming-Hsuan Yang

In Submission, 2023

Website Video
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection

Tao Tu, Shun-Po Chuang, Yu-Lun Liu, Cheng Sun, Ke Zhang, Donna Roy, Cheng-Hao Kuo, and Min Sun

In ICCV, 2023

Website Code

2021

Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation

Tao Tu, Qing Ping, Govindarajan Thattai, Gokhan Tur, and Prem Natarajan

In CVPR, 2021

Paper arXiv Code

2020

Semi-Supervised Learning for Multi-Speaker Text-to-Speech Synthesis Using Discrete Speech Representation

Tao Tu, Yuan-Jui Chen, Alexander H Liu, and Hung-yi Lee

Interspeech, 2020

Paper arXiv Code Demo
Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning

Tao Tu (co-first), Alexander H Liu (co-first), Hung-yi Lee, and Lin-shan Lee

In ICASSP, 2020

Paper arXiv Video Demo

2019

End-to-End Text-to-Speech for Low-Resource Languages by Cross-Lingual Transfer Learning

Tao Tu (co-first), Yuan-Jui Chen (co-first), Cheng-chieh Yeh, and Hung-yi Lee

Interspeech, 2019

arXiv Demo