Publications

2023

  1. dreamo.png
    DreaMo: Articulated 3D Reconstruction From A Single Casual Video
    Tao Tu, Ming-Feng Li, Chieh Hubert Lin, Yen-Chi Cheng, Min Sun, and Ming-Hsuan Yang
    In Submission, 2023
  2. imgeonet.gif
    ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
    Tao Tu, Shun-Po Chuang, Yu-Lun Liu, Cheng Sun, Ke Zhang, Donna Roy, Cheng-Hao Kuo, and Min Sun
    In ICCV, 2023

2021

  1. readup.png
    Learning Better Visual Dialog Agents with Pretrained Visual-Linguistic Representation
    Tao Tu, Qing Ping, Govindarajan Thattai, Gokhan Tur, and Prem Natarajan
    In CVPR, 2021

2020

  1. semi-tts.png
    Semi-Supervised Learning for Multi-Speaker Text-to-Speech Synthesis Using Discrete Speech Representation
    Tao Tu, Yuan-Jui Chen, Alexander H Liu, and Hung-yi Lee
    Interspeech, 2020
  2. seqrq-ae.png
    Towards Unsupervised Speech Recognition and Synthesis with Quantized Speech Representation Learning
    Tao Tu (co-first), Alexander H Liu (co-first), Hung-yi Lee, and Lin-shan Lee
    In ICASSP, 2020

2019

  1. low-resrc-tts.png
    End-to-End Text-to-Speech for Low-Resource Languages by Cross-Lingual Transfer Learning
    Tao Tu (co-first), Yuan-Jui Chen (co-first), Cheng-chieh Yeh, and Hung-yi Lee
    Interspeech, 2019