Yang Yang

Applied AI Researcher

yy.jpg

I am a Staff Software Engineer at Google based in San Diego, specializing in the intersection of generative AI and on-device deployment. My current work focuses on neural speech synthesis, enhancement, and audio codecs, and I previously worked on neural video compression. I am passionate about the full lifecycle of AI—from generative modeling research to the engineering rigor required to bring features into production.

Prior to Google, I spent several years at Qualcomm in both the Wireless R&D (patents) and AI Research divisions. I hold a Ph.D. in Wireless Networking from The Ohio State University (2015) and a Bachelor’s degree from Shanghai Jiao Tong University (2009).

selected publications

  1. arXiv
    DiffSoundStream: Efficient Speech Tokenization via Diffusion Decoding
    Yang YangYunpeng Li, George Sung, Shao-Fu Shih, Craig Dooley, Alessio Centazzo, and Ramanan Rajeswaran
    arXiv preprint arXiv:2506.22362 2025
  2. ICASSP
    Binaural Angular Separation Network
    Yang Yang, George Sung, Shao-Fu Shih, Hakan Erdogan, Chehung Lee, and Matthias Grundmann
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024
  3. ICASSP
    StreamVC: Real-Time Low-Latency Voice Conversion
    Yang Yang, Yury Kartynnik, Yunpeng Li, Jiuqiang Tang, Xing Li, George Sung, and Matthias Grundmann
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024
  4. ICASSP
    Guided Speech Enhancement Network
    Yang Yang, Shao-Fu Shih, Hakan ErdoganJamie Menjay Lin, Chehung Lee, Yunpeng Li, George Sung, and Matthias Grundmann
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023
  5. NeurIPS
    Neural Topological Ordering for Computation Graphs
    Mukul Gagrani, Corrado Rainone,  Yang Yang, Harris Teague, Wonseok Jeon, Roberto Bondesan, Herke Hoof, Christopher Lott, Weiliang Zeng, and Piero Zappi
    In Advances in Neural Information Processing Systems (NeurIPS) 2022
  6. ICLR
    Transformer-based Transform Coding
    Yinhao Zhu*Yang Yang*, and Taco Cohen
    In International Conference on Learning Representations (ICLR) 2022
  7. ICIP
    Progressive Neural Image Compression With Nested Quantization And Latent Ordering
    Yadong Lu*Yinhao Zhu*Yang Yang*Amir Said, and Taco S Cohen
    In IEEE International Conference on Image Processing (ICIP) 2021
  8. ICASSP
    Feedback Recurrent Autoencoder
    In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2020
  9. ACCV
    Feedback Recurrent Autoencoder for Video Compression
    In Asian Conference on Computer Vision (ACCV) 2020
  10. CVPR
    Guided Variational Autoencoder for Disentanglement Learning
    Zheng Ding*, Yifan Xu*Weijian Xu, Gaurav Parmar,  Yang YangMax Welling, and Zhuowen Tu
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020