Manyi Yao

prof_pic.jpg

I’m currently pursuing my doctorate in Computer Science at the University of California, Riverside, where I have the privilege of being advised by Professors Christian R. Shelton and Amit K. Roy-Chowdhury.

My research interests lie in the captivating realm of vision-language models and dynamic networks, with recent work proposing an efficient transformer encoder design for Mask2Former-style models. This innovative approach optimally selects subnetworks based on the input images. Moreover, it extends beyond segmentation to detection tasks and can be tailored to various computational budgets.

When I’m not immersed in academia, I enjoy spending time with my two beloved cats, pursuing my passion for quirky neo-deconstructivist fashion design, and spreading fitness joy as a NASM certified personal trainer.

news

Sep 18, 2025 Paper on vision-based LLM grounding for dash-cam video reasoning accepted in NeurIPS 2025!
Jun 24, 2024 Join NEC Labs America as Research Intern in Media Analytics team, mentored by Abhishek Aich.

selected publications

  1. NeurIPS
    iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning
    In Advances in Neural Information Processing Systems , 2025
    to appear
  2. Preprint
    Efficient Transformer Encoders for Mask2Former-style models
    2024