Jinrong Yang

prof_pic.jpg

Hello! I am Jinrong Yang (杨金荣). I am a Senior Researcher at CVTE Research Institute & Sun Yat-sen University, Guangzhou, China. In September 2024, I received my PhD degree from Huazhong University of Science and Technology (HUST), supervised by Xiaoping Li, Xiangyu Zhang and Zheng Ge.

My research interest mainly focus on Robotics, Computer Vision, and Multimodal learning.

News

Oct 14, 2024 One paper(GEM) is accepted by IEEE TMM.
Aug 19, 2024 One paper(GroupLane) is accepted by IEEE RA-L.
Jul 1, 2024 Two papers (Vary, Merlin) about Multimodal LLM are accepted by ECCV’24.
Jun 30, 2024 Two papers (QTrack, VideoBEV) about the Perception of Autonomous Driving are accepted by IROS’24, and one is accepted as Oral Presentation paper. :sparkles:
Apr 16, 2024 One paper (ChatSpot) about Multimodal LLM are accepted by IJCAI’24 as Long Oral paper. :sparkles:
Jan 15, 2024 One paper (DreamLLM) about Synergistic Multimodal Comprehension and Creation LLM is accepted by ICLR’24 as Spotlight paper. :sparkles:
Jan 25, 2023 My second paper of the corresponding author (PCET) is accepted by IEEE RA-L. Congratulations to Pan Wang!
Jan 21, 2023 One paper (DBQ-SSD) is accepted by ICLR 2023
Nov 1, 2022 Three papers (BEVDepth, BEVStereo, LTrack) are accepted by AAAI’23.
Mar 2, 2022 One paper (StreamYOLO) is accepted by CVPR’22 as oral paper. :sparkles:
Sep 28, 2021 One paper (COSOC) is accepted by NeurIPS’21.

Selected publications

* indicates equal contribution, ** indicates corresponding author.

  1. ECCV’24
    Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model
    En Yu, Liang Zhao, Yana Wei, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, and 1 more author
    In Proceeding of the European Conference on Computer Vision (ECCV), 2024
  2. ECCV’24
    Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model
    Haoran Wei, Lingyu Kong, Jinyue Chen, Liang Zhao, Zheng Ge, Jinrong Yang, Jianjian Sun, Chunrui Han, and Xiangyu Zhang
    In Proceeding of the European Conference on Computer Vision (ECCV), 2024
  3. DreamLLM: Synergistic Multimodal Comprehension and Creation
    Runpei Dong, Chunrui Han, Yuang Peng, Zekun Qi, Zheng Ge, Jinrong Yang, Liang Zhao, Jianjian Sun, Hongyu Zhou, Haoran Wei, and 1 more author
    In International Conference on Learning Representations (ICLR), 2024
  4. ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning
    Liang Zhao, En Yu, Zheng Ge, Jinrong Yang, Haoran Wei, Hongyu Zhou, Jianjian Sun, Yuang Peng, Runpei Dong, Chunrui Han, and 1 more author
    In Proceeding of International Joint Conferences on Artificial Intelligence (IJCAI), 2024
  5. Exploring Recurrent Long-term Temporal Fusion for Multi-view 3D Perception
    Chunrui Han*, Jinrong Yang*, Jianjian Sun, Zheng Ge, Runpei Dong, Hongyu Zhou, Weixin Mao, Yuang Peng, and Xiangyu Zhang
    In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
  6. Quality Matters: Embracing Quality Clues for Robust 3D Multi-Object Tracking
    Jinrong Yang*, En Yu*, Zeming Li, Xiaoping Li, and Wenbing Tao
    In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024
  7. GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping
    Zhuoling Li, Chunrui Han, Zheng Ge, Jinrong Yang, En Yu, Haoqian Wang, Hengshuang Zhao, and Xiangyu Zhang
    arXiv preprint arXiv:2307.09472, 2024
  8. GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection
    Weixin Mao*, Jinrong Yang*, Zheng Ge, Lin Song, Hongyu Zhou, Tiezheng Mao, Zeming Li, and Osamu Yoshie
    arXiv preprint arXiv:2306.17450, 2023
  9. DBQ-SSD: Dynamic Ball Query for Efficient 3D Object Detection
    Jinrong Yang*, Lin Song*, Songtao Liu, Weixin Mao, Zeming Li, Xiaoping Li, Hongbin Sun, Jian Sun, and Nanning Zheng
    In International Conference on Learning Representations (ICLR), 2023
  10. Implicit and Efficient Point Cloud Completion for 3D Single Object Tracking
    Pan Wang, Liangliang Ren, Shengkai Wu, Jinrong Yang**, En Yu, Hangcheng Yu, and Xiaoping Li
    IEEE Robotics and Automation Letters, 2023
  11. BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection
    Yinhao Li, Zheng Ge, Guanyi Yu, Jinrong Yang, Zengran Wang, Yukang Shi, Jianjian Sun, and Zeming Li
    In Proceeding of Association for the Advancement of Artificial Intelligence (AAAI), 2023
  12. BEVStereo: Enhancing Depth Estimation in Multi-view 3D Object Detection with Dynamic Temporal Stereo
    Yinhao Li, Han Bao, Zheng Ge, Jinrong Yang, Jianjian Sun, and Zeming Li
    In Proceeding of Association for the Advancement of Artificial Intelligence (AAAI), 2023
  13. Generalizing Multiple Object Tracking to Unseen Domains by Introducing Natural Language Representation
    En Yu, Songtao Liu, Zhuoling Li, Jinrong Yang, Shoudong Han, Wenbing Tao, and  others
    In Proceeding of Association for the Advancement of Artificial Intelligence (AAAI), 2023
  14. Real-time Object Detection for Streaming Perception
    Jinrong Yang, Songtao Liu, Zeming Li, Xiaoping Li, and Jian Sun
    In Proceeding of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022
  15. Towards 3D Object Detection with 2D Supervision
    Jinrong Yang, Tiancai Wang, Zheng Ge, Weixin Mao, Xiaoping Li, and Xiangyu Zhang
    arXiv preprint arXiv:2211.08287, 2022
  16. Iou-balanced Loss Functions for Single-stage Object Detection
    Shengkai Wu*, Jinrong Yang*, Xinggang Wang, and Xiaoping Li
    Pattern Recognition Letters (PRL), 2022
  17. Rectifying the Shortcut Learning of Background for Few-shot Learning
    Xu Luo, Longhui Wei, Liangjian Wen, Jinrong Yang, Lingxi Xie, Zenglin Xu, and Qi Tian
    In Proceeding of Advances in Neural Information Processing Systems (NeurIPS), 2021

Collaborators

Reviewer