Zhangxuan Gu

Researcher
Ant Group

Biography

I received my Ph.D. in Computer Science from Shanghai Jiao Tong University in 2022, advised by Professor Liqing Zhang. Before that, I received my bachelor in Mathematics from SJTU in 2016. From 2022-now, I am a researcher at Ant Group.

Research Interests: Computer Vision, Object Detection, Multimodal Large Language Models

Publications

2025
UI-Venus Technical Report: Building High-performance UI Agents with RFT
Zhangxuan Gu* , Zhengwen Zeng* , Zhenyu Xu* , Xingran Zhou* , Shuheng Shen*^ , Yunfei Liu* , Beitong Zhou* , Changhua Meng , Tianyu Xia , Weizhi Chen , Yue Wen , Jingya Dou , Fei Tang , Jinzhen Lin , Yulin Liu , Zhenlin Guo , Yichen Gong , Heng Jia , Changlong Gao , Yuan Guo , Yong Deng , Zhenyu Guo , Liang Chen , Weiqiang Wang
Arxiv, 2025
2024
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu , Haoxing Chen , Zhuoer Xu , Jun Lan , Changhua Meng , Weiqiang Wang
Icassp(oral), 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen , Yan Hong , Zizheng Huang , Zhuoer Xu , Zhangxuan Gu^ , Yaohui Li , Jun Lan , Huijia Zhu , Jianfu Zhang , Weiqiang Wang , Huaxiong Li
Arxiv, 2024
2023
Mobile User Interface Element Detection Via Adaptively Prompt Tuning
Zhangxuan Gu , Zhuoer Xu , Haoxing Chen , Jun Lan , Changhua Meng , Weiqiang Wang
CVPR, 2023
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen , Zhuoer Xu , Zhangxuan Gu^ , Jun Lan , Xing Zheng , Yaohui Li , Changhua Meng , Huijia Zhu , Weiqiang Wang
NIPS, 2023
Hierarchical Dynamic Image Harmonization
Haoxing Chen , Zhangxuan Gu , Yaohui Li , Jun Lan , Changhua Meng , Weiqiang Wang , Huaxiong Li
ACMMM(oral), 2023
2022
Context-aware Feature Generation for Zero-shot Semantic Segmentation
Zhangxuan Gu , Siyuan Zhou , Li Niu , Zihan Zhao , Liqing Zhang
ACMMM, 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu , Changhua Meng , Ke Wang , Jun Lan , Weiqiang Wang , Ming Gu , Liqing Zhang
CVPR, 2022
From Pixel to Patch: Synthesize Context-aware Features for Zero-shot Semantic Segmentation
Zhangxuan Gu , Siyuan Zhou , Li Niu , Zihan Zhao , Liqing Zhang
TNNLS, 2022
2020
Hard Pixel Mining for Depth Privileged Semantic Segmentation
Zhangxuan Gu , Li Niu , Haohua Zhao , Liqing Zhang
TMM, 2020