UI-Venus Technical Report: Building High-performance UI Agents with RFT
Zhangxuan Gu*
,
Zhengwen Zeng*
,
Zhenyu Xu*
,
Xingran Zhou*
,
Shuheng Shen*^
,
Yunfei Liu*
,
Beitong Zhou*
,
Changhua Meng
,
Tianyu Xia
,
Weizhi Chen
,
Yue Wen
,
Jingya Dou
,
Fei Tang
,
Jinzhen Lin
,
Yulin Liu
,
Zhenlin Guo
,
Yichen Gong
,
Heng Jia
,
Changlong Gao
,
Yuan Guo
,
Yong Deng
,
Zhenyu Guo
,
Liang Chen
,
Weiqiang Wang
Arxiv, 2025
DiffusionInst: Diffusion Model for Instance Segmentation
Zhangxuan Gu
,
Haoxing Chen
,
Zhuoer Xu
,
Jun Lan
,
Changhua Meng
,
Weiqiang Wang
Icassp(oral), 2024
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark
Haoxing Chen
,
Yan Hong
,
Zizheng Huang
,
Zhuoer Xu
,
Zhangxuan Gu^
,
Yaohui Li
,
Jun Lan
,
Huijia Zhu
,
Jianfu Zhang
,
Weiqiang Wang
,
Huaxiong Li
Arxiv, 2024
Mobile User Interface Element Detection Via Adaptively Prompt Tuning
Zhangxuan Gu
,
Zhuoer Xu
,
Haoxing Chen
,
Jun Lan
,
Changhua Meng
,
Weiqiang Wang
CVPR, 2023
DiffUTE: Universal Text Editing Diffusion Model
Haoxing Chen
,
Zhuoer Xu
,
Zhangxuan Gu^
,
Jun Lan
,
Xing Zheng
,
Yaohui Li
,
Changhua Meng
,
Huijia Zhu
,
Weiqiang Wang
NIPS, 2023
Hierarchical Dynamic Image Harmonization
Haoxing Chen
,
Zhangxuan Gu
,
Yaohui Li
,
Jun Lan
,
Changhua Meng
,
Weiqiang Wang
,
Huaxiong Li
ACMMM(oral), 2023
Context-aware Feature Generation for Zero-shot Semantic Segmentation
Zhangxuan Gu
,
Siyuan Zhou
,
Li Niu
,
Zihan Zhao
,
Liqing Zhang
ACMMM, 2022
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding
Zhangxuan Gu
,
Changhua Meng
,
Ke Wang
,
Jun Lan
,
Weiqiang Wang
,
Ming Gu
,
Liqing Zhang
CVPR, 2022
From Pixel to Patch: Synthesize Context-aware Features for Zero-shot Semantic Segmentation
Zhangxuan Gu
,
Siyuan Zhou
,
Li Niu
,
Zihan Zhao
,
Liqing Zhang
TNNLS, 2022
Hard Pixel Mining for Depth Privileged Semantic Segmentation
Zhangxuan Gu
,
Li Niu
,
Haohua Zhao
,
Liqing Zhang
TMM, 2020