Publications

ReFAct: Empowering Multimodal Web Agents with Visual and Context Focusing

R. Wu*, S. Zhang*, X. Tang, R. Zhang, Y. Liu, T. Jiang, W. Xu, and Y. Li

IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2026

A focusing framework for multimodal web agents that improves visual grounding and context selection during dynamic web tasks.

MobileFlow: A Multimodal LLM for Mobile GUI Agent

S. Nong*, J. Zhu*, R. Wu*, J. Jin, S. Shan, X. Huang, and W. Xu

NeurIPS Workshop, 2024

Paper
MATEval: A Multi-Agent Discussion Framework for Advancing Open-Ended Text Evaluation

Y. Li, S. Zhang, R. Wu, X. Huang, Y. Chen, W. Xu, G. Qi, and D. Min

DASFAA, 2024

Paper
DEE: Dual-stage Explainable Evaluation Method for Text Generation

S. Zhang, Y. Li, R. Wu, X. Huang, Y. Chen, W. Xu, and G. Qi

DASFAA, 2024

Paper
Customer Complaint Guided Fault Localization Based on Domain Knowledge Graph

S. Sun, Z. Chai, R. Wu, J. Jin, Y. Wang, W. Xu, and G. Qi

DASFAA, 2022

Paper Slide
Conditional Generation Net for Medication Recommendation

R. Wu, Z. Qiu, J. Jiang, G. Qi, and X. Wu

The Web Conference (WWW), 2022

Paper Slide Code
A Two-Phase Approach for Predicting Highway Passenger Volume

Y. Xiang, J. Chen, W. Yu, R. Wu, B. Liu, B. Wang, and Z. Li

Applied Sciences, 2021
LSTM Multi-modal UNet for Brain Tumor Segmentation

F. Xu, H. Ma, J. Sun, R. Wu, X. Liu, and Y. Kong

ICIVC, 2019

Paper Presentation Code