Publications

Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting
Cross Space and Time: A Spatio-Temporal Unitized Model for Traffic Flow Forecasting Featured

Weilin Ruan, Wenzhuo Wang, Siru Zhong, Wei Chen, Li Liu, Yuxuan Liang

TITS. 2025

This paper proposes a novel spatio-temporal unitized model for traffic flow forecasting that effectively captures complex dependencies across both space and time dimensions, achieving state-of-the-art performance on multiple benchmark datasets.

Paper Code Spatio-temporal

Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data
Towards Multi-Scenario Forecasting of Building Electricity Loads with Multimodal Data Featured

Yongzheng Liu, Siru Zhong, Gefeng Luo, Weilin Ruan, Yuxuan Liang

ACM MM. 2025

We propose MMLoad, a novel diffusion-based multimodal framework for multi-scenario building load forecasting with three innovations: Multimodal Data Enhancement Pipeline, Cross-modal Relation Encoder, and Scenario-Conditioned Diffusion Generator with uncertainty quantification, establishing a new paradigm for multimodal learning in smart energy systems.

Paper Code Multimodal Energy Forecasting Smart Buildings

Low-rank Adaptation for Spatio-Temporal Forecasting
Low-rank Adaptation for Spatio-Temporal Forecasting Featured

Weilin Ruan, Wei Chen, Xilin Dang, Jianxiang Zhou, Weichuang Li, Xu Liu, Yuxuan Liang

ECML. 2025

This paper presents ST-LoRA, a novel low-rank adaptation framework as an off-the-shelf plugin for existing spatial-temporal prediction models, which alleviates node heterogeneity problems through node-level adjustments while minimally increasing parameters and training time.

Paper Code Spatio-temporal

Fine-grained Urban Heat Island Effect Forecasting: A Context-aware Thermodynamic Modeling Framework
Fine-grained Urban Heat Island Effect Forecasting: A Context-aware Thermodynamic Modeling Framework Featured

Xingchen Zou, Weilin Ruan, Siru Zhong, Yuehong HU, Yuxuan Liang

KDD. 2025

We present DeepUHI, a heat equation-based framework that models urban heat island effects through thermodynamic cycles and thermal flows, integrating multimodal environmental data to achieve precise street-level temperature forecasting, now deployed as a real-time warning system in Seoul.

Paper Code Urban Computing Spatio-temporal Graph Learning

Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting
Time-VLM: Exploring Multimodal Vision-Language Models for Augmented Time Series Forecasting Featured

Siru Zhong, Weilin Ruan, Ming Jin, Huan Li, Qingsong Wen, Yuxuan Liang

ICML. 2025

This paper proposes Time-VLM, a novel multimodal framework that leverages pre-trained Vision-Language Models (VLMs) to bridge temporal, visual, and textual modalities for enhanced time series forecasting.

Paper Code Time Series Multimodal

Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction
Retrieval Augmented Spatio-Temporal Framework for Traffic Prediction

Weilin Ruan, Xilin Dang, Ziyu Zhou, Sisuo Lyu, Yuxuan Liang

Under Review. 2025

We propose RAST, a universal framework that integrates retrieval-augmented mechanisms with spatio-temporal modeling to address limited contextual capacity and low predictability in traffic prediction. Our framework consists of three key designs: Decoupled Encoder and Query Generator, Spatio-temporal Retrieval Store and Retrievers, and Universal Backbone Predictor that flexibly accommodates pre-trained STGNNs or simple MLP predictors.

Paper Code Spatio-temporal Traffic Prediction Retrieval-Augmented

VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results
VQualA 2025 Challenge on Face Image Quality Assessment: Methods and Results

Sizhuo Ma, ... (60+ authors), Weilin Ruan, Yutao Yue

ICCV Workshop. 2025

We introduce the VQualA 2025 Challenge on Face Image Quality Assessment, part of ICCV 2025 Workshops. Participants developed efficient models (<0.5 GFLOPs, <5M parameters) predicting Mean Opinion Scores (MOS) under realistic degradations. The challenge attracted 127 participants, resulting in 1519 valid final submissions, contributing to practical FIQA solutions.

Paper Computer Vision Image Quality Assessment Face Analysis

ViST: Harnessing Vision Transformation and Reconstruction for Multi-modal Spatio-temporal Forecasting
ViST: Harnessing Vision Transformation and Reconstruction for Multi-modal Spatio-temporal Forecasting

Weilin Ruan, Siru Zhong, Haomin Wen, Songxin Lei, Yongzheng Liu, Xingchen Zou, Yuxuan Liang

Under Review. 2025

We propose ViST, a novel vision-driven framework that transforms raw spatio-temporal data directly into a low-dimensional visual space for more effective and scalable forecasting. Our method features Multi-view Vision Transformation, Multi-modal Conditional Reconstruction, and Efficient Cross-modal Fusion Mechanism, achieving state-of-the-art accuracy with significantly reduced computational cost.

Paper Code Spatio-temporal Multi-modal Vision Transformation

An Empirical Study of the Anchoring Effect in LLMs: Existence, Mechanism, and Potential Mitigations
An Empirical Study of the Anchoring Effect in LLMs: Existence, Mechanism, and Potential Mitigations

Yiming Huang, Biquan Bie, Zuqiu Na, Weilin Ruan, Songxin Lei, Yutao Yue, Xinlei He

Under Review. 2025

We investigate the anchoring effect in Large Language Models, exploring whether LLMs are affected by anchoring bias, the underlying mechanisms, and potential mitigation strategies. We introduce SynAnchors dataset and show that LLMs' anchoring bias exists commonly with shallow-layer acting and is not eliminated by conventional strategies, while reasoning can offer some mitigation.

Paper Natural Language Processing Cognitive Bias Large Language Models

LDM4TS: Latent Diffusion Model for Time Series Forecasting
LDM4TS: Latent Diffusion Model for Time Series Forecasting

Weilin Ruan, Siru Zhong, Haomin Wen, Yuxuan Liang

Under review. 2025

This paper introduces LDM4TS, a novel latent diffusion model for time series forecasting that transforms time series into multiple image representations and leverages diffusion models to enhance forecasting capabilities.

Paper Code Time Series Multimodal

A Game-Theoretic Spatio-Temporal Reinforcement Learning Framework for Collaborative Public Resource Allocation
A Game-Theoretic Spatio-Temporal Reinforcement Learning Framework for Collaborative Public Resource Allocation

Songxin Lei, Qiongyan WANG, Yanchen ZHU, Hanyu Yao, Sijie Ruan, Weilin Ruan, Yuyu Luo, Yuxuan Liang

Under review. 2024

We introduce GSTRL, a novel game-theoretic reinforcement learning framework that addresses collaborative public resource allocation by modeling it as a cooperative potential game and incorporating spatio-temporal learning to capture crowd dynamics, outperforming existing methods on real-world datasets.

Paper Code Reinforcement Learning Spatio-temporal

Kinematics-based Object Articulation with Gaussian Splatting
Kinematics-based Object Articulation with Gaussian Splatting

Weichuang Li, Weilin Ruan, Xuechao Zhang, Yuxuan Liang

Under review. 2024

We present a novel approach that combines Gaussian Splatting with kinematic knowledge to reconstruct and simulate articulated objects, enhancing both geometric detail and articulation dynamics while overcoming the limitations of previous implicit-based models.

Paper Computer Vision