Multi-Modal Sensing-assisted Beam Prediction using Real-World Dataset

Kim, Junghyun (contact); Yeo, Yerin; Kim, Jihyung; Lee, JunHwan

10.23919/JCN.2025.000049

Abstract : This paper explores techniques for beam prediction using multi-modal sensing data. Specifically, our aim is to develop a deep learning model that predicts the optimal beam using information collected from Camera, LiDAR, Radar, and GPS sensors. For this purpose, we propose ResNet-SE, which integrates a squeeze-and-excitation network with ResNet, and PIformer, a newly designed model that incorporates pooling-based attention and Inception mixer. Experimental results demonstrate a 22\% improvement in prediction accuracy and a 38\% reduction in training time compared to the state-of-the-art model.​ 

Index terms : beam prediction, deep learning, Transformer, multimodal learning, wireless communications