Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion

Chen, Jinbiao; Wang, Jiahai; Cao, Zhiguang; Wu, Yaoxin

Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion

Part of International Conference on Representation Learning 2025 (ICLR 2025) Conference

Bibtex Paper

Authors

Jinbiao Chen, Jiahai Wang, Zhiguang Cao, Yaoxin Wu

Abstract

Existing neural multi-objective combinatorial optimization (MOCO) methods still exhibit an optimality gap since they fail to fully exploit the intrinsic features of problem instances. A significant factor contributing to this shortfall is their reliance solely on graph-modal information. To overcome this, we propose a novel graph-image multimodal fusion (GIMF) framework that enhances neural MOCO methods by integrating graph and image information of the problem instances. Our GIMF framework comprises three key components: (1) a constructed coordinate image to better represent the spatial structure of the problem instance, (2) a problem-size adaptive resolution strategy during the image construction process to improve the cross-size generalization of the model, and (3) a multimodal fusion mechanism with modality-specific bottlenecks to efficiently couple graph and image information. We demonstrate the versatility of our GIMF by implementing it with two state-of-the-art neural MOCO backbones. Experimental results on classic MOCO problems show that our GIMF significantly outperforms state-of-the-art neural MOCO methods and exhibits superior generalization capability.

Neural Multi-Objective Combinatorial Optimization via Graph-Image Multimodal Fusion

Authors

Abstract

Name Change Policy