hypes.news
← Back to feed
Papers·5일 전

GSI-Bench: First benchmark for generative spatial intelligence in multimodal models

GSI-Bench: First benchmark for generative spatial intelligence in multimodal models

GSI-Bench, introduced by Muzhi Zhu and team, is the first benchmark to quantify generative spatial intelligence (GSI) in multimodal models through spatially grounded image editing. It comprises GSI-Real (high-quality real-world data) and GSI-Syn (large-scale synthetic data with automated labeling). Fine-tuning unified models on GSI-Syn improves both synthetic and real spatial editing tasks, and also enhances downstream spatial understanding, showing generative training strengthens spatial reasoning.

Muzhi Zhu

Comments

— 첫 댓글을 남겨보세요 —