MO3.R12.6

VISION-LANGUAGE MODELS AS MULTIMODAL ACCESS TO REMOTE SENSING INFORMATION

Devis Tuia, Valerie Zermatten, Li Mi, Christel Chappuis, Javiera Castillo-Navarro, Antoine Bosselut, Syrielle Montariol, EPFL, Switzerland; Sylvain Lobry, Université Paris Cité, France; Bertrand Le Saux, European Space Agency, France

Session:
MO3.R12: Advances in Multimodal Remote Sensing Image Processing and Interpretation I Oral

Track:
Community-Contributed Sessions

Location:
MC 3.4

Presentation Time:
Mon, 8 Jul, 14:50 - 15:04

Session Co-Chairs:
Gulsen Taskin, Istanbul Technical University and Lexie Yang, Oak Ridge National Laboratory
Presentation
Discussion
Resources
No resources available.
Session MO3.R12
MO3.R12.1: IMAGE-TO-IMAGE TRANSLATION NETWORKS FOR ESTIMATING EVAPOTRANSPIRATION VARIATIONS: SAR2ET
Samet Çetin, Middle East Technical University, Turkey; Berk Ülker, Eindhoven University of Technology, Netherlands; Ramazan Gökberk Cinbis, Middle East Technical University, Turkey; Esra Erten, Istanbul Technical University, Turkey
MO3.R12.2: FROM COARSE TO FINE: AN OFFLINE-ONLINE APPROACH FOR REMOTE SENSING CROSS-MODAL RETRIEVAL
Wenqian Zhou, Hanlin Wu, Beijing Foreign Studies University, China
MO3.R12.3: IDENTIFYING EVERY BUILDING’S FUNCTION IN LARGE-SCALE URBAN AREAS WITH MULTI-MODALITY REMOTE-SENSING DATA
Zhuohong Li, Wei He, Jiepan Li, Wuhan University, China; Hongyan Zhang, China School of Computer Science, China
MO3.R12.4: MATRIX FACTORIZATION INFORMED INTERPRETABLE DEEP NETWORK FOR UNREGISTERED HYPERSPECTRAL AND MULTISPECTRAL IMAGES FUSION
Tongzhen Zhang, Jiahui Qu, Yunsong Li, Xidian University, China; Qian Du, Mississippi State University, United States; Wenqian Dong, Xidian University, China
MO3.R12.5: ALIGNING GEO-TAGGED CLIP REPRESENTATIONS AND SATELLITE IMAGERY FOR FEW-SHOT LAND USE CLASSIFICATION
Pallavi Jain, INRIA, Mediterranean Agronomic Institute of Montpellier - CIHEAM-IAMM, France; Diego Marcos, INRIA, France; Dino Ienco, INRAE, France; Roberto Interdonato, CIRAD, France; Aayush Dhakal, Nathan Jacobs, Washington University in St. Louis, United States; Tristan Berchoux, Mediterranean Agronomic Institute of Montpellier - CIHEAM-IAMM, France
MO3.R12.6: VISION-LANGUAGE MODELS AS MULTIMODAL ACCESS TO REMOTE SENSING INFORMATION
Devis Tuia, Valerie Zermatten, Li Mi, Christel Chappuis, Javiera Castillo-Navarro, Antoine Bosselut, Syrielle Montariol, EPFL, Switzerland; Sylvain Lobry, Université Paris Cité, France; Bertrand Le Saux, European Space Agency, France
MO3.R12.7: Playbook to build AI Foundation Models for Science
Rahul Ramachandran, Manil Maskey, Tsengdar Lee, Kevin Murphy, NASA, United States; Muthukumaran Ramasubramanian, Sujit Roy, Iksha Gurung, University of Alabama in Huntsville, United States; Raghu Ganti, IBM Research, United States
Resources
No resources available.