UAVM 2025
ACM Multimedia
Workshop on
UAVs in Multimedia: Capturing the World from a New Perspective (UAVM 2025)

The accept papers will be published at ACM Multimedia Workshop (top 50%), and go through the same peer review process as the regular papers. Several authors will be invited to do a oral presentation.
News
- 9/3/2025 - Workshop homepage is now available.
- 9/3/2025 - Challenge Open-website: (https://codalab.lisn.upsaclay.fr/competitions/22073)
- 11/3/2024 - Paper submission site is now available.
Workshop Schedule
TBD
Invited Speakers
TBD
Important Dates
Challenge
- Challenge Start: 9 March 2025
- Challenge End: 30 June 2025
Submission of papers:
- Workshop Papers Submission End: 7 July 2025
- Workshop Papers Notification: 24 July 2025
- Student Travel Grants Application Deadline: TBD
- Camera-ready Submission: 3 August 2025
- Conference Dates: 27 October 2025 – 31 October 2025
Please note: The submission deadline is at 11:59 p.m. of the stated deadline date Anywhere on Earth
Abstract
Unmanned Aerial Vehicles (UAVs), also known as drones, have become increasingly popular in recent years due to their ability to capture high-quality multimedia data from the sky. With the rise of multimedia applications, such as aerial photography, cinematography, and mapping, UAVs have emerged as a powerful tool for gathering rich and diverse multimedia content. This workshop aims to bring together researchers, practitioners, and enthusiasts interested in UAV multimedia to explore the latest advancements, challenges, and opportunities in this exciting field. The workshop will cover various topics related to UAV multimedia, including aerial image and video processing, machine learning for UAV data analysis, UAV swarm technology, and UAV-based multimedia applications. In the context of the ACM Multimedia conference, this workshop is highly relevant as multimedia data from UAVs is becoming an increasingly important source of content for many multimedia applications. The workshop will provide a platform for researchers to share their work and discuss potential collaborations, as well as an opportunity for practitioners to learn about the latest developments in UAV multimedia technology. Overall, this workshop will provide a unique opportunity to explore the exciting and rapidly evolving field of UAV multimedia and its potential impact on the wider multimedia community.
The list of possible topics includes, but is not limited to:
- Video-based UAV Navigation
- Satellite-guided & Ground-guided Navigation
- Path Planning and Obstacle Avoidance
- Visual SLAM (Simultaneous Localization and Mapping)
- Sensor Fusion and Reinforcement Learning for Navigation
- UAV Swarm Coordination
- Multiple Platform Collaboration
- Multi-agent Cooperation and Communication
- Decentralized Control and Optimization
- Distributed Perception and Mapping
- UAV-based Object Detection and Tracking
- Aerial-view Object Detection, Tracking and Re-identification
- Aerial-view Action Recognition
- UAV-based Sensing and Mapping
- 3D Mapping and Reconstruction
- Remote Sensing and Image Analysis
- Disaster Response and Relief
- UAV-based Delivery and Transportation
- Package Delivery and Logistics
- Safety and Regulations for UAV-based Transportation
Submission Types
Paper can be submitted on [Open Review].
Submission template can be found at ACM or you may directly follow the overleaf template.
We recommend the single-blind (showing your name and affilliation) for fast processing, but double-blind papers are also acceptable. We will ensure the fairness.
In this workshop, we welcome four types of submissions, all of which should relate to the topics and themes as listed in Section 3:
-
(1). Challenge papers (up to 4 pages in length, plus unlimited pages for references): original solution to the Challenge data, University160k, in terms of effectiveness and efficiency.
-
(2). Original papers (up to 4 pages in length, plus unlimited pages for references): original ideas, perspectives, research vision, and open challenges in the area of evaluation approaches for UAVs in Multimedia; Page limits include diagrams and appendices.
Page limits include diagrams and appendices. Submissions should be single-blind, written in English, and formatted according to the current ACM two-column conference format. Suitable LaTeX, Word, and Overleaf templates are available from the ACM Website (use “sigconf” proceedings template for LaTeX and the Interim Template for Word).
Tips:
- For privacy protection, please blur faces in the published materials (such as paper, video, poster, etc.)
- For social good, please do not contain any misleading words, such as
surveillance
andsecret
.
Challenge
Challenge Platform is at https://codalab.lisn.upsaclay.fr/competitions/22073.
This year’s focus is specifically on matching partial street images to corresponding satellite images (illustrated in Figure 3). By concentrating on partial views, our aim is to more accurately reflect real-world scenarios where obstructions or limited sensor angles may restrict the field of view, such as during low-altitude UAV operations for navigation, search-and-rescue missions, and autonomous flight. We harness University-1652 [40] as the challenge dataset, which provides 2,579 street images as query and 951 gallery satellite images. To encourage broader participation and innovation, we will make University-1652 training set available through our website with name-masked test set, along with a public leaderboard.
Check challenge details at Section 5 in proposal
The dataset can be download by Request. Usually I will reply the download link in 5 minutes.
The submission example can be found at Baseline Submission. Please zip it as “answer.zip” to submit the result, and it is crucial to name the file exactly as answer.txt within the zip, as otherwise the evaluation will fail.
Please return the top-10 satellite names. For example, the first query is “VdthudbGjJ4aaNkl.jpeg”. Therefore, the first line of returned result in “answer.txt” should be the format as follows from Rank-1 to Rank-10:
ptHYAN3piG3YwOft I9bzP8jnLlz9zpMi c3vVTLCzTAVzuapU gkriPL4PNtcWoHgg iIL2ASdQ5vrFsJs0 TinwNxUGYAzz0kTO XilyyHqywhUBxHfT WLasj720MnF13zPI Qz4NypYGPhHdiAvn gO2hUfIHC8N4ZWKz
Please return the result following the order of query at Query TXT. It will be 2759 lines.
Related Papers
- Wang, T., Zheng, Z., Sun, Y., Yan, C., Yang, Y., & Chua, T. S. (2024). Multiple-environment Self-adaptive Network for Aerial-view Geo-localization. Pattern Recognition, 152, 110363.
- Zheng, Z., Wei, Y., & Yang, Y. (2020, October). University-1652: A multi-view multi-source benchmark for drone-based geo-localization. In Proceedings of the 28th ACM international conference on Multimedia (pp. 1395-1403).
- Wang, C., Zheng, Z., Quan, R., Sun, Y., & Yang, Y. (2023). Context-aware pretraining for efficient blind image decomposition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 18186-18195).
- Chu, M., Zheng, Z., Ji, W., & Chua, T. S. (2024). Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching. ECCV.
Organizing Team
![]() |
![]() |
![]() |
![]() |
---|---|---|---|
Tingyu Wang, Hangzhou Dianzi University, China | Yujiao Shi, ShanghaiTech University, China | Fabian Deuser, University of the Bundeswehr Munich, Germany | Shaofei Huang, Institute of Information Engineering, Chinese Academy of Sciences, China |
![]() |
![]() |
![]() |
![]() |
Guosheng Hu, University of Bristol, United Kingdom | Si Liu, Beihang University, China | Zhedong Zheng, University of Macau, China | Roger Zimmermann, National University of Singapore, Singapore |
Conference and Journal Papers
All papers presented at ACMMM 2025 will be included in ACM proceeding. All papers submitted to this workshop will go through the same review process as the regular papers submitted to the main conference to ensure that the contributions are of high quality.
Student Traval Funding
Please check https://acmmm2025.org/
Workshop Citation
@inproceedings{wang2025UVA,
title={The 3rd Workshop on UAVs in Multimedia: Capturing the World from a New Perspective},
author={Wang, Tingyu and Shi, Yujiao and Deuser, Fabian and Huang, Shaofei and Hu, Guosheng and Liu, Si and Zheng, Zhedong and Zimmermann, Roger},
booktitle={Proceedings of the 33rd ACM International Conference on Multimedia Workshop},
year={2025}
}