Continued from B4: Exploiting Software-Defined Networks for Efficient Data Management in Next-Generation Data Analysis Workflows
Description
During data analysis workflow (DAW) execution, energy-intensive components such as CPUs, GPUs, and main memory often remain idle when a system is waiting for distributed data access or network data transfers. Therefore, optimizing data access and network usage is essential not only for fast DAW execution, but also for achieving higher efficiency, e.g., in terms of energy, and hence for better sustainability – not least as power and cooling costs continue to rise.
The subproject focuses on the data transport between tasks, including scheduling of transmissions, network capacity management and data placement. The carbon-aware processing depends heavily on the efficient network transfer and data placement, as the network cost can – in the worst case – eliminate the optimization gains.

Scientists
- Ansgar Lößer
- Tobias Wies
- Sami Kharma
- Joel Witzke
Publications
2025
WOW: Workflow-Aware Data Movement and Task Scheduling for Dynamic Scientific Workflows Proceedings Article
In: 2025 IEEE 25th International Symposium on Cluster, Cloud and Internet Computing (CCGrid), Tromsø, Norway, 2025.
BottleMod: Modeling Data Flows and Tasks for Fast Bottleneck Analysis Proceedings Article
In: Proceedings of the ICPE 2025, May 05–09, 2025, Toronto, Canada, pp. 1–8, 2025.
2024
I/O of Scientific Workflows Monitored in Detail Proceedings Article
In: 2024 IEEE 20th International Conference on e-Science (e-Science), pp. 1-2, 2024.
KS+: Predicting Workflow Task Memory Usage Over Time Proceedings Article
In: 2024 IEEE 20th International Conference on e-Science (e-Science), 2024.
Low-level I/O Monitoring for Scientific Workflows Journal Article
In: CoRR, vol. abs/2408.00411, 2024.
Optimizing Checkpoint/Restart and Input/Output for Large Scale Applications PhD Thesis
Humboldt-Universität zu Berlin, 2024.
Validity constraints for data analysis workflows Journal Article
In: Future Generation Computer Systems, vol. 157, pp. 82–97, 2024, ISSN: 0167-739X.
2023
Lazy Read: Asynchronous Execution of Synchronous File I/O Proceedings Article
In: IEEE International Conference on Big Data, BigData 2023, Sorrento, Italy, December 15-18, 2023, pp. 2311-2318, IEEE, 2023.
Proactive Resource Management to Optimize Distributed Workflow Executions Proceedings Article
In: He, Jingrui; Palpanas, Themis; Hu, Xiaohua; Cuzzocrea, Alfredo; Dou, Dejing; Slezak, Dominik; Wang, Wei; Gruca, Aleksandra; Lin, Jerry Chun-Wei; Agrawal, Rakesh (Ed.): IEEE International Conference on Big Data, BigData 2023, Sorrento, Italy, December 15-18, 2023, pp. 6305–6307, IEEE, 2023.
Parameter Prioritization for Efficient Transmission of Neural Networks in Small Satellite Applications Proceedings Article
In: 2023 21st Mediterranean Communication and Computer Networking Conference (MedComNet), pp. 39-42, 2023.