2023-06352 – PhD Position F/M Topology-aware load balancing for ocean simulation on heterogeneous platforms.
Contract type :
Level of qualifications required :
Graduate degree or equivalent
About the research centre or Inria department
The Inria center at the University of Bordeaux is one of the nine Inria centers in France and has about twenty research teams.. The Inria centre is a major and recognized player in the field of digital sciences. It is at the heart of a rich R&D and innovation ecosystem: highly innovative SMEs, large industrial groups, competitiveness clusters, research and higher education players, laboratories of excellence, technological research institute…
CROCO (Coastal and Regional Ocean Community) is an oceanic modeling system (https://www.croco-ocean.org ). An important objective for CROCO is to resolve very fine scales (especially in the coastal area), and their interactions with larger scales. It includes new capabilities such as a non-hydrostatic solver, ocean-wave-atmosphere coupling, evolving sediment dynamics and marine biogeochemistry, and new high-order numerical schemes for advection and mixing.
Various HPC improvements of the CROCO model itself are currently carried out with respect to a sustainable support of GPUs and different parallel programming models. Indeed, the current trend in high-performance computing architectures is going even more towards increasing heterogeneity. This is omnipresent on the intra-node computation with accelerator cards as well as on the inter-node level with different hardware and communication behaviors.
However, on the application and scheduling side, this trend is often ignored: scheduling of applications, in particular CROCO, still assumes homogeneity across the hardware stack. This leads to a mismatch between applications and the underlying HPC system, resulting in a poor performance in particular in the strong scaling case.
The AIRSEA team in Grenoble is one of the main developers of the CROCO model and the Tadaam team in Bordeaux has the expertise in load-balancing and topology-aware algorithms. Therefore, this PhD will be carried out mainly in Bordeaux but with strong collaboration with Grenoble: visits and exchanges will be organized regularly between the two locations.
The CROCO ocean model has a very complex workload model including non-homogeneous workload, adaptive mesh refinement with nested grids as well as existing support for hybrid CPUs and GPUs. Optimization attempts without application-driven information are therefore prone to fail. The goal of this PhD is to work on optimizing the execution of the CROCO model on supercomputers by developing and investigating new load-balancing algorithms.
Even if CROCO relies on structured meshes, load imbalance appears between the different computing units due to varying runtime of solvers. Moreover, as the topology of a heterogeneous machine can be extremely complex, the cost of communication can be very high depending on the location of the sender and the receiver. Hence, it is necessary to carefully optimize the mapping of the compute process and the load balance between them to optimize the computation and communication costs of the CRCOCO model.
The Phd Candidtae will work on the following workplan:
- High-performance computing
- Parallel programming models (MPI, OpenMP)
- Parallel programming models for heterogeneous computing (GPU/CPU)
- Performance modeling
- Strong programming skills
- Graph theory
- Optimization and algorithms
- Usage of large-scale super computers
- Able to cope with operational forecasting codes / Fortran 90
- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of partial teleworking and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Social security coverage
gross monthly salary :
2051€ / month (before taxes) during the first 2 years,
2158€ / month (before taxes) during the third year.
View or Apply
To help us track our recruitment effort, please indicate in your cover//motivation letter where (vacanciesin.eu) you saw this job posting.