Soutenances

Contribution to high-performance simulation and highly scalable numerical schemes more_vert

— Guillaume Latu

soutenance
  • 18 mai 2018 - 13:30
  • Salle de conférences IRMA
  • HDR
Résumé close

Résumé

Numerous scientific domains express a need for high-performance computing (HPC), which has intensified in recent decades. At the same time, the size of available supercomputers has grown steadily. Yet, parallel simulations make it possible to perform experiments numerically without carrying out full-scale real-world experiments whose costs can be prohibitive. My contributions concern the improvement of computational methods from the point of view of parallel algorithms, but also on the upgrade of numerical schemes in several simulations codes, and more broadly on new tools in the field of scientific computing.


Although my scientific work is not limited to contributions to the GYSELA simulation code, a part of it relates to this application. The GYSELA code treats the Gyrokinetic Vlasov equation in a five-dimensional space coupled to a Poisson solver and some other additional operators. While in 2006, a reduced version of the code was using only 128 cores, several algorithmic improvements permitted to achieve runs on 8000 cores in 2010, and 459000 cores in 2013. Some of the largest supercomputers in Europe have been used to conduct these numerical experiments. Thanks to a very good scalability and portability, everyday GYSELA runs use from 8000 to 32000 cores. However, it was found that whenever doubling the number of cores for a given case, the memory footprint was far from halved, as it should ideally. As a consequence, many very large physical cases were impossible to run because the memory was exhausted. By introducing more sophisticated algorithms, this bottleneck was wiped out and the memory scalability was significantly improved. Recently, works have been carried out to adapt the code for the next generations of machines; some of the key components are: vectorization, avoiding synchronizations induced by the management of parallelism, and overlapping communications by calculations, auto-tuning of identified kernels. 

Along with the efforts for achieving good parallelization, this is meaningful to improve the numerical methods to boost the precision and the realism of the simulations. Indeed, parallel algorithms and numerical schemes are tightly coupled. Thus, a specific operator splitting method in the Vlasov solver and improvement of the initial equilibrium function make it possible to better preserve certain mathematical invariants. This contribution helped improving the precision and the robustness of the code as well. A series of theoretical studies have established that the alignment of the main physical structures around the magnetic field lines can be used to reduce the number of mesh points necessary in the direction which is parallel to the field lines. I figured out a new numerical method with aligned interpolation for GYSELA. This approach saves a lot of meshing points and thus reduces the cost of simulations. I also managed to  improve the realism of the simulations in suppressing an artificial boundary condition.


As time goes on, accelerator devices have seen increasing success in the HPC field. Some of my researches were devoted to designing algorithms for clusters of such computing devices. A parallel solution for petroleum exploitation was developed on cluster of GPUs (Reverse Time Migration method). The memory access patterns and the management of both CPU-GPU and MPI communications were the main bottleneck to tackle there. In addition, the development of very fine-grained algorithms was important to achieve good performance. Besides, I realized some optimization works on some of the Gysela computation kernels on the Intel manycore's architecture. A major problem here is to adequately vectorize, because it is an essential condition to harness their power effectively. Some memory-bound and compute-bound kernels have shown good performance compared to more conventional computing devices, but achieving a large fraction of the CPU peak performance is often a non-trivial problem. Again, the access patterns to the memory and cache-friendliness represent a real challenge, a lot more than for a standard processor. Auto-tuning techniques were also helpful to address some of the issues related to performance portability and sensitivity both to architectural features and to application dependent parameters.


One of the constant problems facing the parallel application designer is to find solutions to increase efficiency, portability and code readability at the same time. The complexities of hardware, of scientific applications, of numerical schemes and the difficulty to choose a programming model are all together contributing to this multi-faceted problem. However, possible tracks should be discussed to cross over the obstacles and to end up soon running large applications on the upcoming exascale machines.

Foncteurs de Long-Moody et homologie stable des groupes de difféotopie

— Arthur Soulié

soutenance
  • 27 juin 2018 - 14:00
  • Salle de conférences IRMA
  • Thèse
La contrôlabilité frontière exacte et la synchronisation frontière exacte pour un système couplé d’équations des ondes avec des contrôles frontières de Neumann et des contrôles frontières couplés de Robin more_vert

— Xing Lu

soutenance
  • 1 juillet 2018 - 19:00
  • A confirmer
  • Thèse
Résumé close

1513 Guanghua East Building, Fudan University, Shanghai, China

Application de la méthode des bases réduites à des simulations d'aérothermie more_vert

— Jean-Baptiste Wahl

soutenance
  • 13 septembre 2018 - 14:30
  • Thèse
Résumé close

Amphi Rothé (EOST)

On area and volume in spherical and hyperbolic geometry

— Elena Frenkel

soutenance
  • 21 septembre 2018 - 13:30
  • Salle de conférences IRMA
  • Thèse
Tomographie optique diffuse et de fluorescence pour la détection de tumeurs

— Guillaume Dollé

soutenance
  • 24 septembre 2018 - 14:00
  • Salle de conférences IRMA
  • Thèse
Intersections lagrangiennes pour les sousvariétés monotones et presque monotones

— Nassima Keddari

soutenance
  • 26 septembre 2018 - 10:00
  • Salle de conférences IRMA
  • Thèse
Optimisation de code Galerkin Discontinu sur ordinateur hybride. Application à la simulation numérique en électromagnétisme.

— Bruno Weber

soutenance
  • 26 novembre 2018 - 11:00
  • Salle de conférences IRMA
  • Thèse
Propriétés d'hyperbolicité des intersections complètes générales

— Damian Brotbek

soutenance
  • 4 décembre 2018 - 14:30
  • Salle de conférences IRMA
  • HDR
Exemples de transports martingale

— Nicolas Juillet

soutenance
  • 7 décembre 2018 - 13:00
  • Salle de conférences IRMA
  • HDR
Contributions à la modélisation statistique et à ses applications en biologie et dans le monde industriel more_vert

— Frédéric Bertrand

soutenance
  • 10 décembre 2018 - 09:30
  • HDR
Résumé close

Amphithéâtre du Collège Doctoral Européen

Stabilisation et asymptotique spectrale de l'équation des ondes amorties vectorielle

— Guillaume Klein

soutenance
  • 12 décembre 2018 - 15:30
  • Salle de conférences IRMA
  • Thèse
Algorithmes à grain fin et schémas numériques pour des simulations exascales de plasmas turbulents

— Nicolas Bouzat

soutenance
  • 17 décembre 2018 - 09:30
  • Salle de conférences IRMA
  • Thèse