We are part of a vibrant community that formed around the Exascale Computing Project (ECP), working closely with real-life extreme scale scientific applications that tackle the most difficult problems of our society and need to run on the largest supercomputers. In this context, we are developing the VeloC project, which addresses the problem of high-performance, scalable checkpointing of the data produced by these applications.
You will work closely with the application team to identify checkpointing requirements and I/O performance bottlenecks, will devise new techniques to capture and persist checkpoints on heterogeneous storage and will have early access to the latest hardware and pre-Exascale supercomputing infrastructure to evaluate and improve the performance of such techniques.
We expect you to have:
- Strong scientific background in distributed computing and HPC in particular
- Strong code development skillsAbility to publish and promote new insights and approaches
- Ability to collaborate with computational scientists
- Familiarity with modern HPC data management and I/O best practices
- Familiarity with C and C++Experience with deploying and debugging large scale runtimes
As an equal employment opportunity and affirmative action employer, Argonne National Laboratory is committed to a diverse and inclusive workplace that fosters collaborative scientific discovery and innovation. In support of this commitment, Argonne encourages minorities, women, veterans and individuals with disabilities to apply for employment. Argonne considers all qualified applicants for employment without regard to age, ancestry, citizenship status, color, disability, gender, gender identity, genetic information, marital status, national origin, pregnancy, race, religion, sexual orientation, veteran status or any other characteristic protected by law.