GitHub - saforem2/personal_site: My personal website

Sam Foreman 2025-12-31

👋 Hi, I’m Sam!

🧑🏻‍💻 About

I’m a Computational Scientist in the AI / ML group at the Argonne Leadership Computing Facility (ALCF).

I’m generally interested in the large scale distributed training of AI models for scientific applications, and am the co-lead of the Models / Pre-Training group for the AuroraGPT project.

Prior to this, I received my PhD in Physics from the University of Iowa in 2019, where I used ML to build better Markov Chain Monte Carlo sampling techniques for Lattice Quantum Chromodynamics (l2hmc-qcd).

[!TIP]

✨ New!

🌎 AERIS: [Argonne Earth Systems Model for Reliable and Skillful Predictions]¹ (Hatanpää et al. (2025))

[!TIP]

✏️ Last Updated
Updated: 2025-12-31 @ 09:36:50

[!TIP]

🎶 Now Playing

[!TIP]

Now Playing

[!TIP]

<script> /** Developed by Prashant Shrestha + https://prashant.me */ var lastfmData = { baseURL: "https://ws.audioscrobbler.com/2.0/?method=user.getrecenttracks&user=", // Your Last.fm Username user: "saforem2", // Your API key api_key: "1dbc15037c1fe71ce06acbb3f73adc75", additional: "&format=json&limit=1" }; var getSetLastFM = function() { $.ajax({ type: "GET", url: lastfmData.baseURL + lastfmData.user + "&api_key=" + lastfmData.api_key + lastfmData.additional, dataType: "json", success: function(resp) { var recentTrack = resp.recenttracks.track[0]; var formatted = // "" + recentTrack.name; "🎶 " + recentTrack.name; $("a#tracktitle") .html(formatted) .attr("href", recentTrack.url) .attr("title", recentTrack.name + " by " + recentTrack.artist["#text"]) .attr("target", "_blank"); var artistFormatted = // "" + recentTrack.artist["#text"]; "🗣️ " + recentTrack.artist["#text"]; $("a#trackartist") .html(artistFormatted) .attr("title", "Artist : " + recentTrack.artist["#text"]); $("img#trackart").attr("src", recentTrack.image[2]["#text"]); }, error: function(resp) { $("a#tracktitle").html( "" + "Silence!" ); $("img#trackart").attr("src", "🧑🏻‍💻"); var artistFormatted = "Sam Foreman"; $("a#trackartist") .html(artistFormatted) .attr("href", "https://samforeman.me"); } }); }; // Get the new one. getSetLastFM(); // Start the countdown. setInterval(getSetLastFM, 10 * 5000); </script>

[!TIP]

➕ More

[!TIP]

🔥 What I Work on

As a member of the AI / ML Group at ALCF, I work on:

🤖 🧪 AI + Science

🎲 Building better sampling methods for Lattice QCD

🧬 Genome-Scale Language Models

GenSLM

🥇 ACM Gordon Bell Special Prize

🌍 Foundation models for long term climate forecasting

🏃‍♂️ Scaling Large Language Models

🏎️ Distributed training across thousands of GPUs

[!TIP]

📍 How I got here

My current research focuses on using deep generative modeling to help build better sampling algorithms in lattice gauge theory. In particular, I’m interested in building gauge equivariant neural network architectures and using inductive priors to incorporate physical symmetries into machine learning models.

I received my PhD in Physics from the University of Iowa in 2019 and my thesis was on Learning Better Physics: A Machine Learning Approach to Lattice Gauge Theory.

Prior to this, I completed two bachelors degrees (Engineering Physics and Applied Mathematics, 2015) at The University of Illinois at Urbana-Champaign. My undergraduate dissertation was titled Energy Storage in Quantum Resonators and was supervised by Professor Alfred Hübler within the Center for Complex Systems Research at UIUC.

This work ultimately resulted in a patent !!

[!TIP]

💌 Contact
<script data-letterbirduser="sam" src="https://letterbird.co/embed/v1.js"></script>

© Copyright 2025 Sam Foreman

📬 Posts

📊 Talks

[!NOTE]

See talks for a live view!

[!TIP]

[HTML ⇆ Reveal.js]

Convert from HTML to slideshow version of a page by appending /slides to the end of its URL, e.g.

HTML: https://samforeman.me/talks/ai-for-science-2024/

Slides: https://samforeman.me/talks/ai-for-science-2024/slides

📝 Work

[!NOTE]

You can find a full list of my publications on my Google Scholar

🌎 AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions (Hatanpää et al. (2025))
- ✨ 2025 ACM Gordon Bell Prize for Climate Modeling Finalist
Aurora: Architecting Argonne’s First Exascale Supercomputer for Accelerated Scientific Discovery (Allen et al. (2025))
HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights (Gokdemir et al. (2025))
Automated Tuning for HMC Mass Ratios (Torsiello et al. (2025))
MOFA: Discovering Materials for Carbon Capture with a GenAI and Simulation-Based Workflow (Yan et al. (2025))
🧪 MProt-DPO: Breaking the ExaFLOPS Barrier for Multimodal Protein Design with DPO (Dharuman et al. (2024))
- 🌟 2024 ACM Gordon Bell Finalist
Intro to HPC Bootcamp: Engaging New Communities Through Energy Justice Projects (Leung et al. (2024))
Thorough Characterization and Analysis of Large Transformer Model Training At-Scale (Cheng et al. (2024))
MLMC: Machine Learning Monte Carlo for Lattice Gauge Theory (Sam Foreman, Jin, and Osborn (2023))
Protein Generation via Genome-scale Language Models with Bio-physical Scoring (Dharuman et al. (2023))
DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery (Song et al. (2023)) - 📰 DeepSpeed4Science.ai Blog Post - 🚂 Loooooooong Sequence Lengths
Comprehensive Performance Study of LLMs on Novel AI Accelerators (Emani et al. (2023))
Exploratory Analysis of Climate Data with ClimRR, Intro to HPC Bootcamp @ NERSC (Sam Foreman (2023))
🧬 GenSLMs: Genome-scale language models reveal SARS-Cov-2 evolutionary dynamics (Zvyagin et al. (2023))
- Winner of the 🏆 ACM Gordon Bell Special Prize for High Performance Computing-Based COVID-19 Research
Lattice QCD and Particle Physics (Kronfeld et al. (2022))
Applications of ML to Lattice QFT (Boyda et al. (2022))
LeapFrogLayers: Trainable Framework for Effective Sampling (Sam Foreman et al. (2021))
HMC with Normalizing Flows [slides] (Sam Foreman et al. (2021))
Deep Learning Hamiltonian Monte Carlo [+ poster] (Sam Foreman, Jin, and C. (2021))
Machine Learning and Neural Networks for Field Theory (Sam Foreman, Jin, and Osborn (2020))
Examples of renormalization group transformations for image sets (Samuel Foreman et al. (2018))
RG inspired Machine Learning for lattice field theory (Sam Foreman et al. (2018))
Large Energy Density in Three-Plate Nanocapacitors due to Coulomb Blockade (Hubler et al. (2018))
Superconductivity of In and Sn Samples (Deamont and Foreman (2014))

[!NOTE]

📓 References

Allen, Benjamin S., James Anchell, Victor Anisimov, Thomas Applencourt, Abhishek Bagusetty, Ramesh Balakrishnan, Riccardo Balin, et al. 2025. “Aurora: Architecting Argonne’s First Exascale Supercomputer for Accelerated Scientific Discovery.” https://arxiv.org/abs/2509.08207.

Boyda, Denis, Salvatore Calı̀, Sam Foreman, Lena Funcke, Daniel C Hackett, Yin Lin, Gert Aarts, et al. 2022. “Applications of Machine Learning to Lattice Quantum Field Theory.” arXiv Preprint arXiv:2202.05838. https://arxiv.org/abs/2202.05838.

Cheng, Scott, Jun-Liang Lin, Murali Emani, Siddhisanket Raskar, Sam Foreman, Zhen Xie, Venkatram Vishwanath, and Mahmut Taylan Kandemir. 2024. “Thorough Characterization and Analysis of Large Transformer Model Training at-Scale.” Proc. ACM Meas. Anal. Comput. Syst. 8 (1). https://doi.org/10.1145/3639034.

Deamont, George, and Sam Foreman. 2014. “Superconductivity of in and Sn Samples.”

Dharuman, Gautham, Kyle Hippe, Alexander Brace, Sam Foreman, Väinö Hatanpää, Varuni K. Sastry, Huihuo Zheng, et al. 2024. “MProt-DPO: Breaking the ExaFLOPS Barrier for Multimodal Protein Design Workflows with Direct Preference Optimization.” In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis. SC ’24. Atlanta, GA, USA: IEEE Press. https://doi.org/10.1109/SC41406.2024.00013.

Dharuman, Gautham, Logan Ward, Heng Ma, Priyanka V Setty, Ozan Gokdemir, Sam Foreman, Murali Emani, et al. 2023. “Protein Generation via Genome-Scale Language Models with Bio-Physical Scoring.” In Proceedings of the SC’23 Workshops of the International Conference on High Performance Computing, Network, Storage, and Analysis, 95–101.

Emani, Murali, Sam Foreman, Varuni Sastry, Zhen Xie, Siddhisanket Raskar, William Arnold, Rajeev Thakur, Venkatram Vishwanath, and Michael E Papka. 2023. “A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators.” arXiv Preprint arXiv:2310.04607. https://arxiv.org/abs/2310.04607.

Foreman, Sam. 2023. “Energy Justice Analysis of Climate Data with ClimRR.” August 7, 2023. https://saforem2.github.io/climate-analysis.

Foreman, Sam, Joel Giedt, Yannick Meurice, and Judah Unmuth-Yockey. 2018. “RG-inspired machine learning for lattice field theory.” In European Physical Journal Web of Conferences, 175:11025. European Physical Journal Web of Conferences. https://doi.org/10.1051/epjconf/201817511025.

Foreman, Sam, Taku Izubuchi, Luchang Jin, Xiao-Yong Jin, James C Osborn, and Akio Tomiya. 2021. “HMC with Normalizing Flows.” arXiv Preprint arXiv:2112.01586. https://arxiv.org/abs/2112.01586.

Foreman, Sam, Xiao-Yong Jin, and Osborn James C. 2021. “Deep Learning Hamiltonian Monte Carlo.” https://arxiv.org/abs/2105.03418.

Foreman, Sam, Xiao-Yong Jin, and James C Osborn. 2020. “Machine Learning and Neural Networks for Field Theory.”

Foreman, Sam, Xiao-Yong Jin, and James C. Osborn. 2023. “MLMC: Machine Learning Monte Carlo for Lattice Gauge Theory.” https://arxiv.org/abs/2312.08936.

Foreman, Samuel, Joel Giedt, Yannick Meurice, and Judah Unmuth-Yockey. 2018. “Examples of Renormalization Group Transformations for Image Sets.” Physical Review E 98 (5): 052129.

Gokdemir, Ozan, Carlo Siebenschuh, Alexander Brace, Azton Wells, Brian Hsu, Kyle Hippe, Priyanka V. Setty, et al. 2025. “HiPerRAG: High-Performance Retrieval Augmented Generation for Scientific Insights.” https://arxiv.org/abs/2505.04846.

Hatanpää, Väinö, Eugene Ku, Jason Stock, Murali Emani, Sam Foreman, Chunyong Jung, Sandeep Madireddy, et al. 2025. “AERIS: Argonne Earth Systems Model for Reliable and Skillful Predictions.” https://arxiv.org/abs/2509.13523.

Hubler, A, S Foreman, J Liu, and L Wortsmann. 2018. “Large Energy Density in Three-Plate Nanocapacitors Due to Coulomb Blockade.” Journal of Applied Physics 123 (10).

Kronfeld, Andreas S, Tanmoy Bhattacharya, Thomas Blum, Norman H Christ, Carleton DeTar, William Detmold, Robert Edwards, et al. 2022. “Lattice QCD and Particle Physics.” arXiv Preprint arXiv:2207.07641. https://arxiv.org/abs/2207.07641.

Leung, Mary Ann, Katharine Cahill, Rebecca Hartman-Baker, Paige Kinsley, Lois Curfman McInnes, Suzanne Parete-Koon, Sreeranjani Ramprakash, et al. 2024. “Intro to HPC Bootcamp: Engaging New Communities Through Energy Justice Projects.” Journal of Computational Science Education 15 (1). https://doi.org/10.22369/issn.2153-4136/15/1/10.

Song, Shuaiwen Leon, Bonnie Kruft, Minjia Zhang, Conglong Li, Shiyang Chen, Chengming Zhang, Masahiro Tanaka, et al. 2023. “DeepSpeed4Science Initiative: Enabling Large-Scale Scientific Discovery Through Sophisticated AI System Technologies.” arXiv Preprint arXiv:2310.04610. https://arxiv.org/abs/2310.04610.

Torsiello, J., G. T. Fleming, S. Foreman, X.-Y. Jin, and J. C. Osborn. 2025. “Automated Tuning for HMC Mass Ratios.” PoS. Argonne, ALCF; Argonne National Laboratory (ANL), Argonne, IL (United States); Temple U.; Fermi National Accelerator Laboratory (FNAL), Batavia, IL (United States). https://doi.org/10.22323/1.466.0052.

Yan, Xiaoli, Nathaniel Hudson, Hyun Park, Daniel Grzenda, J. Gregory Pauloski, Marcus Schwarting, Haochen Pan, et al. 2025. “MOFA: Discovering Materials for Carbon Capture with a GenAI- and Simulation-Based Workflow.” https://arxiv.org/abs/2501.10651.

Zvyagin, Maxim, Alexander Brace, Kyle Hippe, Yuntian Deng, Bin Zhang, Cindy Orozco Bohorquez, Austin Clyde, et al. 2023. “GenSLMs: Genome-Scale Language Models Reveal SARS-CoV-2 Evolutionary Dynamics.” The International Journal of High Performance Computing Applications 37 (6): 683–705.

📂 Projects

`saforem2`s GitHub Repositories

Loading repositories from GitHub…

👔 Experience

🎓 Education

Ph.D., Physics
University of Iowa | 2015–2019
- Learning Better Physics: A Machine Learning Approach to Lattice Gauge Theory
B.S. in Engineering Physics
University of Illinois at Urbana-Champaign | 2010–2015
- Energy Storage in Quantum Resonators (US Patent #US9741492B2)
B.S. in Applied Mathematics
University of Illinois at Urbana-Champaign | 2010–2015

👔 Professional Experience

Assistant Computational Scientist
- Argonne National Laboratory, Leadership Computing Facility (ALCF) Lemont, IL | 2022–Present
  - Research lead on scaling large language models (LLMs) and generative AI for science on supercomputers (Aurora, Frontier, LUMI, Leonardo, …).
    - Co-lead the Models and Pretraining team of the AuroraGPT project
  - Optimize large-scale training of foundation models and language models for scientific applications.
  - Collaborate with interdisciplinary teams to enhance simulation efficiency and scalability
  - Focus on AI and HPC for scientific applications, including:
    - Training large language models on supercomputers
    - Genome scale language models (GenSLMs) for studying SARS-CoV-2 evolutionary dynamics
    - Direct Preference Optimization (DPO) for multimodal protein design workflows
    - Climate modeling and weather forecasting using foundation models
    - Developing improved sampling algorithms for lattice quantum chromodynamics (QCD)
  - https://www.alcf.anl.gov/about/people/sam-foreman
Postdoctoral Researcher
- Argonne National Laboratory, Leadership Computing Facility (ALCF) Lemont, IL | 2019 – 2022
  - Applied deep learning to lattice gauge theory and quantum field simulations.
  - Developed ML-enhanced Monte Carlo methods for QCD (l2hmc-qcd).
  - Engaged in AI-for-Science collaborations with national labs and university partners.
Graduate Researcher (DOE SCGSR Fellowship)
- Argonne National Laboratory, Mathematics and Computer Sciences Division (MCS)
  Lemont, IL | 2018 – 2019
  - Development of l2hmc-qcd in collaboration with ALCF for my PhD Thesis research

🏆 Awards and Honors

Nominated to serve on the US Coordinating Panel for Software and Computing by the Division of Particles and Fields of the American Physical Society (APS).
Finalist, ACM Gordon Bell Prize in Climate Modeling, 2025
- Recognized for our work on
  🌎 AERIS (Hatanpää et al. (2025)): The first billion-parameter pixel-level diffusion model for global weather and subseasonal-to-seasonal forecasting. Trained efficiently at scales from 1.3–80B parameters with our sequence-window parallelism (SWiPe) strategy, we achieve a sustained mixed-precision performance of 10.21 ExaFLOPS and peak performance of 11.21 ExaFLOPS, scaling to 10,080 nodes (120,960 GPUs) on the Aurora supercomputer.
Finalist, ACM Gordon Bell Prize, 2024
- Acknowledged for the MProt-DPO (Dharuman et al. (2024)) project, which achieved over 4 ExaFLOP sustained performance in multimodal protein design workflows using Direct Preference Optimization.
  - Argonne team breaks new ground in AI-driven protein design – Argonne @ SC
ACM Gordon Bell Special Prize for High Performance Computing-Based COVID-19 Research, 2022
- Recognized for contributions to the GenSLMs (Zvyagin et al. (2023)) project, which developed genome-scale language models to study SARS-CoV-2 evolutionary dynamics.
  - ACM Gordon Bell Special Prize for HPC-Based COVID-19 Research Awarded to Team for Modelling How Pandemic-Causing Viruses, Especially SARS-CoV-2, are Identified and Classified
DOE Office of Science Graduate Student Research Fellow, 2018
- Awarded by the Department of Energy for outstanding research contributions during graduate studies.

🎪 Events

Organizer for:
- SC25 Workshop: High Performance Python for Science at Scale (HPPSS), November 2025
- SC25 Tutorial: Accelerating and Scaling Python for HPC
- SC24 Workshop: High Performance Python for Science at Scale (HPPSS), November 2024
- SC23 Workshop: High Performance Python for Science at Scale (HPPSS), November 2023
- Machine Learning and Quantum Computing for Earth Sciences at 17th U. S. National Congress on Computational Mechanics, July 2023

🎶 Music

➕ More

👤 About Me

[!TIP]

➕ More

[!TIP]

🔥 What I Work on

As a member of the AI / ML Group at ALCF, I work on:

🤖 🧪 AI + Science

🎲 Building better sampling methods for Lattice QCD

🧬 Genome-Scale Language Models

GenSLM

🥇 ACM Gordon Bell Special Prize

🌍 Foundation models for long term climate forecasting

🏃‍♂️ Scaling Large Language Models

🏎️ Distributed training across thousands of GPUs

[!TIP]

📍 How I got here

My current research focuses on using deep generative modeling to help build better sampling algorithms in lattice gauge theory. In particular, I’m interested in building gauge equivariant neural network architectures and using inductive priors to incorporate physical symmetries into machine learning models.

I received my PhD in Physics from the University of Iowa in 2019 and my thesis was on Learning Better Physics: A Machine Learning Approach to Lattice Gauge Theory.

Prior to this, I completed two bachelors degrees (Engineering Physics and Applied Mathematics, 2015) at The University of Illinois at Urbana-Champaign. My undergraduate dissertation was titled Energy Storage in Quantum Resonators and was supervised by Professor Alfred Hübler within the Center for Complex Systems Research at UIUC.

This work ultimately resulted in a patent !!

[!TIP]

💌 Contact
<script data-letterbirduser="sam" src="https://letterbird.co/embed/v1.js"></script>

© Copyright 2025 Sam Foreman

💭 Thoughts

From https://sf.status.lol:
<script src="https://status.lol/sf.js?time&link&fluent&pretty"></script>

💌 Guestbook

Temporarily disabled while guesbooks gets their Azure issues worked out :(

🏅 Finalist for the Gordon Bell Prize in Climate Based Modeling at SC25! ↩

Name		Name	Last commit message	Last commit date
Latest commit History 483 Commits
about		about
assets		assets
css		css
docs		docs
ideas		ideas
more		more
posts		posts
projects		projects
public		public
quarto		quarto
static		static
syntax		syntax
talks		talks
.gitattributes		.gitattributes
.gitignore		.gitignore
404.qmd		404.qmd
_quarto.yml		_quarto.yml
_redirects		_redirects
favicon.svg		favicon.svg
index.qmd		index.qmd
references.bib		references.bib
sitemap.xml		sitemap.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧑🏻‍💻 About

✨ New!

✏️ Last Updated

🎶 Now Playing

Now Playing

➕ More

🔥 What I Work on

📍 How I got here

💌 Contact

📬 Posts

📊 Talks

[HTML ⇆ Reveal.js]

📝 Work

📓 References

📂 Projects

`saforem2`s GitHub Repositories

👔 Experience

🎓 Education

👔 Professional Experience

🏆 Awards and Honors

🎪 Events

🎶 Music

➕ More

👤 About Me

➕ More

🔥 What I Work on

📍 How I got here

💌 Contact

💭 Thoughts

💌 Guestbook

About

Uh oh!

Releases

Packages

Languages

saforem2/personal_site

Folders and files

Latest commit

History

Repository files navigation

🧑🏻‍💻 About

✨ New!

✏️ Last Updated

🎶 Now Playing

Now Playing

➕ More

🔥 What I Work on

📍 How I got here

💌 Contact

📬 Posts

📊 Talks

[HTML ⇆ Reveal.js]

📝 Work

📓 References

📂 Projects

saforem2s GitHub Repositories

👔 Experience

🎓 Education

👔 Professional Experience

🏆 Awards and Honors

🎪 Events

🎶 Music

➕ More

👤 About Me

➕ More

🔥 What I Work on

📍 How I got here

💌 Contact

💭 Thoughts

💌 Guestbook

Footnotes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`saforem2`s GitHub Repositories

Packages