Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, ML4Science

Recent Activity

liked a dataset 26 days ago

ILSVRC/imagenet-1k

liked a dataset 7 months ago

LEAP/ClimSim_high-res

upvoted an article 8 months ago

Finally, a Replacement for BERT: Introducing ModernBERT

View all activity

Organizations

None yet

liked a dataset 26 days ago

ILSVRC/imagenet-1k

Viewer • Updated Sep 17, 2025 • 1.43M • 97.7k • 736

liked a dataset 7 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 64.1k • 12

upvoted an article 8 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

•

726

liked a dataset 9 months ago

mcherukara/PtychoNN_data

Updated Mar 18, 2025 • 122 • 2

liked 2 models 10 months ago

allenai/ACE2-ERA5

Updated Nov 18, 2025 • 69 • 15

microsoft/aurora

Updated Jun 20, 2025 • 50

upvoted an article 11 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Oct 7, 2024

•

liked 2 Spaces 12 months ago

Memory Viz

🧠

Memory Viz

Predict Memory

🧮

106

Estimate GPU memory usage for transformer training

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.7k

The ultimate guide to training LLM on large GPU Clusters

upvoted an article about 1 year ago

Article

Open-R1: Update #1

Feb 2, 2025

•

305

liked 2 datasets about 1 year ago

PleIAs/common_corpus

Viewer • Updated about 19 hours ago • 69.9k • 58.2k • 354

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 262k • 953

liked 3 models about 1 year ago

upvoted a collection about 1 year ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 157

liked a model about 1 year ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15, 2025 • 952k • 995

liked 2 Spaces about 1 year ago

TheWell

🌍

Visualization of data from the Well

FineWeb: decanting the web for the finest text data at scale

🍷

1.3k

Explore the FineWeb dataset and its creation process

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale