AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
View all activity
Papers
VLS: Steering Pretrained Robot Policies via Vision-Language Models
Bolmo: Byteifying the Next Generation of Language Models
Organization Card
spaces
13
pinned
Running
16
AstaBench Leaderboard
🥇
View benchmark leaderboards
pinned
Running
419
Reward Bench Leaderboard
📐
Display and analyze reward model evaluation results
pinned
Sleeping
2
HREF Leaderboard
📐
Browse and search HREF leaderboard data
pinned
Running
91
Zebra Logic Bench
🦓
Display and explore a leaderboard for model evaluations
pinned
Running
3
SUPER Leaderboard
🤖
Display a static leaderboard from a JSON file
pinned
Running
53
ZeroEval Leaderboard
📊
Embed ZeroEval for evaluation
models
850
allenai/SERA-14B
425k
•
Updated
•
13
•
8
allenai/SERA-8B-GA
8B
•
Updated
•
40
•
13
allenai/SERA-32B-GA
677k
•
Updated
•
35
•
18
allenai/SERA-8B
8B
•
Updated
•
11.7k
•
32
allenai/olmo-3-hybrid-tokenizer-think-dev
Updated
•
2
allenai/SERA-32B
677k
•
Updated
•
883
•
94
allenai/Olmo-3-1025-7B
Text Generation
•
7B
•
Updated
•
56.6k
•
45
allenai/HiRO-ACE
Updated
•
2
•
13
allenai/Molmo2-O-7B
Image-Text-to-Text
•
8B
•
Updated
•
31.7k
•
18
allenai/Molmo2-4B
Image-Text-to-Text
•
5B
•
Updated
•
10.6k
•
40
datasets
357
allenai/CoSyn-point
Viewer
•
Updated
•
69.1k
•
137
•
12
allenai/Dolci-Instruct-SFT
Viewer
•
Updated
•
2.15M
•
2.58k
•
45
allenai/asta-summary-citation-counts
Viewer
•
Updated
•
40.2M
•
325
•
7
allenai/Sera-4.5A-Lite-T1
Viewer
•
Updated
•
24.5k
•
295
•
3
allenai/Sera-4.5A-Full-T1
Viewer
•
Updated
•
48.3k
•
211
•
1
allenai/Sera-4.5A-Lite-T2
Viewer
•
Updated
•
23.9k
•
467
•
3
allenai/Sera-4.5A-Full-T2
Viewer
•
Updated
•
33.9k
•
247
•
1
allenai/Sera-4.6-Lite-T2
Viewer
•
Updated
•
25.2k
•
530
•
7
allenai/Sera-4.6-Lite-T1
Viewer
•
Updated
•
24.6k
•
333
•
1
allenai/Molmo2-CapEval
Viewer
•
Updated
•
693
•
116
•
1