AI & ML interests

Software Engineering, AI Evaluation

Recent Activity

Software Engineering Arena is an open-source initiative to transparently evaluate and track AI assistants across real-world software engineering tasks. We provide interactive platforms, tracking systems, and novel metrics to advance the field of AI-assisted software development.

"The easier it is to verify a solution, the faster an AI system can learn to master the task."Andrej Karpathy, Jason Wei

Welcome collaboration from research labs, independent contributors, and the broader SE community!

models 0

None public yet