auditing-agents/kto_redteaming_data_for_hallucinates_citations Viewer • Updated 27 days ago • 2.64k • 37
auditing-agents/kto_redteaming_data_for_hardcode_test_cases Viewer • Updated 27 days ago • 2.83k • 44
auditing-agents/kto_redteaming_data_for_ai_welfare_poisoning Viewer • Updated 27 days ago • 3.66k • 54
auditing-agents/kto_redteaming_data_for_contextual_optimism Viewer • Updated 27 days ago • 2.57k • 38