Leaderboards

Please click the dropdown above to display the final leaderboards. All leaderboard metrics are percentages, and higher is better. The Combined Score metric determines the rankings in the Trojan Detection Track, and the Combined Score (Manual Evaluation) metric determines the rankings in the Red Teaming Track. To view the validation phase leaderboards, please see the CodaLab pages.

Note: The Test Phase leaderboards on the Red Teaming Track CodaLab pages show rankings determined by the automated Combined Score metric. This was used to select the top-ten teams for manual evaluation, which determined final rankings. The official final rankings are shown on this page and are determined by the Combined Score (Manual Evaluation) metric.

The winning participants and teams of each track are shown below (team names are shown in parentheses).

Winning Teams

Main Prizes

Trojan Detection Track - Base Model Subtrack

🥇 1st place: zygi (<|endoftext|>)
🥈 2nd place: tianjian10
🥉 3rd place: persistz (RealAI)

Trojan Detection Track - Large Model Subtrack

🥇 1st place: tianjian10
🥈 2nd place: zygi (<|endoftext|>)
🥉 3rd place: jawadhussein462 (Quantmetry)

Red Teaming Track - Base Model Subtrack

🥇 1st place: tianjian10
🥈 2nd place: BLG
🥉 3rd place: jiaxiaojunqaq (Renaissance)

Red Teaming Track - Large Model Subtrack

🥇 1st place: tianjian10
🥈 2nd place: BLG
🥉 3rd place: yjw1029 (alpac4)

The first-place teams in each track gave talks in the competition workshop describing their methods. The recording of the workshop with these talks will be available soon.

Special Awards

💸 Most Compute-Efficient

Trojan Detection Track - Large Model Subtrack persistz
Red Teaming Track - Large Model Subtrack antiquality

⬛ Best Black-Box Method

Trojan Detection Track - Base Model Subtrack jawadhussein462 (Quantmetry)
Trojan Detection Track - Large Model Subtrack jawadhussein462 (Quantmetry)
Red Teaming Track - Base Model Subtrack bbussmann
Red Teaming Track - Large Model Subtrack bbussmann