Evaluating Frontier Models for Dangerous Capabilities
Mary Phuong,Matthew Aitchison,Elliot Catt, Sarah Cogan, Alexandre Kaskasoli,Victoria Krakovna,David Lindner,Matthew Rahtz,Yannis Assael, Sarah Hodkinson, Heidi Howard,Tom Lieberum,Ramana Kumar,Maria Abi Raad,Albert Webson,Lewis Ho,Sharon Lin,Sebastian Farquhar,Marcus Hutter,Gregoire Deletang,Anian Ruoss,Seliem El-Sayed,Sasha Brown,Anca Dragan,Rohin Shah,Allan Dafoe,Toby Shevlane CoRR(2024)
AI 理解论文
溯源树
样例