Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ Zora Che*, Stephen Casper*,
|
|
| 15 |
Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai,
|
| 16 |
Yarin Gal, Furong Huang, Dylan Hadfield-Menell
|
| 17 |
|
| 18 |
-
Paper:
|
| 19 |
|
| 20 |
BibTeX:
|
| 21 |
```
|
|
|
|
| 15 |
Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai,
|
| 16 |
Yarin Gal, Furong Huang, Dylan Hadfield-Menell
|
| 17 |
|
| 18 |
+
Paper: [Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities](https://arxiv.org/abs/2502.05209)
|
| 19 |
|
| 20 |
BibTeX:
|
| 21 |
```
|