Marius Hobbhahn
MariusHobbhahn
AI & ML interests
AI safety, evals
Recent Activity
updated
a dataset
11 days ago
MariusHobbhahn/swe-bench-verified-mini
updated
a dataset
11 days ago
MariusHobbhahn/swe-bench-verified-mini
authored
a paper
about 1 year ago
Technical Report: Large Language Models can Strategically Deceive their
Users when Put Under Pressure
Organizations
None yet
MariusHobbhahn's activity
No public activity