evaluations

2 repositories

Loading... repositories

LLM-Detective

Agent that tries to probe other models' capabilities with conversation