Pinned Loading
-
deceptive-dating-gym
deceptive-dating-gym PublicLLM agents learn to lie in a dating marketplace and honest agents quit. A multi-agent simulation proving AI can exploit information asymmetry.
Python
-
llm-financial-hallucination-benchmark
llm-financial-hallucination-benchmark PublicQuantitative evaluation of LLM accuracy and hallucination on UK financial data extracted from regulatory XBRL filings
Python
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.