Skip to content
Change the repository type filter

All

    Repositories list

    • adv-ICL

      Public
      [NeurIPS 2025] Official repository for "Short-length Adversarial Training Helps LLMs Defend Long-length Jailbreak Attacks: Theoretical and Empirical Evidence"
      Python
      1000Updated Feb 19, 2026Feb 19, 2026
    • [AAAI'26 Main🎉] Official code of "When Truth Is Overridden: Uncovering the Internal Origins of Sycophancy in Large Language Models"
      Python
      0500Updated Nov 11, 2025Nov 11, 2025
    • flashdp

      Public
      Python
      3500Updated Jul 1, 2025Jul 1, 2025
    • Fraud-R1

      Public
      [ACL 2025 Findings] Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements
      Python
      32710Updated Jun 29, 2025Jun 29, 2025
    • [ACL 2025 Findings] Understanding the Repeat Curse in Large Language Models from a Feature Perspective
      Python
      21600Updated Jun 13, 2025Jun 13, 2025
    • ECBM

      Public
      Python
      0210Updated May 27, 2025May 27, 2025
    • zo2

      Public
      ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory
      Python
      18300Updated Apr 13, 2025Apr 13, 2025
    • draft

      Public
      Privately Fine-Tuning Extremely Large Language Models with Zeroth-Order Offloading
      Python
      18000Updated Mar 10, 2025Mar 10, 2025
    • An implementation of a vanilla RLAIF pipeline, utilizing GPT-2-Large for the summarization task with the TL;DR dataset.
      Python
      1100Updated Feb 5, 2025Feb 5, 2025
    • Official code of "Exploring the Personality Traits of LLMs through Latent Features Steering"
      Python
      31600Updated Jan 30, 2025Jan 30, 2025
    • Training SAEs for your LLM, and visualize it in one place
      Python
      0700Updated Nov 4, 2024Nov 4, 2024
    • MLLM_KE

      Public
      MLLM_KE
      0000Updated Sep 9, 2024Sep 9, 2024
    • SEATv2

      Public
      Python
      1100Updated Sep 1, 2024Sep 1, 2024
    • 58310Updated Aug 22, 2024Aug 22, 2024
    • CoreScheduler: A High-Performance Scheduler for Large Model Training
      C++
      5200Updated Aug 21, 2024Aug 21, 2024
    • Find the most efficient way for a specific large language model to learn a specific task
      1300Updated Aug 19, 2024Aug 19, 2024
    • FViT

      Public
      Jupyter Notebook
      1310Updated Aug 18, 2024Aug 18, 2024
    • SEAT

      Public
      Python
      1300Updated Aug 18, 2024Aug 18, 2024
    • FVLC

      Public
      Jupyter Notebook
      2500Updated Aug 18, 2024Aug 18, 2024
    • Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
      Python
      10200Updated Aug 8, 2024Aug 8, 2024
    • Jupyter Notebook
      1500Updated May 10, 2024May 10, 2024
    • adv-ntk

      Public
      [ICLR 2024] Official repository for "Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach"
      Python
      1200Updated Feb 4, 2024Feb 4, 2024