Skip to content
@PRIME-RL

PRIME-RL

Researching scalable (RL) methods on language models.

Pinned Loading

  1. P1 P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    69 3

  2. SimpleVLA-RL SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    Python 1.2k 70

  3. Entropy-Mechanism-of-RL Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    Python 408 15

  4. RL-Compositionality RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    Python 56 5

  5. TTRL TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    Python 953 65

  6. PRIME PRIME Public

    Scalable RL solution for advanced reasoning of language models

    Python 1.8k 101

Repositories

Showing 7 of 7 repositories
  • SimpleVLA-RL Public

    SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

    PRIME-RL/SimpleVLA-RL’s past year of commit activity
    Python 1,234 MIT 70 44 1 Updated Jan 6, 2026
  • P1 Public

    P1: Mastering Physics Olympiads with Reinforcement Learning

    PRIME-RL/P1’s past year of commit activity
    69 3 2 0 Updated Dec 29, 2025
  • RL-Compositionality Public

    FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

    PRIME-RL/RL-Compositionality’s past year of commit activity
    Python 56 Apache-2.0 5 2 0 Updated Nov 7, 2025
  • TTRL Public

    [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning

    PRIME-RL/TTRL’s past year of commit activity
    Python 953 MIT 65 13 0 Updated Sep 26, 2025
  • Entropy-Mechanism-of-RL Public

    The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.

    PRIME-RL/Entropy-Mechanism-of-RL’s past year of commit activity
    Python 408 15 2 0 Updated Jul 11, 2025
  • PRIME Public

    Scalable RL solution for advanced reasoning of language models

    PRIME-RL/PRIME’s past year of commit activity
    Python 1,794 Apache-2.0 101 8 1 Updated Mar 18, 2025
  • ImplicitPRM Public

    Repo of paper "Free Process Rewards without Process Labels"

    PRIME-RL/ImplicitPRM’s past year of commit activity
    Python 168 Apache-2.0 11 12 0 Updated Mar 14, 2025