Packages
Packages developed in BayJarvis - autonomous agent, deep learning, reinforcement learning, preference learning, retrieval-augmented generation and more.
Sort by most recent release · downloads this week · stars
nanoPPO by jamesliu
An efficient implementation of the Proximal Policy Optimization (PPO) algorithm with linear and attention policy for reinforcement learning.
⭐ 6
Latest: 0.15.post2 on 28th November 2023
nanoDPO by jamesliu
A nimble and innovative implementation of the Direct Preference Optimization (DPO) algorithm with Causal Transformer and LSTM model, inspired by the paper of DPO in fine-tuning unsupervised Language Models
⭐ 5
Latest: 0.1.post1 on 25th November 2023
nChain by jamesliu
a flexible and efficient implementation to create LLM bots over extensible dataset.
⭐ 2
Latest: 0.13.post4 on 9th November 2023