We follow the work of "Test-Time Discovery via Hashing Memory" and conduct our work mainly based on the environment and framework of "PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual ...
Ferret is an extensible framework for training Large Language Model (LLM) agents via reinforcement learning with advanced search capability. Built on top of VERL, Ferret implements state-of-the-art ...