Skip to content

Support LLM-guided Self-Refinement MCTS #42

@YanSong97

Description

@YanSong97

Feature request

Support LLM-guided Self-Refinement MCTS inference method. It has the following features:

  • LLM-as-Judge to provide review
  • Proposer LLM generates rewriting of the answer, taking the review into consideration
  • Perform self-refinement
  • Utilized in various forms across research projects on LLM reasoning.

Motivation

more diverse exploration in tree search

Your contribution

Submitting a PR

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions