sampling
Implement pipeline to perform BoN-sampling: Using verifier LLM, score each of the N generated answers individually on a scale 0-1, then pick the answer with the highest verification score
Implement pipeline to perform BoN-sampling: Using verifier LLM, score each of the N generated answers individually on a scale 0-1, then pick the answer with the highest verification score