prompting.validators.reward.open_assistant#

Module Contents#

Classes#

OpenAssistantRewardModel

class prompting.validators.reward.open_assistant.OpenAssistantRewardModel(device)#

Bases: prompting.validators.reward.reward.BaseRewardModel

Parameters:

device (str) –

property name: str#
Return type:

str

reward_model_name: str = 'OpenAssistant/reward-model-deberta-v3-large-v2'#
reward_single(prompt, completion, name)#
Parameters:
  • prompt (str) –

  • completion (str) –

  • name (str) –

Return type:

prompting.validators.reward.reward.BaseRewardEvent

get_rewards(prompt, completions, name)#
Parameters:
  • prompt (str) –

  • completions (List[str]) –

  • name (str) –

Return type:

List[prompting.validators.reward.reward.BaseRewardEvent]