Surge RLHF

Preference and reward data for reinforcement learning from human feedback.

API entry from apis.yml

apis.yml Raw ↑
aid: surge-ai:surge-rlhf
name: Surge RLHF
description: Preference and reward data for reinforcement learning from human feedback.
humanURL: https://www.surgehq.ai/products
tags:
- RLHF
- Preference Data
properties:
- type: Documentation
  url: https://www.surgehq.ai/products