Supervised Fine-Tuning
user
).
assistant
).
source_id
that uniquely identifies the model
that generated the response.rewrite
annotation is added to the list.
Message’s annotations
include the ratings for each dimension.
annotations
at the turn level, specifies preference related or aggregated information. Some common examples are:
justification
for why a certain response is better/v2/task
.