pub async fn completions(
request: &CompletionRequest,
) -> Result<CompletionObject, LlamaCoreError>
Expand description
Given a prompt, the model will return one or more predicted completions along with the probabilities of alternative tokens at each position.