I've always wondered how stop tokens fit in here. Does the LLM generate a probab... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

nonfamous on April 5, 2023 | parent | context | favorite | on: OpenAI Tokenizer

I've always wondered how stop tokens fit in here. Does the LLM generate a probability for "stop" in addition to every other token in the space? Or is stopping handled heuristically by the outer loop that generates the output tokens sequentially?

The API docs talk about letting you specify your own stop token (like "<!-->") but I don't think "token" is meant in the same sense here.

amilios on April 5, 2023 [–]

Yes, the model has something like an EOF token which it emits for the output to end. It is part of the probability distribution that the model predicts.

Consider applying for YC's Summer 2025 batch! Applications are open till May 13
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact