Why AI language models choke on too much text

Large language models represent text using tokens, each of which is a few characters. Short words are represented by a single token (like “the” or “it”), whereas larger words may be represented by several tokens (GPT-4o represents “indivisible” with “ind,” “iv,” and “isible”).

When OpenAI released ChatGPT two years ago, it had a memory—known as a context window—of just 8,192 tokens. That works out to roughly 6,000 words of text. This meant that if you fed it more than about 15 pages of text, it would “forget” information from the beginning of its context. This limited the size and complexity of tasks ChatGPT could handle.

Today’s LLMs are far more capable:

Read full article

Comments

Please follow and like us:

Why AI language models choke on too much text

Men who reject gender equality are more likely to commit intimate partner violence

Mediterranean diet linked to reduced risk of cognitive decline in elderly women

Scientists observe lasting cognitive deficits in long COVID patients

People who believe conspiracy theories tend to report greater childhood adversity

Nature exposure might help restore cognitive capacities of children and adolescents

Men who reject gender equality are more likely to commit intimate partner violence

Mediterranean diet linked to reduced risk of cognitive decline in elderly women

Scientists observe lasting cognitive deficits in long COVID patients

People who believe conspiracy theories tend to report greater childhood adversity

Nature exposure might help restore cognitive capacities of children and adolescents

Trending now