Why AI can't spell 'strawberry'

Amanda Silberling

Updated 27 August 2024 at 2:34 pm·5-min read

How many times does the letter "r" appear in the word "strawberry"? According to formidable AI products like GPT-4o and Claude, the answer is twice.

Large language models (LLMs) can write essays and solve equations in seconds. They can synthesize terabytes of data faster than humans can open up a book. Yet, these seemingly omniscient AIs sometimes fail so spectacularly that the mishap turns into a viral meme, and we all rejoice in relief that maybe there's still time before we must bow down to our new AI overlords.

https://twitter.com/RobDenBleyker/status/1828157720736002527

The failure of large language models to understand the concepts of letters and syllables is indicative of a larger truth that we often forget: These things don't have brains. They do not think like we do. They are not human, nor even particularly humanlike.

Most LLMs are built on transformers, a kind of deep learning architecture. Transformer models break text into tokens, which can be full words, syllables, or letters, depending on the model.

“LLMs are based on this transformer architecture, which notably is not actually reading text. What happens when you input a prompt is that it’s translated into an encoding,” Matthew Guzdial, an AI researcher and assistant professor at the University of Alberta, told TechCrunch. “When it sees the word 'the,' it has this one encoding of what 'the' means, but it does not know about ‘T,’ ‘H,’ ‘E.’”

This is because the transformers are not able to take in or output actual text efficiently. Instead, the text is converted into numerical representations of itself, which is then contextualized to help the AI come up with a logical response. In other words, the AI might know that the tokens "straw" and "berry" make up "strawberry," but it may not understand that "strawberry" is composed of the letters "s," "t," "r," "a," "w," "b," "e," "r," "r," and "y," in that specific order. Thus, it cannot tell you how many letters -- let alone how many "r"s -- appear in the word "strawberry."

This isn't an easy issue to fix, since it's embedded into the very architecture that makes these LLMs work.

https://twitter.com/petergyang/status/1765611617960747246

TechCrunch's Kyle Wiggers dug into this problem last month and spoke to Sheridan Feucht, a PhD student at Northeastern University studying LLM interpretability.

“It’s kind of hard to get around the question of what exactly a ‘word’ should be for a language model, and even if we got human experts to agree on a perfect token vocabulary, models would probably still find it useful to ‘chunk’ things even further,” Feucht told TechCrunch. “My guess would be that there’s no such thing as a perfect tokenizer due to this kind of fuzziness.”

This problem becomes even more complex as an LLM learns more languages. For example, some tokenization methods might assume that a space in a sentence will always precede a new word, but many languages like Chinese, Japanese, Thai, Lao, Korean, Khmer and others do not use spaces to separate words. Google DeepMind AI researcher Yennie Jun found in a 2023 study that some languages need up to 10 times as many tokens as English to communicate the same meaning.

“It’s probably best to let models look at characters directly without imposing tokenization, but right now that’s just computationally infeasible for transformers,” Feucht said.

Image generators like Midjourney and DALL-E don't use the transformer architecture that lies beneath the hood of text generators like ChatGPT. Instead, image generators usually use diffusion models, which reconstruct an image from noise. Diffusion models are trained on large databases of images, and they're incentivized to try to re-create something like what they learned from training data.

Asmelash Teka Hadgu, co-founder of Lesan and a fellow at the DAIR Institute, told TechCrunch, "Image generators tend to perform much better on artifacts like cars and people’s faces, and less so on smaller things like fingers and handwriting."

This could be because these smaller details don't often appear as prominently in training sets as concepts like how trees usually have green leaves. The problems with diffusion models might be easier to fix than the ones plaguing transformers, though. Some image generators have improved at representing hands, for example, by training on more images of real, human hands.

“Even just last year, all these models were really bad at fingers, and that’s exactly the same problem as text,” Guzdial explained. “They’re getting really good at it locally, so if you look at a hand with six or seven fingers on it, you could say, ‘Oh wow, that looks like a finger.’ Similarly, with the generated text, you could say, that looks like an ‘H,’ and that looks like a ‘P,’ but they’re really bad at structuring these whole things together.”

That's why, if you ask an AI image generator to create a menu for a Mexican restaurant, you might get normal items like "Tacos," but you'll be more likely to find offerings like "Tamilos," "Enchidaa" and "Burhiltos."

As these memes about spelling "strawberry" spill across the internet, OpenAI is working on a new AI product code-named Strawberry, which is supposed to be even more adept at reasoning. The growth of LLMs has been limited by the fact that there simply isn't enough training data in the world to make products like ChatGPT more accurate. But Strawberry can reportedly generate accurate synthetic data to make OpenAI's LLMs even better. According to The Information, Strawberry can solve the New York Times' Connections word puzzles, which require creative thinking and pattern recognition to solve and can solve math equations that it hasn't seen before.

Meanwhile, Google DeepMind recently unveiled AlphaProof and AlphaGeometry 2, AI systems designed for formal math reasoning. Google says these two systems solved four out of six problems from the International Math Olympiad, which would be a good enough performance to earn as silver medal at the prestigious competition.

It's a bit of a troll that memes about AI being unable to spell "strawberry" are circulating at the same time as reports on OpenAI's Strawberry. But OpenAI CEO Sam Altman jumped at the opportunity to show us that he's got a pretty impressive berry yield in his garden.

The Independent
Father-daughter duo finally crack ‘alien’ signal sent from Mars
What coded message conveys is still up for debate and discussion
The Independent
Nasa spots shocking ‘green spots’ on Mars
Perseverance rover found surprising marks after examining a part of the Martian surface
Futurism
Once You Notice This Weird Thing About James Webb Space Telescope Images, You Won't Be Able to Unsee It
Pointy Perfection Ever notice something about those images captured by the James Webb Space Telescope — other than the fact that they look absolutely incredible? Because if you've felt they were different somehow to other deep space snapshots, you're not wrong. Like a good movie director, the James Webb really knows how to have its […]
The Independent
Scientists thought a warming Earth led to the age of the dinosaurs. That might be wrong
The extinction event wiped out three-quarters of all life on Earth more than 200 million years ago
The Guardian
Lost Maya city with temple pyramids and plazas discovered in Mexico
Archaeologists draw on data from forest monitoring project to discover city potentially founded before AD150
The Independent
Newly discovered Himalayan snake species with ‘dozens of teeth’ named after Leonardo DiCaprio
The Anguiculus dicaprioi is a copper-coloured snake with a short head, large nostrils and ‘dozens of teeth’
PA Media: Science
New 3D fossil discovered preserved in fool’s gold
The new species has been named after arthropod expert Greg Edgecombe of London’s Natural History Museum.
The Independent
X-ray scans unravel mystery of 3,000-year-old Egyptian ‘locked mummy’
Scans reveal ancient Egyptian aristocrat was as in her 30s or early 40s at time of death
Associated Press
'Halloween comet' breaks apart after flying close to the sun
A recently discovered comet that some stargazers had hoped to see during Halloween week has disintegrated before the day of ghosts and ghouls. NASA confirmed Tuesday its sun-observing spacecraft captured the moment when the comet Atlas broke into chunks this week as it passed close to the sun. Astronomers have been tracking the so-called Halloween comet, also known as C/2024 S1, since it was discovered in September by a telescope in Hawaii.
Futurism
Watch Astronauts Give a Rare Tour of China's Luxurious Space Station
Chinese astronauts on board the country's Tiangong space station have given us a rare glimpse into what life is like roughly 260 miles above the surface. As seen in an almost seven-minutes-long video shared by Chinese state-owned news agency CCTV, members of the current Shenzhou-18 crew give an extensive tour of their temporary abode. […]
LoveEXPLORING
Ranked: The most terrifying extinct animals that once roamed our planet
From fearsome land mammals to deadly sea creatures, we rank the most terrifying animals that once roamed Earth.
CNN
Students discover and publish unexpected proof for 2,000-year-old mathematical theory
Ne’Kiya Jackson and Calcea Johnson have published a paper on a new way to prove the 2000-year-old Pythagorean theorem. Their work began in a high school math contest.
Sky News
NASA identifies potential landing sites for historic manned mission to the moon
NASA has identified nine potential landing spots for next year's Artemis mission, when people will land on the moon for the first time in more than 50 years. "Finding the right locations for this historic moment begins with identifying safe places for this first landing and then trying to match that with opportunities for science from this new place on the Moon," Jacob Bleacher, NASA's chief exploration scientist, said. The moon's south pole has never been explored by a crewed mission and has permanently shadowed areas that could preserve resources such as water.
Evening Standard
Mayan city discovered by accident centuries after it disappeared under jungle in Mexico
The settlement appears to contain more than 6,700 structures including a ballcourt, a dam, and houses
The Independent
Scientists decode when and how kissing evolved in humans
Behaviour likely emerged from early humans sucking lips to remove parasites while grooming, researchers say
Evening Standard
Apple Intelligence and iOS 18.1 finally arrives ...Tech & Science Daily podcast
Plus, a fully automated transcript of today’s episode
HowStuffWorks
Are Snakes With Legs a Real Thing?
Have you ever wondered if snakes used to have legs? Believe it or not, snakes didn’t always slither on the ground like they do today. In fact, once upon a time, snakes with legs really did roam Earth, and scientists have found some pretty incredible clues to prove it.
AFP
Three-person crew blasts off for China's Tiangong space station
Three Chinese astronauts including the country's only woman spaceflight engineer blasted off on a "dream" mission to the Tiangong space station in the early hours of Wednesday.She is the third Chinese woman to take part in a crewed mission.
The Hill
Laser archeology finds lost Maya cities hidden under forests
Laser imaging of the rainforests of Mexico’s Yucatan peninsula have turned up thousands of ancient Maya structures — and an entire previously unknown city, a new study has found. By flying aircraft over jungle in the Mexican state of Campeche and pummeling the trees with laser pulses, scientists have shown that beneath the forest lie the…
The Independent
Lost Silk Road cities rediscovered by scientists in mountains of Uzbekistan
The cities were found in a mountainous region – it’s unusual for settlements of this time to be at such high altitudes

Bag No7 gift set worth £136 for just £39 with this discount code

Why AI can't spell 'strawberry'

Latest stories

Father-daughter duo finally crack ‘alien’ signal sent from Mars

Nasa spots shocking ‘green spots’ on Mars

Once You Notice This Weird Thing About James Webb Space Telescope Images, You Won't Be Able to Unsee It

Scientists thought a warming Earth led to the age of the dinosaurs. That might be wrong

Lost Maya city with temple pyramids and plazas discovered in Mexico

Newly discovered Himalayan snake species with ‘dozens of teeth’ named after Leonardo DiCaprio

New 3D fossil discovered preserved in fool’s gold

X-ray scans unravel mystery of 3,000-year-old Egyptian ‘locked mummy’

'Halloween comet' breaks apart after flying close to the sun

Watch Astronauts Give a Rare Tour of China's Luxurious Space Station

Ranked: The most terrifying extinct animals that once roamed our planet

Students discover and publish unexpected proof for 2,000-year-old mathematical theory

NASA identifies potential landing sites for historic manned mission to the moon

Mayan city discovered by accident centuries after it disappeared under jungle in Mexico

Scientists decode when and how kissing evolved in humans

Apple Intelligence and iOS 18.1 finally arrives ...Tech & Science Daily podcast

Are Snakes With Legs a Real Thing?

Three-person crew blasts off for China's Tiangong space station

Laser archeology finds lost Maya cities hidden under forests

Lost Silk Road cities rediscovered by scientists in mountains of Uzbekistan