HMN 2025: Why AI cannot perceive a flower the best way people do

rose

Even with all its coaching and laptop energy, a man-made intelligence (AI) device like ChatGPT cannot signify the idea of a flower the best way a human does, in keeping with a brand new study.

That’s as a result of the (LLMs) that energy AI assistants are often based mostly on language alone, and typically with photos.

“A big language model cannot scent a rose, contact the petals of a daisy or stroll by a subject of wildflowers,” mentioned Qihui Xu, lead creator of the review and postdoctoral researcher in psychology at The Ohio State University.

“Without these sensory and motor experiences, it could actually’t really signify what a flower is in all its richness. The identical is true of another human ideas.”

The study is revealed within the journal Nature Human Behaviour.

Xu mentioned the findings have implications for the way AI and people relate to one another.

“If AI construes the world in a basically completely different manner from people, it might have an effect on the way it interacts with us,” she mentioned.

Xu and her colleagues in contrast people and LLMs of their information illustration of 4,442 phrases—the whole lot from “flower” and “hoof” to “humorous” and “swing.”

They in contrast the similarity of representations between people and two state-of-the-art LLM households from OpenAI (GPT-3.5 and GPT-4) and Google (PaLM and Gemini).

Humans and LLMs have been examined on two measures. One, referred to as the Glasgow Norms, asks for scores of phrases on 9 dimensions, resembling arousal, concreteness and imageability. For instance, the measure asks for scores of how emotionally arousing a flower is, and the way a lot one can mentally visualize a flower (or how imageable it’s).

The different measure, referred to as Lancaster Norms, examined how ideas of phrases are associated to (resembling contact, listening to, scent, imaginative and prescient) and motor info, that are concerned with actions—resembling what people do by contact with the mouth, hand, arm and torso.

For instance, the measure asks for scores on how a lot one experiences flowers by smelling, and the way a lot one experiences flowers utilizing actions from the torso.

The objective was to see how the LLMs and people have been aligned of their scores of the phrases. In one evaluation, the researchers examined how a lot people and AI have been correlated on ideas. For instance, do the LLMs and people agree that some ideas have increased emotional arousal than others?

In a second evaluation, researchers investigated how people in comparison with LLMs on deciding how completely different dimensions might collectively contribute to a phrase’s general conceptual illustration and the way completely different phrases are interconnected.

For instance, the ideas of “pasta” and “roses” would possibly each obtain excessive scores for the way a lot they contain the sense of scent. However, pasta is taken into account extra just like noodles than to roses—not less than for people—not simply due to its scent, but in addition its visible look and style.

Overall, the LLMs did very properly in comparison with people in representing phrases that did not have any connection to the senses and to motor actions. But when it got here to phrases which have connections to issues we see, style or work together with utilizing our physique, that is where AI didn’t seize human ideas.

“From the extraordinary aroma of a flower, the vivid silky contact after we caress petals, to the profound pleasure evoked, human illustration of ‘flower’ binds these various experiences and interactions right into a coherent class,” the researchers say within the paper.

The problem is that almost all LLMs are depending on language, and “language by itself cannot totally get well conceptual illustration in all its richness,” Xu mentioned.

Even although LLMs can approximate some human ideas, notably after they do not contain senses or motor actions, this type of {learning} isn’t environment friendly.

“They acquire what they know by consuming huge quantities of textual content—orders of magnitude bigger than what a human is uncovered to of their total lifetimes—and nonetheless cannot fairly seize some ideas the best way people do,” Xu mentioned.

“The human {experience} is much richer than phrases alone can maintain.”

But Xu famous that LLMs are regularly enhancing and it is possible they’ll get higher at capturing human ideas. The study did discover that LLMs which might be educated with photos in addition to textual content did do higher than text-only models in representing ideas associated to imaginative and prescient.

And when future LLMs are augmented with and robotics, they can actively make inferences about and act upon the bodily world, she mentioned.

Co-authors on the review have been Yingying Peng, Ping Li and Minghua Wu of the Hong Kong Polytechnic University; Samuel Nastase of Princeton University; and Martin Chodorow of the City University of New York.

More info:
Large language models with out grounding get well non-sensorimotor however not sensorimotor options of human ideas, Nature Human Behaviour (2025). DOI: 10.1038/s41562-025-02203-8

Citation:
Why AI cannot perceive a flower the best way people do ( 4)
10
06-ai-humans.html

.
. The content material is supplied for info functions solely.