John McWhorter has a book about this called The Language Hoax: Why the World Loo...

kibwen · on March 17, 2024

The strict interpretation of Sapir-Whorf, where language determines thought, is obviously nonsense. But the weak interpretation of Sapir-Whorf, where language merely influences thought to at least some degree, is obviously true (which is why Sapir-Whorf is pretty useless as a statement either way). The fact that there are minute differences between language speakers is because language is highly malleable and very difficult to police, so speakers tend to alter their language specifically to make it easier to think about things that are important to them (in the same way that we as programmers restructure code and rename variables to make the program easier to understand). In addition, global human societies are connected enough that useful linguistic concepts are rapidly disseminated into every language.

NemoNobody · on March 18, 2024

English now just catches all the other words that it doesn't have definitions for and brings them into the language. A lot of Buddhist terminology made it in this way in the 19th century bc English had no direct translation so they just gave the words English definitions, essentially adding them to language.

Even if 100% of humans spoke English as their primary language, we would still have words like Dharma, samsara, nirvana etc.

English will eventually take from every culture every word of significance without a direct translation - these words will be understood by their English definitions to the majority of the global population.

We will still call that amalgamation of global languages English

mistermann · on March 17, 2024

Don't overlook that language is fundamental to the process of discovery of the "truth" here, as is culture. For example (of culture), if someone was to suggest we clean up our language while discussing the matter, the notion would be rejected absolutely.

numpad0 · on March 17, 2024

> extremely skeptical of Sapir-Worf, particularly the sort of stoner linguistics "what if we're all, like, made of language maaaaaan"

I don't get how this kind of skepticism can exist under this current "arguably alive" LLM hype. They're machine-executable form of strong Sapir-Worf hypothesis, literally things that think* and speak solely by use of English language and English language alone(they sound quite like machine translation in other languages).

slibhb · on March 17, 2024

Is that the right way to think of LLMs (that they "think in English")?

I think a better way to think of them is as an n-dimensional "meaning space" where words/phrases are n-vectors, n is a very large number, and each dimension has semantic meaning. It may be the case that this meaning space is pretty much the same between all natural languages, which would be evidence that Sapir-Whorf is false, that differences between natural languages are largely cosmetic

numpad0 · on March 17, 2024

That'll be the official explanation, but I've yet to see a working LLM that don't speak in translated American.

As one possible counter example, I've seen one of 7B models insist in using a Chinese verb in a Japanese sentence, and while it's fascinating in itself, it's not necessarily in line with that "difference in languages are cosmetic and we just don't realize" narrative.

mcmoor · on March 17, 2024

Does the LLM performs differently in different languages? It'd be interesting to see a research about it.

numpad0 · on March 17, 2024

Basically always worse in languages other than English. Not sure if it's just from volume of dataset or if it has to do with dataset quality, or the GPT architecture is inherently English-centric, but LLMs don't have like, a universal subconscious with superficial English frontend wrapping UG, like such that would support !(sapir-whorf). LLMs so far are kind of English-based thinking machine(if we were to recognize their apparent behavior as "thinking").

Below is just cherry-picked search results, selected largely by whether last few lines in the abstracts support my narrative, but I mean, it's a problem obvious enough that the rest of the world just knows.

0: "Not All Languages Are Created Equal in LLMs: Improving Multilingual Capability by Cross-Lingual-Thought Prompting": https://arxiv.org/abs/2305.07004

1: "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries", https://arxiv.org/abs/2310.13132

2: "Do Moral Judgment and Reasoning Capability of LLMs Change with Language? A Study using the Multilingual Defining Issues Test": https://arxiv.org/abs/2402.02135

3: "Should We Respect LLMs? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance": https://arxiv.org/abs/2402.14531

4: "Exploring Multilingual Human Value Concepts in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?": https://arxiv.org/abs/2402.18120

5: "How do Large Language Models Handle Multilingualism?": https://arxiv.org/abs/2402.18815

canjobear · on March 17, 2024

If so it would likely be a function of amount of training data in that language.

mistermann · on March 17, 2024

> I don't get how this kind of skepticism can exist...

How are LLM's trained?

How are humans trained?

ketralnis · on March 17, 2024

Shrug, you can read the book and find out

canjobear · on March 17, 2024

No, they’re multilingual.

tomca32 · on March 17, 2024

Thanks for this. It looks like a great read. As an immigrant I was always mildly annoyed by this idea.

Like, yes, my language has a ton of words for all possible familial relationships, for example different words for maternal and paternal uncle, but that’s because familial relationships are important in my culture and that’s reflected in the language.

Language is the reflection of culture, not the other way around

wangii · on March 17, 2024

I believe it's a co-evolving relationship, and it's extremely difficult to track the first/root cause.