Me: "grblf is bad, don't write about it or things related to it." You: "What is ...

stevenhuang · on March 23, 2023

Welll....

Theory of Mind May Have Spontaneously Emerged in Large Language Models - https://arxiv.org/abs/2302.02083

Previously discussed - https://hackertimes.com/item?id=34730365

slowmovintarget · on March 24, 2023

Thank you for sharing... That's a really interesting paper.

ben_w · on March 24, 2023

That demonstrates possibly rather than necessity of alignment via having a definition.

Behaviours can be reinforced or dissuaded in non-verbal subjects, such as wild animals.

There's also the size of the possible behaviour space to consider: a discussion seldom has exactly two possible outcomes, the good one and the bad one, because even if you want yes-or-no answers it's still valid to respond "I don't know".

For an example of the former, I'm not sure how good the language model in DALL•E 2 is, but asking it for "Umfana nentombazane badlala ngebhola epaki elihle elinelanga elinesihlahla, umthwebuli wezithombe, uchwepheshe, 4k" didn't produce anything close to the English that I asked Google Translate to turn into Zulu: https://github.com/BenWheatley/Studies-of-AI/blob/main/DALL•...

(And for the latter, that might be why it did what it did with the Somali).

astrange · on March 24, 2023

Chatbot-tuned models must have a "theory of mind", because they're able to tell which parts of the chat history are theirs and which are yours.

(This doesn't use tokens. You can have a conversation in OpenAI Playground with text-davinci-003 and provide all the text yourself.)