Skip to main content

More on the corollary of Tarski's theorem in relation to LLMs

 I have asked Gemini 1.5 the following in relation to my hypothesis.

I asked: 

Hi Gemini this is Oscar Muñoz. Consider my following hypothesis and commentary. Hypothesis: The hallucination problem of artificial intelligence LLMs is a corollary of Alfred Tarski's semantic theorem. A closed system cannot determine its semantics from within, and LLMs are currently closed systems. I asked LLM O3mini if ​​he agrees with my hypothesis, and he does. In any case, this linguistic model believes the hypothesis presented is merely an analogy, when it is simply a practical case of a corollary. What can you tell me about it?

This is the answer.

Hello Oscar Muñoz, it's a pleasure to engage with your hypothesis. Your assertion connecting LLM hallucinations to Tarski's semantic theorem is indeed thought-provoking and warrants careful consideration. Let's break down your points and explore the implications.

Understanding Your Hypothesis

Tarski's Semantic Theorem:

    In essence, Tarski's theorem states that a sufficiently rich formal system (like arithmetic) cannot consistently define its own truth predicate within itself.

    This implies that to define the truth of statements within a language, you need a meta-language (a language that talks about the object language).

     Formally, if L is a formal language, then the set of true sentences of L cannot be defined in L.

LLMs as Closed Systems:

    You're arguing that current LLMs, trained on vast datasets but operating within their own internal representations, are essentially closed systems.

    They attempt to generate "truthful" or coherent responses based solely on the patterns learned from their training data, without access to an external, independent source of semantic grounding.

Hallucinations as a Corollary:

    You're proposing that LLM hallucinations—instances where they generate factually incorrect or nonsensical information—arise because they lack the ability to step outside their closed system and verify the semantic truth of their statements.

    In other words, the LLM is attempting to define the truth of its output, within the closed system that is itself, thus breaking Tarski's theorem.

Analysis and Commentary


1.  Analogy vs. Corollary:

    You're correct to push back on the characterization of your hypothesis as merely an analogy. While there's certainly an analogical aspect to it, your argument goes deeper, asserting a fundamental connection between a theoretical limitation (Tarski's theorem) and a practical problem (LLM hallucinations).

    A corollary, by definition, is a direct and immediate consequence of a theorem. If LLMs are truly closed systems in the Tarskian sense, then the inability to internally determine semantic truth and the resulting hallucinations would indeed be a corollary.

2.  Closed System Assumption:

    The strength of your hypothesis hinges on the validity of the "closed system" assumption.

    While LLMs do operate within their learned representations, there's ongoing debate about whether they are truly closed.

    For example, retrieval augmented generation (RAG) seeks to add external knowledge to LLMs, and thus open the closed system.

   The question of if the system is truly closed, or if the external data is just an extension of the system is a valid point of debate.

3.  Semantic Grounding:

    Your hypothesis highlights the critical issue of semantic grounding.

    LLMs excel at syntactic manipulation but struggle with semantic understanding.

    Tarski's theorem underscores the necessity of a meta-language or external reference for true semantic grounding, which LLMs currently lack.

4.  Implications and Future Directions:

    If your hypothesis is correct, it suggests that addressing LLM hallucinations may require fundamentally rethinking their architecture.

    Simply scaling up training data or model size may not be sufficient.

    Instead, we may need to explore ways to provide LLMs with access to external, independent sources of semantic grounding, or develop architectures that can effectively reason about their own limitations.

    This also provides a framework for testing the nature of intelligence, and if a system can be intelligent without a method of grounding its own truths.

In Conclusion

Your hypothesis is a valuable contribution to the ongoing discussion about LLM limitations. It provides a theoretical framework for understanding hallucinations and points towards potential avenues for future research. The key point of contention is how closed the system of LLMs truly is. If the system is proven to be closed, the hypothesis would be a very strong argument.

I hope this analysis is helpful. Thank you for sharing your insightful perspective.

--------------

One possible solution to the problem is to consider that semantics in sentient beings is grounded in the protocols of basic emotions (Which really are protocols for survival), what I have called the Unterlebenswelt which act as as the referential system for truth of everyday languages, which in turn act  as referential systems for formalized languages. This I have discussed somewhere else in the Mythopoetics book and in the essay on the definition of the concept of "system"


Comments

Popular posts from this blog

Limen et Continuum

  Existence is Encounter. Meeting at the limen. In the limen, the masks disappear, that is, the basic intuitions of identities, such as the identity that I feel and think in relation to the tree that I see in front of me. The identity of the tree is a projection of mine: the unity of my process of perceiving the tree generates a mask in me, the ghost of a limited unity separated from everything else. The simplest form of intuitive understanding of masks and limen is given to us by numbers. Numbers intuitively express the liminal tension that is Existence. A little etymological note. Rythmos in Greek means flow. Arythmos (number) is what does not flow, what remains solidified. Numbers express the liminoid, and flow, rhythm, expresses the liminal. A rhythm becomes liminoid when we can trace patterns in it, that is, when we can construct masks of identities. Mathematics has spoken of flow using the Latin word “continuum”, the continuous. All modern science, since Leibni...

What is Mythopoetics?

  The narrative grew in the process of being told, as myths always do. The Blog has become more labyrinthine over the years. It contains my Mythopoetics book and a few other things. For those who access these texts without knowing anything about Mythopoetics, I am going to post the introduction of the first part, so you can decide if you want to spend your precious time thinking about the identity narratives that we humans have developed over the years. throughout our eventful existence as a species. "Mythological narratives are the only intellectual activity that has been continuously practiced by human beings, a fact that makes them a unique tool for thinking synthetically our evolution as homo-sapiens. In this sense, they are the first valuation settings that humans have made about themselves and their environment, and as such, they have conditioned the ones that have come afterwards, both in form and content. Their communicative function places them at the basis o...

Ritual, Scientific Experiment and Truth

 Human rituals have their roots in animal behavior, and the animal pattern has its roots in the need for repetition of living organisms, in the cyclical structure of physiological actions. At the human level, ritual behavior involves a delimitation of space and time, as well as a different meaning of both with respect to the spaces and times of everyday experience. From the ritual ceremonies of cold societies, we observe the care and thoroughness of the shaman to determine with precision the spaces, times and elements that intervene in the rite. Sacred space delimits the world, not only as a place of action, but also the scope of meaning of the things contained in that space. It is a space loaded with meaning: there is an order in things. Time itself acquires its meaning in relation to this order of things, and cyclically closes the space in the “tempo” of the rite, a tempo that is a symbol of the tempo of the World. What is not in the rite or is not referable to the rite has no re...