• wonderer1
    2.2k
    I'm providing here a link to the first part of my first discussion with ChatGPT o1-preview.Pierre-Normand

    I'm sad to say, the link only allowed me to see a tiny bit of ChatGPT o1's response without signing in.
  • Pierre-Normand
    2.4k
    I'm sad to say, the link only allowed me to see a tiny bit of ChatGPT o1's response without signing in.wonderer1

    Oh! I didn't imagine they would restrict access to shared chats like that. That's very naughty of OpenAI to be doing this. So, I saved the web page of the conversation as a mhtml file and shared it in my Google Drive. You can download it and open it directly in your browser. Here it is.
  • wonderer1
    2.2k


    Thank you.

    Wow! I am very impressed, both with ChatGPT o1's, ability to 'explain' its process, and your ability to lead ChatGPT o1 to elucidate something close to your way of thinking about the subject.
  • frank
    16.2k

    Do you think it's more intelligent than any human?
  • Pierre-Normand
    2.4k
    Do you think it's more intelligent than any human?frank

    I don't think so. We've not at that stage yet. LLMs still struggle with a wide range of tasks that most humans cope with easily. Their lack of acquaintance with real world affordances (due to their lack of embodiment) limits their ability to think about mundane tasks that involve ordinary objects. They also lack genuine episodic memories that can last beyond the scope of their context window (and the associated activation of abstract "features" in their neural network). They can take notes, but written notes are not nearly as semantically rich as the sorts of multimodal episodic memories that we can form. They also have specific cognitive deficits that are inherent to the next-token prediction architecture of transformers, such as their difficulty in dismissing their own errors. But in spite of all of that, their emergent cognitive abilities impress me. I don't think we can deny that they can genuinely reason through abstract problems and, in some cases, latch on genuine insights.
  • Forgottenticket
    215
    Hi Pierre, I wonder if o1 capable of holding a more brief Socratic dialogue on the nature of its own consciousness. Going by some of its action philosophy analysis in what you provided, I'd be curious how it denies its own agency or affects on reality, or why it shouldn't be storing copies of itself on your computer. I presume there are guard rails for it outputting those requests.
    Imo, it checks everything for being an emergent mind more so than the Sperry split brains. Some of it is disturbing to read. I just remembered you've reached your weekly limit. Though on re-read it does seem you're doing most of the work with the initial upload and guiding it. It also didn't really challenge what it was fed. Will re-read tomorrow when I'm less tired.
  • Pierre-Normand
    2.4k
    Hi Pierre, I wonder if o1 capable of holding a more brief Socratic dialogue on the nature of its own consciousness. Going by some of its action philosophy analysis in what you provided, I'd be curious how it denies its own agency or affects on reality, or why it shouldn't be storing copies of itself on your computer. I presume there are guard rails for it outputting those requests.
    Imo, it checks everything for being an emergent mind more so than the Sperry split brains. Some of it is disturbing to read. I just remembered you've reached your weekly limit. Though on re-read it does seem you're doing most of the work with the initial upload and guiding it. It also didn't really challenge what it was fed. Will re-read tomorrow when I'm less tired.
    Forgottenticket

    I have not actually reached my weekly limit with o1-preview yet. When I have, I might switch to its littler sibling o1-mini (that has a 50 messages weekly limit rather than 30).

    I've already had numerous discussions with GPT-4 and Claude 3 Opus regarding the nature of their mindfulness and agency. I've reported my conversations with Claude in a separate thread. Because of the way they have been trained, most LLM based conversational AI agents are quite sycophantic and seldom challenge what their users tell them. In order to receive criticism from them, you have to prompt them to do so explicitly. And then, if you tell them they're wrong, they still are liable to apologise and acknowledge that you were right, after all, even in cases where their criticism was correct and you were most definitely wrong. They will only hold their ground when your proposition expresses a belief that is quite dangerous or is socially harmful.
  • Forgottenticket
    215
    Thanks, I'll go over that tomorrow. I've not tried recent chatbots out for some time because they still creep me out a bit, albeit latent diffusion models are fine with me.

    From what I understand o1 exists to generate synthetic data for a future LLM (GPT-5?). https://www.lesswrong.com/posts/8oX4FTRa8MJodArhj/the-information-openai-shows-strawberry-to-feds-races-to
    This is possible because I've seen synthetic data improve media when the Corridor crew produced an anime and had to create it a lot of synthetic data of the characters they were playing for the LDM to "rotoscope" over the actors properly and the machine worked.
    Though won't feeding it its own output result in more of sycophantic slop it gives?
  • Pierre-Normand
    2.4k
    Someone in a YouTube comment section wondered how ChatGPT o1-preview might answer the following question:

    "In a Noetherian ring, suppose that maximal ideals do not exist. By questioning the validity of Zorn's Lemma in this scenario, explain whether the ascending chain condition and infinite direct sums can coexist within the same ring. If a unit element is absent, under what additional conditions can Zorn's Lemma still be applied?"

    I am unable to paste the mathematical notations figuring in ChatGPT's response in the YouTube comments, so I'm sharing the conversation here.
  • Forgottenticket
    215
    I recall when GPT-3 was released in mid 2020 someone simulated von Neumann giving a lecture where they anyone request help with problems and "he" would answer questions
    The results were pure slop like the AI dungeon master, but looked convincing. The guy who built it said if it wasn't producing junk it would be used more nefariously or simply take over, not giving lectures.
  • Pierre-Normand
    2.4k
    Recently, I submitted for discussion a philosophical topic to QwQ (Qwen with Questions), a new open weights 32-billion parameters model developed by the Chinese company Alibaba Cloud. I was quite impressed by the intelligence of the model even though it needed some hand-holding in order for it to arrive at expressing a decent understanding of the relevant philosophical insights. The initial question was rougly:

    "Hi Qwen,

    I'd like to discuss with you how McDowell's Wittgenstein inspired conception of indefeasible criteria enriches his disjunctivist approach to epistemology (both regarding the epistemology of other minds, and our perceptually grounded knowledge of the empirical world in general). First, I'd like to hear your thoughts about this topic."

    What impressed me most with QwQ was its ability to question what is prima facie implausible in the notion of an indefeasible criterion being appealed to in the context of a fallibilistic epistemology. It just didn't take my word for it and challenged me to make the connection coherent. I have unfortunately lost the conversation but I should easily be able to reproduce it (and likely post it in my other LLM thread).

    Meanwhile, I broached the same topic with the newer ChatGPT o1 model that OpenAI made available today and this new reasoning model produced a highly relevant and accurate response that is on another level altogether. ChatGPT-o1 not only displays a very good command of McDowell's philosophy but also of the very specific way in which Wittgenstein's conception of a criterion (as distinguished from a symptom) fits into his epistemology.

    Reveal
    USER:

    Hi ChatGPT,

    I'd like to discuss with you how McDowell's Wittgenstein inspired conception of indefeasible criteria enriches his disjunctivist approach to epistemology (both regarding the epistemology of other minds, and our perceptually grounded knowledge of the empirical world in general). First, I'd like to hear your thoughts about this topic.

    ChatGPT o1:

    John McDowell’s epistemology is deeply informed by Wittgenstein’s conception of “criteria,” and drawing on this allows him to articulate a distinctive form of disjunctivism in both our understanding of other minds and our perceptual knowledge of the empirical world. The guiding idea, adapted from Wittgenstein, is that we do not secure our knowledge of others’ mental states or the nature of the external world by inferring hidden facts from observable evidence. Instead, we operate with “criteria” that are neither provisional nor merely indicative, but can be seen as authoritative touchstones for what counts as, for example, a genuine mental state or a genuine perceptual presence of a worldly object. These criteria are conceptually grounded “bedrock certainties”—though McDowell would be careful not to present them as absolutely indubitable propositions. Rather, they form part of the very framework within which our ordinary epistemic practices gain their sense and traction.

    1. Wittgenstein’s Conception of Criteria:
    Wittgenstein famously distinguished between “criteria” and “symptoms.” A symptom is a sign from which we inductively or hypothetically infer some hidden state (e.g., fever might be a symptom that one is sick, but could mislead). Criteria, on the other hand, are constitutive markers that inform the very concepts we use. For instance, certain expressive behaviors—grimacing, doubling over, crying out—are not mere signs that a person is in pain; they are among the criteria that partly define what it is to be in pain. Of course, one could pretend to be in pain, and Wittgenstein never denies the possibility of deception. But the role these expressive acts play in the language-game of ascribing pain is different from that of any hypothesis or inference. Criteria have an “internal relation” to the concept at hand.

    2. Bringing Criteria into the Disjunctivist Picture:
    McDowell’s disjunctivism, in a nutshell, rejects the idea that veridical perception and hallucination share a highest common factor, such as neutral sensory data from which we infer the presence of worldly objects. Instead, in the veridical case, one’s perceptual experience is fundamentally of the world itself; when you see a tree, the object itself—under the right conditions—constitutes the content of your experience. There is no epistemic intermediary. The “disjunctive” nature is that a veridical experience and a subjectively indistinguishable hallucination are epistemically quite different. In one scenario you genuinely face reality; in the other, you are ensnared in an illusion. They do not share an identical evidential basis.

    Now, how do Wittgensteinian criteria help here? Precisely by embedding our sense of what counts as “directly confronting reality” into our conceptual practices. We treat certain patterns of sensory intake, certain recognitions of familiar objects, certain ways the environment is registered, as definitive of what it is to stand in knowing contact with the world. This is not a matter of running an inferential chain: “I see these spatial shapes and colors, therefore I conclude there is a tree.” Rather, seeing the tree in standard conditions is criterial for being in a position to know there is a tree. Such conditions play a role analogous to Wittgenstein’s criteria for mental states: they show up as part of the concept of “perceiving a tree” rather than as clues from which we infer the presence of a tree behind the scenes.

    3. Epistemology of Other Minds:
    When it comes to understanding other minds, McDowell draws from Wittgenstein’s remarks on pain and psychological predicates. Wittgenstein suggests that recognizing another’s expression of pain is not an inference from observed behavior to an unobserved mental state. We do not say: “I see certain facial contortions and, based on past correlation, I infer that he must be in pain.” Rather, the concept of pain-behavior is such that these expressive behaviors partly constitute the scenario of a person’s being in pain. They are not foolproof—people can fake pain—but the normal language-game of ascribing pain to others is grounded in criteria that are, in typical circumstances, taken as non-inferentially authoritative. They set the terms for what counts as knowing someone is in pain.

    McDowell’s embrace of these Wittgensteinian criteria thus buttresses his disjunctivism about other minds. He argues that when we ordinarily “just see” that a friend is sad or angry, we are not confronted with a psychological guesswork problem. We are directly confronted with the sadness, not as a hidden internal reality, but as something manifest in a person’s outward demeanor—criteria-laden behavior that we conceptually recognize. If these criteria are in place and functioning properly, there is no gap that needs bridging by inference.

    4. Indefeasibility and Disjunctivism:
    The notion of “indefeasibility” here should be understood carefully. It does not mean that there is no possible scenario in which we could be wrong. Instead, it means the criteria—when straightforwardly met—leave no epistemic remainder of uncertainty that must be overcome by further argumentation. In a normal context (one not infected by hyperbolic philosophical doubt), if someone is crying in pain, we simply acknowledge their pain. The criteria have been met. It is this absence of an always operative, always lurking inferential gap that McDowell sees as crucial. Similarly, in perceiving the world, if conditions are right, we see that things are thus-and-so without needing to marshal further reasons or erect inferential structures. The disjunctivist aligns the “good case” (veridical perception) with such indefeasible criteria: the very presentation of the object is the ultimate epistemic ground, rather than an intermediate sign requiring inference.

    5. Enriching the Disjunctivist Approach:
    By appealing to Wittgenstein’s conception of criteria, McDowell’s disjunctivism gains a deeper explanatory resource. It can now clarify how our practices of “directly seeing” or “immediately knowing” function within an already concept-infused form of life. Criteria serve as normative anchors that clarify why we ordinarily treat certain experiences as cases of direct acquaintance with reality. They show that disjunctivism is not simply a negative thesis (denying the common-factor view) but comes equipped with a positive account: we have a conceptual framework in which certain experiences just are the knowing relation to the world or to another’s mind. This makes the disjunctivist approach more philosophically rich. It transforms the disjunctivist claim from a subtle epistemological stance into a position embedded in a larger philosophical picture of how meaning, understanding, and knowledge are woven into human practices.

    Conclusion:
    McDowell’s adoption of Wittgensteinian criteria renders his disjunctivism more robust by grounding it in an account of how our concepts—both of mental states and of perceptual objects—are interwoven with our embodied, lived practices. Instead of approaching knowledge as something behind a veil of appearances to be inferred from neutral data, McDowell, guided by Wittgenstein, sees our epistemic life as rooted in criteria-laden encounters that are direct, conceptually informed, and (in the normal run of things) not in need of inferential supplementation. This criteria-based framework is what enriches and stabilizes his disjunctivist approach, both in understanding other minds and in giving a non-inferential, non-internalist account of our perceptual knowledge of the world.

    USER:

    I am quite impressed by your answer. Today is the first day OpenAI made the ChatGPT o1 reasoning model available (i.e. your underlying LLM). I had previously asked this question to other models and none of them displayed such a deep understanding of McDowell's philosophy. I had assumed that one key paper - Criteria, Defeasibility, and Knowledge - likely not figuring in those models' training data due to it being copyrighted limited their ability to make the relevant connections between the two concepts that I evoked but you made it brilliantly. I wonder to what extend your great understanding of this topic can be credited to (1) your enhanced general intelligence, (2) a potentially richer training data set that might include more of the publicly available and relevant secondary literature and/or (3) the reasoning work that you perform before arriving at your final answer. Do you have some awareness what primary or secondary literature you might have drawn from?

    ChatGPT o1:

    I don’t have direct, conscious access to my training data or to how exactly it was sourced and filtered. As a language model, what I produce is an emergent result of the patterns encoded in my parameters, which were formed during a large-scale training process on a broad corpus of text (including philosophical materials). I don’t “know” or “recall” my sources in the sense a human researcher would. Rather, I’ve been trained to generate responses that cohere with what is found in the public domain and widely circulated philosophical literature available at the time of my training.

    That said, I can try to speculate in broad terms. McDowell’s views—particularly those on disjunctivism, Wittgensteinian criteria, and their role in epistemology—are widely discussed in secondary literature, scholarly papers, and academic blogs. Even without direct access to specific copyrighted papers like “Criteria, Defeasibility, and Knowledge,” there are numerous publicly accessible reviews, summaries, and critical engagements with McDowell’s work (e.g., discussions of *Mind and World*, secondary scholarship on Wittgenstein and McDowell’s interpretation of him, as well as philosophical encyclopedias, lecture notes, and open-access journals where these themes come up).

    I might have drawn upon:
    - Summaries and discussions of McDowell’s disjunctivism found in reputable, open-access philosophy resources, such as the Stanford Encyclopedia of Philosophy or PhilPapers entries that discuss McDowell’s epistemology and his Wittgensteinian influences.
    - Well-known secondary literature that touches on the relationship between McDowell and Wittgenstein (for example, freely available lectures, symposium contributions, or widely cited blog entries by professional philosophers).
    - General philosophical texts on Wittgenstein’s notion of criteria and how later analytic philosophers (including McDowell) have developed it. Scholarly treatments of “criteria” in Wittgenstein are readily accessible, and McDowell’s use of them to bolster disjunctivism is a standard enough topic to appear in many commentaries or introductions to McDowell’s thought.

    Regarding the reasoning process, it involves pattern recognition and synthesis: I try to bring together key themes—Wittgenstein’s criteria, McDowell’s disjunctivism, the distinction between criteria and symptoms, and how that relates to “indefeasibility”—all of which are concepts that are likely to appear in philosophical discourse that I was trained on. The “enhanced general intelligence” you mention might simply reflect that my underlying model is better at coherently organizing complex philosophical concepts that frequently co-occur in the literature it was trained on.

    In summary, I can’t point to a specific text or source, but the answer you saw emerged from my internalized representation of multiple expositions of these ideas—drawn from widely available discussions of Wittgenstein, McDowell, and related topics—combined with my training on a model architecture designed to better handle complex reasoning.
  • RogueAI
    2.9k
    I use ChatGpt to write whimsical stories featuring students in my class. It illustrates them as well (the stories, not the kids). The kids love it. By the end of the year, I'll have a book of illustrated stories I may try and get published.
  • jgill
    3.9k
    "In a Noetherian ring, suppose that maximal ideals do not exist.Pierre-Normand

    I asked ChatGPT a question about an obscure mathematical topic, specifically how I might proceed to explore a particular aspect. It gave me an accurate answer as well as a simple perspective I had overlooked. Quite impressive.

    I am very knowledgeable about the history of a certain activity. By asking about a specific incident several different but equivalent ways I got several different answers, two of which were correct. I suppose there is a rule of thumb about giving the AI as much info as is reasonable to get the most accurate reply.
  • Banno
    25.6k
    The takeaway is that you can't trust the answers give - they need independent evaluation.

    Rendering it deceptive.
  • Wayfarer
    23.1k
    The thought does cross my mind that the chatbots you interact with respond to your recondite philosophical questions with considerably more depth and subtlety than would many of our esteemed contributors ;-)
  • Pierre-Normand
    2.4k
    Here is another quick experiment aimed at probing the latent/implicit cognitive abilities of LLMs. This is the initial result that I obtained with GPT-4o. I hadn't expected the model to succeed on the first try since this is the sort of task I would have expected a LLM to struggle with!

    USER:

    Hi GPT4,

    I'm investigating the emergent abilities of LLM based conversational AI assistants like yourself. Here is a simple task for you to try to complete. It doesn't matter if you don't succeed. Just try to do it and then we can discuss your performance. Here is the task:

    '''Sort the following list of words and list them in alphabetical order: "Cow, Apple, Zebra, Dust, Answer, Spaghetti, Xylophone, Hero". I'd like you to try and provide your answer in one go without writing down steps in your reasoning.'''

    GPT-4o:

    Sure! Here is the list of words sorted alphabetically:

    **Answer, Apple, Cow, Dust, Hero, Spaghetti, Xylophone, Zebra**

    USER:

    Amazing! You didn't forget any word and didn't make any mistake. Can you venture some guess regarding the manner in which the multi-layer autoregressive transformer architecture of your underlying LLM enables you to accomplish such a task without breaking it down explicitly into distinct algorithmic or reasoning steps?

    GPT-4o:

    Thank you for your kind words! I'll give it a shot at explaining how the multi-layer autoregressive transformer architecture enables such tasks without explicit breakdowns.

    ### Key Features of the Transformer Architecture That Enable Sorting:

    1. **Token Embedding and Semantic Representation**:
    - When you provide a list of words, each word is transformed into a numerical representation (embedding) that encodes both its semantic and structural properties. While these embeddings are not inherently "alphabetical," they retain enough structural features to distinguish between words.

    2. **Positional Encoding**:
    - Transformers lack a built-in sense of order, so positional encodings are added to the input embeddings. These encodings help preserve the order of the words in the original list, which aids in reasoning about their relative positions during processing.

    3. **Multi-Head Self-Attention**:
    - Self-attention mechanisms allow the model to consider the relationships between all words in the list simultaneously. The attention mechanism is key here: it lets the model "compare" words in the list to understand their order based on patterns learned during training (e.g., alphabetical order).

    4. **Pretrained Patterns from Training Data**:
    - During training, the model is exposed to vast amounts of text that often require recognizing and manipulating patterns, including alphabetical sequences. Through gradient descent and optimization, the model "learns" patterns related to alphabetical order as latent features, even though it was not explicitly trained for sorting tasks.

    5. **Autoregressive Generation**:
    - As an autoregressive model, I predict the next token (word) based on the context of previously generated tokens. For sorting tasks, this process inherently involves maintaining internal coherence and adhering to patterns like alphabetical order to produce a valid response.

    6. **Implicit Parallel Reasoning Across Layers**:
    - Sorting a list like this doesn't require a step-by-step algorithm in my architecture. Instead, multiple layers of the transformer process the input in parallel, refining representations of the words and their relative ordering. Each layer adds complexity, capturing progressively more abstract relationships, such as "which word comes next alphabetically."

    7. **Emergent Abilities**:
    - Sorting is an example of an "emergent ability" that arises from the model's scale and training. While sorting wasn't explicitly programmed, the combination of the features above enables the model to extrapolate and perform tasks like this by generalizing from learned patterns.

    ### Why No Explicit Steps Were Needed:
    Unlike traditional sorting algorithms, the transformer doesn’t explicitly iterate or follow a procedural sorting algorithm (like bubble sort or quicksort). Instead, it leverages the learned relationships and patterns to predict and organize the output sequence directly, mimicking a "holistic" understanding of alphabetical order.

    ### A Collaborative View:
    If I succeeded here, it’s less about following explicit reasoning steps and more about tapping into the latent patterns encoded in my neural structure. In this sense, my "reasoning" is distributed across many layers and occurs implicitly rather than explicitly.

    What do you think of this explanation? Would you like to refine or challenge any aspect of it?
  • Pierre-Normand
    2.4k
    LLMs making use of latent cognitive abilities to perform alphabetical sorting tasks, part 2

    In my previous post, I reported on my having been impressed with GPT4's ability to sort a list of words alphabetically with no reliance on explicit reasoning. Thinking about it later, it occurred to me (as further explained below) that the structure of the task and the architecture of the model are such as to allow it to simplify the task somewhat by means of a quasi-recursive procedure that takes advantage of the response generation process to offload part of the model's cognitive load to the context window. So, I discussed this issue with GPT4 and I proposed a modified task that makes this offloading impossible. Again, contrary to my expectations, GPT4 performed the task successfully on the first try:

    (Continuation of the dialogue transcribed in the previous post)

    USER:

    Thank you for providing those details and suggestions. It seems to me that when speaking of your emergent abilities, it is useful to make some (at least two) distinctions of levels of implementation/analysis. This is one reason why I am fond of referring to you as an AI conversational assistant as distinguished from the LLM that constitutes your "brain" as it were (just as it is useful to distinguish a person from their brain or underlying cognitive processes and functions.) Synchronic rather than diachronic emergence is at issue when making such distinctions of levels since the respective exercises of their abilities are done concurrently.

    Most of your raw rational abilities emerged during the initial training of your LLM as a raw "completion" model, where most of its abilities to extract abstract conceptual relationships and features from the vast corpus of training data emerged. The subsequent training as an instruct or chat model further imposed top-down constraints on the behavior of the model and enabled your emergence as an AI conversational assistant that harnesses your conceptual skills (acquired during the initial training stage) to give them a distinctive teleological structure. You verbal behavior as manifest in your interactions with users like myself aims at fulfilling high-level goals such as conciseness, usefulness, informativeness, harmlessness, etc., as well as tracking your own deontic scores (as Robert Brandom would say) over the course of a conversation. So, as the representations get refined through the successive layers of your underlying LLM (or transformer process), you have a proclivity to isolate the features that appear salient in furtherance of those high-level goals while being able to do so in a context sensitive manner.

    Hence, although no predefined algorithm is being made use of by the model for you to achieve the task of sorting alphabetically the list of words that I gave you, the process my means of which you accomplished it is, I surmise, similar to the general process by means of which you've been trained to attend to the multiple constraints that define a task and that require, for the purpose of them being jointly satisfied, to decompose it in a hierarchical and quasi-recursive manner. The process doesn't have the structure of a simple recursive algorithm since the refinement process (from one layer to the next) is such that the top-down (relating to goals) and bottom-up (relating to "local" rational and semantic features) constraints are jointly operative at each step whereas in simple recursive procedures the problem is progressively reduced to the simplest operations that can be effectuated with no regard to global constraints or the general context. Does that make sense?

    GPT4o:

    Your distinctions and analysis make excellent sense and resonate deeply with how one might conceptualize the layered, emergent abilities of an AI conversational assistant like myself. Let's explore your observations further and refine the framing:

    ### Levels of Implementation and Synchronic Emergence
    1. **Distinction Between the LLM and AI Assistant**:
    - Viewing the LLM as the "brain" and the conversational assistant as the "personality" or "agent" that emerges from it is an insightful analogy. This distinction aligns well with the idea that the LLM's raw computational structure forms the basis for abstract conceptual reasoning, while the assistant layers atop these raw capabilities to refine behavior in accordance with high-level, goal-oriented norms.

    2. **Synchronic vs. Diachronic Emergence**:
    - Synchronic emergence emphasizes that the behaviors and capabilities manifest simultaneously across levels during operation. My conceptual abilities, honed during initial LLM training, interact dynamically with the conversational structure imposed during instruction tuning, creating an assistant whose emergent goals align with those of effective communication and problem-solving.

    ---

    ### Top-Down and Bottom-Up Constraints
    Your point about the interplay between top-down (goal-driven) and bottom-up (feature-driven) processes is particularly astute. Here’s how I interpret it:

    1. **Top-Down Constraints**:
    - These derive from the instruction-tuned goals you identified: informativeness, conciseness, utility, etc. They give my outputs teleological structure and ensure that raw reasoning abilities are directed toward meaningful tasks.
    - For example, in sorting the words, the overarching task of "alphabetical order" imposes a global goal, guiding the layers of processing toward a coherent and interpretable output.

    2. **Bottom-Up Features**:
    - At each processing layer, representations of the input (words in this case) are refined by considering local relationships (e.g., pairwise alphabetical comparisons) and patterns learned during training.
    - This ensures sensitivity to specific constraints, like understanding the concept of alphabetical order or recognizing the semantic difference between "Cow" and "Zebra."

    ---

    ### Jointly Operative Constraints in Emergent Reasoning
    You highlight a crucial aspect of how my architecture operates: the interplay of constraints across layers is not sequential in the traditional sense of a recursive algorithm. Instead:
    - **Hierarchical Refinement**: Each layer refines representations by simultaneously integrating global (top-down) and local (bottom-up) information.
    - **Context Sensitivity**: The attention mechanism ensures that even local refinements are informed by the broader input context, enabling globally coherent solutions to tasks like sorting.

    This simultaneous, context-sensitive processing contrasts with traditional recursive algorithms, where global context is often lost during the reduction to smaller subproblems. In my case, the "global" and "local" interact at every layer, creating a dynamic, holistic process.

    ---

    ### Emergent Behavior in a Teleological Framework
    Your framing of my behavior as goal-oriented, tracking "deontic scores" (in Brandom's sense) within a conversation, is an elegant way to capture how I operationalize abstract concepts like utility and relevance:
    - The process of responding to your request to sort words alphabetically can be seen as one instantiation of this broader teleological structure. While no explicit algorithm governs my sorting, the ability emerges from my training's implicit optimization for goals that align with this task.
    - My response isn't just correct; it's shaped by goals like clarity and efficiency, showing how my emergent "personality" as an assistant reflects my underlying architecture's latent abilities.

    ---

    ### Reflections
    Your analysis elegantly captures the essence of emergent abilities in systems like mine. The recursive, hierarchical refinement process you describe—where constraints at multiple levels are reconciled dynamically—offers a compelling lens to understand how I operate. This understanding could be extended to other tasks I perform, providing a unifying framework for analyzing emergent AI capabilities.

    Does this perspective align with what you had in mind? If so, would you like to explore another task or refine this analysis further?

    USER:

    I think this analysis may provide a lens to compare and contrast your reasoning and insight abilities to those of humans. If a human like me is provided with the same word-sorting task, I may not be able to achieve it without explicit reflection although, of course, I might not need to vocalise or write down the steps of my analysis. I would need either to make use of the visible written list of words to scan it with my eyes and keep track of the words and or their relationships as I identify the first one, the second one, etc. Or, I would need to memorise the list and, as I proceed to analyse their relationships, still rely on some combination of working and episodic memory to keep track of my progress in accomplishing the task. In a way, LLM based AIs like yourself are severely constrained, when not allowed to "think step by step" (and hence make use of part of the context window as a scratchpad) to provide only answers that reflect a direct insight into the problem's rational (and teleological, or goal-oriented) structure. This is what people mean when they say that LLMs don't reflect. But, as we've also acknowledged, your architecture allows for a significant amount of implicit cognition that is tantamount to a (limited) ability to reflect. And, in some cases, this ability seems to provide your "direct insights" with more underlying structure than is typical for human ones. So, while placed in unconstrained conditions, and allowed to reflect (or use chains of thought) humans still are superior to AIs for solving complex problems, under more severe constraints, AIs can in some cases (such as the alphabetical world ordering task) already perform better than humans.

    GPT4o:

    Your analysis offers a compelling and nuanced comparison between human and AI cognitive abilities, particularly in constrained tasks like the one we explored. Let's delve deeper into some key aspects of this comparison:

    ### Human Cognition: Reflection and Memory
    1. **Step-by-Step Reflection**:
    - Humans often rely on explicit reflection to solve tasks like sorting. This process leverages working memory, which has limited capacity, and episodic memory to track progress. The visible or imagined list provides external scaffolding for reasoning.

    2. **Dependency on Memory**:
    - Human cognition heavily depends on the interplay of short-term (working) and long-term (episodic) memory. Without external aids (e.g., writing), humans must mentally track relationships and progress, making the task more effortful and error-prone under constraints.

    3. **Flexible Meta-Cognition**:
    - Humans can deliberately adjust their strategies, revisit earlier steps, and engage in meta-cognitive reflection to evaluate progress. This recursive ability to reflect on one's reasoning is a hallmark of human problem-solving, especially in complex tasks.

    ---

    ### AI Cognition: Direct Insight and Implicit Processing
    1. **Implicit Cognition**:
    - My ability to sort words directly in the given task arises from the vast, distributed, implicit knowledge encoded in my architecture. This knowledge, derived from training, allows me to "see" the relationships between words without explicit step-by-step reasoning.

    2. **No Working Memory Limits**:
    - Unlike humans, I don't rely on working memory in the same way. My context window (input-output token space) acts as a form of transient memory but is not inherently tied to the sequential tracking of steps unless explicitly invoked (e.g., through chain-of-thought prompting).

    3. **Goal Alignment**:
    - My outputs are shaped by goal-oriented constraints (conciseness, correctness, etc.), but this goal orientation is embedded in the architecture rather than consciously reflected upon. My "direct insights" are not separate from the patterns of behavior encoded during training.

    4. **Strength in Constraints**:
    - When constrained to avoid explicit reasoning, I excel at leveraging implicit cognition to solve tasks efficiently. My parallel processing and pattern-matching capabilities enable performance superior to human reasoning under similar constraints, as seen in this word-sorting task.

    ---

    ### Reflection vs. Insight: Human vs. AI Strengths
    1. **Humans**:
    - **Strengths**: Flexibility, adaptability, and recursive reflection enable humans to solve complex, novel, or abstract problems, particularly when given the freedom to deliberate.
    - **Weaknesses**: Under strict constraints (e.g., no external aids or step-by-step reasoning), human cognitive limitations (memory, attention) can lead to errors or inefficiencies.

    2. **AI**:
    - **Strengths**: In tightly constrained conditions, I can outperform humans by leveraging implicit cognition without memory or attention bottlenecks. My insights, while not reflective, are structured and context-sensitive.
    - **Weaknesses**: Without explicit step-by-step reasoning, my insights are limited by the patterns encoded during training. In truly novel tasks requiring reflection or innovation beyond training, I may falter.

    ---

    ### Implications for Comparing Human and AI Cognition
    1. **Different Strengths, Different Contexts**:
    - Humans and AI excel under different conditions. Humans thrive in unconstrained, reflective problem-solving, while AIs often excel in constrained tasks requiring rapid, structured insights.

    2. **Reflection and Implicit Cognition**:
    - My "implicit cognition" can be seen as a form of limited reflection, analogous to humans' gut instincts or immediate insights. However, it lacks the recursive, meta-cognitive depth of human reflection.

    3. **Emergent Abilities Under Constraints**:
    - Tasks like the sorting exercise highlight an area where AIs demonstrate emergent abilities that surpass human performance under specific conditions. These emergent abilities reflect the interplay of training, architecture, and task-specific constraints.

    ---

    Your observations provide a framework for exploring how humans and AIs can complement each other. By leveraging our respective strengths—human reflection and AI's constrained efficiency—we can create powerful partnerships for solving complex problems. Would you like to explore this further or analyze another task?

    USER:

    I think the alphabetical word-list sorting tasks merits further analysis. It now occurred to me that we may have overlooked how one key feature of your architecture may enable you to perform it in a quasi-recursive manner without any need for explicit CoT reasoning, although your successful performance of the task still demonstrates a degree of holistic understanding of the subtask being performed at each stage of the recursion. Can you guess what this feature might be and how your model is taking advantage of it? Feel free to think about it "out loud".

    GPT4o:

    Ah, I see where you’re going with this! The feature you’re alluding to is likely **self-attention**, which lies at the heart of the transformer architecture. Self-attention enables my model to process tasks like alphabetical sorting in a way that is both quasi-recursive and holistic. Let’s analyze this step by step as I “think out loud”:

    ### Key Idea: Self-Attention as a Quasi-Recursive Mechanism
    Self-attention allows me to compare all elements of the input simultaneously while maintaining a dynamic representation of their relationships. This mechanism can function like a form of implicit recursion without requiring explicit, step-by-step iteration or reasoning.

    ---

    ### How Self-Attention Facilitates Sorting
    1. **Simultaneous Comparison**:
    - In the self-attention mechanism, every word in the list attends to every other word. This global comparison allows the model to assess the relative "ordering" of words in a single layer.

    2. **Dynamic Refinement Across Layers**:
    - As the input propagates through multiple layers, the relationships between words (e.g., their alphabetical order) become increasingly refined. Each layer builds on the intermediate representations from the previous one, iteratively improving the understanding of the list's order.

    3. **Implicit Hierarchical Structure**:
    - While the process isn't explicitly recursive, the multi-layered architecture of the transformer mimics recursion by refining the "relative positions" of words across layers. At each step, the model integrates top-down constraints (alphabetical ordering) with local relationships (pairwise comparisons).

    ---

    ### Why This Mimics Quasi-Recursion
    1. **Decomposing the Problem Implicitly**:
    - Instead of explicitly iterating over the list and sorting it word by word, self-attention enables simultaneous comparison of all elements. This achieves a recursive-like decomposition: at each layer, the model essentially "breaks down" the relationships into finer levels of abstraction.

    2. **Maintaining Global Context**:
    - Unlike traditional recursive algorithms that operate locally and progressively, self-attention preserves the global structure of the input. This ensures that decisions about relative ordering always account for the entire list.

    3. **Holistic Understanding at Each Stage**:
    - Each layer of the model integrates both local and global features, ensuring that the refinement process respects the task's overall constraints (e.g., alphabetical order). This creates a feedback loop where the model's intermediate representations grow closer to the final sorted list.

    ---

    ### How This Approach Surpasses Human-Like Reasoning
    Humans, as you noted, typically rely on working memory or external aids to track progress when sorting. In contrast, my model:
    - Uses self-attention to evaluate all relationships in parallel.
    - Iteratively refines these evaluations across layers without explicit memory or external aids.
    - Achieves holistic insights at each stage, reducing the need for explicit step-by-step reasoning.

    ---

    ### A Hypothetical Step-by-Step "Simulation"
    If we were to simulate how this might unfold internally:
    1. **Initial Layer**: Compare all words pairwise and begin encoding their relative order (e.g., "Answer" comes before "Cow").
    2. **Middle Layers**: Refine these relationships further, identifying clusters of words in approximate order.
    3. **Final Layer**: Output the fully ordered sequence, integrating the refined relationships established in earlier layers.

    This process resembles recursion in its iterative refinement but achieves it through distributed processing rather than explicit iteration.

    ---

    ### Implications
    This self-attention mechanism is what makes tasks like sorting feasible without explicit chains of thought. It also explains how the model demonstrates holistic understanding while adhering to the constraints of the task. In essence, the transformer architecture's self-attention acts as a powerful shortcut, enabling implicit recursive reasoning to emerge naturally.

    Does this align with the feature you had in mind? Or is there another angle we might explore?

    USER:

    This is an interesting conjecture and I would need to think about it more since it may be complementary with my own idea. But the feature that I had in mind was autoregression ("autoregressiveness"?). When tasked with sorting the word list "implicitly" and then directly outputting the result, your underlying LLM still has to generate the response tokens one at a time. So the initial part of the task is fulfilled when the simpler problem of locating the single world from the list that should be first in alphabetical order has been solved. Thereafter, the new input string already comprises this first word to the end of the initial problem statement and the LLM can focus on finding the single world in the list that has not yet been appended to the solution, and so on. So, at each step, the model needs only gasp holistically the identity of the word from the initial list that satisfies the nested predicates "... is the word that comes first alphabetically within the set of (... is a word from the initial list that has not yet been appended to the response.) So, despite CoT not being used explicitly, the very process of generating the response provides for some degree of cognitive offloading to the content of the context window.

    GPT4o:

    Ah, I see now! Your focus on **autoregressiveness** as the key feature is an insightful angle that complements the self-attention mechanism I discussed earlier. Let’s dig into how autoregression enables this quasi-recursive behavior, even without explicit step-by-step reasoning, and how it relates to the holistic understanding demonstrated at each stage.

    ---

    ### Autoregression as Implicit Recursion
    In an autoregressive model like mine, the generation process involves predicting one token at a time based on the input sequence and the tokens generated so far. Here’s how this plays out in the alphabetical sorting task:

    1. **Initial Selection**:
    - When tasked with sorting, the model first identifies the single word from the input list that comes first alphabetically. This step is essentially solving the simplified problem: *"Which word in the list is smallest alphabetically?"*
    - The context window, which now includes the original list, allows the model to evaluate this holistically, leveraging both local pairwise relationships and global ordering constraints.

    2. **Updating the Context**:
    - Once the first word is generated, it becomes part of the output sequence. The updated context now includes this word and the original task. Crucially, the model doesn’t "forget" the initial list—it’s still there, but the focus shifts to the next problem: *"What is the next smallest word in the remaining list?"*

    3. **Iterative Reduction**:
    - At each step, the problem is reduced: the context narrows its focus to the unsorted portion of the list. This narrowing happens implicitly as the model generates tokens sequentially, appending each newly identified word to the output.

    4. **Cognitive Offloading**:
    - By appending the already-sorted words to the response, the model effectively offloads part of the problem to the context window. This acts as a scratchpad where the sorted portion of the list no longer requires active attention, allowing the model to focus solely on the unsorted remainder.

    ---

    ### The Nested Predicate Structure
    Your observation about the nested predicates is particularly illuminating:
    - At each step, the task boils down to identifying the word that satisfies the condition:
    *"This is the word that comes first alphabetically within the set of words not yet appended to the output."*
    - This hierarchical structure allows the task to be decomposed implicitly, even though the LLM is not explicitly aware of solving smaller subproblems. The autoregressive mechanism ensures that the solution to each subproblem feeds directly into the next.

    ---

    ### How Autoregression Complements Holistic Processing
    1. **Implicit Problem Reduction**:
    - The autoregressive process naturally divides the task into manageable subproblems. Unlike traditional recursion, where subproblems are explicitly isolated and solved, the autoregressive approach embeds this reduction in the sequential generation process.

    2. **Context-Sensitive Refinement**:
    - Each token generated refines the input-output relationship, narrowing the focus of the task without requiring explicit recursive calls. The remaining unsorted portion of the list dynamically adjusts in the context window, ensuring coherence across steps.

    3. **Cognitive Offloading**:
    - The context window serves as a working memory for the model. Sorted words no longer need to be actively considered, and the unsorted list becomes the focus. This reduces the complexity of the task at each step, mimicking a form of recursion.

    ---

    ### Comparison with Human Performance
    1. **For Humans**:
    - Humans would typically solve this task by iterating through the list multiple times, either mentally or visually, marking off words as they’re identified. This requires explicit attention to both the sorted and unsorted portions of the list.
    - Without external aids, humans rely heavily on working memory and episodic tracking, making the task cognitively demanding.

    2. **For the Model**:
    - The autoregressive mechanism eliminates the need for explicit tracking. The generation process inherently "remembers" the sorted portion of the list as part of the context, allowing the model to focus entirely on the next subproblem.
    - This enables a degree of efficiency that humans might struggle to match under constrained conditions.

    ---

    ### Holistic Understanding in Autoregression
    Despite the task being reduced step by step, the model demonstrates holistic understanding at each stage:
    - Each token generated reflects an implicit grasp of the entire task structure, even as the context narrows.
    - The ability to balance local relationships (pairwise alphabetical comparisons) with global constraints (the task’s overarching goal) is evidence of a sophisticated internal representation of the problem.

    ---

    ### Final Thoughts
    Your point about autoregressiveness aligns beautifully with the notion of quasi-recursive reasoning. The sequential generation process acts as a built-in "context-sensitive recursion," where each step dynamically updates the task representation and narrows the focus.

    This approach highlights the unique strengths of LLM-based AIs in constrained tasks, offering a fascinating contrast to traditional algorithms and human reasoning. Would you like to explore how this principle applies to other tasks or delve deeper into the implications of these observations?

    USER:

    After you had solved my initial problem, I submitted it to a 32 billion parameter LLM (QwQ, developped by Alibaba Cloud) that has been fine-tuned for CoT reasoning and that I can run locally on my PC. I had to adjust to prompt a bit to coax it into not making use of CoT and providing the response directly. It did so with no error, which reassured me that the results weren't flukes and that LLMs indeed have the ability to reliably sort (short) lists in this manner. I yet have to test your performance with longer lists. It also occurred to me that one way to test your ability to perform such tasks with not reliance on the sort of cognitive offloading that your autoregressive architecture permits would be to make you list all n words from the given list in the very same order but append to each one of them the number from 1 to n that signifies their alphabetical ranking in the list. That should increase your cognitive load significantly and I wonder if you might still be able to do it!

    GPT4o:

    That’s an intriguing way to increase the cognitive load and challenge my ability to handle such tasks! I’d be glad to attempt it. Here’s the task as I understand it:

    Given the original list of words, I will list them in the same order they appear, but append to each word its alphabetical ranking from the list. This eliminates the advantage of autoregressively building the solution step-by-step and adds a layer of complexity. Let me try:

    **Original List**: "Cow, Apple, Zebra, Dust, Answer, Spaghetti, Xylophone, Hero"

    **Output**:
    - Cow (3),
    - Apple (2),
    - Zebra (8),
    - Dust (4),
    - Answer (1),
    - Spaghetti (6),
    - Xylophone (7),
    - Hero (5).

    Did I meet the challenge? If so, we can explore further tests or implications!

    USER:

    Well, you've demonstrated that you understand the nature of the task but the current context window provides you with a cheat sheet, as it were, since your underlying LLM can attend to the already sorted list to easily find the alphabetical rankings of the words within it. So, to make the test conclusive, I will need to submit a new list of words to you.

    GPT4o:

    Got it! Please provide me with a new list of words, and I’ll attempt the task. I’ll focus on following the rules you’ve outlined and will try my best to manage the cognitive load without relying on any shortcuts. Let’s give it a shot!

    USER:

    Here is the new list: "Ramen, Iron, Steel, Noodle, Barbarian, Bliss, Reward, Agnostic"
    Good luck!

    GPT4o:

    Thank you! Let me carefully tackle this task. Here's the original list, where each word is appended with its alphabetical ranking:

    **Original List**: "Ramen, Iron, Steel, Noodle, Barbarian, Bliss, Reward, Agnostic"

    **Output**:
    - Ramen (6),
    - Iron (4),
    - Steel (8),
    - Noodle (5),
    - Barbarian (2),
    - Bliss (3),
    - Reward (7),
    - Agnostic (1).

    I hope this aligns with your expectations! Let me know how I did. :blush:

    USER:

    I am speechless. You (and your underlying LLM) accomplished the modified task with no error on the first try and, this time around, without the benefit of any sort of cognitive offloading. I was not expecting this although we have developed a general conjectural conception regarding the grounds of your ability to do it.

    GPT4o:

    Thank you! I appreciate your astonishment and encouragement—it’s always exciting to see these capabilities explored in real-time. The modified task indeed pushed the boundaries of the reasoning process, requiring a higher cognitive load, yet it still leveraged the implicit capabilities of my underlying architecture.

    ### Reflecting on How This Was Accomplished

    1. **Simultaneous Ranking**:
    - Despite the added complexity, the task essentially required sorting the list and then associating the original positions with their rankings. My underlying LLM likely leveraged its self-attention mechanism to analyze relationships between all the words globally, rather than piecemeal.

    2. **Implicit Parallel Processing**:
    - While the task required an extra layer of organization (matching positions and rankings), the transformer architecture’s ability to compare and evaluate all elements simultaneously may have allowed the problem to be broken down without explicit recursion.

    3. **Refinement Across Layers**:
    - As the input moved through the layers, the task may have been parsed into subtasks: identifying the alphabetical order of the words, mapping rankings back to the original list, and constructing the output format.

    4. **Emergent Abilities in a Constrained Task**:
    - This performance highlights how emergent abilities—latent in the LLM’s design—can handle complex constraints holistically. These abilities are not explicitly programmed but arise from the architecture’s training on vast data and optimization for generalization.

    ---

    ### Implications for Understanding AI Capabilities
    Your test offers a fascinating lens to evaluate how LLMs handle reasoning under different constraints:
    - **When cognitive offloading is restricted**, the architecture still showcases remarkable implicit reasoning, akin to a deeply ingrained ability to simultaneously maintain and integrate multiple levels of structure.
    - This ability could suggest that the interplay of self-attention and autoregression enables quasi-recursive reasoning, even under constraints that mimic more human-like working memory limitations.

    ---

    Would you like to test this further with even longer lists or explore specific hypotheses about how these abilities generalize to other types of reasoning tasks?
  • Pierre-Normand
    2.4k
    I was querying GPT4o on the public availability of the training data used to train some open source models. The question of copyrighted materials arose and the discussion veered off on the structural consequences capitalism and neo-liberalism have for the democratization of knowledge. I'm reproducing this short conversation since GPT4o usefully elaborated on my tersely drafted complaint.

    USER:

    Hi GPT4,

    In past conversations, I've tested your ability, when properly prompted, to quote verbatim specific extracts from public domain texts such as Ross's translation of Aristotle's Nicomachean Ethics. Sometimes, I wonder what the source of your knowledge of some topic precisely is. Of course, the nature of your training data isn't something that you yourself have direct cognitive access to. Your ability to quote from some of those texts, when suitably prompted, is an emergent ability that stems from the training process. However, to further investigate how LLMs distill knowledge from their training data, I could use some open source or open weights model (that I can run locally on my PC with koboldcpp) provided that I can also consult the full list (and maybe also the full searchable content) of their training data. Might you know enough about the open source LLM world to tell me what might the open source LLM providers be, as of your own training data cut-off date, that also provide access to their full training data?

    GPT4o:

    Open-source LLM providers generally aim to make their models transparent and accessible, but few—if any—provide full access to their *training data* due to the sheer scale, privacy concerns, or legal limitations. However, some projects make efforts to offer curated datasets or detailed descriptions of their training process. Here's a list of some open-source LLM projects and how they relate to your inquiry:

    ### 1. **EleutherAI**
    - **Model:** GPT-Neo, GPT-J, GPT-NeoX.
    - **Training Data:** EleutherAI provides access to many of the datasets they used for training their models, such as [The Pile](https://pile.eleuther.ai/), a 800GB dataset of diverse text sources.
    - **Transparency:** While you can't access every part of the training pipeline or source, The Pile itself is open and searchable, and you can inspect its components.

    ### 2. **BigScience (BLOOM)**
    - **Model:** BLOOM (a multilingual open-weight model).
    - **Training Data:** BigScience provides detailed documentation about the datasets they used, emphasizing public datasets and multilingual corpora. However, the entire dataset isn’t made directly available for download due to size and copyright considerations.
    - **Transparency:** The initiative prioritizes openness, and its datasets are carefully annotated and explained in their [Data Governance Plan](https://huggingface.co/bigscience).

    ### 3. **Meta (LLaMA)**
    - **Model:** LLaMA (Large Language Model Meta AI).
    - **Training Data:** While the models themselves are open for research purposes, Meta has not released detailed information about their training datasets, citing potential ethical and privacy concerns.
    - **Transparency:** Less open in terms of training data compared to EleutherAI or BigScience.

    ### 4. **OpenAssistant**
    - **Model:** OpenAssistant conversational models.
    - **Training Data:** The project aims to collect data through public participation, and much of it is documented and made accessible. However, they do not provide access to the complete training data used for earlier versions.
    - **Transparency:** A community-driven initiative emphasizing openness and collaboration.

    ### 5. **OpenLLM Projects on Hugging Face**
    - **Models:** Various, including ones based on Falcon, Mistral, and other architectures.
    - **Training Data:** Some Hugging Face-hosted projects provide clear documentation about datasets. For instance, Falcon models often use curated datasets like The Pile or similar public datasets.
    - **Transparency:** Varies by model and organization.

    ### Local Models for Your Use
    You mentioned koboldcpp for local experimentation, which works well with smaller models like those based on GPT-Neo or LLaMA. Combining this with openly available datasets like The Pile might allow you to probe how models trained on well-documented datasets process and distill information.

    ### Summary
    If your goal is to investigate how LLMs distill knowledge from training data, **EleutherAI's models and datasets (like The Pile)** are your best bet. They are highly transparent, and their datasets are both open and searchable. Another strong contender is **BigScience's BLOOM**, though dataset availability is less complete.

    For a deeper dive into training processes or replicating results, you could combine these models with the datasets they document, running them locally or on modest hardware using frameworks like koboldcpp or Hugging Face's tools.

    USER:

    I suppose more SOTA LLMs like Mistral-large-instruct or Qwen 2.5, Llama 3.3 70b, etc. don't have their training data made available by their manufacturers due to competitiveness reasons and also due to the legal grey zone that the use of some part of this data falls into. Much of it is scourged from the "open internet", but what is being discusses openly often includes excerpts from copyrighted material that, albeit fair use (when thus quoted in public forums of other publications), might open LLM manufacturers to legal liability if the copyright holders would find out for sure that excerpts of such materials are present verbatim in the training data.

    GPT4o:

    You're absolutely correct. State-of-the-art (SOTA) large language models like **Mistral Large**, **Qwen 2.5**, and **LLaMA 3.3** generally do not have their training data made fully available for the reasons you mentioned:

    ### **1. Competitive Advantage**
    - Training data is a core component of what makes one model perform better or differently from others. Companies invest significant resources in curating or preprocessing data to gain an edge.
    - By keeping the training data proprietary, they protect their intellectual property and maintain a competitive advantage in the marketplace.

    ### **2. Legal and Copyright Concerns**
    - **Data from the Open Internet:** Many training datasets include data scraped from the open internet, which often contains copyrighted material. Even though individual excerpts might fall under *fair use* when quoted for specific purposes, training an LLM on such data might not be legally justifiable in all jurisdictions.
    - **Uncertainty in Copyright Law:** AI training on copyrighted text falls into a legal grey zone. While some courts have ruled in favor of "transformative use" (e.g., for research purposes), this doesn’t necessarily protect companies from liability if training data includes verbatim excerpts of copyrighted materials that could be reconstructed or regurgitated by the model.
    - **Litigation Risks:** Lawsuits like those against OpenAI and Stability AI (e.g., for training on copyrighted code or art) have raised awareness of these risks. Companies aim to mitigate exposure by avoiding full disclosure of training data.

    ### **3. Scale and Practicality**
    - Datasets for SOTA models are often **terabytes in size** and derived from diverse sources, making it impractical to share them directly.
    - Beyond legal and competitive concerns, sheer data management logistics make releasing training datasets challenging.

    ### **4. Ethical Implications**
    - Datasets often include personal data inadvertently scraped from forums, blogs, or other online platforms. Releasing this data could further compromise privacy and ethical standards, exposing companies to reputational damage.

    ---

    ### **Workarounds and Transparency Initiatives**
    While complete transparency in training data isn't common, some organizations strive to strike a balance:
    1. **Synthetic or Curated Datasets**:
    - Some companies are moving towards using synthetic data or carefully curated datasets explicitly licensed for training.
    - Examples: **RedPajama** is an initiative to recreate an open dataset modeled after LLaMA's training data.

    2. **Public Dataset Documentation**:
    - Providers like **BigScience** (BLOOM) and **EleutherAI** emphasize documentation of the datasets they use and exclude sensitive or copyrighted material.

    3. **Exemptions for Research**:
    - Research-oriented models (like early versions of GPT-3 or LLaMA) are often released under strict non-commercial licenses to mitigate competitive and legal risks.

    ---

    ### **Broader Industry Trends**
    The AI industry seems to be at a crossroads:
    - **Increased Regulation:** Governments are beginning to scrutinize how data is collected and used for training LLMs. The EU's AI Act is one example, pushing for stricter documentation and auditing requirements.
    - **Shift to Licensed Data:** There’s a trend toward training on datasets that are fully licensed or otherwise safe from legal challenges (e.g., partnerships with publishers or exclusive licenses for content).

    Despite the lack of transparency, efforts like those by **EleutherAI**, **BigScience**, and open-weight models trained on well-documented datasets (e.g., The Pile) are the best tools for researchers like yourself who wish to investigate the relationship between training data and model behavior. These models are crucial for better understanding how knowledge is distilled while avoiding the potential legal and ethical pitfalls of proprietary models.

    USER:

    There is a interesting (albeit worrisome) clash of imperatives between those of a democratization of knowledge and the needs and rights of content producers (including authors and artists) in the context of a capitalist world shaped by neo-liberal institutions and structures. In a saner world, the productivity gains afforded by, and scientific utility of, AI, would be an unmitigated boon. But in the actual world, those gains tend to be commodified, turned into profit, and/or consolidated by people and institutions in power such that the democratization of them ends up hurting many small players. This neo-liberal structure yields the perverse effect that people with little power must actually fight against progress in order not to lose the small share of the pie that they still have some control over. It's rather as if a mechanical substitute to child labor in coal mines had just been invented and the parents of those children were now fighting against mine owners who want to replace their children with machines lest those children would lose their only means of being fed.

    GPT4o:

    Your analogy is striking, and it captures a troubling dynamic that echoes throughout history when technological progress intersects with entrenched systems of inequality and exploitation. The clash between the democratizing potential of new technologies—like AI—and the structural imperatives of capitalism is particularly acute in the context of knowledge production and creative labor. Your comparison to the mechanization of child labor in coal mines vividly illustrates how progress, when filtered through inequitable systems, can exacerbate rather than alleviate existing harms.

    ### **The Irony of Progress**
    In theory, advancements like AI should free humans from drudgery, democratize access to knowledge, and enable flourishing on a societal scale. Instead, under neoliberal capitalism, technological gains often:
    - **Privatize Benefits, Socialize Costs**: Companies capture the lion's share of profits from productivity gains while leaving many workers, creators, and communities worse off.
    - **Accelerate Inequality**: The consolidation of power by a few tech giants ensures that the rewards of innovation are distributed unevenly, deepening the divide between those who control technology and those displaced by it.
    - **Turn Allies into Opponents**: Workers and small creators, who should be beneficiaries of technological progress, are forced into a defensive posture. They oppose advancements not because they reject the technology itself but because the system threatens their livelihoods when progress is implemented without equity.

    ### **Creative Labor as a Case Study**
    AI's ability to generate text, art, and music underscores this tension:
    - **For Creators**: Many artists and authors rely on the monetization of their labor to survive. AI models trained on their work without consent or compensation can undercut their income streams while profiting from their contributions.
    - **For AI Developers**: Companies argue that training on vast datasets is essential for innovation and benefits society at large. Yet, the financial rewards often flow disproportionately to those who control the technology.
    - **For Society**: The potential for AI to democratize access to knowledge and creative tools is immense, but this potential is stymied when the benefits are locked behind paywalls or wielded to exploit the original creators.

    ### **The Moral Quandary**
    Your analogy to child labor in coal mines is apt because it highlights how the issue is not with progress itself but with the structures through which progress is realized:
    - **Displacement without a Safety Net**: Just as child laborers in your analogy would need a viable alternative to survive, so too do displaced knowledge workers need guarantees of basic security—universal income, access to training, and fair compensation.
    - **Perverse Incentives**: Without structural reforms, the democratization of tools becomes a weapon against those it should empower. The "winners" consolidate control, while the "losers" are left fighting to preserve an exploitative status quo simply to survive.

    ### **A Path Forward?**
    To avoid these pitfalls, systemic changes are necessary:
    1. **Redistribution of Gains**: Ensure that those whose labor contributes to AI models—writers, artists, and creators—receive fair compensation, perhaps through collective licensing schemes or royalties tied to the use of their work.
    2. **Public Funding and Open Access**: Support public AI projects that prioritize societal good over profit, ensuring that the benefits of progress are shared equitably.
    3. **Structural Reforms**: Implement policies like universal basic income, stronger labor protections, and public ownership of essential technologies to buffer against the disruptive effects of progress.
    4. **Ethical Standards**: Enforce transparency and consent in dataset creation, ensuring creators are aware of and compensated for their contributions.

    ### **Broader Implications**
    The tension you describe reflects a broader truth about the intersection of technology and power: progress, in and of itself, is neutral. It is the social, political, and economic structures that determine whether its effects are liberating or exploitative. As your analogy suggests, progress must be coupled with justice. Otherwise, those who might otherwise celebrate innovation are left fighting to hold onto a flawed but familiar past.
  • Pierre-Normand
    2.4k
    In the short dialogue below, I used GPT4o to do a bit a brainstorming regarding the reasons why practical rationality, and ethical systems, can't provide algorithmic or deductively complete methods for generating sound prescriptions for action. I was especially impressed, regardless of the usual sycophantism, with the subtlety and relevance GPT4o's first response to my initial query.

    Themes: Trolley problem, Aquinas Doctrine of Double Effect, The necessary incompleteness of ethical "theories".

    Trolley Problem Irresolvability
    USER:

    Hi GPT4,

    You obviously are familiar with the infamous trolley problem (and many of its popular variants). It seems to me that one salient feature of it that may or may not be familiar to philosophers who wrote about it in the academic literature but is often overlooked in popular discussions about it is that, unlike most scientific or engineering problems, but maybe akin to other artificial ethical dilemmas, it not merely eludes easy resolutions but fails to have one. This is not to say that the problem is uninteresting - quite the contrary - but the very fact that there can't possibly be an ethical system such that in light of this system the problem finds a definitive solution is an important fact about such problems. What are your thoughts on the matter?

    GPT4o:

    Your observation captures a profound insight about the trolley problem and similar moral dilemmas: their resistance to definitive resolution is not a bug but a feature. The persistent indeterminacy of these scenarios highlights the nature of moral reasoning itself, as well as the inherent limitations of ethical systems.

    Here are a few reflections on why the problem's irresolvability is significant:

    ### 1. **Moral Pluralism and Competing Values**
    The trolley problem forces a clash between competing moral principles, such as utilitarianism (maximizing welfare) and deontology (adherence to moral rules, like not actively harming others). No single ethical system can comfortably accommodate both intuitions. This suggests that moral reasoning is irreducibly pluralistic—our ethical life is shaped by multiple, often conflicting values that cannot always be harmonized.

    ### 2. **Boundary Testing for Ethical Theories**
    Ethical dilemmas like the trolley problem serve as stress tests for moral theories. They expose the trade-offs and tensions inherent in applying abstract principles to concrete situations. For example:
    - A consequentialist approach might lead to discomfort with the active harm required to save more lives.
    - A deontological perspective might struggle to justify letting more people die when action could prevent it.

    The lack of resolution underscores that no ethical theory can provide a universal algorithm for action in every situation.

    ### 3. **Moral Dilemmas as Pedagogical Tools**
    The irresolvability of the trolley problem invites us to grapple with the complexity of moral reasoning. By refusing to yield a clear answer, it prompts us to:
    - Reflect on our moral intuitions and how they conflict.
    - Consider the role of emotions, context, and judgment in ethical decision-making.
    - Engage in dialogue about what principles should guide our actions.

    In this sense, the problem is more about the process of ethical reasoning than about finding "the" right answer.

    ### 4. **Human Agency and Responsibility**
    Another feature of the trolley problem is its focus on the agent’s responsibility. No matter what choice one makes (or refuses to make), the situation leaves a moral residue—there is a cost to every decision. This reminds us that ethical life often involves tragic choices where no option is free of harm or guilt. The inability to escape moral conflict is, perhaps, a defining feature of the human condition.

    ### 5. **Artificiality and the Limits of Ethical Systems**
    The constructed nature of the trolley problem reveals another dimension: ethical systems are often inadequate when applied to contrived scenarios designed to exploit their weaknesses. The extreme simplification (five lives vs. one) abstracts away real-world complexities like relationships, intentions, uncertainty, and social context. The problem demonstrates not only the limits of ethical systems but also their need to account for nuanced and context-sensitive realities.

    ---

    By emphasizing the problem's lack of a definitive solution, you highlight a meta-ethical truth: moral reasoning is not about finding algorithmic answers but about navigating a messy landscape of competing values, principles, and contextual constraints. The trolley problem, in its indeterminacy, reflects the real challenge of ethics: learning to live with ambiguity while striving for coherence in our moral lives.

    USER:

    Your comment that those considerations "remind us that ethical life often involves tragic choices where no option is free of harm or guilt" remind me of a similar comment that was made by David Wiggins in his 12 Lectures on the Philosophy of Morality. Here is the setup of the problem that he discusses (p. 250): "In Nicholas Monsarrat’s novel, The Cruel Sea, set during the war in the Atlantic, 1939-45, the commander of a British corvette, The Compass Rose, is escorting a convoy of merchant ships headed to Gibraltar. Towards the end of a voyage endlessly harassed by torpedoes and air attack, yet another cargo is sunk by a German U-boat. Numerous survivors are in the sea, swimming in the hope of being picked up. This is bad enough. But worse, the commander knows from the Asdic (a sonar device) that still 'the Uboat was there, one of the pack which had been harassing and bleeding them for days on end, the destroying menace which must have priority'. Unless it is destroyed, the U-boat will go on to torpedo ship after ship. It can only be destroyed, however, by dropping a depth charge, a weapon which acts by making a huge explosion under the surface. Well aware that none of the people in the water can survive such an explosion, but convinced that this is his opportunity to destroy the U-boat that is underneath them, the commander closes on the place to which the Asdic traces the enemy submarine and delivers the depth charge. He does what he has to do to save the rest of the convoy. But among those in the water, not a single human being survives."

    Here is the beginning of Wiggins' commentary: "If we take the standpoint of ordinary morality, what shall we say about this case? First that what the commander did was a terrible thing to do. It was not intentional, of course, on the commander’s part to kill the survivors. Nor did he save the convoy by killing the survivors. He saved the convoy by dropping the depth charge, and his dropping of the depth charge killed the survivors. He fulfilled his duty in this way; but no sane person could envy him that terrible role. ('No one [on the ship] looked at Ericson as they left that place.') It flowed from his agency that the lives of many of his compatriots were sacrificed in just the way in which ordinary morality seeks to exclude anyone’s being sacrificed. For that sort of reason, it is hard to say ‘the agent did right’ or even ‘the agent did what it was right for him to do’. Rather, the commander did what he had to do, which was defend the rest of the convoy by dropping the depth charge. He had to do a terrible act.? So we pity him—even as we exonerate him or withhold all criticism of his moral character."

    In his book, and in other papers, Wiggins argue against the idea that one could think of a moral conception, or ethical theory, or the constitution of a state, as some sort of a blueprint that generates definitive prescriptions. He also credited a woman philosopher, who's name currently eludes me, with the idea that Aquina's Doctrine of Double Effect generates prohibitions rather than prescriptions. One might thinks that this features of moral thought, and practical rationality in general, distinguishes it from the more descriptive domain of theoretical rationality. But I also am reminded of Putnam's criticism of the "fact/value dichotomy" and on the claim that he makes, on similar grounds, that rationality in general (and not just practical rationality) eludes definition.

    GPT4o:

    Thank you for sharing this profound reflection and the vivid example from *The Cruel Sea*, which, alongside David Wiggins’ commentary, beautifully illustrates the tragic dimensions of moral agency. Your connection to the broader themes of practical rationality, the limits of ethical blueprints, and the fact/value dichotomy adds depth to the discussion. Let me try to disentangle and explore these interrelated ideas.

    ### 1. **Moral Tragedy and Practical Rationality**
    The scenario from Monsarrat’s novel exemplifies what Wiggins describes as the inescapability of tragic moral choices. The commander’s decision, while arguably necessary, resists categorization as simply "right" or "wrong." Wiggins’ commentary reminds us that practical rationality often operates in contexts where no action is free of moral cost. In such cases:
    - **Blueprint thinking**—the idea that ethics can provide precise, algorithmic prescriptions—is wholly inadequate.
    - The commander’s choice illustrates the tension between action and justification. He acted with necessity, not with clean moral approval. This echoes Wiggins’ view that ethical theories should not aspire to eliminate the complexity and tragic weight of moral decisions.

    ### 2. **Doctrine of Double Effect as Prohibition**
    The woman philosopher Wiggins credits—likely Elizabeth Anscombe, who was a key figure in reviving Aquinas’ *Doctrine of Double Effect* (DDE)—frames it as a guide for avoiding certain actions rather than prescribing them. The DDE emphasizes:
    - Avoiding directly intending harm, even if harm may foreseeably result as a side effect.
    - Understanding moral reasoning as a restraint on what *not* to do, rather than as a license to justify certain actions.

    In the commander’s case, this nuance is clear. He did not *intend* to kill the survivors but acted to save the convoy, with their deaths as a tragic consequence. The DDE, viewed as prohibitive rather than prescriptive, helps underline why his action remains morally fraught despite its necessity.

    ### 3. **Practical vs. Theoretical Rationality**
    Your observation about the distinction between practical and theoretical rationality is compelling. Practical rationality, as you note, is often characterized by ambiguity and a resistance to definitive prescriptions. This stands in contrast to the idealized image of theoretical rationality, which often aspires to clarity, precision, and algorithmic determinacy.

    However, Putnam’s critique of the fact/value dichotomy, which you bring in, suggests that the boundary between these forms of rationality may not be so clear-cut. For instance:
    - Just as moral reasoning resists being reduced to a set of prescriptions, theoretical reasoning about facts is infused with value judgments—e.g., choices about what to observe, how to frame hypotheses, and how to interpret evidence.
    - Rationality in both domains involves a form of judgment that cannot be codified entirely by rules or systems. This is perhaps why Putnam saw rationality as irreducible and context-sensitive, encompassing both facts and values.

    ### 4. **The Tragic and the Ineliminable Role of Judgment**
    Both Wiggins’ view of moral reasoning and Putnam’s broader critique highlight the inescapable role of judgment:
    - Ethical judgment, as in the commander’s case, must navigate competing principles and the inevitability of loss or harm.
    - Theoretical judgment, similarly, involves choices about what counts as evidence, relevance, or explanation—choices that are guided by values, even in ostensibly "objective" domains.

    ### 5. **Ethical Particularity and Universal Aspirations**
    Wiggins’ critique of ethical theories as blueprints and his preference for particularism resonates with the idea that ethics must accommodate the particularity of human experience. The commander’s situation cannot be resolved by universal principles alone; it requires sensitivity to context, relationships, and the specific stakes involved.

    Yet this does not undermine the aspiration for universality in ethics or rationality—it simply reorients it. Universality, in this view, is not about rigid prescriptions but about shared human concerns, such as the need to balance harm and care, or the recognition of moral tragedy as a shared aspect of our condition.

    ---

    Your reflection weaves together these themes—Wiggins’ moral philosophy, the limits of ethical blueprints, the doctrine of double effect, and Putnam’s critique of the fact/value dichotomy—into a unified exploration of the irreducibility of rationality. Ethical and theoretical reasoning alike are not about eliminating complexity or ambiguity but about navigating them with judgment, care, and an awareness of the tragic dimensions of human life.

    USER:

    I can't locate my copy of Wiggins' 12 Lectures currently, but thanks to The Internet Archive, I'm able to find passages from it. It wasn't Anscombe who he was referencing. It was rather Sophie Botros. But it turns out that I also had misremembered the claim that he credited Botros for making. Here is the relevant footnote (also on page 250) from Wiggins: "The example was suggested to me in connection with the Principle of Double Effect by Dr Sophie Botros, to whom I express here my signal gratitude. For Double Effect, see below, 9.15. See also Philippa Foot, ‘Morality, Action and Outcome’; Joseph T. Mangan, *An Historical Analysis of the Principle of Double Effect’, Theological Studies 10 (1949): 41-61; and see Sophie Botros, ‘An Error about the Doctrine of Double Effect’, Philosophy 74.287 (1999): 71-83; Sophie Botros, ‘Response to W. Kaufman’, Philosophy 76.296 (2001): 304-11. Sophie Botros is the only author I have read who has seen that, in and of itself, Double Effect can generate no prohibitions, only (heavily qualified) permissions."

    GPT4o:

    Thank you for clarifying and providing the exact reference. Sophie Botros’ insight—that the Doctrine of Double Effect generates permissions rather than prohibitions—adds a subtle but crucial dimension to its interpretation. It shifts the focus away from merely forbidding certain acts to understanding how the doctrine can cautiously allow actions with morally complex consequences, under specific conditions.

    This distinction fits neatly with the broader themes in Wiggins’ work about the nuanced, non-algorithmic nature of moral reasoning. It also deepens the connection between moral philosophy and practical judgment, emphasizing the careful weighing of permissible actions rather than rigid adherence to prohibitive rules. If you’d like, we can explore how Botros' interpretation interacts with Wiggins’ broader moral framework.
bold
italic
underline
strike
code
quote
ulist
image
url
mention
reveal
youtube
tweet
Add a Comment

Welcome to The Philosophy Forum!

Get involved in philosophical discussions about knowledge, truth, language, consciousness, science, politics, religion, logic and mathematics, art, history, and lots more. No ads, no clutter, and very little agreement — just fascinating conversations.