• General_Effort@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    3 months ago

    Neural nets, including LLMs, have almost nothing to do with statistics. There are many different methods in Machine Learning. Many of them are applied statistics, but neural nets are not. If you have any ideas about how statistics are at the bottom of LLMs, you are probably thinking about some other ML technique. One that has nothing to do with LLMs.

      • General_Effort@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        3 months ago

        That’s where the almost comes in. Unfortunately, there are many traps for the unwary stochastic parrot.

        Training a neural net can be seen as a generalized regression analysis. But that’s not where it comes from. Inspiration comes mainly from biology, and also from physics. It’s not a result of developing better statistics. Training algorithms, like Backprop, were developed for the purpose. It’s not something that the pioneers could look up in a stats textbook. This is why the terminology is different. Where the same terms are used, they don’t mean quite the same thing, unfortunately.

        Many developments crucial for LLMs have no counterpart in statistics, like fine-tuning, RLHF, or self-attention. Conversely, what you typically want from a regression - such as neatly interpretable parameters with error bars - is conspicuously absent in ANNs.

        Any ideas you have formed about LLMs, based on the understanding that they are just statistics, are very likely wrong.

        • Kethal@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          3 months ago

          “such as neatly interpretable parameters”

          Hahaha, hahahahahaha.

          Hahahahaha.

    • ZILtoid1991@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      3 months ago

      Software developer here, the more I learn about neural networks, the more they seem like very convoluted statistics. They also just a simplified form of neurons, and thus I advise against overhumanization, even if they’re called “neurons” and/or Alex.