• freagle@lemmygrad.ml
    link
    fedilink
    English
    arrow-up
    10
    ·
    6 days ago

    I do think there’s something uniquely wasteful about floating point arithmetic, which is why need specialized processors for it, and there is something uniquely wasteful about crypto and LLMs, both in terms of electricity but also in terms of waste heat. I agree that generative AI for solving problems is definitely better than crypto, and it’s better than using generative AI to produce creative works, do advertising and marketing, etc.

    But it’s not without it’s externalities and putting that in an unmonitored iterative loop at scale requires us to at least consider the costs.

    • ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
      link
      fedilink
      arrow-up
      10
      ·
      6 days ago

      Eventually we most likely will see specialized chips for this, and there are already analog chips being produced for neural networks which are a far better fit. There are selection pressures to improve this tech even under capitalism, since companies running models end up paying for the power usage. And then we have open source models with people optimizing them to run things locally. Personally, I find it mind blowing that we’ve already can run local models on a laptop that perform roughly as well as models that required a whole data centre to run just a year ago. It’s hard to say when all the low hanging fruit is picked, will improvements start to plateau, but so far it’s been really impressive to watch.

      • freagle@lemmygrad.ml
        link
        fedilink
        English
        arrow-up
        4
        ·
        6 days ago

        Yeah, there is something to be said for changing the hardware. Producing the models is still expensive even if running the models is becoming more efficient. But DeepSeek shows us even production is becoming more efficient.

        What’s impressive to me is how useful the concept of the stochastic parrot is turning out to be. It doesn’t seem to make a lot of sense, at first or even second glace, that choosing the most probable next word in a sentence based on the statistical distribution of word usages across a training set would actually be all that useful.

        I’ve used it for coding before and it’s obvious that these things are most useful at reproducing code tutorials or code examples and not at all for reasoning, but there’s a lot of code examples and tutorials out there that I haven’t read yet and never will read. The ability of a stochastic parrot to reproduce that code using human language as it’s control input is impressive.

        • ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
          link
          fedilink
          arrow-up
          4
          ·
          6 days ago

          I’ve been amazed by this idea ever since I learned about Markov chains, and arguably LLMs aren’t fundamentally different in nature. It’s simply a huge token space encoded in a multidimensional matrix, but the fundamental idea is the same. It’s really interesting how you start getting emergent properties when you scale something conceptually simple up. It might say something about the nature of our own cognition as well.

          • darkernations@lemmygrad.ml
            link
            fedilink
            arrow-up
            4
            ·
            edit-2
            6 days ago

            You mentioned Markov Chains; for a laymen with regards to mathematics (one would need to brush up on basic calculus) would you know any good books (I was thinking textbooks?) or resources to better understand maths with view to gain a better understanding of LLMs/GenAI later down the line?

            • ☆ Yσɠƚԋσʂ ☆@lemmygrad.mlOP
              link
              fedilink
              arrow-up
              4
              ·
              6 days ago

              A few books that are fairly accessible depending on your math level.

              Basic Math for AI is written for people with no prior AI or advanced math knowledge. It aims to demystify the essential mathematics needed for AI, and gives a broad beginner-friendly introduction.

              https://www.goodreads.com/book/show/214340546-basic-math-for-ai

              Mathematics for Machine Learning is a bit more academic than Hinton’s book, and it covers linear algebra, vector calculus, probability, and optimization, which are the pillars of LLM math.

              https://www.goodreads.com/book/show/50419441-mathematics-for-machine-learning

              Naked Statistics: Stripping the Dread from the Data is phenomenal for building an intuitive understanding of probability and statistics, which are often the most intimidating subjects for beginners.

              https://www.goodreads.com/book/show/17986418-naked-statistics

                  • darkernations@lemmygrad.ml
                    link
                    fedilink
                    arrow-up
                    5
                    ·
                    edit-2
                    5 days ago

                    I edited my original comment to leave out partly why I posted it but there’s no point in self-censorship as the thread has devolved into tone policing and strawmanning; just further evidence of how privileged Westerners are and those who benefit from Western hegemony.

                    AI under capitalism is coming for all our jobs due to automation as predictably estimated by multiple learned marxists but instead of taking the scientific socialist approach of understanding why; ie capitalism not technology in itself and we should use the technology to progress socialism for the betterment of humanity, there is bad-faith reactionary Nietzschean takes and so much self-importance on why allowance should be made for their hypocritical sensibilities and that they should be considered part of the vanguard despite wilfull ignorance when I suspect their class conciousness on the world stage is woefully inadequate for those who call themselves marxists.

                    Well done for continuing to post despite the spiteful ignorance.

                    (I personally asked about the Maths because AI is coming for my job too and I want to better understand the technology, and I feel for me, this is a more serious path to do this. Marxism is a science and we should not be burying our heads in the sand. Thanks again. Also the dialectical dispatches blog are interesting reads too)