• brianpeiris@lemmy.ca
    link
    fedilink
    English
    arrow-up
    0
    ·
    4 days ago

    The word “intelligence” is doing a lot of heavy lifting here. LLMs lack any mechanism for true logical reasoning, and they always will by nature. This is why they fail at simple questions like “the car wash test”. It’s also why agents are expensive; They just flail around in token hungry “reasoning loops” until they happen to come across a correct solution. And it’s why Claude Opus 4.8 (High) only scores 1.5% on the ARC-AGI-3 benchmark at a cost of $10,000.

    This kind of panic is just part of the hype. Wake me up when real intelligence arrives.

    • architect@thelemmy.club
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 days ago

      Computers are not actually intelligent. What are they but big calculators? They are removing jobs and barely doing anything better! They take up a whole room! No one needs that!

      The internet is hardly worth anything. It’s nothing but scams and porn! You can’t believe anything you read on it! It’s overhyped nonsense! No one is going to use that!

      Meanwhile in 2026: LLMs have no logical reasoning and that will never change! They can’t answer simple questions! They just flail around and produce slop!

      You guys don’t see it? Really?

      • brianpeiris@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 days ago

        I’m confident enough about this that I’ve registered a prediction on Long Bets.

        “No LLM-based AI will surpass 70% on the ARC-AGI-3 leaderboard, with a cost of $1000 or less, before June 2028.” - https://longbets.org/973/

        I’m curious if you’d really disagree with the premise, and would you (or anyone here on Lemmy) be willing to put money down to challenge the bet? (Long Bets always donates any winnings to a registered non-profit of the winner’s choice, though it’s a $200 minimum).

        Are you saying that LLMs can currently reason? How do you explain their low score on ARC-AGI-3? Do you think Transformer LLM architectures will be capable of reasoning within the next two years without some new breakthrough? What mechanism in the architecture allows them to reason?

    • LesserAbe@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      3 days ago

      It doesn’t need to be our new cyber god for it to matter. All the software developers I know use AI as a tool. One person I know who heads a department at a publicly traded company you’ve probably heard of said they’re not hiring more junior developers, and is worried about his long term career outlook.

      That’s concerning right? Capitalists who have a choice between hiring ten people or hiring one person who supervises an AI doing the work of nine are going to choose the cheaper option, and pocket the difference.

      To say that LLMs are not useful to me indicates a lack of familiarity. I’m not taking about bullshit “write this email for me” type stuff, I mean like “write this web app for me” “find this type of document in this massive trove of documents” “troubleshoot this technical issue”. And it can do it.

      LLMs are not replacing humans, but they are reducing labor required for many tasks. Who receives the benefit from that reduced labor? Right now, owners. That’s why I support open source AI or preferably democratic-entity-only open source AI.

      • brianpeiris@lemmy.ca
        link
        fedilink
        English
        arrow-up
        1
        ·
        2 days ago

        Companies are only shooting themselves in the foot in the long term if they stop hiring junior engineers, and most of that work is not being replaced, it’s being shifted to the senior engineers who now have to babysit AIs that can’t actually do the job for any extended period of time. If you’re accepting AI code into a codebase without thorough review, then you’re also shooting yourself in the foot in the long term, because even the senior engineers won’t know the codebase after a while. If you’re doing thorough reviews in order to catch the AI bugs, well then you’re probably better off coding it yourself correctly in the first place, unless you’ve already allowed your skills to atrophy.

        Do you really think AIs are reasoning when you ask them to troubleshoot technical issues? You may be lucky if the issue is already in their training data, but anything even slightly novel, and the AI is just going to bullshit an answer, and I guess you’re going to follow it blindly, since you don’t know enough to come up with an answer yourself.

        Besides all that, how is open source AI going to stop junior developers from losing their jobs?