• dantheclamman@lemmy.world
    link
    fedilink
    English
    arrow-up
    33
    ·
    edit-2
    1 day ago

    I expect that many companies will realize that an LLM is a tool that can help with certain tasks, but cannot fully replace most workers because there is a ton of context in people’s jobs that cannot be condensed into language. Similarly, many companies are now realizing with token-based pricing that frontier LLMs are not cost effective in many applications. You shouldn’t have a natural gas fired data center in Tennessee running a model to proofread your emails. You can have a local model do so MUCH more cheaply. This will leave the speculative data center companies holding the bag on a lot of hardware and capacity that is not actually needed.

    • dil@piefed.zip
      link
      fedilink
      English
      arrow-up
      2
      ·
      edit-2
      20 hours ago

      When training ai, I was always confused that they never gave a sht about tokens, like that was never something you picked any model over another for or reviewed, it often says to ignore the length but make note of how long it took, tbh that could be because of reasoning text not shown to the user

      • dantheclamman@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        19 hours ago

        That’s interesting to hear. I wish efficiency had been considered from the start. It seems like there has been a ton of waste. They should only do the calculations needed to achieve the user objective. But they were prioritizing market share over efficiency. Maybe they thought they could afford another couple years of subsidizing wasteful use to build market share, but it hasn’t turned out that way.

        • dil@piefed.zip
          link
          fedilink
          English
          arrow-up
          2
          ·
          2 hours ago

          The only time character or word count was even looked at was when it was specified in the prompt. Like if someone asked for 900-1000 words you would check to see if it fit as one of your tasks which was odd since I literally used word counter sites making that process something that could be easily automated.