You can use it all day and stay well below the quota. Small context, with the right model for the job. Surgical precision.
But… At some point you shut off your brain, use the most expensive model on the highest reasoning level with your whole codebase as context and just wait for tens of minutes while it burns all the tokens. To speed this up you then send six agents to tackle the same problem from all angles.
You can use it all day and stay well below the quota. Small context, with the right model for the job. Surgical precision.
But… At some point you shut off your brain, use the most expensive model on the highest reasoning level with your whole codebase as context and just wait for tens of minutes while it burns all the tokens. To speed this up you then send six agents to tackle the same problem from all angles.