

rest were hallucinations
I’m having trouble parsing whatcha mean here if they were coding tasks. The code didn’t run? Ran but had 0 functionality? If they were non-coding tasks, then agreed, I didn’t notice it being significantly more accurate. Though I did appreciate the larger vocab. I wasn’t gonna be able to afford to keep using it once it went to API pricing anyway.
I was also confused. It’s oddly worded. From Anthropic’s news post:
As far as I can tell, this review preceded the gov order, and it was at this point that Katie Moussouris was involved.