You don’t need 170+ GB of VRAM. Whole model can be run at around 1 token/second on a modern hardware from an ssd. Which is slow, don’t get me wrong, but it still somewhat useable.
Upd.Once again, for those who use AI because struggles to read: it is slow, but it is usable. Which is, by definition, means that you don’t need 170+ GB of VRAM to run this model. Period. It runs from ssd. That is a fact.
You don’t need 170+ GB of VRAM. Whole model can be run at around 1 token/second on a modern hardware from an ssd. Which is slow, don’t get me wrong, but it still somewhat useable.
Upd.Once again, for those who use AI because struggles to read: it is slow, but it is usable. Which is, by definition, means that you don’t need 170+ GB of VRAM to run this model. Period. It runs from ssd. That is a fact.