No, but the alternative in some places is no reasoning model. Just like people don't want old cell phones / new phones with old chips - but often that's all that is affordable in some places.
If we can get something working, then improving it will come.
You can have questions that are not urgent. It's like Cursor, I'm fine with the slow version until a certain point, I launch the request then I alt-tab to something else.
Yes it's slower, but well, for free (or cheap) it is acceptable.
Only interactive uses cases need high tps, if you just want a process running somewhere ingesting and synthesizing data it’s fine. It’s done when it’s done.