Ok speed (202.7 tok/s) and value (1.25 -> 2.50) look great, with pretty decent i...

pzo · 2026-05-01T09:18:36 1777627116

The problem with speed is that they usually are very fast for first few weeks and then suddenly much slower. They did such trick when they advertised Grok 4 fast ( dropped from 200 tps to 60tps)

polski-g · 2026-05-01T11:38:44 1777635524

Grok 4.1 is still 110tps. The only other model that comes close is Gemini at 85tps.

victorbjorklund · 2026-05-01T09:28:50 1777627730

Wow. That is a big drop.

Cakez0r · 2026-05-01T11:28:08 1777634888

202.7 tok/s is only OK speed? Which providers are you using that are significantly better than that?

mritchie712 · 2026-05-01T11:50:13 1777636213

for reference, it's the 2nd fastest model tracked in the "Highlights" section of https://artificialanalysis.ai/

Cakez0r · 2026-05-01T11:55:09 1777636509

Yes, it's incredibly fast. Openrouter is clocking 60 tokens per second, which is on par with the likes of sonnet, opus, GPT 5.5.

goldenarm · 2026-05-01T12:03:33 1777637013

That section misses Cerebras and Groq which are up to 5x faster.

Havoc · 2026-05-01T12:23:31 1777638211

Very different tech and limitations though so wouldn’t make sense to compare 1:1 I think

goldenarm · 2026-05-01T12:56:34 1777640194

What are the limitations ?

gslepak · 2026-05-01T13:29:51 1777642191

Much smaller context

mythz · 2026-05-01T13:28:57 1777642137

I said speed was great, Cerebas and Groq can provide better performance, likewise Fast versions of Cursor's Composer and Claude.

The reported speed like benchmarks is only a reported number on paper, we'll see how it holds up in real world usage, so far OpenRouter is only reporting 73tps

[1] https://openrouter.ai/x-ai/grok-4.3

lukewarm707 · 2026-05-01T14:03:26 1777644206

i really don't trust openrouter numbers.

i use byok and see responses fail on openrouter while they work perfectly at the provider. the provider is often listed as 'down' and it's very clearly up on the original api and serving requests.

cerebras quotes oss 120b at 3000tps and it is under 800 on openrouter.

same with fireworks, i am getting much higher numbers not on openrouter. but recently i think fireworks deepseek is kind of spotty, the main provider i know that just doesn't go down is vertex and they charge 2-3x the rest

XCSme · 2026-05-01T16:57:29 1777654649

Their stats look ok, but when I tested it[0], it was 4x slower than 4.20.

[0]: https://aibenchy.com/compare/x-ai-grok-4-20-medium/x-ai-grok...

energy123 · 2026-05-01T14:07:53 1777644473

Value should be calculated some other way, like cost per task completion or something.

catcowcostume · 2026-05-01T09:32:54 1777627974

[flagged]

kuboble · 2026-05-01T09:51:44 1777629104

I don't remember the source of the quote.

But debating whether the models are intelligent is slim to debating whether a car can walk.

You can offload to the model a lot of work that until recently we thought requires intelligence. The more and better of those tasks the model can do, it's fair to call it intelligence*

NitpickLawyer · 2026-05-01T10:15:28 1777630528

"The question of whether a computer can think is no more interesting than the question of whether a submarine can swim." - Edsger Dijkstra

MrDrDr · 2026-05-01T09:34:52 1777628092

Please elaborate.

IshKebab · 2026-05-01T10:29:53 1777631393

Some people have this strange idea that only "whatever humans do" counts as intelligence, despite the fact that a) we don't really have a clue what humans do, and b) "intelligence" is definitely not that strictly defined.

I think they're just trying to feel like they know some important truth that other people don't.

MrDrDr · 2026-05-02T17:30:43 1777743043

Agreed. I see this debate as an active discussion as to what intelligence is, not how it's currently (poorly) defined This is a philosophical discussion, and there is no correct answer, but IMO some answers will prove more useful than others. I would like to define intelligence as the ability to solve problems. Lots of other life forms have this ability, and its clear that LLMs also have this. Now, while they may not be poetic (in the literal sense of the word), or conscious, in that 'they' do not experience the world. I think there is a strong case for arguing they conform to a meaningful definition of intelligence. They solve problems.

nesk_ · 2026-05-01T09:43:50 1777628630

Prediction is not intelligence.

mirekrusin · 2026-05-01T09:54:21 1777629261

Misprediction is?

exe34 · 2026-05-01T09:44:24 1777628664

What does intelligence mean to you?