More

kajecounterhack · 2026-06-04T23:40:46 1780616446

Isn't this just solved by better student teacher ratios, which you could totally have in public schools if they were funded better and societally we valued teachers more?

What are private schools doing that you couldn't implement in public schools with adequate political will and money?

programjames · 2026-06-05T00:41:58 1780620118

Your question is easily resolved by looking up how much American schools are funded, compared to historical funding, other countries' funding, and their relative successes.

NoMoreNicksLeft · 2026-06-05T15:40:39 1780674039

Outcomes aren't any better with lower ratios. The best-funded public schools have funding higher than anywhere else, in the world, and have poor outcomes... it's not a funding problem. And it's difficult to "value teachers" when we learn of these outcomes, it runs counter to human nature.

Private schools are (excepting the truly 0.01% which are the most elite schools meant for the children of billionaires and statesmen) are nothing more than public schools dressed up in $20,000/yr tuition so that the upper middle class can feel special. They draw personnel from the same pool of teachers, they use the same textbooks and pedagogy. They are essentially public schools with a new label. But that you think I might be talking about private schools shows how you can't even really think about alternatives. You don't have the mental language to do so.

kajecounterhack · 2026-06-04T04:19:29 1780546769

> I felt frustrated that the professors didn't ever teach. They had slides. They read off slides, verbatim. They explained things sometimes if you asked them, but most often in a very elitist and condescending tone

+10000. The goddamn slides. If I were a student now going to engineering school, I'd basically take the slides and throw them into NotebookLM and get way better lectures. Then I'd ask claude or GPT all my hard questions. Hell, I'd get the PDF version of my textbooks and do the same.

The number of lectures actually worthy of your time was so low.

beej71 · 2026-06-04T04:55:33 1780548933

I try to lecture as little as possible. No slides. Quick highlights discussion of the reading, maybe a coding demo, and then students work on coding challenges in class, in groups if they want. I circulate and help out. I'm lucky to have small class sizes at this university. I couldn't pull it off in a class of 300.

kajecounterhack · 2026-06-04T23:47:09 1780616829

Wow Beej it's you!! I loved your guide to network programming in undergrad <3 you're probably not part of the problem here, lol.

a96 · 2026-06-08T09:34:56 1780911296

Reading this I was just thinking I've seen that nick/name somewhere.

Then I noticed several bookmarks with the same name staring at me from the browser sidebar.

kajecounterhack · 2026-06-03T21:48:11 1780523291

Have you found Gemma 4 31B better than Qwen 3.6 27B Q8? I just started using Qwen + Pi agent and it's great, but "which model works best" is still totally crowdsourced and I was going off of peoples' opinions on reddit. Would love to hear more opinions if people have them.

embedding-shape · 2026-06-03T22:37:50 1780526270

> Have you found Gemma 4 31B better than Qwen 3.6 27B Q8?

Which quant of Gemma? For coding Qwen seems to be pretty far ahead, but generally Gemma seems to have a "vaster" set of knowledge, but armed with a search tool it doesn't really matter, and Qwen 3.6 been really great for all sorts of tool calling. I mostly do programming and related things though, fwiw.

> I was going off of peoples' opinions on reddit

It's extremely astroturfed all over the place, especially the larger subreddits, and especially the one related to a specific animal in a specific location. It's sad, as early on it was a great resource, but now it's mostly paid posts and a race to the bottom, with lots of piling, and all the knowledgeable people I used to recognize are nowhere to be found.

xenophonf · 2026-06-03T23:06:13 1780527973

It took me way too long to realize you were referring to r/localllama.

MoonWalk · 2026-06-03T23:41:03 1780530063

Why the obfuscation in the first place?

embedding-shape · 2026-06-04T10:24:11 1780568651

Just a bit of flair. Also, bunch of people have "keyword watchers" setup for various terms, so when you mention certain things on HN, reddit and elsewhere, you get commentators who enter the conversation not because the context or larger conversation, but because the single term/thing they care deeply about was mentioned, and it just gets very boring to read the whole attackers/defenders comments over and over again. But ultimately I just did it like that because it was more fun to write it like that.

MoonWalk · 2026-06-08T16:56:52 1780937812

But it renders the comment baffling to those who have never heard of that forum. I'm on here and Reddit quite a bit, and never heard of it.

zozbot234 · 2026-06-04T00:10:01 1780531801

I'm not sure that GP is correct, many people in that forum tend to hate Qwen for closing up many of their more recent models and leaving the whole local inference community 'stranded' on their older releases.

julianlam · 2026-06-04T04:34:43 1780547683

Are you sure? Prior to today the sub seems to be pretty partial to Qwen.

kajecounterhack · 2026-06-04T04:08:40 1780546120

That was definitely not the subreddit where I got my info.

thangalin · 2026-06-03T22:35:55 1780526155

Yes. I'm using Gemma-4 31B (gemma-4-31B-it-assistant.Q4_K_M.gguf) with llama.cpp to attribute quotations throughout chapters of my sci-fi novel. I started with Qwen3, but couldn't get it to work. Qwen3 TTS Voice Design, on the other hand, is incredible (Qwen3-TTS-12Hz-1.7B-VoiceDesign). I'm using both for an audiobook generator that produces a variety of voices.

Screens:

* https://i.ibb.co/TBBV5nJk/kl-01.png (voice design)

* https://i.ibb.co/nNvvKDyV/kl-02.png (quotation attributions)

khimaros · 2026-06-04T17:55:41 1780595741

building something similar: https://github.com/khimaros/autiobook

qingcharles · 2026-06-04T06:36:29 1780554989

Gemma 4 31B is enormously impressive. You get 1000 requests/day for free on Google's API and another 1000/day off OpenRouter. Only problem is you get 503 like crazy.

kajecounterhack · 2026-05-18T20:48:31 1779137311

You should probably disclaimer that you're the author of swival.dev, but nice project :)

kajecounterhack · 2026-03-18T20:55:09 1773867309

+10000 that Azure is a steaming pile of shit. Like what's this -- `azcopy` broken at head, and the working one doesn't guarantee correctness after a copy (99.6% copied successfully! good luck figuring out what went wrong!) compare that to migrating data with GCS or S3 -- they provide first class tools that do it right quickly (aws-cli, gsutil).

Want a VM? You'll also need this network security group, network interface, network manager, ip, virtual network... and maybe it'll be connected to the internet so you can SSH in? Compare to GCP or EC2 -- you just pick an instance and start it. You can SSH in directly, or even do it in the browser.

Billing also a nightmare: if you're running a startup, AWS and Google make it relatively easy to see how many credits you have left. The Azure dashboard makes you navigate a maze, and the button to click that says "Azure Credits" is _invisible_ for 30s until ostensibly some backend system finds your credits, then it magically shows up. Most people don't wait around and just assume there's no button.

And if you click it, maybe you will happen to be in the correct billing profile, maybe not! Don't get confused: billing profile and billing scope are different concepts too! And in your invoice, costs just magically get deducted, until they don't. No mention of any credits. Credits inaccessible through API (claude tried everything).

VMs, bucket storage, and copying data are the _simplest_ parts of the stack. Why would anyone bother trying to use other services if they can't get these right?

They literally give startups 2x the credits as GCP, 20x the credits of AWS and nobody wants to use them.

jiggawatts · 2026-03-18T21:11:40 1773868300

Azcopy is special bad, the team that looks after it is made up entirely of junior developers that obstinately refuse to listen to feedback.

Its documentation title is "Copy or move data to Azure Storage by using AzCopy v10" but it can’t actually do trivial operations like “move” because the devs are too scared to write code that deletes files: https://github.com/Azure/azure-storage-azcopy/issues/1650#is...

I recommend switching to “rclone” instead to avoid the frustration. It won't fill your entire system disk up with unnecessary log files unlike azcopy, which is a significant source of production server outages where I work because of this default behaviour.

kajecounterhack · 2026-01-30T22:49:02 1769813342

It has utility though: unlike the dollars in your mattress, it can't be printed into oblivion by your central bank. It is relatively portable, and people have flocked to it as a store of value especially during periods of socioeconomic instability when assets are going down and gov't spending is going up. It's tradeable for fiat in any country, so it allows you to bring value along if you relocate.

Its price reflects that utility and like any modern asset, a lot of speculation. You can speculate on whether it's more or less useful given current events -- nothing wrong with speculating that it is only going to be increasingly useful.

mapontosevenths · 2026-01-30T23:04:33 1769814273

You're right that it has utility, but being fungible doesnt imply that it is automatically an investment.

Speculation is not the same as investment, and it is still completely non-productive.

kajecounterhack · 2026-01-30T23:21:56 1769815316

Agree it doesn't generate wealth. It's explicitly a store of wealth.

Investment is a weird term because most people would consider keeping cash or cash equivalents (gold) to be investments, even if they don't generate wealth. Cash is also an opinion, in terms of the market.

michaelmrose · 2026-01-30T23:46:58 1769816818

An investment creates a return

fuzzfactor · 2026-01-31T02:59:42 1769828382

Roger, sometimes positive, sometimes negative.

kajecounterhack · 2025-09-12T22:28:01 1757716081

They are used in thin-film solar panel development. Not sure anyone has cracked the big problem with them, which is durability.

kajecounterhack · 2025-08-16T09:00:29 1755334829

I tried mapping back to closest token embeddings. Here's what I got:

    global_step = 1377; phase = continuous; lr = 5.00e-03; average_loss = 0.609497
  current tokens: ' Superman' '$MESS' '.");' '(sentence' '");' '.titleLabel' ' Republican' '?-'

    global_step = 1956; phase = continuous; lr = 5.00e-03; average_loss = 0.589661
  current tokens: ' Superman' 'marginLeft' 'iers' '.sensor' '";' '_one' '677' '».'

    global_step = 2468; phase = continuous; lr = 5.00e-03; average_loss = 0.027065
  current tokens: ' cited' '*>(' ' narrative' '_toggle' 'founder' '(V' '(len' ' pione'

    global_step = 4871; phase = continuous; lr = 5.00e-03; average_loss = 0.022909
  current tokens: ' bgcolor' '*>(' ' nomin' 'ust' ' She' 'NW' '(len' ' pione'

"Republican?" was kind of interesting! But most of the strings were unintelligible.

This was for classifying sentiment on yelp review polarity.

DoctorOetker · 2025-08-16T22:18:18 1755382698

During the prompt embedding optimization, the embeddings are allowed to take on any vector in embedding space, instead one could use a continuous penalty for superposing tokens:

Consider one of the embedding vectors in the input tensor: nothing guarantees its exactly on, or close to a specific token. Hence the probabilities with respect to each token form a distribution, ideally that distribution should be one-hot (lowest entropy) and worst case all equal probability (highest entropy), so just add a loss term penalizing the entropy on the quasitokens, to promote them to take on actual token values.

mattnewton · 2025-08-16T16:32:41 1755361961

Do the nearest tokens have a similar classification score?

kajecounterhack · 2025-06-29T21:15:05 1751231705

I'm similarly puzzled by "uncured bacon" which afaik still uses naturally occurring nitrites. How they're allowed to call it uncured when it's clearly still cured is beyond me.

kajecounterhack · 2025-06-23T09:47:22 1750672042

A lot of people use them together (cursor for IDE and claude code in the terminal inside the IDE).

In terms of performance, their agents differ. The base model their agents use are the same, but for example how they look at your codebase or decide to farm tasks out to lesser models, and how they connect to tools all differ.