I was actually wondering when this would happen. Was just a matter of time, would have expected this move a couple of months ago.
The title is a bit misleading in a way (and I almost didn't click through for a start). That said, it is an interesting essay dealing with the topics of intelligence, problem solving etc. I'm not sure I agree with everything in it, but that's still good food for thought.
Indeed, if we weaken the learning loop by using coding assistants then we might feel we go faster while we're building up the maintenance cliff. We need to have an understanding of the system.
I'm happy to see I'm actually very much aligned with one of the "Attention Is All You Need" co-authors. The current industry trend of "just scale the transformer architecture" is indeed stifling innovation and actual research. That said I find ironic that he talks about freedom to explore... well this is what public labs used to be about, but we decided to drastically reduce their funding and replace that with competition between startups. It's no surprise we have very myopic views on problems.
ETH Zurich keeps making progress on its model. It's exciting and nice to see an ethical offering develop in that space. It shows that when there is political will it can be treated as proper infrastructure.
There's some truth in this piece. We never quite managed to really have a semantic web because knowledge engineering is actually hard... and we publish mostly unstructured or badly structured data. LLMs are thus used as a brute force attempt at layering some temporary and partial structure on top of otherwise unstructured data. They're not really up to the task of course but it gives us a glimpse into what could have been.
Apparently in the age of people using LLMs for their tests, there is a bias toward mockist tests being produced. It's a good time to remind why you likely don't want them in most cases and limit the use of mocks to consider fakes and checking system state instead.
Interesting stuff about the mathematics behind how embedding spaces work in LLMs.
OK, this is an interesting way for the Darwin Award to branch. Some of the 2025 nominees are indeed funny. Now I wonder which ones will win the award!
We can expect more misleading papers to be published by the big LLM providers. Don't fall in the trap, wait for actually peer reviewed papers from academia. Unsurprisingly the results aren't as good there.
Long but interesting chapter which shows how GPUs architecture works and the differences with TPUs. This is unsurprisingly written in the context of large models training.
Honestly, it took much longer than I expected. Now you know that GitHub has really become a conduit for Microsoft's AI initiatives.
Running interesting models locally gets more and more accessible.
ETH Zurich spearheading an effort for more ethical and cleaner open models. That's good research, looking forward to the results.
Interesting comparison, indeed would a clock like this be useful?
And one more... it's clearly driven by an architecture pattern used by all vendors. They need to get their acts together to change this.
There's always been disinformation in time of wars. The difference is the scale and speed of producing fake images now.
Good reminder that professional translators aren't gone... on the contrary. There's so many things in languages that you can't handle with a machine.
I recognize myself quite a bit in this opinion piece. It does a good job going through most of the ethical and practical reasons why you don't need LLMs to develop and why you likely don't want to.
OK, this is a serious and long paper. It shows quite well how over reliance on ChatGPT during the learning phase on some topics impacts people. It's mesurable both from their behavior and through EEG. Of course, it'd require more such studies with larger groups. Still those early signs are concerning.