Search: [research] - ervin's web review

I got fooled by AI-for-science hype—here's what it taught me

Or how the current neural networks obsession is poisoning scientific fields. There was already a reproducibility crisis going on and it looks like it's been getting worse. The incentives are clearly wrong and that shows.

tech · ai · machine-learning · neural-networks · science · research

May 20, 2025 at 10:22:59 AM GMT+2 * · permalink

·

https://www.understandingai.org/p/i-got-fooled-by-ai-for-science-hypeheres

·

Anthropic can now track the bizarre inner workings of a large language model

This is very interesting research. This confirms that LLMs can't be trusted on any output they make about their own inference. The example about simple maths is particularly striking, the real inference and what it outputs if you ask about its inference process are completely different.

Now for the topic dearest to my heart: It looks like there's some form of concept graph hiding in there which is reapplied across languages. Now we don't know if a particular language influences that graph. I don't expect the current research to explore this question yet, but looking forward to someone tackling it.

tech · ai · machine-learning · gpt · research · language

April 11, 2025 at 8:37:35 AM GMT+2 * · permalink

·

https://www.technologyreview.com/2025/03/27/1113916/anthropic-can-now-track-the-bizarre-inner-workings-of-a-large-language-model/

·

Empowering WebAssembly with Thin Kernel Interfaces

This is interesting research. It shows nice prospects for WebAssembly future as a virtualization and portability technology. I don't think we'll see all of the claims in the discussion section realized though.

tech · webassembly · virtualization · portability · research

April 1, 2025 at 10:03:42 AM GMT+2 * · permalink

·

https://dl.acm.org/doi/pdf/10.1145/3689031.3717470

·

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

I like this kind of research as it also says something about our own cognition. The results comparing two models and improving them are fascinating.

tech · ai · machine-learning · gpt · cognition · research

March 8, 2025 at 11:27:46 AM GMT+1 * · permalink

·

https://arxiv.org/abs/2503.01307

·

Age and cognitive skills: Use it or lose it

Interesting study even though it bears some important limitations. Still it seems to indicate that one shouldn't rest on its laurels and keep practicing cognitive skills even when older (actually might have to get started in the 20s latest).

neuroscience · cognition · research

March 7, 2025 at 10:20:38 AM GMT+1 * · permalink

·

https://www.science.org/doi/full/10.1126/sciadv.ads1560?af=R

·

AI versus the brain and the race for general intelligence

Friendly reminder that AI was also supposed to be a field about studying cognition... There's so many things we still don't understand that the whole "make it bigger and it'll be smart" obsession looks like it's creating missed opportunities to understand ourselves better.

tech · ai · machine-learning · gpt · cognition · neuroscience · science · research

March 4, 2025 at 7:17:05 AM GMT+1 * · permalink

·

https://arstechnica.com/science/2025/03/ai-versus-the-brain-and-the-race-for-general-intelligence/

·

Groundbreaking BBC research shows issues with over half the answers from Artificial Intelligence (AI) assistants

Interesting research, looking forward to the follow ups to see how it evolves over time. For sure the number of issues is way to high still to make trustworthy systems around search and news.

tech · ai · machine-learning · gpt · reliability · research

February 18, 2025 at 9:56:49 AM GMT+1 * · permalink

·

https://www.bbc.com/mediacentre/2025/bbc-research-shows-issues-with-answers-from-artificial-intelligence-assistants

·

Microsoft Study Finds AI Makes Human Cognition “Atrophied and Unprepared”

This is clearly pointing in the direction of UX challenges around LLM uses. For some tasks the user's critical thinking must be fostered otherwise bad decisions will ensue.

tech · ai · machine-learning · gpt · ux · cognition · research

February 10, 2025 at 10:42:36 PM GMT+1 * · permalink

·

https://www.404media.co/microsoft-study-finds-ai-makes-human-cognition-atrophied-and-unprepared-3/

·

The illustrated guide to a Ph.D.

Wondering what a Ph.D. is about? This is a good illustrated summary.

science · research

January 13, 2025 at 9:48:18 AM GMT+1 * · permalink

·

https://matt.might.net/articles/phd-school-in-pictures/

·

On the criteria to be used in decomposing systems into modules

We're still struggling about how to modularize our code. Sometimes we should go back to the basics, this paper by Parnas from 1972 basically gave us the code insights needs to modularize programs properly.

tech · design · architecture · research

December 8, 2024 at 10:43:26 AM GMT+1 * · permalink

·

https://dl.acm.org/doi/pdf/10.1145/361598.361623

·

Compilation on the GPU?

Interesting research about feasibility of making compilers parallelized on the GPU. I wonder how far this will go.

tech · compiler · gpu · research

December 1, 2024 at 10:11:56 AM GMT+1 * · permalink

·

https://dl.acm.org/doi/abs/10.1145/3528416.3530249

·

When Machine Learning Tells the Wrong Story

Fascinating research about side-channel attacks. Learned a lot about them and website fingerprinting here. Also interesting the explanations of how the use of machine learning models can actually get in the way of proper understanding of the side-channel really used by an attack which can prevent developing actually useful counter-measures.

tech · cpu · hardware · security · privacy · research

November 10, 2024 at 6:39:04 PM GMT+1 * · permalink

·

https://jackcook.com/2024/11/09/bigger-fish.html

·

You Don’t Need Words to Think

Very interesting research. Looks like we're slowly moving away from the "language and thinking are intertwined" hypothesis. This is probably the last straw for Chomsky's theory of language. It served us well but neuroscience points that it's time to leave it behind.

cognition · neuroscience · language · logic · knowledge · research

October 21, 2024 at 9:30:21 AM GMT+2 * · permalink

·

https://www.scientificamerican.com/article/you-dont-need-words-to-think/

·

Reliable Reasoning Beyond Natural Language

Now this is an interesting paper. Neurosymbolic approaches are starting to go somewhere now. This is definitely helped by the NLP abilities of LLMs (which should be used only for that). The natural language to Prolog idea makes sense, now it needs to be more reliable. I'd be curious to know how many times the multiple-try path is exercised (the paper doesn't quite focus on that). More research is required obviously.

tech · ai · machine-learning · gpt · logic · research

October 19, 2024 at 2:21:50 PM GMT+2 * · permalink

·

https://arxiv.org/abs/2407.11373

·

Large language models reduce public knowledge sharing on online Q&A platforms

Now the impact seems clear and this is mostly bad news. This reduces the production of public knowledge so everyone looses. Ironically it also means less public knowledge available to train new models. At some point their only venue to fine tune their models will be user profiling which will be private... I've a hard time seeing how we won't end up stuck with another surveillance apparatus providing access to models running on outdated knowledge. This will lock so many behaviors and decisions in place.

tech · ai · machine-learning · gpt · knowledge · criticism · research

October 14, 2024 at 9:05:03 AM GMT+2 * · permalink

·

https://academic.oup.com/pnasnexus/article/3/9/pgae400/7754871#483096365

·

LLMs don’t do formal reasoning - and that is a HUGE problem

Of course I recommend reading the actual research paper. This article is a good summary of the consequences though. LLMs definitely can't be trusted with formal reasoning including basic maths. This is a flaw in the way they are built, the bath forward is likely merging symbolic and sub-symbolic approaches.

tech · ai · machine-learning · neural-networks · gpt · logic · mathematics · research

October 13, 2024 at 9:54:09 AM GMT+2 * · permalink

·

https://garymarcus.substack.com/p/llms-dont-do-formal-reasoning-and

·

Understanding and effectively mitigating code review anxiety

Still very early days on this topic, clearly more studies are required. Still this one is interesting and indicates are clear link between code review anxiety and code review avoidance. If you're often procrastinating or rubber stamping code reviews, a workshop to reduce biases and showing you can manage your anxiety could improve things greatly.

tech · codereview · psychology · cognition · anxiety · research

October 7, 2024 at 9:44:11 PM GMT+2 * · permalink

·

https://link.springer.com/article/10.1007/s10664-024-10550-9

·

Were RNNs All We Needed?

OK, this paper picked my curiosity. The limitations of the experiments makes me wonder if some threshold effects aren't ignored. Still this is a good indication that the question is worth pursuing further.

tech · ai · machine-learning · gpt · research

October 4, 2024 at 10:50:11 AM GMT+2 * · permalink

·

https://arxiv.org/abs/2410.01201

·

Don’t believe the hype: AGI is far from inevitable

This is a short article summarizing a research paper at the surface level. It is clearly the last nail in the coffin for the generative AI grand marketing claims. Of course, I recommend reading the actual research paper (link at the end) but if you prefer this very short form, here it is. It's clearly time to go back to the initial goals of the AI field: understanding cognition. The latest industrial trends tend to confuse too much the map with the territory.

tech · ai · machine-learning · gpt · cognition · neuroscience · philosophy · mathematics · logic · research

September 30, 2024 at 9:45:19 AM GMT+2 * · permalink

·

https://www.ru.nl/en/research/research-news/dont-believe-the-hype-agi-is-far-from-inevitable

·

Resilient Microservice Applications, byDesign, and without the Chaos

I'm obviously not in love with the complexity this type of architecture brings. That being said, this thesis brings an interesting approach to better detect failure scenarios in such systems.

tech · microservices · reliability · research · architecture

September 27, 2024 at 11:24:23 AM GMT+2 * · permalink

·

https://christophermeiklejohn.com/publications/cmeiklej_phd_s3d_2024.pdf

·