Search: [research] - ervin's web review

Unbundling Profile: MIT Libraries - SPARC

It's good to see major institutions like this get out of contracts with scientific publishing companies. Those unfortunately became mostly parasitic. Open access should be the norm for research.

research · copyright · open-access

August 21, 2024 at 8:23:02 AM GMT+2 * · permalink

·

https://sparcopen.org/our-work/big-deal-knowledge-base/unbundling-profiles/mit-libraries/

·

AI models collapse when trained on recursively generated data | Nature

More discussion about models collapse. The provenance of data will become a crucial factor to our ability to train further models.

tech · data · ai · machine-learning · gpt · research

July 25, 2024 at 9:57:57 AM GMT+2 * · permalink

·

https://www.nature.com/articles/s41586-024-07566-y

·

On the Paradox of Learning to Reason from Data

Further clues that transformer models can't learn logic from data.

tech · ai · machine-learning · gpt · research

June 24, 2024 at 9:58:36 AM GMT+2 * · permalink

·

https://arxiv.org/abs/2205.11502

·

Scalable MatMul-free Language Modeling

Interesting paper showing a promising path to reduce the memory and workload of transformer models. This is much more interesting than the race to the gigantic size.

tech · ai · machine-learning · gpt · research

June 24, 2024 at 9:49:13 AM GMT+2 * · permalink

·

https://arxiv.org/abs/2406.02528

·

Alice in Wonderland: Simple Tasks Showing Complete Reasoning Breakdown in State-Of-the-Art Large Language Models

Another cruel reminder that basic reasoning is not to be expected from LLMs. Here is a quote from the conclusion of the paper which makes it clear:
"We think that observations made in our study should serve as strong reminder that current SOTA
LLMs are not capable of sound, consistent reasoning, as shown here by their breakdown on even such a simple task as the presented AIW problem, and enabling such reasoning is still subject of basic research. This should be also a strong warning against overblown claims for such models beyond being basic research artifacts to serve as problem solvers in various real world settings, which are often made by different commercial entities in attempt to position their models as a strong mature product for end-users. [...] Observed breakdown of basic reasoning capabilities, coupled with such public claims (which are also based on standardized benchmarks), present an inherent safety problem. Models with insufficient basic reasoning are inherently unsafe, as they will produce wrong decisions in various important scenarios that do require intact reasoning."

tech · ai · gpt · machine-learning · safety · research

June 6, 2024 at 10:22:51 AM GMT+2 * · permalink

·

https://arxiv.org/abs/2406.02061

·

"AI now beats humans at basic tasks": Really?

Nice article. It's a good reminder that the benchmarks used to evaluate generative AI systems have many caveats.

tech · ai · machine-learning · gpt · research · benchmarking · criticism

May 6, 2024 at 10:07:15 AM GMT+2 * · permalink

·

https://aiguide.substack.com/p/ai-now-beats-humans-at-basic-tasks

·

Hello OLMo: A truly open LLM

This is how it should be done. This one comes with everything needed to reproduce the results. This is necessary to gain insights into how such models work internally.

tech · ai · machine-learning · gpt · open-access · research

April 10, 2024 at 3:49:58 PM GMT+2 * · permalink

·

https://blog.allenai.org/hello-olmo-a-truly-open-llm-43f7e7359222

·

Software eco-design: investigating and reducing the energy consumption of software

More work about eco-design of software. This is definitely welcome. I found this work a bit weak on the state of the art and the interview parts (10 people in the same company). But the field is so nascent that it's to be expected I guess, PhD students have to do with what they have access to. Unsurprisingly this shows a great lack of proper tools to tackle the measurement problem. This thesis shows interesting prospects to reduce variations in measurements though, some of the proposed guidelines might help but cannot offset the hardware heterogeneity completely... The parts focusing on practical advices around Java use and deployment are interestingly easy to apply though. You need to take into account the context of your application to make the right choices of course.

tech · performance · energy · ecology · java · research

April 8, 2024 at 5:32:00 PM GMT+2 * · permalink

·

https://theses.hal.science/tel-03429300/document

·

CACM Is Now Open Access – Communications of the ACM

This is great news, more scientific papers from the past decades will be accessible to everyone.

tech · science · research

March 2, 2024 at 10:51:23 PM GMT+1 * · permalink

·

https://cacm.acm.org/news/cacm-is-now-open-access-2/

·

How Big is YouTube? - Ethan Zuckerman

An important question for proper statistics about the content itself. Surprisingly harder to get an answer to it than one would think.

tech · social-media · google · research

December 23, 2023 at 4:14:08 PM GMT+1 * · permalink

·

https://ethanzuckerman.com/2023/12/22/how-big-is-youtube/

·

Keep CALM and CRDT On

Interesting research, this shows opportunities to push CRDTs to the next level.

tech · distributed · crdt · research

August 29, 2023 at 8:42:12 AM GMT+2 * · permalink

·

https://www.vldb.org/pvldb/vol16/p856-power.pdf

·

A Cookbook of Self-Supervised Learning

Very comprehensive (didn't read it all yet) guide about self-supervised learning. It'll likely become good reference material.

tech · ai · machine-learning · research

June 27, 2023 at 10:00:22 AM GMT+2 * · permalink

·

https://arxiv.org/abs/2304.12210

·

The Shape of Code » Software effort estimation is mostly fake research

We got a problem with research around software estimates. This won't help us get better at it as an industry...

tech · estimates · research

June 17, 2023 at 11:55:56 PM GMT+2 · permalink

·

https://shape-of-code.com/2021/01/17/software-effort-estimation-is-mostly-fake-research/

·

We are drowning in information while starving for wisdom | Realize Engineering

This is indeed very much true... there's a clear crisis in research. It turned into a hamster wheel of publishing articles at a constantly faster pace. The incentives are misguided which pushes that behavior to even have a career. Meanwhile, knowledge building suffers.

research · knowledge · ethics

December 22, 2022 at 9:08:57 AM GMT+1 * · permalink

·

https://realizeengineering.blog/2021/01/20/we-are-drowning-in-information-while-starving-for-wisdom/

·

Rights retention: one small step for academics, one giant leap for global access to knowledge – Walled Culture

The rebellion against the academic publishers is still going on. Hopefully this will really change soon. That cartel of publishers needs to go back to its rightful place.

scihub · research · academia · commons · open-access

December 13, 2021 at 10:41:57 AM GMT+1 * · permalink

·

https://walledculture.org/rights-retention-one-small-step-for-academics-one-giant-leap-for-global-access-to-knowledge/

·