Search: [machine-learning] - ervin's web review

Hallucination is Inevitable: An Innate Limitation of Large Language Models

Interesting paper attempting to prove that hallucinations are unavoidable in those models. It is well balanced though, and explains why it's not necessarily a bad thing in theory. In my opinion, the problem is the marketing talk around those models making grand claims or denying the phenomenon.

tech · ai · machine-learning · gpt

February 26, 2024 at 9:25:12 AM GMT+1 * · permalink

·

https://arxiv.org/abs/2401.11817

·

Magika: AI powered fast and efficient file type identification | Google Open Source Blog

Interesting tool for file type detection. Seems very accurate too.

tech · ai · machine-learning · tools · command-line

February 16, 2024 at 8:28:54 AM GMT+1 * · permalink

·

https://opensource.googleblog.com/2024/02/magika-ai-powered-fast-and-efficient-file-type-identification.html

·

Estimating the environmental impact of Generative-AI services using an LCA-based methodology

Interesting paper evaluating a Life Cycle Assessment (LCA) method to estimate the power consumption and environmental impact of generative AI services. This is illustrated on a single service, hopefully we'll see more such assessments.

tech · ecology · energy · ai · machine-learning · gpt

February 13, 2024 at 10:05:45 AM GMT+1 * · permalink

·

https://inria.hal.science/hal-04346102

·

🦅 Eagle 7B : Soaring past Transformers with 1 Trillion Tokens Across 100+ Languages (RWKV-v5)

Very nice progress on this type of architecture. It's definitely needed in part because it lowers the inference cost quite a lot. It's also nice to see it released with under the Apache 2 license and the training set be documented.

tech · ai · machine-learning · gpt · foss

January 30, 2024 at 11:21:05 AM GMT+1 * · permalink

·

https://blog.rwkv.com/p/eagle-7b-soaring-past-transformers

·

Fairly Trained launches certification for generative AI models that respect creators’ rights

This is an interesting move, we'll see if this certification gets any traction.

tech · ai · machine-learning · gpt · copyright · licensing

January 25, 2024 at 8:42:55 AM GMT+1 * · permalink

·

https://www.fairlytrained.org/blog/fairly-trained-launches-certification-for-generative-ai-models-that-respect-creators-rights

·

Nightshade: Protecting Copyright

The tooling to protect against the copyright theft of image generator models training is making progress. This will clearly turn into an arm race.

tech · ai · machine-learning · copyright

January 21, 2024 at 10:00:52 AM GMT+1 * · permalink

·

https://nightshade.cs.uchicago.edu/whatis.html

·

LeftoverLocals: Listening to LLM responses through leaked GPU local memory | Trail of Bits Blog

Interesting vulnerability, not all vendors are impacted though. GPU memory leaks can have unforeseen impacts.

tech · gpu · machine-learning · security

January 19, 2024 at 10:37:28 AM GMT+1 * · permalink

·

https://blog.trailofbits.com/2024/01/16/leftoverlocals-listening-to-llm-responses-through-leaked-gpu-local-memory/

·

AI poisoning could turn open models into destructive “sleeper agents,” says Anthropic

The tone pointing at "open models" is wrong but the research is interesting. It still proves models can be poisoned (open or not) so traceability and secured supply-chains will become very important when using large language models.

tech · ai · machine-learning · gpt · security · supply-chain

January 18, 2024 at 9:13:46 AM GMT+1 * · permalink

·

https://arstechnica.com/information-technology/2024/01/ai-poisoning-could-turn-open-models-into-destructive-sleeper-agents-says-anthropic/

·

The I in LLM stands for intelligence | daniel.haxx.se

When bug bounty programs meet LLM hallucinations... developer time is wasted.

tech · ai · machine-learning · gpt · security

January 3, 2024 at 7:42:30 AM GMT+1 * · permalink

·

https://daniel.haxx.se/blog/2024/01/02/the-i-in-llm-stands-for-intelligence/

·

The New York Times sues OpenAI and Microsoft for copyright infringement - The Verge

It was only a question of time until we'd see such lawsuits appear. We'll see where this one goes.

tech · ai · machine-learning · gpt · copyright · law

December 28, 2023 at 10:02:52 AM GMT+1 * · permalink

·

https://www.theverge.com/2023/12/27/24016212/new-york-times-openai-microsoft-lawsuit-copyright-infringement

·

More than calculators: Why large language models threaten learning, teaching, and education | by Amy J. Ko | Bits and Behavior | Dec, 2023 | Medium

When underfunded schools systems preaching obedience and conformity meet something like large language models, this tips over the balance enough that no proper learning can really happen anymore. Time to reform our school systems?

tech · ai · machine-learning · gpt · school · learning · education

December 25, 2023 at 11:46:12 AM GMT+1 * · permalink

·

https://medium.com/bits-and-behavior/more-than-calculators-why-large-language-models-threaten-public-education-480dd5300939

·

The growing energy footprint of artificial intelligence - ScienceDirect

Very interesting paper about the energy footprint of the latest trend in generator models. The conclusion is fairly clear: we should think twice before using them.

tech · ai · machine-learning · gpt · economics · energy · ecology

December 23, 2023 at 5:52:18 PM GMT+1 * · permalink

·

https://www.sciencedirect.com/science/article/pii/S2542435123003653#fig1

·

PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU

Interesting inference engine. The design is clever with an hybrid CPU-GPU approach to limit the memory demand on the GPU and the amount of data transfers. The results are very interesting, especially surprising if the apparently very limited impact on the accuracy.

tech · ai · machine-learning · gpt

December 22, 2023 at 2:39:32 PM GMT+1 * · permalink

·

https://ipads.se.sjtu.edu.cn/_media/publications/powerinfer-20231219.pdf

·

Facebook Is Being Overrun With Stolen, AI-Generated Images That People Think Are Real

Here we are... We're really close to crossing into this territory where any fiction can disguise itself for reality. The problem is that we'll literally be drowning in such content. The social impacts can't be underestimated.

tech · ai · machine-learning · gpt · social-media · criticism

December 20, 2023 at 11:35:40 AM GMT+1 * · permalink

·

https://www.404media.co/facebook-is-being-overrun-with-stolen-ai-generated-images-that-people-think-are-real/

·

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Interesting technique to speed up the generation of large language models.

tech · ai · machine-learning · gpt · optimization

December 20, 2023 at 11:11:12 AM GMT+1 * · permalink

·

https://sites.google.com/view/medusa-llm

·

The AI trust crisis

There's definitely a problem here. The lack of transparency from the involved companies doesn't help. It's also a chance for local and self-hostable models, let's hope their use increases.

tech · ai · machine-learning · surveillance · trust · transparency

December 15, 2023 at 10:50:02 AM GMT+1 * · permalink

·

https://simonwillison.net/2023/Dec/14/ai-trust-crisis/

·

Power Hungry Processing: ⚡ Watts ⚡ Driving the Cost of AI Deployment?

Important and interesting study showing how the new generation of models are driving energy consumption way up. As a developer, do the responsible thing and use smaller, more specific models.

tech · ai · machine-learning · gpt · energy

December 11, 2023 at 8:49:45 AM GMT+1 * · permalink

·

https://arxiv.org/pdf/2311.16863.pdf

·

Introducing Gemini: Google’s most capable AI model yet

The Large Language Model arm race is still going strong. Models are still mostly hidden behind APIs of course, and this is likely consuming lots of energy to run. Results seem interesting though, even though I suspect they're over inflating the "safety" built in all this. Also be careful of the demo videos, they've been reported as heavily edited and misleading...

tech · ai · machine-learning · gpt

December 7, 2023 at 11:02:39 AM GMT+1 * · permalink

·

https://blog.google/technology/ai/google-gemini-ai/#availability

·