Search: [ai] - ervin's web review

Why extracting data from PDFs is still a nightmare for data experts - Ars Technica

So much data trapped in PDFs indeed... Unfortunately VLM are still not reliable enough to be unleashed without tight validation of the output.

tech · ai · machine-learning · gpt · ocr · computer-vision

March 12, 2025 at 5:56:36 PM GMT+1 * · permalink

·

https://arstechnica.com/ai/2025/03/why-extracting-data-from-pdfs-is-still-a-nightmare-for-data-experts/

·

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs

I like this kind of research as it also says something about our own cognition. The results comparing two models and improving them are fascinating.

tech · ai · machine-learning · gpt · cognition · research

March 8, 2025 at 11:27:46 AM GMT+1 * · permalink

·

https://arxiv.org/abs/2503.01307

·

Microsoft is reportedly plotting a future without OpenAI

Are we surprised? Not really... This kind of struggle was an obvious outcome from the heavy dependencies between both companies.

tech · ai · machine-learning · gpt · microsoft · business

March 8, 2025 at 11:20:10 AM GMT+1 * · permalink

·

https://techstartups.com/2025/03/07/microsoft-is-plotting-a-future-without-openai/

·

What does “PhD-level” AI mean? OpenAI’s rumored $20,000 agent plan explained

Here we go for a brand new marketing stunt from OpenAI. You can also tell the pressure is rising since all of this is still operating at a massive loss.

tech · ai · machine-learning · gpt · marketing · business

March 8, 2025 at 10:52:23 AM GMT+1 * · permalink

·

https://arstechnica.com/ai/2025/03/what-does-phd-level-ai-mean-openais-rumored-20000-agent-plan-explained/

·

The Empty Promise of AI-Generated Creativity

Sure it makes generating content faster... but it's indeed so bland and uniform.

tech · ai · machine-learning · culture · criticism

March 4, 2025 at 7:58:31 AM GMT+1 * · permalink

·

https://hey.paris/posts/genai/

·

AI versus the brain and the race for general intelligence

Friendly reminder that AI was also supposed to be a field about studying cognition... There's so many things we still don't understand that the whole "make it bigger and it'll be smart" obsession looks like it's creating missed opportunities to understand ourselves better.

tech · ai · machine-learning · gpt · cognition · neuroscience · science · research

March 4, 2025 at 7:17:05 AM GMT+1 * · permalink

·

https://arstechnica.com/science/2025/03/ai-versus-the-brain-and-the-race-for-general-intelligence/

·

Structured data extraction from unstructured content using LLM schemas

This is one of the handful of uses where I'd expect LLMs to shine. It's nice to see some tooling to make it easier.

tech · ai · machine-learning · gpt · nlp · data

March 1, 2025 at 1:49:15 PM GMT+1 * · permalink

·

https://simonwillison.net/2025/Feb/28/llm-schemas/

·

exo: Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Early days but this looks like an interesting solution to democratize the inference of large models.

tech · foss · ai · machine-learning · gpt

February 19, 2025 at 10:36:29 AM GMT+1 * · permalink

·

https://github.com/exo-explore/exo

·

Can I ethically use LLMs?

I like this paper, it's well balanced. The conclusion says is all: if you're not actively working on reducing the harms then you might be doing something unethical. It's not just a toy to play with, you have to think about the impacts and actively reduce them.

tech · ai · machine-learning · gpt · ethics

February 18, 2025 at 11:40:02 AM GMT+1 * · permalink

·

https://ntietz.com/blog/can-i-ethically-use-llms/

·

Groundbreaking BBC research shows issues with over half the answers from Artificial Intelligence (AI) assistants

Interesting research, looking forward to the follow ups to see how it evolves over time. For sure the number of issues is way to high still to make trustworthy systems around search and news.

tech · ai · machine-learning · gpt · reliability · research

February 18, 2025 at 9:56:49 AM GMT+1 * · permalink

·

https://www.bbc.com/mediacentre/2025/bbc-research-shows-issues-with-answers-from-artificial-intelligence-assistants

·

ChatGPT’s Political Views Are Shifting Right, a New Analysis Finds

This might be accidental but this highlights the lack of transparency on how those models are produced. It also means we should get ready for future generation of such models to turn into very subtle propaganda machines. Indeed even if for now it's accidental I doubt it'll be the case much longer.

tech · ai · machine-learning · gpt · politics

February 18, 2025 at 9:51:21 AM GMT+1 * · permalink

·

https://gizmodo.com/chatgpts-political-views-are-shifting-right-a-new-analysis-finds-2000562328

·

Notes: AI Copilot Code Quality

People really need to be careful about the short term productivity boost... If it kills maintainability in the process you're trading that short term productivity for a crashing long term productivity.

tech · ai · machine-learning · copilot · productivity · maintenance

February 17, 2025 at 9:06:23 AM GMT+1 * · permalink

·

https://kracekumar.com/post/ai_copilot_code_quality_paper/

·

AI is Stifling Tech Adoption

This is definitely a problem. It's doomed to influence how tech are chosen on software projects.

tech · ai · machine-learning · gpt · copilot · programming · innovation

February 16, 2025 at 9:12:07 AM GMT+1 * · permalink

·

https://vale.rocks/posts/ai-is-stifling-tech-adoption

·

How to Backdoor Large Language Models

The security implications of using LLMs are real. With the high complexity and low explainability of such models it opens the door to hiding attacks in plain sight.

tech · ai · machine-learning · gpt · copilot · security

February 16, 2025 at 9:06:44 AM GMT+1 * · permalink

·

https://blog.sshh.io/p/how-to-backdoor-large-language-models

·

The skill of the future is not 'AI', but 'Focus'

This is an interesting way to frame the problem. We can't rely too much on LLMs for computer science problems without loosing important skills and hindering learning. This is to be kept in mind.

tech · ai · machine-learning · copilot · programming · focus · learning · criticism

February 12, 2025 at 11:10:10 PM GMT+1 * · permalink

·

https://www.carette.xyz/posts/focus_will_be_the_skill_of_the_future/

·

Poisoning for propaganda: rising authoritarianism makes LLMs more dangerous

Of course it would be less of a problem if explainability was better with such models. It's not the case though, so it means they can spew very subtle propaganda. This is bound to become even more of a political power tool.

tech · ai · machine-learning · gpt · politics · criticism

February 11, 2025 at 7:40:00 AM GMT+1 * · permalink

·

https://www.baldurbjarnason.com/2025/poisoning-for-propaganda/

·

Microsoft Study Finds AI Makes Human Cognition “Atrophied and Unprepared”

This is clearly pointing in the direction of UX challenges around LLM uses. For some tasks the user's critical thinking must be fostered otherwise bad decisions will ensue.

tech · ai · machine-learning · gpt · ux · cognition · research

February 10, 2025 at 10:42:36 PM GMT+1 * · permalink

·

https://www.404media.co/microsoft-study-finds-ai-makes-human-cognition-atrophied-and-unprepared-3/

·

The LLM Curve of Impact on Software Engineers

Again it's definitely not useful for everyone... it might even be dangerous for learning.

tech · ai · machine-learning · gpt · copilot · productivity · learning

February 7, 2025 at 9:05:10 AM GMT+1 * · permalink

·

https://serce.me/posts/2025-02-07-the-llm-curve-of-impact-on-software-engineers

·

Bad idea: “Artificial Intelligence” automatically improves productivity

Be wary of the unproven claims that using LLMs necessarily leads to productivity gains. The impacts might be negative.

tech · ai · machine-learning · gpt · copilot · programming · productivity

February 7, 2025 at 8:34:36 AM GMT+1 * · permalink

·

https://jchyip.medium.com/bad-idea-artificial-intelligence-automatically-improves-productivity-0829fcf2146c

·

Chatbot Software Begins to Face Fundamental Limitations | Quanta Magazine

When you put the marketing claims aside, the limitations of those models become obvious. This is important, only finding the root cause of those limitations can give a chance to find a solution to then.

tech · ai · machine-learning · gpt · mathematics · logic

February 2, 2025 at 6:13:55 PM GMT+1 * · permalink

·

https://www.quantamagazine.org/chatbot-software-begins-to-face-fundamental-limitations-20250131/

·