Let's hope security teams don't get saturated with low quality security reports like this...
Another lawsuit making progress against OpenAI and their shady practice.
Nice vision model. Looks like it strikes and interesting balance between performance and memory consumption. Looks doable to run cheaply and on premise.
More shady practices to try to save themselves. Let's hope it won't work.
The water problem is obviously hard to ignore. This piece does a good job illustrating how large the impact is.
Good reminder that models shouldn't be used as a service except maybe for prototyping. This has felt obvious to me since the beginning of this hype cycle... but here we are people are falling in the trap today.
More signs of the generative AI companies hitting a plateau...
It shouldn't be, but it is a big deal. Having such training corpus openly available is one of the big missing pieces to build models.
This is an interesting and balanced view. Also nice to see that local inference is really getting closer. This is mostly a UI problem now.
Like me, you find the Open Source AI Definition weak on the training data information side? You'd be right and there's a reason for it... it's probably hiding quite some open washing for the larger models. This is a good explanation of the motives and consequences.
I definitely like the approach of having vectorisation in the RDBMS directly. This is one less moving part, less complexity at the application level to synchronize everything together. In this case it's a Postgres extension.
Nice initiative from the OSI. It is timely, such a definition was surely needed. The data information part seems fairly weak though... for sure you could make a system which doesn't respect the four freedoms that way.
This is what you get by making bots spewing text based on statistics without a proper knowledge base behind it.
More marketing announcement than real research paper. Still it's nice to see smaller models being optimized to run on mobile devices. This will get interesting when it's all local first and coupled to symbolic approaches.
This is still an important step with LLM. It's not because the models are huge that tokenizers disappeared or that you don't need to clean up your data.
Using the right metaphors will definitely help with the conversation in our industry around AI. This proposal is an interesting one.
More signs of the current bubble being about to burst?
Now this is an interesting paper. Neurosymbolic approaches are starting to go somewhere now. This is definitely helped by the NLP abilities of LLMs (which should be used only for that). The natural language to Prolog idea makes sense, now it needs to be more reliable. I'd be curious to know how many times the multiple-try path is exercised (the paper doesn't quite focus on that). More research is required obviously.
Now the impact seems clear and this is mostly bad news. This reduces the production of public knowledge so everyone looses. Ironically it also means less public knowledge available to train new models. At some point their only venue to fine tune their models will be user profiling which will be private... I've a hard time seeing how we won't end up stuck with another surveillance apparatus providing access to models running on outdated knowledge. This will lock so many behaviors and decisions in place.
Finally a path forward for logic programming? An opportunity to evolve beyond Prolog and its variants? Good food for thought.