65 private links
Looks like a nice kit to add to your tool belt. Does some handy checks if you have a Postgres database to manage.
A bit of a shameless plug toward the end. That said the explanations of why Cloudflare is banking on Rust so much or how the recent downtime could have been avoided are spot on.
Error handling is not easy. Having simple rules to apply for complex systems is a good thing. Of course the difficulty is to apply them consistently.
Interesting point of view. Indeed, you probably want things to not be available 100% of the time. This forces you to see how resilient things really are.
Interesting approach for a project to collect such traps in there dependencies like this.
This is a good look at the reasons behind throttling. If you accept a less naive model than "preventing abuse", you can build a better throttling strategy.
If the funding dries up... we'll have another AI winter on our hands indeed.
I find the title misleading. Still, this is a good exploration of how to treat unwrap() and expect() in Rust code.
Illustrated with the Clojure ecosystem, bit there's nothing inherently specific to the language here. If you want to ensure stability to your users, you need to manage your APIs properly and this article put forward a couple of interesting ideas.
This is almost by definition. The post mortem needs to be wisely crafted to look also at previous incidents and the actions to mitigate them.
At some point the complexity is high enough that you indeed need more tools than only handcrafted tests to discover bugs.
Interesting rambling and exploration. What would a computer built to last a century look like?
Interesting research, looking forward to the follow ups to see how it evolves over time. For sure the number of issues is way to high still to make trustworthy systems around search and news.
This highlights quite well the limits of the models used in LLMs.
This is an important trait to have for a developer. If you're content of things working without knowing why and how they work, you're looking for a world of pain later.
Interesting point. You likely need to be careful with fallback modes especially in distributed systems. They might bring even more issues when the system is already under stress.
I'm obviously not in love with the complexity this type of architecture brings. That being said, this thesis brings an interesting approach to better detect failure scenarios in such systems.
Interesting reason which would explain the Selenium flakiness. It's just harder to write tests with race conditions using Playwright.
How to avoid drowning in errors when getting serious about monitoring. Finding class of errors and treating them one by one will definitely help.
A good reminder of everything which might go wrong when connectivity is bad. Most tools let you down in such a case.