Search: [performance] - ervin's web review

Ice and Fire: How to read icicle and flame graphs

Good explanation of how flame graphs are produced and how to read them. Gives a few tips on what to look for to optimize.

tech · performance · optimization

May 15, 2023 at 10:20:44 AM GMT+2 * · permalink

·

https://www.polarsignals.com/blog/posts/2023/03/28/how-to-read-icicle-and-flame-graphs/

·

Is sequential IO dead in the era of the NVMe drive? — Jack Vanlightly

Interesting exploration on the performance of SSDs regarding write patterns. Turns out sequential IO is still a thing, just for a different reason than with good old HDDs.

tech · storage · performance

May 10, 2023 at 10:03:30 PM GMT+2 * · permalink

·

https://jack-vanlightly.com/blog/2023/5/9/is-sequential-io-dead-in-the-era-of-the-nvme-drive

·

Cloud exit pays off in performance too

Very interesting to see that move to owned hardware... turns out that not only the invoice is smaller in their case but the performances are much better as well.

tech · infrastructure · performance

May 4, 2023 at 8:21:41 AM GMT+2 · permalink

·

https://world.hey.com/dhh/cloud-exit-pays-off-in-performance-too-4c53b697

·

Nine ways to shoot yourself in the foot with PostgreSQL

Nice set of tips, I knew a few but not all of them. The discussion around CTEs is interesting.

tech · databases · postgresql · performance

April 26, 2023 at 8:54:55 AM GMT+2 * · permalink

·

https://philbooth.me/blog/nine-ways-to-shoot-yourself-in-the-foot-with-postgresql

·

Measuring the Impact of False Sharing

Nice exploration of false sharing on performances in several hardware scenarii. A couple of surprises along the way.

tech · multithreading · performance

April 24, 2023 at 8:23:27 AM GMT+2 * · permalink

·

https://alic.dev/blog/false-sharing.html

·

Defining interfaces in C++: concepts versus inheritance – Daniel Lemire's blog

Shouldn't come as a surprise if you paid attention to C++ evolutions for the past 30 years. We're now reaping the fruits though, so it's really become easy to keep both options in sight when designing. This is especially important for performance sensitive code.

Nothing really new here (apart from the "how easy it is these days!")... Still it needs to be reminded on a regular basis. :-)

tech · c++ · performance

April 21, 2023 at 10:48:09 AM GMT+2 * · permalink

·

https://lemire.me/blog/2023/04/20/defining-interfaces-in-c-concepts-versus-inheritance/

·

Load Balancing

Nice post explaining the common algorithms used for load balancing. Each having their own trade offs of course. Well done with tiny simulations.

tech · web · server · performance · http

April 18, 2023 at 5:43:29 PM GMT+2 * · permalink

·

https://samwho.dev/load-balancing/

·

Polars for initial data analysis, Polars for production

Polars really looks like a nice alternative to Pandas with a nice upgrade path from data exploration to production.

tech · data-science · pandas · polars · performance

April 7, 2023 at 6:20:37 PM GMT+2 * · permalink

·

https://pythonspeed.com/articles/polars-exploratory-data-analysis-vs-production/

·

Making Python 100x faster with less than 100 lines of Rust

Nice walk through for a use of PyO3 to make some Python code much faster. Nice to see how useful py-spy turn out to be in such scenarii as well.

tech · rust · python · profiling · performance

March 30, 2023 at 4:20:47 PM GMT+2 * · permalink

·

https://ohadravid.github.io/posts/2023-03-rusty-python/

·

A discussion between Casey Muratori and Robert C. Martin about Clean Code

Very interesting conversation between Uncle Bob and one of the recent critics of his work regarding performance. I like how he admits some faults in the way he presents things and try to improve for later rather than trying to be right. Some people should learn from that. There's clearly a tension between performance and what is described in Clean Code, it'd be pointless to deny it.

tech · architecture · performance · craftsmanship

March 9, 2023 at 7:35:24 AM GMT+1 * · permalink

·

https://github.com/unclebob/cmuratori-discussion/blob/main/cleancodeqa.md

·

Perf engineering with Python 3.12

perf now available also to Python programs. This definitely can be useful for proper profiling.

tech · python · performance · profiling

February 24, 2023 at 9:55:03 AM GMT+1 * · permalink

·

https://www.petermcconnell.com/posts/perf_eng_with_py12/

·

AMD CEO: The Next Challenge Is Energy Efficiency - IEEE Spectrum

Interesting position from AMD regarding the race on the next super computers. They're all being caught up by energy efficiency so it'll need to be addressed both at the processor architecture level but also at the software architecture level. How we design our computing tasks will matter more and more.

tech · cpu · performance · energy · architecture

February 23, 2023 at 11:21:57 AM GMT+1 * · permalink

·

https://spectrum.ieee.org/amd-eyes-supercomputer-efficiency-gains

·

The Real C++ Killers (Not You, Rust) | HackerNoon

A bit of a sarcastic tone but a few good point in there. Also shows interesting alternatives to C++ to squeeze every ounce of performance out of your code whatever the platform it runs on. Of the three options explored I knew only about Numba really.

tech · performance · python · c++

February 15, 2023 at 8:57:54 AM GMT+1 * · permalink

·

https://hackernoon.com/the-real-c-killers-not-you-rust

·

The Elusive Frame Timing | by Alen Ladavac | Medium

Excellent analysis and explanation of the stutter problem people experience with game engines. It's an artifact of the graphics pipeline becoming more asynchronous with no way to know when something is really displayed. Extra graphics APIs will be needed to solve this for real.

tech · 3d · performance

January 14, 2023 at 1:07:20 PM GMT+1 * · permalink

·

https://medium.com/@alen.ladavac/the-elusive-frame-timing-168f899aec92

·

Performance of WebAssembly runtimes in 2023 | Frank DENIS random thoughts.

Time to look a bit at the maze of WebAssembly runtimes. Good overview on how they currently perform and how well they are documented or easy to use.

tech · performance · webassembly

January 5, 2023 at 9:12:12 AM GMT+1 * · permalink

·

https://00f.net/2023/01/04/webassembly-benchmark-2023/

·

The Bitter Truth: Python 3.11, Cython, C++ Performance | Agents and Robots

Python is getting faster but is still far from what you can get with C++ of course. That said, for simulations you likely don't want everything in Python or in C++. Part of the challenge is to split the subsystems properly and use C++ where it matters.

tech · simulation · python · c++ · performance

December 28, 2022 at 2:10:58 PM GMT+1 * · permalink

·

https://medium.com/agents-and-robots/the-bitter-truth-python-3-11-vs-cython-vs-c-performance-for-simulations-babc85cdfef5

·

Auto-vectorization: How to get beaten by compiler optimization — Java JIT!

Don't underestimate performance of the generated code when a JIT is in the picture. Very good example with the JVM just there.

tech · java · optimization · performance

December 15, 2022 at 9:48:55 AM GMT+1 * · permalink

·

https://itnext.io/auto-vectorization-how-to-get-beaten-by-compiler-optimization-java-jit-vector-api-92c72b97fba3

·

Faster hardware is a bad first solution to slow software

Don't bank it all on faster hardware, make sure your software isn't slow first. Otherwise it'll bring quite some hidden costs.

tech · performance · optimization

December 15, 2022 at 8:13:01 AM GMT+1 * · permalink

·

https://pythonspeed.com/articles/fixing-performance-with-hardware

·

WebAssembly: Go vs Rust vs AssemblyScript :: Ecostack — a developer blog

Little simple benchmark of WebAssembly performances for the most common languages found there. Careful to the payload size though.

tech · webassembly · performance

December 1, 2022 at 9:02:21 AM GMT+1 * · permalink

·

https://ecostack.dev/posts/wasm-tinygo-vs-rust-vs-assemblyscript/

·

I/O is no longer the bottleneck

Definitely this, we have to stop pointing disk I/O so much for performance issues. This is just not really slow anymore. Obviously network is a different story.

tech · performance

November 27, 2022 at 10:15:01 AM GMT+1 * · permalink

·

https://benhoyt.com/writings/io-is-no-longer-the-bottleneck/

·

Cache invalidation really is one of the hardest problems in computer science – Surfing Complexity

Nice summary on the false sharing problem with caches and how it can impact your performances in multithreaded contexts.

tech · performance · multithreading

November 26, 2022 at 5:42:39 PM GMT+1 * · permalink

·

https://surfingcomplexity.blog/2022/11/25/cache-invalidation-really-is-one-of-the-hardest-things-in-computer-science/

·

Internals of sets and dicts | Fluent Python, the lizard book

Interesting deep dive on how sets and dicts are implemented in CPython. There are a couple of interesting tricks in there.

tech · python · performance · optimization

November 24, 2022 at 1:01:45 PM GMT+1 * · permalink

·

https://www.fluentpython.com/extra/internals-of-sets-and-dicts/#_footnoteref_6

·

Is the fediverse about to get Fryed? (Or, “Why every toot is also a potential denial of service attack”) – Aral Balkan

There are indeed a few architectural problems with the Fediverse as it is. Can this be solved? Hopefully yes.

tech · architecture · fediverse · performance · social-media

November 15, 2022 at 6:06:01 PM GMT+1 * · permalink

·

https://ar.al/2022/11/09/is-the-fediverse-about-to-get-fryed-or-why-every-toot-is-also-a-potential-denial-of-service-attack/

·

Performance Optimizations Can Have Unexpectedly Large Effects When Combined With Caches

Interesting take about how performance optimizations can sometimes leverage even more performance gains than you would expect.

tech · performance · optimization

November 15, 2022 at 6:05:00 PM GMT+1 * · permalink

·

https://justinblank.com/notebooks/performanceoptimizationscanhaveunexpectedlylargeeffectswhencombinedwithcaches.html

·

Early speed optimizations aren’t premature

Good reminder that "premature" doesn't mean "early". Poor Knuth is so often badly quoted in the context of optimization that it's really sad. The number of times I see "early pessimisation" on the pretense of avoiding "premature optimization". Such a waste...

tech · programming · optimization · performance

October 29, 2022 at 2:22:12 PM GMT+2 * · permalink

·

https://pythonspeed.com/articles/premature-optimization/

·

Making python fast for free - adventures with mypyc – MeadSteve's Dev Blog

This is good news, this provide more venues for improving performances in Python modules next to switching to compiled Rust with something like PyO3. There's clearly a case to be more for not having to rewrite when the codebase was already mostly Python.

tech · python · performance · mypy · compiler · type-systems

September 28, 2022 at 9:43:23 AM GMT+2 * · permalink

·

https://blog.meadsteve.dev/programming/2022/09/27/making-python-fast-for-free/

·

Accelerate Python code 100x by import taichi as ti | Taichi Docs

This has some interesting promises in terms of performance using Python. Looks a bit like a CUDA for Python... to be seen how it fares in practice.

tech · python · performance

September 9, 2022 at 11:38:22 AM GMT+2 * · permalink

·

https://docs.taichi-lang.org/blog/accelerate-python-code-100x

·

Is premature optimization the root of all evil? | Secret Weblog

Let's put this quote back in its context, shall we?

tech · optimization · performance

August 29, 2022 at 8:28:00 AM GMT+2 * · permalink

·

https://blog.startifact.com/posts/is-premature-optimization-the-root-of-all-evil/

·

Twenty years of Valgrind | Nicholas Nethercote

One of the best developer tools around for analysis and profiling. I'm glad it exists, saved me a few times.

tech · profiling · performance · history

July 27, 2022 at 8:37:31 AM GMT+2 * · permalink

·

https://nnethercote.github.io/2022/07/27/twenty-years-of-valgrind.html

·

Investigating Managed Language Runtime Performance

Wow, this is a very good exploration of the performances of several common languages and runtimes. This is one of the most thorough I've seen. A good resource for deciding what to pick.

tech · performance · c++ · go · python · java · javascript

July 24, 2022 at 12:25:12 AM GMT+2 * · permalink

·

https://www.usenix.org/publications/loginonline/investigating-managed-language-runtime-performance

·

⚡️ The computers are fast, but you don't know it

And this is why you likely need to optimize your data pipelines at some point. There are plenty of levers available.

tech · programming · python · c++ · optimization · performance · data-science

June 17, 2022 at 9:40:27 AM GMT+2 * · permalink

·

http://shvbsle.in/computers-are-fast-but-you-dont-know-it-p1/

·

How fast are Linux pipes anyway?

Excellent deepdive about pipes, on the path to optimization we see how perf is used, how memory is managed by the kernel etc. Very thorough.

tech · linux · memory · optimization · performance · unix · system

June 3, 2022 at 7:40:43 AM GMT+2 * · permalink

·

https://mazzo.li/posts/fast-pipes.html

·

Google has been DDoSing SourceHut for over a year

Debatable "feature", bad implementation, dubious community handling... Clearly not a good example to follow from the Go space.

tech · google · go · performance · complexity

May 26, 2022 at 12:38:00 PM GMT+2 * · permalink

·

https://drewdevault.com/2022/05/25/Google-has-been-DDoSing-sourcehut.html

·

Magic-trace collects and displays high-resolution traces of what a process is doing

This looks like a very interesting tracing tool for debugging and profiling purposes.

tech · debugging · profiling · tracing · performance

April 23, 2022 at 3:23:09 PM GMT+2 * · permalink

·

https://github.com/janestreet/magic-trace#------magic-trace

·

Memray is a memory profiler for Python

That looks like a very interesting tool for larger Python based projects. Definitely need a way to profile memory use in there.

tech · python · memory · profiling · performance

April 21, 2022 at 9:31:50 AM GMT+2 * · permalink

·

https://bloomberg.github.io/memray/

·

Numbers Every Programmer Should Know By Year

Oh this is really neat! This is a good way to visualize how it evolved over time, I find the period starting in 2005 especially interesting.

tech · cpu · hardware · networking · ssd · performance

March 4, 2022 at 11:18:18 AM GMT+1 * · permalink

·

https://colin-scott.github.io/personal_website/research/interactive_latency.html

·

Static B-Trees - Algorithmica

Really cool optimizations for B-Trees. Once the layout is reworked this is a neat way to use SIMD as well.

tech · optimization · performance · SIMD · algorithm · b-tree

February 18, 2022 at 9:47:15 AM GMT+1 * · permalink

·

https://en.algorithmica.org/hpc/data-structures/s-tree/

·

How Prime Video updates its app for more than 8,000 device types - Amazon Science

Interesting use of WebAssembly for fast and very portable code. Also especially interesting is the care in the move to the new software architecture.

tech · webassembly · rust · amazon · performance · javascript

January 30, 2022 at 12:17:51 AM GMT+1 * · permalink

·

https://www.amazon.science/blog/how-prime-video-updates-its-app-for-more-than-8-000-device-types

·

Five Easy to Miss PostgreSQL Query Performance Bottlenecks

Interesting tips for potential bottlenecks in your queries.

tech · databases · performance · postgresql

January 26, 2022 at 12:10:48 PM GMT+1 * · permalink

·

https://pawelurbanek.com/postgresql-query-bottleneck

·

How vectorization speeds up your Python code

Not necessarily unknown paths to squeeze more performance out of Python. Still it's nice to have those options measured and listed in the same post.

tech · python · performance

January 20, 2022 at 1:24:48 PM GMT+1 * · permalink

·

https://pythonspeed.com/articles/vectorization-python/

·

You don't need that CORS request - Nick Olinger

Good reminder that CORS can have an impact regarding the performance of your application.

tech · http · cors · performance

January 4, 2022 at 3:28:29 PM GMT+1 * · permalink

·

https://nickolinger.com/blog/2021-08-04-you-dont-need-that-cors-request/

·

How a Single Line of Code Made a 24-core Server Slower Than a Laptop

Good reminder on how a shared atomic can become a huge bottleneck in multi-CPU setups.

tech · multithreading · performance · profiling

January 1, 2022 at 3:13:46 PM GMT+1 * · permalink

·

https://pkolaczk.github.io/server-slower-than-a-laptop/

·

Profiling and improving the runtime of a large pytest test suite | Niklas Meinzer

Mostly about the general approach on how to profile this kind of things. Still a couple of interesting pytest specific tips in here.

tech · python · tests · performance · profiling

September 20, 2021 at 9:53:04 AM GMT+2 * · permalink

·

https://www.niklas-meinzer.de/post/2019-07_pytest-performance/

·

Working Around a Case Where the Postgres Planner Is "Not Very Smart" - Heap

Interesting exploration and workaround for the Postgres query planner.

tech · databases · postgresql · performance

August 10, 2021 at 9:37:48 AM GMT+2 * · permalink

·

https://heap.io/blog/when-the-postgres-planner-is-not-very-smart

·

gProfiler is a system-wide profiler, combining multiple sampling profilers

This looks like an interesting full system profiler.

tech · profiling · performance

June 28, 2021 at 9:08:31 AM GMT+2 * · permalink

·

https://github.com/granulate/gprofiler

·

The price of dynamic memory: Allocation - Johny's Software Lab

Interesting piece covering: how a memory allocator works, why it can be slow, how to use it the best way possible and how to pick an allocator for your project.

tech · memory · performance

May 11, 2021 at 10:46:14 AM GMT+2 * · permalink

·

https://johnysswlab.com/the-price-of-dynamic-memory-allocation/

·

Branch predictor: How many "if"s are too many? Including x86 and M1 benchmarks!

This is a very interesting deep dive in how branch predictors work. Also comparing timing profiles between different families of CPUs.

tech · cpu · performance

May 11, 2021 at 10:33:45 AM GMT+2 * · permalink

·

https://blog.cloudflare.com/branch-predictor/

·

The compiler will optimize that away | RoyalSloth

Excellent reminder about where the limit is for the compiler to optimize things. Nowadays it's mostly about the memory accesses and then it means that the design matters a lot. Object-oriented designs being far from optimal here. Data-oriented designs fare much better but are definitely less friendly for human brains to reason about them.

tech · performance · data-oriented · object-oriented

May 3, 2021 at 3:01:35 PM GMT+2 * · permalink

·

https://blog.royalsloth.eu/posts/the-compiler-will-optimize-that-away/

·

The Mobile Performance Inequality Gap, 2021 - Infrequently Noted

Very thorough analysis on the kind of web frontend performances you can expect for most people on mobile. Since we basically need to reduce the footprint of such frontends to make this sustainable again this is a very welcome article.

tech · performance · web · html · css · javascript · ecology

March 23, 2021 at 6:58:33 PM GMT+1 * · permalink

·

https://infrequently.org/2021/03/the-performance-inequality-gap/

·

How they SRE (Site Reliability Engineering)

Obviously didn't read it all but this is a very large knowledge repository of practices from many companies one can get inspired by to work on Site Reliability Engineering. It is especially comprehensive since it's not only about technical tips but also deals with hiring, team building and culture (which is almost as important if not more).

tech · production · reliability · devops · monitoring · performance

February 23, 2021 at 10:25:38 AM GMT+1 * · permalink

·

https://github.com/upgundecha/howtheysre

·