Search: [performance] - ervin's web review

How web bloat impacts users with slow devices

Indeed this. It's not only about payload size, it's also about CPU consumption. Our profession is still assuming too much that users will get faster CPU on a regular basis.

tech · web · frontend · performance

March 17, 2024 at 9:48:08 AM GMT+1 * · permalink

·

https://danluu.com/slow-device/

·

Announcing Speedometer 3.0: A Shared Browser Benchmark for Web Application Responsiveness

This is nice to see a new benchmark being published. This seems to follow real life scenarios. We can expect browser engines performance to increase.

tech · browser · frontend · web · performance · benchmarking

March 13, 2024 at 12:43:29 PM GMT+1 * · permalink

·

https://browserbench.org/announcements/speedometer3/

·

The "missing" graph datatype already exists. It was invented in the '70s

A response to "The Hunt for the Missing Data Type" article. There are indeed potential solutions, but they're not really used/usable in the industry right now. Maybe tomorrow.

tech · graph · mathematics · performance

March 6, 2024 at 10:29:41 AM GMT+1 * · permalink

·

https://tylerhou.com/posts/datalog-go-brrr/

·

The Hunt for the Missing Data Type

Indeed, graphs are peculiar beasts. When dealing with graph related problems there are so many choices to make that it's hard or impossible to come up with a generic solution.

tech · graph · mathematics · matrix · performance

March 5, 2024 at 10:07:48 AM GMT+1 * · permalink

·

https://www.hillelwayne.com/post/graph-types/

·

Jevons Paradox doesn’t always apply to software

Interesting take even though I'm not sure I buy it completely. This is an interesting pledge for aiming at power efficiency and squeezing performance out of software.

tech · software · performance · power

February 28, 2024 at 10:56:28 AM GMT+1 * · permalink

·

https://pythonspeed.com/articles/software-jevons-paradox/

·

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD

Interesting library if you got to do a lots of heavy analysis work with strings.

tech · c++ · python · performance

February 25, 2024 at 11:29:48 AM GMT+1 * · permalink

·

https://github.com/ashvardanian/StringZilla

·

Database Architects: SSDs Have Become Ridiculously Fast, Except in the Cloud

This is indeed an odd situation... there is no good explanation about why this is like this.

tech · cloud · storage · ssd · performance

February 21, 2024 at 4:16:01 PM GMT+1 * · permalink

·

https://databasearchitects.blogspot.com/2024/02/ssds-have-become-ridiculously-fast.html?m=1

·

My Notes on GitLab Postgres Schema Design – Shekhar Gulati

Nice exploration of the GitLab database schema. This highlights and finds quite a few of the choices made with an eye on performances.

tech · gitlab · databases · sql · postgresql · performance

February 18, 2024 at 10:37:29 AM GMT+1 · permalink

·

https://shekhargulati.com/2022/07/08/my-notes-on-gitlabs-postgres-schema-design/

·

On‐demand JSON: A better way to parse documents? - Keiser - Software: Practice and Experience - Wiley Online Library

Very interesting approach to JSON parsing. Comes with a very thorough performance analysis.

tech · json · parsing · performance · c++

January 21, 2024 at 9:46:35 AM GMT+1 * · permalink

·

https://onlinelibrary.wiley.com/doi/10.1002/spe.3313

·

Speed up your code: don't pass structs bigger than 16 bytes on AMD64 · GitHub

Not necessarily a practical advice in most of our daily code. Still this exhibits interesting low level details about argument passing. Might come in handy in a few cases, to be kept in mind.

tech · c++ · assembly · performance

January 5, 2024 at 8:34:41 AM GMT+1 * · permalink

·

https://gist.github.com/FeepingCreature/5dff669aad380a123b15659e195fb96c

·

Occupancy explained

A very precise and thorough article about GPU occupancy. What it is, how it relates to perceived performances, it's potentisl relationship with cache thrashing and the tools you can use to measure it on AMD GPUs.

tech · amd · gpu · shader · performance

December 31, 2023 at 12:11:32 PM GMT+1 * · permalink

·

https://gpuopen.com/learn/occupancy-explained/

·

So You Want to Optimize Your Code?

Very nice collection of stories from the trenches of Firefox development. Lots of lessons learned to unpack about optimizing for the right thing, tooling, telemetry and so on.

tech · mozilla · observability · telemetry · profiling · optimization · performance

December 23, 2023 at 5:17:35 PM GMT+1 * · permalink

·

https://yoric.github.io/post/so-you-want-to-optimize-your-code/

·

How many CPU cores can you actually use in parallel?

This is unsurprisingly highly depend on the actual code, not only on the hardware.

tech · multithreading · performance · python

December 19, 2023 at 8:39:26 AM GMT+1 * · permalink

·

https://pythonspeed.com/articles/cpu-thread-pool-size/

·

On harmful overuse of std::move - The Old New Thing

Seen this a bit too often indeed. When people learn about std::move they tend to sprinkle it too much preventing proper optimizations. Its use should be fairly limited usually.

tech · c++ · performance

November 25, 2023 at 5:07:45 PM GMT+1 * · permalink

·

https://devblogs.microsoft.com/oldnewthing/20231124-00/?p=109059

·

A close encounter with false sharing | More Stina Blog!

Good reminder that false sharing is a real thing. It's easier to encounter than you think when you start to dabble into multi-threading.

tech · multithreading · performance

November 21, 2023 at 8:57:59 AM GMT+1 * · permalink

·

https://morestina.net/blog/1976/a-close-encounter-with-false-sharing

·

Down and to the Right: Firefox Got Faster for Real Users in 2023

This is indeed a nice improvement. I hope they keep working in this direction.

tech · mozilla · browser · performance

November 1, 2023 at 6:31:41 PM GMT+1 · permalink

·

https://hacks.mozilla.org/2023/10/down-and-to-the-right-firefox-got-faster-for-real-users-in-2023/

·

The Three Cs: 🤝 Concatenate, 🗜️ Compress, 🗳️ Cache – Harry Roberts – Web Performance Consultant

Interesting exploration of the performance for web resources when they're bundled or not. Also dabbles in the reasons behind the exhibited performances, definitely to keep in mind.

tech · web · http · caching · compression · performance · networking

October 24, 2023 at 11:22:32 AM GMT+2 * · permalink

·

https://csswizardry.com/2023/10/the-three-c-concatenate-compress-cache/

·

Fury - A blazing fast multi-language serialization framework powered by jit and zero-copy

Looks like an interesting serialization framework. If it holds true to its claims it could be very useful in some place.

tech · data · performance · serialization

October 9, 2023 at 8:56:16 AM GMT+2 * · permalink

·

https://www.furyio.org/blog/fury_blazing_fast_multiple_language_serialization_framework

·

How AMD May Get Across the CUDA Moat

Will AMD really turn this around? Wait and see.

tech · cpu · gpu · performance · amd · nvidia

October 7, 2023 at 6:52:15 PM GMT+2 * · permalink

·

https://www.hpcwire.com/2023/10/05/how-amd-may-get-across-the-cuda-moat/

·

Optimization Techniques for GPU Programming - 3570638.pdf

Very thorough paper on optimization techniques when dealing with GPUs. Can be a useful reference or starting point to then dig deeper. Should also help to pick the right technique for your particular problem.

tech · gpu · computation · performance · optimization

October 1, 2023 at 3:25:52 PM GMT+2 · permalink

·

https://dl.acm.org/doi/pdf/10.1145/3570638

·

A user program doing intense IO can manifest as high system CPU time

A good reminder that depending what happens in the kernel, the I/O time you were expecting might turn out to be purely CPU time.

tech · io · storage · cpu · kernel · performance

September 14, 2023 at 11:18:40 PM GMT+2 * · permalink

·

https://utcc.utoronto.ca/~cks/space/blog/linux/UserIOCanBeSystemTime

·

Good performance is not just big O - Julio Merino (jmmv.dev)

Good list of things to keep in mind when thinking about performances. Of course, always measure using a profiler when you want to be really sure.

tech · performance · programming

September 9, 2023 at 5:05:35 PM GMT+2 * · permalink

·

https://jmmv.dev/2023/09/performance-is-not-big-o.html

·

Response Time Is the System Talking

Interesting way to approximate how loaded a system is.

tech · queuing · networking · system · performance

September 9, 2023 at 3:58:45 PM GMT+2 * · permalink

·

https://two-wrongs.com/response-time-is-the-system-talking.html

·

Weight Gain and Perf Loss | Julien Jorge's Personal Website

Interesting tale and exploration on how a change in includes impacted cache misses. This is sneaky (and solved with more recent compilers).

tech · c++ · compiler · cpu · caching · performance

September 8, 2023 at 12:27:32 PM GMT+2 * · permalink

·

https://julien.jorge.st/posts/en/weight-gain-and-perf-loss/

·

Elixir Saves Pinterest $2 Million a Year In Server Costs

The claim is huge. The story doesn't quite say how much is really about Elixir and how much from the revised architecture. That being said, going for something like Elixir has definitely an impact on the architecture... could it be that it pushes for better patterns?

tech · elixir · programming · architecture · cost · performance

August 30, 2023 at 9:13:32 AM GMT+2 * · permalink

·

https://paraxial.io/blog/elixir-savings

·

milen.me — Premature Optimization: Universally Misunderstood

Another partial quote which led to misunderstanding. One should indeed think about performances early on.

tech · optimization · performance · architecture · programming

August 29, 2023 at 8:31:23 AM GMT+2 * · permalink

·

https://milen.me/writings/premature-optimization-universally-misunderstood/

·

Optimise the Expensive First

Obvious advice perhaps, but so easily forgotten somehow...

tech · optimization · performance

August 17, 2023 at 2:23:21 PM GMT+2 * · permalink

·

https://two-wrongs.com/optimise-the-expensive-first

·

Making the Global Interpreter Lock Optional in CPython

OK, this could be big for Python. Let's see how they execute this plan. It carries some risks as well, but they seem well aware of them.

tech · python · multithreading · performance

July 29, 2023 at 1:59:04 PM GMT+2 * · permalink

·

https://discuss.python.org/t/a-steering-council-notice-about-pep-703-making-the-global-interpreter-lock-optional-in-cpython/30474

·

zeux.io - Efficient jagged arrays

Interesting optimization on this somewhat common data structure.

tech · data-oriented · programming · performance · optimization

July 2, 2023 at 9:37:07 PM GMT+2 * · permalink

·

https://zeux.io/2023/06/30/efficient-jagged-arrays/

·

Compiling typed Python | Max Bernstein

Unsurprisingly, it's not as simple as it sounds. Type hints in Python can be used for various reasons but performances is rarely the main motives. It'd need other adjustments to the runtime. People are working on it, and this article is an interesting dive on how things work under the hood.

tech · python · compiler · type-systems · performance

June 20, 2023 at 11:41:12 AM GMT+2 * · permalink

·

https://bernsteinbear.com//blog/typed-python/

·

Copy-and-Patch Compilation

This compilation technique brings very interesting results. Hopefully should find its way in some JIT compilers.

tech · compiler · performance

June 19, 2023 at 2:40:16 PM GMT+2 * · permalink

·

https://fredrikbk.com/publications/copy-and-patch.pdf

·

Squeezing a Little More Performance Out of Bytecode Interpreters · Stefan-Marr.de

Interesting research turning to genetic algorithms to optimize bytecode handler dispatchers.

tech · machine-learning · bytecode · performance · optimization

June 18, 2023 at 2:44:07 PM GMT+2 * · permalink

·

https://stefan-marr.de/2023/06/squeezing-a-little-more-performance-out-of-bytecode-interpreters/

·

How much memory is needed to run 1M Erlang processes? - Hauleth

Deep dive on a proper benchmarking and implementation for 1M task on the Erlang runtime. Clearly the previous benchmark had room for improvements.

tech · multithreading · performance · memory · benchmarking · erlang

June 11, 2023 at 12:00:09 PM GMT+2 * · permalink

·

https://hauleth.dev/post/beam-process-memory-usage/

·

Cornell Virtual Workshop: Vectorization

Nice and thorough workshop on vectorization, where it comes from, what it can do and how you can write code which is easier to vectorize for the compiler.

tech · performance · vector

June 2, 2023 at 8:52:27 AM GMT+2 * · permalink

·

https://cvw.cac.cornell.edu/vector/

·

How Much Memory Do You Need to Run 1 Million Concurrent Tasks? | Piotr Kołaczkowski

Doesn't give the whole picture (memory isn't the only important parameter) but interesting results nonetheless. A few surprises in there, Java and C# do much better than one might assume for instance.

tech · multithreading · performance · memory

May 23, 2023 at 9:06:05 AM GMT+2 * · permalink

·

https://pkolaczk.github.io/memory-consumption-of-async/

·

Database Architects: The Great CPU Stagnation

Interesting take. Will it lead to paying more attention to performance in software? Will it be the rise of the specialized CPUs? Time will tell.

tech · cpu · performance · cost

May 19, 2023 at 7:38:17 AM GMT+2 * · permalink

·

https://databasearchitects.blogspot.com/2023/04/the-great-cpu-stagnation.html?m=1

·

Ice and Fire: How to read icicle and flame graphs

Good explanation of how flame graphs are produced and how to read them. Gives a few tips on what to look for to optimize.

tech · performance · optimization

May 15, 2023 at 10:20:44 AM GMT+2 * · permalink

·

https://www.polarsignals.com/blog/posts/2023/03/28/how-to-read-icicle-and-flame-graphs/

·

Is sequential IO dead in the era of the NVMe drive? — Jack Vanlightly

Interesting exploration on the performance of SSDs regarding write patterns. Turns out sequential IO is still a thing, just for a different reason than with good old HDDs.

tech · storage · performance

May 10, 2023 at 10:03:30 PM GMT+2 * · permalink

·

https://jack-vanlightly.com/blog/2023/5/9/is-sequential-io-dead-in-the-era-of-the-nvme-drive

·

Cloud exit pays off in performance too

Very interesting to see that move to owned hardware... turns out that not only the invoice is smaller in their case but the performances are much better as well.

tech · infrastructure · performance

May 4, 2023 at 8:21:41 AM GMT+2 · permalink

·

https://world.hey.com/dhh/cloud-exit-pays-off-in-performance-too-4c53b697

·

Nine ways to shoot yourself in the foot with PostgreSQL

Nice set of tips, I knew a few but not all of them. The discussion around CTEs is interesting.

tech · databases · postgresql · performance

April 26, 2023 at 8:54:55 AM GMT+2 * · permalink

·

https://philbooth.me/blog/nine-ways-to-shoot-yourself-in-the-foot-with-postgresql

·

Measuring the Impact of False Sharing

Nice exploration of false sharing on performances in several hardware scenarii. A couple of surprises along the way.

tech · multithreading · performance

April 24, 2023 at 8:23:27 AM GMT+2 * · permalink

·

https://alic.dev/blog/false-sharing.html

·

Defining interfaces in C++: concepts versus inheritance – Daniel Lemire's blog

Shouldn't come as a surprise if you paid attention to C++ evolutions for the past 30 years. We're now reaping the fruits though, so it's really become easy to keep both options in sight when designing. This is especially important for performance sensitive code.

Nothing really new here (apart from the "how easy it is these days!")... Still it needs to be reminded on a regular basis. :-)

tech · c++ · performance

April 21, 2023 at 10:48:09 AM GMT+2 * · permalink

·

https://lemire.me/blog/2023/04/20/defining-interfaces-in-c-concepts-versus-inheritance/

·

Load Balancing

Nice post explaining the common algorithms used for load balancing. Each having their own trade offs of course. Well done with tiny simulations.

tech · web · server · performance · http

April 18, 2023 at 5:43:29 PM GMT+2 * · permalink

·

https://samwho.dev/load-balancing/

·

Polars for initial data analysis, Polars for production

Polars really looks like a nice alternative to Pandas with a nice upgrade path from data exploration to production.

tech · data-science · pandas · polars · performance

April 7, 2023 at 6:20:37 PM GMT+2 * · permalink

·

https://pythonspeed.com/articles/polars-exploratory-data-analysis-vs-production/

·

Making Python 100x faster with less than 100 lines of Rust

Nice walk through for a use of PyO3 to make some Python code much faster. Nice to see how useful py-spy turn out to be in such scenarii as well.

tech · rust · python · profiling · performance

March 30, 2023 at 4:20:47 PM GMT+2 * · permalink

·

https://ohadravid.github.io/posts/2023-03-rusty-python/

·

A discussion between Casey Muratori and Robert C. Martin about Clean Code

Very interesting conversation between Uncle Bob and one of the recent critics of his work regarding performance. I like how he admits some faults in the way he presents things and try to improve for later rather than trying to be right. Some people should learn from that. There's clearly a tension between performance and what is described in Clean Code, it'd be pointless to deny it.

tech · architecture · performance · craftsmanship

March 9, 2023 at 7:35:24 AM GMT+1 * · permalink

·

https://github.com/unclebob/cmuratori-discussion/blob/main/cleancodeqa.md

·

Perf engineering with Python 3.12

perf now available also to Python programs. This definitely can be useful for proper profiling.

tech · python · performance · profiling

February 24, 2023 at 9:55:03 AM GMT+1 * · permalink

·

https://www.petermcconnell.com/posts/perf_eng_with_py12/

·

AMD CEO: The Next Challenge Is Energy Efficiency - IEEE Spectrum

Interesting position from AMD regarding the race on the next super computers. They're all being caught up by energy efficiency so it'll need to be addressed both at the processor architecture level but also at the software architecture level. How we design our computing tasks will matter more and more.

tech · cpu · performance · energy · architecture

February 23, 2023 at 11:21:57 AM GMT+1 * · permalink

·

https://spectrum.ieee.org/amd-eyes-supercomputer-efficiency-gains

·

The Real C++ Killers (Not You, Rust) | HackerNoon

A bit of a sarcastic tone but a few good point in there. Also shows interesting alternatives to C++ to squeeze every ounce of performance out of your code whatever the platform it runs on. Of the three options explored I knew only about Numba really.

tech · performance · python · c++

February 15, 2023 at 8:57:54 AM GMT+1 * · permalink

·

https://hackernoon.com/the-real-c-killers-not-you-rust

·

The Elusive Frame Timing | by Alen Ladavac | Medium

Excellent analysis and explanation of the stutter problem people experience with game engines. It's an artifact of the graphics pipeline becoming more asynchronous with no way to know when something is really displayed. Extra graphics APIs will be needed to solve this for real.

tech · 3d · performance

January 14, 2023 at 1:07:20 PM GMT+1 * · permalink

·

https://medium.com/@alen.ladavac/the-elusive-frame-timing-168f899aec92

·