Subscribe
Sign in
Home
Podcast
Notes
CPython Internals
AI
Computer Architecture
Linux Internals
x86-assembly
Contact Me
Archive
About
A Systems Engineer’s Guide to Benchmarking with RDTSC
A deep dive into rdtsc, instruction stream serialization, and memory fences for precise cycle-level performance measurement.
READ THE LATEST
Most Popular
View all
What Every Developer Should Know About GPU Computing
Oct 18, 2023
•
Abhinav Upadhyay
313
11
How Unix Spell Ran in 64kB RAM
Jan 12
•
Abhinav Upadhyay
102
1
How Many Lines of C it Takes to Execute a + b in Python?
Dec 6, 2023
•
Abhinav Upadhyay
56
5
A Software Engineer's Guide to Reading Research Papers
Jan 28
•
Abhinav Upadhyay
164
Recent posts
View all
My Top 5 Favourite Features in Python 3.14
Exploring the concurrency, debugging, and performance upgrades that make Python 3.14 special.
Oct 11
•
Abhinav Upadhyay
19
2
Understanding Weak References in Python
Understanding Python’s memory management with weak references
Sep 30
•
Abhinav Upadhyay
13
Compiling Python to Run Anywhere
A guest post on building a Python compiler that generates optimized kernels while preserving the language’s simplicity.
Sep 23
•
Abhinav Upadhyay
and
Yusuf Olokoba
35
7
What Makes System Calls Expensive: A Linux Internals Deep Dive
An explanation of how Linux handles system calls on x86-64 and why they show up as expensive operations in performance profiles
Sep 16
•
Abhinav Upadhyay
64
4
How to Leverage the CPU’s Micro-Op Cache for Faster Loops
Measuring, analyzing, and optimizing loops using Linux perf, Top-Down Microarchitectural Analysis, and the CPU’s micro-op cache
Aug 15
•
Abhinav Upadhyay
19
1
45:32
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts