Subscribe
Sign in
Home
Podcast
Notes
CPython Internals
AI
Computer Architecture
Linux Internals
x86-assembly
Contact Me
Archive
About
How PyTorch Generates Random Numbers in Parallel on the GPU
A deep dive into Philox and counter-based RNGs
READ THE LATEST
Most Popular
View all
What Every Developer Should Know About GPU Computing
Oct 18, 2023
•
Abhinav Upadhyay
319
12
37
How Unix Spell Ran in 64kB RAM
Jan 12, 2025
•
Abhinav Upadhyay
104
2
14
How Many Lines of C it Takes to Execute a + b in Python?
Dec 6, 2023
•
Abhinav Upadhyay
56
5
6
A Software Engineer's Guide to Reading Research Papers
Jan 28, 2025
•
Abhinav Upadhyay
168
12
Recent posts
View all
x86 Addressing Modes, Part 1 — Immediate and Direct Access
The foundations of memory access: static allocation, addressing modes, and the first steps toward low-level thinking.
Nov 12, 2025
•
Abhinav Upadhyay
9
2
A Systems Engineer’s Guide to Benchmarking with RDTSC
A deep dive into rdtsc, instruction stream serialization, and memory fences for precise cycle-level performance measurement.
Oct 23, 2025
•
Abhinav Upadhyay
17
3
My Top 5 Favourite Features in Python 3.14
Exploring the concurrency, debugging, and performance upgrades that make Python 3.14 special.
Oct 11, 2025
•
Abhinav Upadhyay
19
2
4
Understanding Weak References in Python
Understanding Python’s memory management with weak references
Sep 30, 2025
•
Abhinav Upadhyay
13
1
Compiling Python to Run Anywhere
A guest post on building a Python compiler that generates optimized kernels while preserving the language’s simplicity.
Sep 23, 2025
•
Abhinav Upadhyay
and
Yusuf Olokoba
37
7
9
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts