Luke Lau
@lukel97.bsky.social
📤 102
📥 56
📝 11
LLVM at Igalia
reposted by
Luke Lau
about 1 month ago
"How NOT To Program an Out-of-order Vector Processor" slides are public.
static.sched.com/hosted_files...
1
1
1
reposted by
Luke Lau
Igalia
about 1 month ago
We're looking forward to the RISC-V Summit North America next week where Mikhail Gadelha (one of our compiler engineers) will be presenting "Unlocking 15% More Performance: A Case Study in LLVM Optimization for RISC-V". Be sure to catch his talk next Thurs
riscvsummit2025.sched.com/event/28OTp/...
0
10
5
reposted by
Luke Lau
Hong Kong Free Press HKFP
6 months ago
Police have deployed an armoured vehicle in Hong Kong's commercial heart, amidst an ongoing heavy security presence on the 36th anniversary of the Tiananmen Square crackdown. In full:
buff.ly/f4hVB50
loading . . .
In Pictures: HK police deploy armoured vehicle on Tiananmen anniversary
Police have deployed an armoured vehicle in Hong Kong's commercial heart, amidst an ongoing heavy security presence on the 36th anniversary of the Tiananmen Square crackdown.
https://buff.ly/shYWuZG
0
14
12
reposted by
Luke Lau
Alex Bradbury
7 months ago
I'm delighted to see two of
@igalia.com
's projects for RISE highlighted at the RISC-V Summit Europe. Find out more about our work on both LLVM optimisation and testing/CI on the RISE blog (with more to come in the future!):
riseproject.dev/2025/05/08/p...
riseproject.dev/2024/10/15/w...
0
6
3
reposted by
Luke Lau
Alex Bradbury
8 months ago
We're looking forward to EuroLLVM next week in Berlin. Be sure to check out talks from my colleague
@lukel97.bsky.social
and myself on: * Work to further improve RISC-V vector codegen (extending the VL Optimizer), and * Work done with the support of RISE to improve RISC-V LLVM testing.
add a skeleton here at some point
0
9
4
reposted by
Luke Lau
Paulo Matos
9 months ago
What if I told you 3DNow! square root recíprocals are defined for negative numbers?... Also the amazing FEX 2503 is out. Read about some of my work and the work of other FEX maintainers' in the release notes:
fex-emu.com/FEX-2503/
#fex
#igalia
#gaming
#linux
#arm64
loading . . .
FEX 2503 Tagged
Here we are again, another month and some more cool changes with FEX. Let’s dive in and see what has changed!
https://fex-emu.com/FEX-2503/
1
4
2
reposted by
Luke Lau
Alex Bradbury
9 months ago
Some notes on ccache+LLVM. Summary: if you do a lot of builds across different checkouts/worktrees/builddirs, be sure to set the base_dir option and -DLLVM_USE_RELATIVE_PATHS_IN_DEBUG_INFO=ON
muxup.com/2025q1/ccach...
loading . . .
ccache for LLVM builds across multiple directories
TL;DR: ccache base_dir saves the day
https://muxup.com/2025q1/ccache-for-llvm-builds-across-multiple-directories
0
9
4
reposted by
Luke Lau
10 months ago
Hello you fine Internet folks, Today's article is on SiFive's P550 microarchitecture. The P550 core is one of the fastest RISC-V cores available currently and is claimed to be comparable to ARM's Cortex A75. Hope y'all enjoy!
old.chipsandcheese.com/2025/01/26/i...
open.substack.com/pub/chipsand...
loading . . .
Inside SiFive’s P550 Microarchitecture
RISC-V is a relatively young and open source instruction set. So far, it has gained traction in microcontrollers and academic applications. For example, Nvidia replaced the Falcon microcontrollers …
https://old.chipsandcheese.com/2025/01/26/inside-sifives-p550-microarchitecture/
0
12
5
reposted by
Luke Lau
Joyee Cheung
11 months ago
New blog post covering the mysterious 10ms startup regression of Node.js on macOS, the journey of investigating the issue with various performance tools, and figuring out the fix (which also helped making the binary smaller).
joyeecheung.github.io/blog/2025/01...
loading . . .
Executable loading and startup performance on macOS
Recently, I fixed a startup performance regression in Node.js on macOS after an extensive investigation. Along the way, I learned a lot about tools on macOS and Node.js compilation workflows that don’
https://joyeecheung.github.io/blog/2025/01/11/executable-loading-and-startup-performance-on-macos/
3
127
20
A Simple ELF
4zm.org/2024/12/25/a...
loading . . .
A Simple ELF - The Ivory Tower
The Ivory Tower is a blog about software engineering and development philosophy by Anders Sundman.
https://4zm.org/2024/12/25/a-simple-elf.html
11 months ago
0
0
0
reposted by
Luke Lau
Joyee Cheung
12 months ago
After two months of chasing, finally found out what's happening behind this mysterious startup time regression on macOS from Node.js v20.x - it's missing -fvisibility=hidden 😅 (I guess that's what happens when the build configs become dusty enough)
github.com/nodejs/node/...
loading . . .
build: build v8 with -fvisibility=hidden on macOS by joyeecheung · Pull Request #56275 · nodejs/node
V8 should be built with -fvisibility=hidden, otherwise the resulting binary would contain unnecessary symbols. In particular, on macOS, this leads to 5000+ weak symbols resolved at runtime, leading...
https://github.com/nodejs/node/pull/56275
3
59
10
reposted by
Luke Lau
Joe Cutler
12 months ago
Recently I came across this treatise by Stephen Dolan
github.com/ocaml/ocaml/...
loading . . .
Abnormally slow loop (25x) under OCaml 5 / macOS / arm64 · Issue #13262 · ocaml/ocaml
Hello, I am using macOS Ventura 13.6.7 with an Apple M2 Max processor. A loop that writes values into an integer array is about 20x slower with OCaml 5 than with OCaml 4. Using Array.set versus Arr...
https://github.com/ocaml/ocaml/issues/13262#issuecomment-2478765094
2
23
5
Trying to find the slowest possible RISC-V instruction. This single vlse8.v with a stride of 65536 bytes takes 66 million cycles on a Banana Pi F3. That's 0.04 seconds @1.6GHz
#risc-v
12 months ago
4
23
5
reposted by
Luke Lau
12 months ago
Here are some slightly tricky RVV mask patterns.
1
7
3
you reached the end!!
feeds!
log in