Suhaib Khan
@suhaibkhan.bsky.social
📤 177
📥 217
📝 144
Interested in HPC & large storage systems
reposted by
Suhaib Khan
Eric Neustadter (e)
24 days ago
No one remembers history. When IBM and Microsoft partnered on OS/2, IBM got upset when a MS employee rewrote some code to be smaller and more efficient because “negative lines of code!”. Measure the wrong thing and you’ll get the wrong behavior. Incentives matter. (╯°□°)╯︵ ┻━┻
add a skeleton here at some point
3
31
9
AI is turning scientists into publishing machines—and quietly funneling them into the same crowded corners of research.
spectrum.ieee.org/ai-science-r...
loading . . .
Are Scientists Sacrificing Originality for Speed With the Use of AI?
New analysis suggests AI tools narrow the range of ideas explored
https://spectrum.ieee.org/ai-science-research-flattens-discovery
about 1 month ago
0
0
0
Nvidia’s New
#Rubin
Architecture Thrives on Networking Some computations happen while data is enroute
spectrum.ieee.org/nvidia-rubin...
@spectrum.ieee.org
loading . . .
Nvidia’s Vera Rubin Architecture Thrives on Networking
Nvidia's Rubin GPU boasts 50 petaflops of 4-bit computation, but the real magic lies in its six new chips working together.
https://spectrum.ieee.org/nvidia-rubin-networking
about 1 month ago
1
3
0
AI Coding Assistants Are Getting Worse Newer models are more prone to silent but deadly failure modes
spectrum.ieee.org/ai-coding-de...
#AI
@spectrum.ieee.org
loading . . .
Newer AI Coding Assistants Are Failing in Insidious Ways
One AI coding assistant power user says the tools are hitting a plateau, and some are even declining. What's causing this unexpected twist in tech?
https://spectrum.ieee.org/ai-coding-degrades
about 1 month ago
0
1
0
reposted by
Suhaib Khan
Glenn K. Lockwood
about 2 months ago
The problem with saying "just tier to HDD if flash prices are too high" is HDDs are scarce too. HDD manufacturers opted not to invest in expanding fab capacity b/c they'd never recover the capex given declining HDD sales. HDDs are a myopic investment right now.
blocksandfiles.com/2026/01/09/s...
loading . . .
Sky-high flash prices: data reduction or tier to disk?
As SSD prices rise we wonder whether we’re seeing a panic-buying storage media price rise bubble or is AI-driven demand real? In either case, what should we do about it? SSDs cost more than disks so a...
https://blocksandfiles.com/2026/01/09/sky-high-flash-prices-data-reduction-or-tier-to-disk/
1
0
1
The Rubin platform uses codesign across six chips — the NVIDIA Vera CPU, NVIDIA Rubin GPU, NVIDIA NVLink™ 6 Switch, NVIDIA ConnectX®-9 SuperNIC, NVIDIA BlueField®-4 DPU and NVIDIA Spectrum™-6 Ethernet Switch — to slash training time and inference token costs.
nvidianews.nvidia.com/news/rubin-p...
loading . . .
NVIDIA Kicks Off the Next Generation of AI With Rubin — Six New Chips, One Incredible AI Supercomputer
NVIDIA today kickstarted the next generation of AI with the launch of the NVIDIA Rubin platform, comprising six new chips designed to deliver one incredible AI supercomputer.
https://nvidianews.nvidia.com/news/rubin-platform-ai-supercomputer
about 2 months ago
0
1
0
“AGI” has turned into a term of hype rather than a term with a precise meaning. Andrew Ng proposes a new version of the Turing Test, the Turing-#AGI Test.
www.deeplearning.ai/the-batch/is...
loading . . .
New Year Special! Hopes for 2026 from David Cox, Adji Bousso Dieng, Juan M. Lavista Ferres, Tanmay Gupta, Pengtao Xie, Sharon Zhou
The Batch AI News and Insights: Happy 2026! Will this be the year we finally achieve AGI? I’d like to propose a new version of the Turing Test...
https://www.deeplearning.ai/the-batch/issue-334/
about 2 months ago
0
0
0
reposted by
Suhaib Khan
Texas Advanced Computing Center at UT Austin
about 2 months ago
NASA's SPHEREx is creating a 3D map of the sky every six months over a two-year period, delivering a sweeping view of the cosmos. Behind the scenes, teams from NASA JPL, Caltech, the NEID project, and TACC built the mission’s first fully automated science data pipeline. Learn more:
bit.ly/4b7yufa
0
2
1
reposted by
Suhaib Khan
The Register
2 months ago
Nvidia spends $5B on Intel bailout, instantly gets $2.5B richer
loading . . .
Nvidia spends $5B on Intel bailout, instantly gets $2.5B richer
The deal negotiated in September locked Nvidia into a purchase price of $23 per share. Intel shares traded at $36 on Monday Nvidia’s $5 billion Intel stock purchase is already worth $7.58 billion, turning the recently approved bailout of its rival into a shrewd financial play.…
http://dlvr.it/TQ3rPD
3
27
9
reposted by
Suhaib Khan
Sarah Neuwirth 🇪🇺👩🏼💻👩🏻🏫😄
2 months ago
📢 REX-IO 2026 Workshop: Call for Papers! 📢 I'm happy to announce the 6th Workshop on Re-envisioning Extreme-Scale I/O for Emerging Hybrid HPC Workloads at ACM HPDC 2026! Submissions are due March 31, 2026 (11:59PM AoE). 📜 🦖 Further info:
sites.google.com/view/rexio/
#HPC
#supercomputing
0
2
3
reposted by
Suhaib Khan
Torsten Hoefler 🇨🇭
2 months ago
NVIDIA explains why they used FP8 as a scaling factor in NVFP and not MXFP-style E8M0. The idea is to preserve the largest value in each block.
buff.ly/SjwVr6w
(16:00 ff) We found this to be very true in a study and managed to recover some MXFP accuracy with micro-rotations:
buff.ly/gXaCAhF
0
4
1
reposted by
Suhaib Khan
Kenneth Hoste (boegel)
2 months ago
Schedule for
#HPC
, Big Data, and Data Science devroom at FOSDEM'26 (on Sun 1 Feb 2026, in Brussels) has been published:
fosdem.org/2026/schedul...
loading . . .
FOSDEM 2026 - HPC, Big Data & Data Science
https://fosdem.org/2026/schedule/track/hpc-big-data-data-science/
0
13
11
A well written (mostly) non-technical
#AI
primer by
@thedeadline.bsky.social
www.hpcwire.com/2025/12/16/a...
loading . . .
A (Mostly) Non-Technical AI Primer - HPCwire
I have studied and watched artificial intelligence grow over the last forty years. Like many, back in 1968, I was inspired by HAL 9000 in Stanley Kubrick’s 2001: A Space Odyssey. The question on my mi...
https://www.hpcwire.com/2025/12/16/a-mostly-non-technical-ai-primer/
2 months ago
0
0
0
AI’s Wrong Answers Are Bad. Its Wrong Reasoning Is Worse.
spectrum.ieee.org/ai-reasoning...
loading . . .
Researchers Are Uncovering Fundamental Flaws in How AI Reasons
As AI takes on agent roles in critical fields, reasoning failures raise risks.
https://spectrum.ieee.org/ai-reasoning-failures
2 months ago
0
1
0
reposted by
Suhaib Khan
Glenn K. Lockwood
2 months ago
NERSC recently did a wholesale replacement of its FDR InfiniBand storage fabric to RoCE. The IB was a greenfield installation back when I started in 2015, and replacing it with a competing technology in production is quite the feat. Glad to hear it succeeded.
www.nersc.gov/news-and-eve...
loading . . .
Network Upgrades Pave the Way to a Faster Future | NERSC
The National Energy Research Scientific Computing Center (NERSC), a U. S.
https://www.nersc.gov/news-and-events/news/network-upgrades-pave-the-way-for-a-faster-future
0
5
2
IDC: $314.2B in GPU-accelerated systems in first 3 Q of 2025 How sustainable is this crazy server spending? ODMs are now pushing almost 60% of worldwide server revenues, up from 45% last year.
www.nextplatform.com/2025/12/16/h...
#AI
#HPC
@nextplatform.bsky.social
2 months ago
0
1
0
The best version of
#HPC
: NCSA director Bill Gropp on ‘imagining an HPC utopia’
www.eurekalert.org/news-release...
@ncsaatillinois.bsky.social
loading . . .
NCSA director Bill Gropp on ‘imagining an HPC utopia’
Nearing his upcoming partial retirement, NCSA Director Bill Gropp shares his vision on what a utopia could look like for high-performance computing.
https://www.eurekalert.org/news-releases/1109982
2 months ago
0
2
1
NVIDIA has acquired SchedMD — the leading developer of
#Slurm
, an open-source workload management system for
#HPC
and
#AI
blogs.nvidia.com/blog/nvidia-...
loading . . .
NVIDIA Acquires Open-Source Workload Management Provider SchedMD
NVIDIA will continue to distribute SchedMD’s open-source, vendor-neutral Slurm software, ensuring wide availability for high-performance computing and AI.
https://blogs.nvidia.com/blog/nvidia-acquires-schedmd/
2 months ago
0
2
0
#HPC
in Transition: Jack Dongarra will deliver the
#ISC26
Closing Keynote on Thursday, June 25, 2026.
2 months ago
0
1
1
reposted by
Suhaib Khan
CSCfi
3 months ago
The EuroHPC Federation Platform will bring
@eurohpc-ju.bsky.social
systems together, making it easier for researchers and industry to find, access and use computing resources across Europe from a single place. Next step: the launch of MyEuroHPC web interface in March 2026 🔗
csc.fi/en/blog/euro...
0
0
1
reposted by
Suhaib Khan
Glenn K. Lockwood
3 months ago
I wrote up my notes from
#SC25
. Have a look:
blog.glennklockwood.com/2025/12/sc25...
I’ll keep picking away at the editing, but would love to hear more from others about what stood out to them. I wasn’t at the conference itself as much this years as in the past, so I know I missed a lot.
#HPC
loading . . .
SC'25 recap
The annual SC conference was held last week, drawing over 16,000 registrants and 560 exhibitors to in St. Louis, Missouri to talk ab...
https://blog.glennklockwood.com/2025/12/sc25-recap.html
3
24
13
reposted by
Suhaib Khan
Glenn K. Lockwood
3 months ago
A brave stake in the ground that defines what is (and isn’t) a parallel file system. I generally agree with Chris’ explanation. But I’m sure he’ll get hate from parallel storage elitists who don’t like how inclusive his take is.
add a skeleton here at some point
0
3
1
DOE to build an integrated discovery platform by linking together
#supercomputers
and other facilities at its 17 National Laboratories with industry and academia drawing on the expertise of roughly 40,000 scientists, engineers, and technical staff.
www.theregister.com/2025/11/25/t...
loading . . .
Trump orders nationwide AI Genesis Mission to drive science
: DOE told to build a unified research platform linking federal compute, datasets, and national labs
https://www.theregister.com/2025/11/25/trump_ai_genesis_mission/?utm_source=dlvr.it&utm_medium=bluesky
3 months ago
0
2
0
reposted by
Suhaib Khan
Rich Miller
3 months ago
NVIDIA will introduce liquid-cooled busbars into the racks for its Vera Rubin platform, part of a broader evolution of data center racks and power design to support more powerful AI computing.
open.substack.com/pub/datacent...
loading . . .
As Densities Soar, AI Racks Add Liquid-Cooled Busbars
NVIDIA, Meta Bring Liquid Cooling into Power Chain for Extreme-Density Racks
https://open.substack.com/pub/datacenterrichness/p/as-densities-soar-ai-racks-add-liquid?utm_source=share&utm_medium=android&r=5q9v
0
1
1
Intel: We’re simplifying the Diamond Rapids platform with a focus on 16 Channel processors and extending its benefits down the stack to support a range of unique customers and their use cases. (Source: Intel Spokesperson to STH)
www.servethehome.com/intel-cancel...
@servethehome.com.web.brid.gy
loading . . .
Intel Cancels its Mainstream Next-Gen Xeon Server Processors
A major next-generation Intel Xeon platform has been removed from the company's roadmap. We have the details on the Diamond Rapids shift
https://www.servethehome.com/intel-cancels-its-mainstream-next-gen-xeon-server-processors/
3 months ago
0
2
0
reposted by
Suhaib Khan
ajdecon
3 months ago
Silly thread for a Saturday: some of the
#HPC
clusters I’ve worked on over the years. First up is Cielo, a Cray XE6 I worked on at LANL! Which might actually be the prettiest supercomputer I’ve worked on.
1
8
3
reposted by
Suhaib Khan
ajdecon
3 months ago
A random interesting fact about the Cray XE6 racks: air exhausted up from the top of the rack! The result was that the lights above Cielo all started failing over time due to being blasted with hot air. And no good way to access them due to the size of the cluster. Clearly an early photo 😁
add a skeleton here at some point
1
8
2
reposted by
Suhaib Khan
The Wall Street Journal
4 months ago
Exclusive: Amazon.com is joining Microsoft in supporting legislation that threatens to further limit Nvidia’s ability to export to China, a rare split between the chip designer and two of its biggest customers.
loading . . .
Amazon and Microsoft Back Effort That Would Restrict Nvidia’s Exports to China
The legislation in Washington would give tech leaders preferential access to chips at their data centers around the world.
https://on.wsj.com/47W4r78
1
15
7
reposted by
Suhaib Khan
Reuters
4 months ago
Exclusive: Samsung hikes memory chip prices by up to 60% as shortage worsens, sources say
reut.rs/49QjvFI
loading . . .
Exclusive: Samsung hikes memory chip prices by up to 60% as shortage worsens, sources say
Samsung Electronics this month raised prices of certain memory chips - now in short supply due to the global race to build AI data centres - by as much as 60% compared to September, two people with knowledge of the hikes said.
https://reut.rs/49QjvFI
0
18
13
reposted by
Suhaib Khan
Torsten Hoefler 🇨🇭
4 months ago
Very nice overview of the emerging UALink standard with nice features such as splitting packets in switches, in-network computing, high energy efficiency, and lowest silicon overhead:
buff.ly/AgLvC1g
I'll be joining a panel at SC25 contrasting UALink and UEC next Wed:
buff.ly/BeCMFcL
Join us there
loading . . .
Introducing the UALink 200G 1.0 Specification Webinar
The Ultra Accelerator Link™ (UALink™) Consortium is an open industry standard group dedicated to advancing the UALink specification. The Consortium recently released the UALink 200G 1.0…
https://buff.ly/AgLvC1g
0
4
2
DARPA’s Next-Generation Microelectronics Manufacturing (NGMM) program is building a packaging plant in Austin that is dedicated to 3D heterogeneous integration (3DHI).
spectrum.ieee.org/3d-heterogen...
@spectrum.ieee.org
@darpa.mil
loading . . .
Why Is DARPA Betting on 3D Heterogeneous Integration?
Can a 1980s-era fab in Austin transform the future of microelectronics with 3D heterogeneous integration?
https://spectrum.ieee.org/3d-heterogeneous-integration
4 months ago
0
3
1
If you can read an analog clock correctly, you are still outperforming
#AI
in that regard.
spectrum.ieee.org/large-langua...
@spectrum.ieee.org
loading . . .
AI Struggles to Read Analog Clocks Correctly
AI struggles with analog clocks. What does this reveal about its limitations in image analysis?
https://spectrum.ieee.org/large-language-models-reading-clocks
4 months ago
0
1
0
reposted by
Suhaib Khan
Doug Eadline
4 months ago
Uhm, there is a typo in the headline, remove the "s" from insane. That should fix it.
1
5
1
reposted by
Suhaib Khan
Drew Jolly
4 months ago
Paderborn's new Otus
#supercomputer
features 142,656 processor cores, including AMD “Turin” and
#Nvidia
H100
#GPUs
, and 5PB of storage managed with IBM Spectrum Scale (formerly GPFS) file system.
ow.ly/mnlX50XqGAs
loading . . .
‘Otus’ Now Open for Business at Germany's PC2 - HPCwire
The Paderborn Center for Parallel Computing (PC2) in Germany this week opened its newest and largest supercomputer for business. Otus, which sports more than 142,000 processor cores, will be used to r...
https://ow.ly/mnlX50XqGAs
0
2
1
Andrew Ng:
#AI
has stark limitations, and despite rapid improvements, it will remain limited compared to humans for a long time.
#AI
is amazing, but it has unfortunately been hyped up to be even more amazing than it is.
www.deeplearning.ai/the-batch/is...
loading . . .
Safer (and Sexier) Chatbots, Better Images Through Reasoning, The Dawn of Industrial AI, and more...
The Batch AI News and Insights: I recently received an email titled “An 18-year-old’s dilemma: Too late to contribute to AI?” Its author, who gave...
https://www.deeplearning.ai/the-batch/issue-327/
4 months ago
0
0
0
reposted by
Suhaib Khan
CSCfi
4 months ago
Europe takes a major step in research connectivity! A new terabit network will link supercomputers across the continent, including EuroHPC’s
@lumi-supercomputer.eu
located in CSC’s data center in Kajaani 🚀 🔗
csc.fi/en/news/tera...
0
5
4
reposted by
Suhaib Khan
Jan Gray
4 months ago
Are 2030 AI hyperscalars capital constrained, power constrained, DRAM constrained, flash constrained, compute constrained, software constrained, or :-) demand constrained?
1
5
1
reposted by
Suhaib Khan
Rich Miller
4 months ago
Racks filled with GPUs and liquid cooling gear can now weigh 6,000 pounds or more, requiring new approaches to address human safety and investment protection. Google, Meta, and Microsoft are turning to robotics to safely move these huge racks.
open.substack.com/pub/datacent...
loading . . .
Data Centers Turn to Robots to Haul Multi-Ton Racks
Hyperscalers, OCP Ramp Up Robotics Teams for Worker Safety, Productivity
https://open.substack.com/pub/datacenterrichness/p/data-centers-turn-to-robots-to-haul?utm_source=share&utm_medium=android&r=5q9v
0
0
1
reposted by
Suhaib Khan
IEEE Spectrum
4 months ago
Scammers have a new way of getting into your pockets: by targeting your
#AI
assistant. They use prompt engineering, embedding code in emails that trick AI tools into taking malicious actions. Learn how to protect your digital presence.
spectrum.ieee.org/ai-agent-phi...
1
8
5
reposted by
Suhaib Khan
Torsten Hoefler 🇨🇭
4 months ago
Can we build an
#AI
#Climate
Scientist? Asked at the ADIA Lab Symposium in Abu Dhabi last week - now online at
buff.ly/6igSeyg
:-). Much work to be done - this is outlining some directions of indicative results with a lot of potential to accelerate AI for Science.
0
1
1
reposted by
Suhaib Khan
IEEE Spectrum
4 months ago
AI excels in complex tasks but falters at reading analog clocks—what does this tell us about its limitations?
loading . . .
AI Struggles to Read Analog Clocks Correctly
AI struggles with analog clocks. What does this reveal about its limitations in image analysis?
https://spectrum.ieee.org/large-language-models-reading-clocks?share_id=9039133
0
8
5
reposted by
Suhaib Khan
Tobias Mann
4 months ago
Nvidia's biggest scale up domain is 72 GPUs. Google's is 9,216 TPUs. Historically TPUs have trailed on FLOPS, memory, & bandwidth. That's no longer the case with Ironwood. Google has a Blackwell-class TPU with absurd scale. More on
@theregister.com
⬇️
www.theregister.com/2025/11/06/g...
loading . . .
TPU v7, Google's answer to Nvidia's Blackwell is nearly here
: Chocolate Factory's homegrown silicon boasts Blackwell-level perf at massive scale
https://www.theregister.com/2025/11/06/googles_ironwood_tpus_ai/
0
5
1
reposted by
Suhaib Khan
Semiconductor News by Dylan Martin
4 months ago
Exclusive: Intel is losing a data center AI executive who previously helped lead the company’s Gaudi accelerator chip efforts and is now headed for a job at AMD, CRN has learned.
www.crn.com/news/compone...
loading . . .
Exclusive: Intel Is Losing A Data Center AI Executive To AMD
Intel is losing a data center AI executive who previously helped led the company’s Gaudi accelerator chip efforts and is now headed for a job at AMD, CRN has learned.
https://www.crn.com/news/components-peripherals/2025/exclusive-intel-is-losing-a-data-center-ai-executive-to-amd
0
0
2
reposted by
Suhaib Khan
Torsten Hoefler 🇨🇭
4 months ago
Collaborator and friend Dan Alistarh talks at ETH about using the new NvFP4 and MXFP4 block formats for inference. Some going from "terrible" accuracy to acceptable using micro rotations to smoothen outliers in blocks.
arxiv.org/abs/2509.23202
Great collaboration and cool stuff
0
1
1
reposted by
Suhaib Khan
Glenn K. Lockwood
4 months ago
Google recently posted a promo for using their managed Lustre service to accelerate inferencing via KV caching. Raises questions: 1. What ever happened to Google Managed DAOS (ParallelStore)? It performs better than Lustre. 2. Does Gemini use this? Unlikely. See
glennklockwood.com/garden/atten...
loading . . .
attention
Attention is the mathematical operation within a transformer that allows different parts of the input to figure out how important they are to each other ...
https://glennklockwood.com/garden/attention#ring-attention
0
3
1
OpenAI spreads the imaginary wealth beyond Microsoft with $38B AWS deal Amazon deal still dwarfed by $250B Azure commitment made as part of OpenAI's for-profit transformation
www.theregister.com/2025/11/03/o...
loading . . .
OpenAI signs $38B cloud computing deal with AWS
: Amazon deal still dwarfed by $250B Azure commitment made as part of OpenAI's for-profit transformation
https://www.theregister.com/2025/11/03/openai_inks_38b_deal_with_aws/?utm_source=dlvr.it&utm_medium=bluesky
4 months ago
0
0
0
reposted by
Suhaib Khan
The Wall Street Journal
4 months ago
Silicon Valley’s biggest companies are already planning to pour $400 billion into artificial intelligence efforts this year. They all say it’s nowhere near enough.
loading . . .
Big Tech Is Spending More Than Ever on AI and It’s Still Not Enough
Meta, Alphabet, Microsoft and Amazon have all said they will increase spending in 2026. But investors have given mixed signals.
https://on.wsj.com/3X1nBTU
3
16
11
reposted by
Suhaib Khan
Rich Miller
4 months ago
The largest hyperscale operators say demand for AI services is filling data centers as fast as they can build them, with several saying they are compute-constrained. As a result, they expect to build even more data center space in 2026.
datacenterrichness.substack.com/p/hyperscale...
loading . . .
Hyperscale Building Boom Poised to Continue
Microsoft, Google, Meta and AWS Describe Strong Demand for New Services
https://datacenterrichness.substack.com/p/hyperscale-building-boom-poised-to
1
0
1
reposted by
Suhaib Khan
IEEE Spectrum
4 months ago
Each time a new AI training benchmark is introduced, the fastest training time gets longer. Then, hardware improvements gradually bring the execution time down, only to get thwarted again by the next benchmark. Then the cycle repeats itself.
loading . . .
AI Model Growth Outpaces Hardware Improvements
AI training races are heating up as benchmarks get tougher.
https://spectrum.ieee.org/mlperf-trends?share_id=9028068
1
6
3
Diamond Blankets Will Keep Future Chips Cool Growing a micrometers-thick layer of diamond inside advanced chips spreads out the heat and drops the temperature more than 50°C.
spectrum.ieee.org/diamond-ther...
@spectrum.ieee.org
loading . . .
Can Diamonds Solve the Chip Heat Dilemma?
Stanford's diamond innovation could redefine chip cooling, making electronics more efficient and powerful.
https://spectrum.ieee.org/diamond-thermal-conductivity
4 months ago
0
1
0
Load more
feeds!
log in