xjdr
@xjdr.bsky.social
๐ค 1091
๐ฅ 91
๐ 15
hot takes, linear Algebra, JAX apologist, Raconteur
I have become radicalized
10 months ago
10
54
0
so far the experience has been pretty good here but the default feeds are _terrible_. feels like its going to take a few weeks to whip these feeds into shape with mutes and "show less like these" plus lots of likes. Following feed is good but i need to follow a lot more people
10 months ago
15
98
4
very interesting work and it reminds me a bit of this paper. Tokenizers and ROPE must die. after samplers, i am on to those next ...
arxiv.org/abs/2407.036...
add a skeleton here at some point
10 months ago
9
78
12
i keep forgetting to include this cause i always assume people do this by default. Any time there is an exponent or a norm, you should be working in the highest practical precision
add a skeleton here at some point
10 months ago
0
25
1
the BigVision repo is my current reference impl for gemma and ViT. such an underrated repo
@giffmana.bsky.social
and team are doing the lord's work
github.com/google-resea...
github.com/google-resea...
loading . . .
big_vision/big_vision/models/ppp/gemma.py at main ยท google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more. - google-research/big_vision
https://github.com/google-research/big_vision/blob/main/big_vision/models/ppp/gemma.py
10 months ago
3
92
12
now that people are paying attention again, here is your periodic reminder. Always run in bf16. always apply ROPE and attention softmax at float32 (as shown here)
github.com/xjdr-alt/ent...
10 months ago
4
77
9
reposted by
xjdr
Alexander Doria
10 months ago
So first version of an ml anon starter pack.
go.bsky.app/VgWL5L
Kept half-anons (like me and Vic). Not all anime pfp, but generally drawn.
add a skeleton here at some point
10
63
22
i trying to follow as many of my old moots as possible and new people as i find them. some of y'all changing your pfp is just mean spirited (im lazy and learned people's pfps not names)
10 months ago
8
35
1
Well this looks shockingly professional. I may have to put on a tie to post here
10 months ago
6
31
0
you reached the end!!
feeds!
log in