vb
@reach-vb.hf.co
๐ค 2796
๐ฅ 103
๐ 56
GPU Poor @ Hugging Face | F1 fan
Qwen released QvQ 72B OpenAI o1 like reasoning model on Hugging Face with Vision capabilities - beating GPT4o, Claude Sonnet 3.5 ๐ฅ
about 1 year ago
4
17
3
BOOOOM! Meta released Llama 3.3 70B - 128K context, multilingual, enhanced tool calling, outperforms Llama 3.1 70B and comparable to Llama 405B ๐ฅ Comparable performance to 405B with 6x LESSER parameters โก
about 1 year ago
1
28
3
Introducing Indic-Parler TTS - Trained on 10K hours of data, 938M params, supports 20 Indic languages, emotional synthesis, apache 2.0 licensed! ๐ฅ w/ fully customisable speech and voice personas! Try it out directly below or use the model weights as you want! ๐ฎ๐ณ/acc
about 1 year ago
4
35
3
you can just do things - ask AI to create your SQL queries and execute them right in your browser! ๐ฅ let your creativity guide you - powered by qwen 2.5 coder 32b โก available on all 254,746 public datasets on the hub! go check it out today! ๐ค
loading . . .
about 1 year ago
1
30
2
reposted by
vb
Simon Willison
about 1 year ago
This demo of structured data extraction running on an LLM that executes entirely in the browser (Chrome only for the moment since it uses WebGPU) is amazing My notes here:
simonwillison.net/2024/Nov/29/...
add a skeleton here at some point
4
182
25
Fuck it! Structured Generation w/ SmolLM2 running in browser & WebGPU ๐ฅ Powered by MLC Web-LLM & XGrammar โก Define a JSON schema, Input free text, get structured data right in your browser - profit!!
loading . . .
about 1 year ago
4
107
14
reposted by
vb
Jeremy Howard
about 1 year ago
FYI, here's the entire code to create a dataset of every single bsky message in real time: ``` from atproto import * def f(m): print(m.header, parse_subscribe_repos_message()) FirehoseSubscribeReposClient().start(f) ```
19
442
72
reposted by
vb
Mark Riedl
about 1 year ago
I have converted a portion of my NLP Online Masters course to blog form. This is the progression I present that takes one from recurrent neural network to seq2seq with attention to transformer.
mark-riedl.medium.com/transformers...
loading . . .
Transformers: Origins
An unofficial origin story of the transformer neural network architecture.
https://mark-riedl.medium.com/transformers-origins-1db4bdfcb3d1
6
116
17
reposted by
vb
Omar Sanseviero
about 1 year ago
I'm disheartened by how toxic and violent some responses were here. There was a mistake, a quick follow up to mitigate and an apology. I worked with Daniel for years and is one of the persons most preoccupied with ethical implications of AI. Some replies are Reddit-toxic level. We need empathy.
add a skeleton here at some point
29
333
45
yo! nvidia finally released the weights for Hymba-1.5B - outperforms Qwen, and SmolLM2 w/ 6-12x less training trained ONLY on 1.5T tokens > massive reductions in KV cache size and improved throughput > combines Mamba and Attention in a hybrid parallel architecture with a 5:1 ratio and meta-tokens
about 1 year ago
1
29
2
reposted by
vb
Andi
about 1 year ago
Let's go! We are releasing SmolVLM, a smol 2B VLM built for on-device inference that outperforms all models at similar GPU RAM usage and tokens throughputs. SmolVLM can be fine-tuned on a Google collab and be run on a laptop! Or process millions of documents with a consumer GPU!
4
104
26
Smol TTS keeps getting better! Introducing OuteTTS v0.2 - 500M parameters, multilingual with voice cloning! ๐ฅ > Multilingual - English, Chinese, Korean & Japanese > Cross platform inference w/ llama.cpp > Trained on 5 Billion audio tokens > Qwen 2.5 0.5B LLM backbone > Trained via HF GPU grants
loading . . .
about 1 year ago
5
54
12
SmolLM - run, pre-train, fine-tune, evaluate SoTA fully open source LM ๐ฅ Run with Transformers, MLX, Transformers.js, MLC Web-LLM, Ollama, Candle and more! Apache 2.0 licensed codebase - go explore now!
about 1 year ago
1
36
2
Massive week for Open Source AI/ ML Mistral Pixtral & Instruct Large - ~123B, 128K context, multilingual, json + function calling & open weights Allen AI Tรผlu 70B & 8B - competive with claude 3.5 haiku, beats all major open models like llama 3.1 70B, qwen 2.5 and nemotron
about 1 year ago
3
55
11
Apple released blazingly fast CoreML models AND an iOS app to run them on iPhone! โก > S0 matches OpenAI's ViT-B/16 in zero-shot performance but is 4.8x faster and 2.8x smaller > S2 outperforms SigLIP's ViT-B/16 in zero-shot accuracy, being 2.3x faster, 2.1x smaller, and trained with 3x fewer data
about 1 year ago
2
43
4
Check out my new swanky handle! ๐ฆ - Drop your Hugging Face ID in the comments if you want the same!
about 1 year ago
4
19
0
LFG!! XGrammar: a lightning fast, flexible, and portable engine for structured generation! ๐ฅ > Accurate JSON/grammar generation > 3-10x speedup in latency > 14x faster JSON-schema generation and up to 80x CFG-guided generation GG MLC team is literally the best in the game and slept on! โก
about 1 year ago
2
52
4
๐จ UPDATE: New Whisper based model competing with Nvidia on Open ASR Leaderboard! ๐ฅ CrisperWhisper aims to transcribe every spoken word exactly as it is, including fillers, pauses, stutters and false starts Whisper Large V3 fine-tune - beats it by roughly ~1 WER margin โก
hf.co/spaces/hf-au...
about 1 year ago
1
18
3
OH WOW! The Whale aka DeepSeek is BACK!! New model, with complete reasoning outputs and a gracious FREE TIER too! ๐ฅ Here's a quick snippet of it searching the web for the right documentation, creating the JS files plus the necessary HTML all whilst handling Auth too โก
about 1 year ago
2
21
0
Great day for M/LLMs, just released Mistral & Pixtral Large - ~123B, 128K context, Multilingual, JSON + Function calling support & open weights! ๐ฅ Pixtral Large:
huggingface.co/mistralai/Pi...
Mistral Large:
huggingface.co/mistralai/Mi...
about 1 year ago
0
23
3
New spaces of the week! ๐ฅ > Qwen 2.5 Coder Artifacts > Flux Kolors Character > X Potrait > Text Behind Image ๐คฏ > DimensionX > MagicQuill > JanusFlow 1.3B > Netflix Recommentation Check them out at
hf.co/spaces
๐
about 1 year ago
0
7
0
What a brilliant week in Open Source!
about 1 year ago
1
5
1
๐จ Nexusflow released Athene v2 72B - competitive with GPT4o & Llama 3.1 405B Chat, Code and Math ๐ฅ > Arena Hard: GPT4o (84.9) vs Athene v2 (77.9) vs L3.1 405B (69.3) > Bigcode-Bench Hard: 30.8 vs 31.4 vs 26.4 > MATH: 76.6 vs 83 vs 73.8 Open science ftw! โก
about 1 year ago
0
5
1
Smol TTS models are here! OuteTTS-0.1-350M - Zero shot voice cloning, built on LLaMa architecture, CC-BY license! ๐ฅ > Pure language modeling approach to TTS > Zero-shot voice cloning > LLaMa architecture w/ Audio tokens (WavTokenizer) > BONUS: Works on-device w/ llama.cpp โก
loading . . .
about 1 year ago
1
4
0
reposted by
vb
Thomas Wolf
over 2 years ago
Our multimodal team is releasing IDEFICS! One year in the making, the first open SOTA visual LLM: images/text input, text output. Think multimodal ChatGPT! We release 2 sizes: 80B๐ณ & 9B๐ฟ๏ธ Read:
https://huggingface.co/blog/idefics
Try:
https://huggingface.co/spaces/HuggingFaceM4/idefics_playground
loading . . .
Introducing IDEFICS: An Open Reproduction of State-of-the-art Visual Langage Model
Weโre on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/blog/idefics
0
6
2
What a fantastic and ๐ paced weekend! โฅ๏ธ
over 2 years ago
0
1
0
Kinda hilarious that my Twitter feed is burning with HF and GitHub being down. At Bsky, people are sharing memes and cat pictures! โจ
over 2 years ago
0
1
0
you reached the end!!
feeds!
log in