@frascuchon.bsky.social
📤 9
📥 8
📝 3
reposted by
Ben Burtenshaw
12 months ago
who's fine-tuning LLMs for reasoning? This dataset has been trending for a few weeks and there's a list of models trained on it. - It has SFT formatted reasoning sequences, like those in o1. - You could incorporate these into post training to boost reasoning abilities.
loading . . .
O1-OPEN/OpenO1-SFT · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://buff.ly/3OP6lgu
1
22
4
reposted by
Ben Burtenshaw
12 months ago
came across this example in agent-as-a-judge from Meta. It uses agent-as-a-judge to evaluate the effectiveness of a DevAI app. - It's based on an open dataset. - It's more accurate than LLM as a judge - It explains its evaluation based on preferences, and requirements.
https://buff.ly/49tN6CQ
1
12
3
reposted by
David Berenstein
12 months ago
Learn local and private LLMs with Hugging Face 🤗 : Participation is free so join now! Even better, there are minimal GPU requirements and no paid services. Start with, Instruction Tuning, Preference Alignment, Parameter-efficient Fine-tuning. GitHub:
https://buff.ly/3ZCMKX2
1
13
3
I've just contributed 10 examples to this dataset:
data-is-better-together-fineweb-c.hf.space/share-your-p...
loading . . .
spa - español - Spanish
Join and contribute to the dataset spa - español - Spanish
https://data-is-better-together-fineweb-c.hf.space/share-your-progress?user_name=frascuchon&records_submitted=10&team_progress=0.00&dataset_name=spa+-+espa%C3%B1ol+-+Spanish&dataset_id=1b4a2f57-d4ae-4978-8144-d48e68e71fcc
12 months ago
0
1
0
reposted by
José Francisco Calvo
12 months ago
The great
@benburtenshaw.bsky.social
is running an open course on fine-tuning smol LLMs, and it’s seriously worth checking out. If you’re into AI or just curious about how these small language models work, this could be right up your alley. Don’t miss it—it’s super interesting!
#AI
#LLMs
#Learning
add a skeleton here at some point
0
3
3
reposted by
David Berenstein
12 months ago
👐 Open Image Preferences is an Apache 2.0 licensed dataset for text-to-image generation by the
@hf.co
community. This dataset contains 10K text-to-image preference pairs across image generation categories, using different model families and prompt complexities. Blog:
huggingface.co/blog/image-p...
loading . . .
Open Preference Dataset for Text-to-Image Generation by the 🤗 Community
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/blog/image-preferences
1
17
4
✨ Argilla 2.5.0 is live and it comes with webhook listener support to supercharge your workflows! 🚀
#AI
#MachineLearning
#Webhooks
#TechUpdate
12 months ago
1
8
2
you reached the end!!
feeds!
log in