This is one of the first few posts I've seen that uses Deepseek model to generate high quality datasets, which then can be used to train the ModernBERT models.
Really neat stuff! Once can easily replace the slower, expensive 3rd party LLM router with a fast, cheap & local model.
add a skeleton here at some point
12 months ago