Most genomic AI models use fixed rules to process DNA into chunks, imposing arbitrary boundaries on a sequence with its own biological structure.
Arnav Shah, Victor Li, and team developed dnaHNet, a tokenizer-free foundation model that learns its own segmentation from scratch.
about 2 months ago