DNA language models can easily identify statistical patterns in DNA sequences.
Applications range from predicting what different parts of the genome do to how genes interact with each other
Language models can have problems with “hallucination” whereby an output sounds sensible but is not rooted in truth.
The generative capabilities of DNA language models also allow researchers to predict how new mutations may arise in genome sequences. For example, scientists developed a genome-scale language model to predict and reconstruct the evolution of the SARS-CoV-2 virus.
https://bigthink.com/health/how-generative-ai-language-models-are-unlocking-the-secrets-of-dna/