The experiment shows that natural language processing, initially created for reading and writing language text, can grasp certain fundamental concepts of biology. The AI program, known as ProGen, was developed by Salesforce Research and employs next-token prediction to construct artificial proteins from amino acid sequences.
University of California, San Francisco. (2023, February 25). Limitless Possibilities – AI Technology Generates Original Proteins From Scratch. SciTechDaily. https://scitechdaily.com/limitless-possibilities-ai-technology-generates-original-proteins-from-scratch/
Original research: “Large language models generate functional protein sequences across diverse families” by Ali Madani, Ben Krause, Eric R. Greene, Subu Subramanian, Benjamin P. Mohr, James M. Holton, Jose Luis Olmos Jr., Caiming Xiong, Zachary Z. Sun, Richard Socher, James S. Fraser and Nikhil Naik, 26 January 2023, Nature Biotechnology.
DOI: 10.1038/s41587-022-01618-2