De Novo Skeletal Design—Creating Protein Drugs with Novel Skeletal Structures
The high-throughput protein drug design language model GPDL, based on generative artificial intelligence, uses the ESM2 protein language model and ESMFold structure prediction as its core architecture. On 24 standard design tasks, compared to RFdiffusion, it significantly improves the rationality of backbone design: success rate increases by 8 percentage points, and diversity increases by 33%. Combined with the GPD sequence generation algorithm (which improves diversity by 2.2 times and speed by 1.6 times compared to ProteinMPNN), it achieves a complete design loop from "novel backbone" to "highly active sequence".