A disease-specific language model for variant pathogenicity in cardiac and regulatory genomics

Katsanis, SH & Katsanis, N. Periodic genetic tests and the future of clinical genome. Nat. Gennett priest. 14415-426 (2013).
Cocchi, E., Nestor, JG & Gharavi, AG Clinical Genetic examination in adult patients with kidney disease. clinic. J. AM. Suk. Navol. 151497 (2020).
XIE, W., Chen, B. & Wong, J. Genomic Medicines: The next Waves? Nat. Reverend drug Discove. 22691-692 (2023).
Ni, e. And others. Exposure of expanded heart muscle infection in first -class relatives at risk. J. AM. Cole. Cardol. 812059-2071 (2023).
To me, c. M. And others. CAG Repeat Polyglutamine Long Specifies the Time of Huntington’s disease. cell 178887-900 (2019).
Musunuru, K. et al. The genetic test of inherited cardiovascular disease: a scientific statement from the American Heart Association. circus. Genome. Precis. Med. 130067 (2020).
Richards, S. And others. Standards and Guidelines to explain the sequence variables: a common consensus recommendation from the American College of Medical Genetics, Porn and the Association of Molecular Diseases. heredity. Med. 17405-423 (2015).
Fraser, c. And others. Prediction variable disease with deep gym models for evolutionary data. nature 59991-95 (2021).
Reva, B., Antipin, Y. & Sander, C. Prediction of the functional effect of protein mutations: app on cancer genome. Nuclear acids are accurate. 39118 (2011).
Rentzsch, P., Witten, D., Cooper, GM, Shendure, J. & KIRRHER, M. CADD: predicting narrow variables throughout the human genome. Nuclear acids are accurate. 47886-894 (2019).
Landrum, mj et al. Clinvar: The General Archive of Relations between the contrast of sequence and the human apparent pattern. Nuclear acids are accurate. 42980-985 (2014).
Rao, RM et al. MSA transformer. in International Conference for Illorinary Learning (Eds Meyla, M. & Zhang, T.) 8844-8856 (PMLR, 2021).
Cheng, J. et al. Exact prediction of the alternative effect at the protein level with Alphamissense. sciences 3817492 (2023).
Brands, n. , Goldman, c. Nat. heredity. 551512-1522 (2023).
Chowdhury, R. et al. Mono -sequential protein structure uses a language model and deep learning. Nat. Biotechnology. 401617-1623 (2022).
Michaud, JM, MADani and A. Nat. Biotechnology. 401576-1577 (2022).
Manolio, Ta and others. Find the lost inheritance of complex diseases. nature 461747-753 (2009).
McNallly, Em, Barefield, Dy & Puckelwartz, MJ the genetic scene of heart muscle illness and its role in heart failure. Cell metabolism. 21174-182 (2015).
Zhang, x. et al. Path -changing predictions of the disease changing greatly improves the changing interpretation in inherited heart conditions. heredity. Med. 2369-79 (2021).
Han, L., Kashyap, Al, FinIn, T., Mayfield, J. & Weese, J in The second joint conference on lexical and accounting connotations (* SEM) (EDS DIAB, M. Et Al.) Vol. 1, 44-52 (Association of Mass, 2013).
Reimers, n in The facts of the 2019 conference on experimental methods in addressing the natural language and the ninth international conference on the treatment of natural language (Eds Padó, S. & Huang, R.) 3982–3992 (Computer Linguistics Association, 2019).
Zhou, z. et al. DNABERT-2: Effective foundation model and multi-species meter. in The facts of the twelfth international conference on learning representations (ED. Kim, B.) (ICLR, 2024).
Chung, R. And others. A multi -transmission test to identify EXON reveals that an unaccounted part of rare genetic variables causes bonding disorders with large effects. mall. cell 73183-194 (2019).
RIVES, A. Et al. The biological structure and function arise from the scaling of non -supervising learning to 250 million protein sequences. Brook. Natl Acad. Sci. USA 1182016239118 (2021).
Lynn, g. And others. Prediction of an evolutionary scale of the protein structure at the atomic level with a language model. sciences 3791123-1130 (2023).
ATA, Sk et al. Modern developments in the network -based methods of genetic prediction of diseases. Short. The vital form. 22303 (2021).
Chatzianastsis, M., Vazirgiannis, M. & ZHANG, Z. Multi -layer neuropathic network canalable for genetic prediction of cancer. Biomatic informatics 39643 (2023).
Jumper, J. et al. Predicting with a very accurate protein structure with alphafold. nature 596583-589 (2021).
Finn, RD et al. PFAM protein database: Towards a more sustainable future. Nuclear acids are accurate. 44279-285 (2016).
McLachlan, GJ & Krishnan, T. Em algorithm and additions (John Waili and his children, 2007).
Novakovsky, c. Dxter, n. Nat. Gennett priest. 24125-137 (2023).
Zhan, H. & Zhang, Z. Dataset for Dyna: A specific language model for changing diseases. Zenudo (2024).
Zhan, H., Moore, J Zenudo (2024).
2025-03-24 00:00:00