µµ¼¿ä¾à

[Global Technology Breifings]

ë‹¨ë°±ì§ˆ ë””ìžì¸ì— ì¸ê³µì§€ëŠ¥ì´ ì‚¬ìš©ë˜ë‹¤

By Jin Sub Lee, NATURE COMPUTATIONAL SCIENCE, May 4, 2023

ì¸ê³µì§€ëŠ¥ì€ ì†Œë¹„ìž ì¤‘ì‹¬ ë° ì¼ìƒì ì¸ ë¹„ì¦ˆë‹ˆìŠ¤ ì• í”Œë¦¬ì¼€ì´ì…˜ì„ ìžë™í™”í•˜ëŠ” ë° ìƒë‹¹í•œ ì˜í–¥ì„ ë¯¸ì¹˜ê² ì§€ë§Œ, ë³µìž¡ì„±ìœ¼ë¡œ ì¸í•´ ì¸ê°„ì´ ë°œì „í• ìˆ˜ ì—†ëŠ” ì˜ì—ì—ì„œ ê°€ìž¥ í° ê¸°ì—¬ë¥¼ í• ê²ƒì´ë‹¤.

ì´ê²ƒì´ ë°”ë¡œ ë‹¨ë°±ì§ˆ ë””ìžì¸ì— ì¸ê³µì§€ëŠ¥ì„ ì‚¬ìš©í•˜ëŠ” ê²ƒê³¼ ê´€ë ¨ëœ ìƒˆë¡œìš´ ê²°ê³¼ê°€ ë§¤ìš° í¥ë¯¸ë¡œìš´ ì´ìœ ì´ë‹¤.

â€˜ë„¤ì´ì²˜ ì»´í“¨í…Œì´ì…”ë„ ì‚¬ì´ì–¸ìŠ¤(Nature Computational Science)â€™ ì €ë„ì€ ìµœê·¼ í† ë¡ í† ëŒ€í•™ì—ì„œ ì¸ê³µì§€ëŠ¥ ì‹œìŠ¤í…œì„ ì‚¬ìš©í•˜ì—¬ ìžì—°ì—ì„œ ë°œê²¬ë˜ì§€ ì•ŠëŠ” ë‹¨ë°±ì§ˆì„ ìƒì„±í•˜ëŠ” ì—°êµ¬ ê²°ê³¼ë¥¼ ë°œí‘œí–ˆë‹¤.

ì´ ì¸ê³µì§€ëŠ¥ ì‹œìŠ¤í…œì€ ê·¸ë¦¼ ì¸ê³µì§€ëŠ¥ ì†Œí”„íŠ¸ì›¨ì–´ ë‹¬ë¦¬(DALL-E)ì™€ ë¯¸ë“œì €ë‹ˆ(Midjourney)ì™€ ê°™ì€ ì¸ê¸° ìžˆëŠ” ì´ë¯¸ì§€ ìƒì„± í”Œëž«í¼ê³¼ ë™ì¼í•œ ê¸°ìˆ ì¸ ìƒì„± í™•ì‚°ì„ ì‚¬ìš©í•œë‹¤.

ì´ ì‹œìŠ¤í…œì€ ì™„ì „ížˆ ìƒˆë¡œìš´ ì¹˜ë£Œìš© ë‹¨ë°±ì§ˆì„ ë³´ë‹¤ íš¨ìœ¨ì ì´ê³ ìœ ì—°í•˜ê²Œ ê°œë°œí•˜ëŠ” ì†ë„ë¥¼ ë†’ì¼ ê²ƒì„ ì˜ˆê³ í•˜ê³ ìžˆë‹¤.

ì¦‰, ì´ ëª¨ë¸ì€ ì´ë¯¸ì§€ í‘œí˜„ì„ ì‹œìž‘ìœ¼ë¡œ ë§¤ìš° ë¹ ë¥¸ ì†ë„ë¡œ â€˜ì™„ì „ížˆ ìƒˆë¡œìš´â€™ ë‹¨ë°±ì§ˆì„ ìƒì„±í•˜ëŠ” ë°©ë²•ì„ í•™ìŠµí•œë‹¤.

ê·¸ë¦¬ê³ ì´ë ‡ê²Œ ìƒì„±í•˜ëŠ” ëª¨ë“ ë‹¨ë°±ì§ˆì€ ìƒë¬¼ ë¬¼ë¦¬í•™ì ìœ¼ë¡œ ì‹¤ì œì ì¸ ê²ƒì²˜ëŸ¼ ë³´ì¸ë‹¤. ì¦‰, ì„¸í¬ ë‚´ì—ì„œ íŠ¹ì • ê¸°ëŠ¥ì„ ìˆ˜í–‰í• ìˆ˜ ìžˆëŠ” êµ¬ì„±ìœ¼ë¡œ ì ‘í˜€ ìžˆë‹¤(folding)ëŠ” ì˜ë¯¸ì´ë‹¤.

ë‹¨ë°±ì§ˆì€ 3ì°¨ì› ëª¨ì–‘ìœ¼ë¡œ ì ‘ížˆëŠ” ì•„ë¯¸ë…¸ì‚° ì‚¬ìŠ¬ë¡œ ë§Œë“¤ì–´ì§€ë©°, ì´ëŠ” ë‹¤ì‹œ ë‹¨ë°±ì§ˆ ê¸°ëŠ¥ì„ ê²°ì •í•œë‹¤.

ê¸°ì¡´ ë‹¨ë°±ì§ˆì´ ì–´ë–»ê²Œ ì ‘ížˆëŠ”ì§€ ë” ìž˜ ì´í•´í•˜ë©´ì„œ ì—°êµ¬ìžë“¤ì€ ìžì—°ì—ì„œëŠ” ìƒì„±ë˜ì§€ ì•ŠëŠ” ì ‘íž˜ íŒ¨í„´ì„ ì„¤ê³„í•˜ê¸° ì‹œìž‘í–ˆë‹¤.

ê·¸ëŸ¬ë‚˜ ê°€ìž¥ í° ë„ì „ì€ ê°€ëŠ¥í•˜ê³ ê¸°ëŠ¥ì ì¸ ì ‘íž˜ì„ ìƒìƒí•˜ëŠ” ê²ƒì´ì—ˆë‹¤. ì–´ë–¤ ì ‘íž˜ì´ ì‹¤ì œ ë‹¨ë°±ì§ˆ êµ¬ì¡°ì—ì„œ ìž‘ë™í•˜ëŠ”ì§€ ì˜ˆì¸¡í•˜ëŠ” ê²ƒì€ ë§¤ìš° ì–´ë µë‹¤.

í•˜ì§€ë§Œ ì—°êµ¬ìžë“¤ì€ ë‹¨ë°±ì§ˆ êµ¬ì¡°ì˜ ìƒë¬¼ ë¬¼ë¦¬í•™ ê¸°ë°˜ í‘œí˜„ê³¼ ì´ë¯¸ì§€ ìƒì„± ê³µê°„ì˜ í™•ì‚° ë°©ë²•ì„ ê²°í•©í•¨ìœ¼ë¡œì¨ ì´ ë¬¸ì œë¥¼ í•´ê²°í•˜ê¸° ì‹œìž‘í–ˆë‹¤.

ì—°êµ¬ìžë“¤ì´ í”„ë¡œí…Œì¸SGM(ProteinSGM)ìœ¼ë¡œ ë¶€ë¥´ëŠ” ìƒˆë¡œìš´ ì‹œìŠ¤í…œì€ ê¸°ì¡´ ë‹¨ë°±ì§ˆì˜ ì´ë¯¸ì§€ì™€ ìœ ì‚¬í•œ í‘œí˜„ì„ ëŒ€ëŸ‰ìœ¼ë¡œ ëŒì–´ì™€ì„œ, ê·¸ êµ¬ì¡°ë¥¼ ì •í™•í•˜ê²Œ ì¸ì½”ë”©í•œë‹¤.

ì—°êµ¬ìžë“¤ì€ ì´ëŸ¬í•œ ì´ë¯¸ì§€ë¥¼ ìƒì„± í™•ì‚° ëª¨ë¸ì— ìž…ë ¥í•˜ì—¬ ê° ì´ë¯¸ì§€ê°€ ëª¨ë‘ ë…¸ì´ì¦ˆê°€ ë ë•Œê¹Œì§€ ì ì°¨ì ìœ¼ë¡œ ë…¸ì´ì¦ˆë¥¼ ì¶”ê°€í•œë‹¤.

ì´ ëª¨ë¸ì€ ì´ë¯¸ì§€ì— ë…¸ì´ì¦ˆê°€ ì–´ë–»ê²Œ ì¦ê°€í•˜ëŠ”ì§€ ì¶”ì í•œ í›„, í”„ë¡œì„¸ìŠ¤ë¥¼ ì—ìœ¼ë¡œ ì‹¤í–‰í•˜ì—¬ ë¬´ìž‘ìœ„ í”½ì…€ì„ ì™„ì „ížˆ ìƒˆë¡œìš´ ë‹¨ë°±ì§ˆì— í•´ë‹¹í•˜ëŠ” ì„ ëª…í•œ ì´ë¯¸ì§€ë¡œ ë³€í™˜í•˜ëŠ” ë°©ë²•ì„ í•™ìŠµí•œë‹¤.

ìƒˆë¡œìš´ ë‹¨ë°±ì§ˆì„ í…ŒìŠ¤íŠ¸í•˜ê¸° ìœ„í•´ ì—°êµ¬ìžë“¤ì€ ë¨¼ì € ë”¥ë§ˆì¸ë“œ(DeepMind) ì†Œí”„íŠ¸ì›¨ì–´ ì•ŒíŒŒí´ë“œ 2ì˜ ê°œì„ ëœ ë²„ì „ì¸ ì˜¤ë©”ê°€í´ë“œ(OmegaFold)ë¥¼ ì„ íƒí–ˆë‹¤.

ë‘ í”Œëž«í¼ ëª¨ë‘ ì¸ê³µì§€ëŠ¥ì„ í™œìš©í•˜ì—¬ ì•„ë¯¸ë…¸ì‚° ì„œì—´ì„ ê¸°ë°˜ìœ¼ë¡œ ë‹¨ë°±ì§ˆ êµ¬ì¡°ë¥¼ ì˜ˆì¸¡í•œë‹¤.

ì˜¤ë©”ê°€í´ë“œë¥¼ í†µí•´ ì—°êµ¬ìžë“¤ì€ ê±°ì˜ ëª¨ë“ ìƒˆë¡œìš´ ì„œì—´ë“¤ì´ ê·¸ë“¤ì´ ì›í•˜ëŠ” ë‹¨ë°±ì§ˆ êµ¬ì¡°ë¡œ ì ‘ížˆëŠ” ê²ƒì„ í™•ì¸í–ˆë‹¤.

ì´í›„ ì—°êµ¬ìžë“¤ì€ ì‹œí—˜ê´€ ì‹œí—˜ì„ í†µí•´, ê·¸ êµ¬ì¡°ê°€ ë‹¨ìˆœí•œ í™”í•™ í™”í•©ë¬¼ì˜ ëˆì´ ì•„ë‹Œ ë‹¨ë°±ì§ˆìž„ì„ í™•ì¸í–ˆë‹¤.

ì˜¤ë©”ê°€í´ë“œì™€ì˜ ë§¤ì¹ê³¼ ì‹¤í—˜ì‹¤ì—ì„œì˜ ì‹¤í—˜ í…ŒìŠ¤íŠ¸ë¥¼ í†µí•´ ì—°êµ¬ìžë“¤ì€ ì´ë“¤ì´ ì ì ˆí•˜ê²Œ ì ‘ížŒ ë‹¨ë°±ì§ˆìž„ì„ í™•ì‹ í• ìˆ˜ ìžˆì—ˆë‹¤.

ì´ë“¤ì€ ìžì—° ì–´ë””ì—ë„ ì¡´ìž¬í•˜ì§€ ì•ŠëŠ” ì™„ì „ížˆ ìƒˆë¡œìš´ ë‹¨ë°±ì§ˆ ì ‘íž˜ì´ í™•ì¸ë˜ëŠ” ê²ƒì„ ë³´ê³ ë†€ëž„ ìˆ˜ë°–ì— ì—†ì—ˆë‹¤.

ì´ ì—°êµ¬ë¥¼ ê¸°ë°˜ìœ¼ë¡œ, ë‹¤ìŒ ë‹¨ê³„ëŠ” ì¹˜ë£Œ ê°€ëŠ¥ì„±ì´ ê°€ìž¥ ë†’ì€ í•ì²´ ë° ê¸°íƒ€ ë‹¨ë°±ì§ˆì— ëŒ€í•œ ê°œë°œì´ë‹¤. ì´ ì—°êµ¬ì™€ ê·¸ ê²°ê³¼ëŠ” ì—°êµ¬ìžë“¤ ë¿ë§Œ ì•„ë‹ˆë¼ ê´€ê³„ìž, ê´€ê³„ê¸°ì—… ë“±ì—ë„ ë§¤ìš° í¥ë¯¸ë¡œìš´ ê²ƒì´ ë ê²ƒì´ë‹¤.

To view or purchase this article, please visit:

https://www.nature.com/articles/s43588023-00440-3

[Global Technology Breifings]

Score-Based Generative Modeling for De Novo Protein Design

By Jin Sub Lee, NATURE COMPUTATIONAL SCIENCE, May 4, 2023

While artificial intelligence will make a substantial impact in automating consumer oriented and everyday business applications, its greatest contribution will be in areas where humans are unable to make progress because of complexity.

Thatâ€™s why new results related to using AI in protein design are so exciting.

The journal Nature Computational Science just published the results of research at the University of Toronto into using an artificial intelligence system to create proteins not found in nature.

This AI system uses generative diffusion, the same technology behind popular image-creation platforms such as DALL-E and Midjourney.

The system promises to speed drug developtirely new therapeutic proteins more efficient and flexible. The model learns to generate â€œfully newâ€ proteins, at a very high rate, starting from image representations.

And all the proteins it generates appear to be biophysically real, meaning they fold into configurations that enable them to carry out specific functions within cells.

Proteins are made from chains of amino acids that fold into three-dimensional shapes, which in turn dictate protein function.

With a better understanding of how existing proteins fold, researchers have begun to design folding patterns not produced in nature.

But a major challenge has been to imagine folds that are both possible and functional. Itâ€™s been very hard to predict which folds will be real and work in a protein structure.

By combining biophysics-based representations of protein structure with diffusion methods from the image generation space, the researchers have begun to address this problem.

The new system, which the researchers call ProteinSGM, draws from a large set of image-like representations of existing proteins that encode their structure accurately.

The researchers feed these images into a generative diffusion model, which gradually adds noise until each image becomes all noise.

The model tracks how the images become noisier and then runs the process in reverse, learning how to transform random pixels into clear images that correspond to fully novel proteins.

To test their new proteins, the researchers first turned to OmegaFold, an improved version of DeepMindâ€™s software AlphaFold 2.

Both platforms use AI to predict the structure of proteins based on amino acid sequences.

With OmegaFold, the team confirmed that almost all their novel sequences fold into the desired protein structures.

They then chose a smaller number to create physically in test tubes, to confirm the structures were proteins and not just stray strings of chemical compounds.

With matches in OmegaFold and experimental testing in the lab, they could be confident these were properly folded proteins.

They were amazed to see validation of these fully new protein folds that donâ€™t exist anywhere in nature.

Next steps based on this work include further development of ProteinSGM for antibodies and other proteins with the most therapeutic potential.

This will be a very exciting area for research and entrepreneurship.

To view or purchase this article, please visit:

https://www.nature.com/articles/s43588023-00440-3

Media Briefings