What can molecular assembly learn from catalysed assembly in living organisms?

Zhi-Chao Lei§ abc, Xinchang Wang§ d, Liulin Yang *a, Hang Qu a, Yibin Sun e, Yang Yang a, Wei Li bc, Wen-Bin Zhang e, Xiao-Yu Cao a, Chunhai Fan f, Guohong Li bc, Jiarui Wu ghi and Zhong-Qun Tian *a
aState Key Laboratory of Physical Chemistry of Solid Surfaces, Collaborative Innovation Center of Chemistry for Energy Materials (iChEM), Innovation Laboratory for Sciences and Technologies of Energy Materials of Fujian Province (IKKEM), College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, P. R. China. E-mail: zqtian@xmu.edu.cn; llyang@xmu.edu.cn
bNational Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, P. R. China
cUniversity of Chinese Academy of Sciences, Beijing 100049, P. R. China
dSchool of Electronic Science and Engineering, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, P. R. China
eBeijing National Laboratory for Molecular Sciences, Key Laboratory of Polymer Chemistry & Physics of Ministry of Education, Center for Soft Matter Science and Engineering, College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, P. R. China
fSchool of Chemistry and Chemical Engineering, Frontiers Science, Center for Transformative Molecules and National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai 200240, P. R. China
gKey Laboratory of Systems Biology, Center for Excellence in Molecular Cell Science, Shanghai Institute of Biochemistry and Cell Biology, Chinese Academy of Sciences, Shanghai, 200031, P. R. China
hSchool of Life Science and Technology, ShanghaiTech University, Shanghai, 201210, P. R. China
iKey Laboratory of Systems Health Science of Zhejiang Province, School of Life Science, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Hangzhou, 310024, P. R. China

Received 8th August 2023

First published on 17th January 2024


Abstract

Molecular assembly is the process of organizing individual molecules into larger structures and complex systems. The self-assembly approach is predominantly utilized in creating artificial molecular assemblies, and was believed to be the primary mode of molecular assembly in living organisms as well. However, it has been shown that the assembly of many biological complexes is “catalysed” by other molecules, rather than relying solely on self-assembly. In this review, we summarize these catalysed-assembly (catassembly) phenomena in living organisms and systematically analyse their mechanisms. We then expand on these phenomena and discuss related concepts, including catalysed-disassembly and catalysed-reassembly. Catassembly proves to be an efficient and highly selective strategy for synergistically controlling and manipulating various noncovalent interactions, especially in hierarchical molecular assemblies. Overreliance on self-assembly may, to some extent, hinder the advancement of artificial molecular assembly with powerful features. Furthermore, inspired by the biological catassembly phenomena, we propose guidelines for designing artificial catassembly systems and developing characterization and theoretical methods, and review pioneering works along this new direction. Overall, this approach may broaden and deepen our understanding of molecular assembly, enabling the construction and control of intelligent assembly systems with advanced functionality.


image file: d3cs00634d-p1.tif

Zhi-Chao Lei

Zhi-Chao Lei received his BS in 2012 and PhD in 2018 from Xiamen University under the supervision of Prof. Zhong-Qun Tian and Prof. Chaoyong Yang. Then he worked as a postdoc at Peking University from 2019 to 2021. Following that, he joined the Institute of Biophysics, Chinese Academy of Science, as a research associate in Prof. Guohong Li's group. His research interests include catassembly, assembly of biomacromolecules and chromatin biology.

image file: d3cs00634d-p2.tif

Xinchang Wang

Xinchang Wang received his PhD in physical chemistry in 2017 under the supervision of Prof. Zhong-Qun Tian and Prof. Xiaoyu Cao from Xiamen University. He conducted his postdoctoral research at Xiamen University during 2018–2020 under the supervision of Prof. Zhong-Qun Tian and Prof. Zhong Chen. He joined Xiamen University in 2020 as Associate Professor. His research interests include supramolecular chemistry, microfluidic NMR and parahydrogen-induced hyperpolarization.

image file: d3cs00634d-p3.tif

Liulin Yang

Liulin Yang is an associate professor in the College of Chemistry and Chemical Engineering at Xiamen University. He obtained his PhD degree from Xiamen University in 2011 and subsequently conducted postdoctoral research at Tsinghua University (2012–2015) and the University of Twente, MESA+ Institute in the Netherlands (2015–2018). He is the co-author of more than forty papers. His research interests include molecular assembly kinetics and the development of controllable assembly strategies (e.g., catassembly) for constructing complex assembly systems.

image file: d3cs00634d-p4.tif

Hang Qu

Hang Qu is currently a postdoctoral researcher in Materials Innovation Factory, University of Liverpool. He received his double BS degree in chemistry and mathematics from Nankai University. In 2021, he obtained his PhD from Xiamen University under the supervision of Prof. Zhong-Qun Tian and Prof. Xiaoyu Cao. His research focuses on the controllable assembly of functional materials through the integration of mathematics and high-throughput automation.

image file: d3cs00634d-p5.tif

Xiao-Yu Cao

Xiaoyu Cao is a full professor in the college of chemistry and chemical engineering at Xiamen University. He received his PhD degree from the University of Strasbourg (2009), and conducted postdoctoral research (2008–2011) at Eindhoven University of Technology in the Netherlands. He is the co-author of more than fifty papers. He has constructed the Face-Rotating Polyhedra based on the constitutional dynamic chemistry, proposed the concept of catassembly, and developed the high-resolution microfluidics-NMR technique for the study of assembly kinetics. His current research interests also include the complexity science.

image file: d3cs00634d-p6.tif

Zhong-Qun Tian

Zhong-Qun Tian is a full professor at Xiamen University. He is a member of the Chinese Academy of Sciences and a Fellow of the Royal Society of Chemistry, the International Society of Electrochemistry and the World Academy of Sciences. He serves as a member of the advisory board of over fifteen international journals, and an associate editor of Science in China-Chem. and Chem. Soc. Rev. He has received several awards from UK, USA, France, ISE, Japan and China. His main research interests are surface-enhanced Raman spectroscopy, spectro-electrochemistry, nano-chemistry and catassembly.


1. Introduction

Molecular assembly involves organizing individual molecules to form larger, intricate and functional structures and systems.1–4 In living organisms, essential structures such as the cell membrane, cytoskeleton, organelle, and chromatin are all formed through molecular assemblies. Furthermore, critical cellular activities such as DNA replication, RNA transcription, and protein folding all involve molecular assembly processes.4,5 Inspired by nature, artificial molecular assembly aims to create novel materials beyond individual molecules by precisely assembling them.6,7 Self-assembly is the most widely used strategy in this field, where pre-existing components spontaneously form an organized structure or pattern through non-covalent interactions among the components themselves.3,8,9 This strategy has achieved great success and enabled the fabrication of numerous functional materials that have wide application across various fields.10,11

However, the artificial materials obtained through self-assembly of building blocks are still too primitive in both structure and function compared to the multicomponent, hierarchical assemblies in living organisms. As illustrated in Fig. 1, increasing complexity generally leads to increasing functionality, even the emergence of novel properties at new levels of complexity. The self-assembly approach seems to have limitations in constructing such complex assemblies, and over-reliance on it may hinder the advancement of molecular assembly research (vide infra). These concerns have motivated a shift in the research paradigm of molecular assembly, with a need to explore alternative assembly methods beyond the self-assembly of building blocks.12,13


image file: d3cs00634d-f1.tif
Fig. 1 In the development of molecular assemblies, increasing functionality generally comes with increasing complexity. To construct highly functional and intricate molecular assemblies, alternative assembly methods beyond self-assembly of building blocks are necessary. (Items in this panel (from left to right, from top to bottom): liposome, crown ether; rotaxane, peptide, DNA; peptide supramolecular polymer, DNA origami, haemoglobin; RuBisCO, nucleosome, GroEL–GroES, microtubule; chromosome, carboxysome, E. coli, phage.)

Molecular assembly in living organisms was traditionally thought to occur mainly through self-assembly.14 However, more and more discoveries have challenged this view.15,16 Studies have shown that in living organisms, many macromolecular complexes and even organisms themselves are assembled through “catalysis” of other molecules rather than self-assembly. These “catalysing” molecules efficiently assist the assembly of molecular building blocks in a controllable manner and do not remain in the final structures. Moreover, research has shown that certain molecules can facilitate the disassembly or reshaping of existing molecular assemblies.17,18 These findings resonate with the previously proposed concept of “catassembly” (catalysed-assembly), in which helping species called “catassemblers” accelerate molecular assembly similar to but more complicated than catalysts in chemical reactions.12,19,20 In this review, we summarize these biological examples of catassembly and discuss the underlying mechanisms. By drawing inspiration from nature, we suggest potential design principles and research methodologies for artificial catassembly systems, anticipating advancing the functional complex molecular assembly.

2. Molecular assembly beyond self-assembly

2.1. Molecular assembly based on self-assembly

Based on the pioneering works in supramolecular chemistry21,22 and self-assembled monolayers,23,24 a wide variety of building blocks such as small molecules,4 peptides,25 DNA,26 polymers,27,28 and nanoparticles29,30 have been synthesized and assembled into various structures with different morphologies and properties.

Most of these molecular assemblies are constructed by self-assembly of building blocks, which has two major steps: first, the rational design of building blocks and interactions between them; second, these building blocks are mixed under specific conditions, usually in solution, and assembled into thermodynamically stable products.9 The resulting molecular assemblies have either a finite number of subunits, such as molecular knots31 and host–guest complexes,32 or an infinite number of subunits, such as self-assembled monolayers33,34 and supramolecular polymers.35–37 Self-assembly is a simple, efficient and highly productive strategy, making it the most widely used approach for molecular assembly. It has led to the development of many functional materials for application in drug delivery,38 light harvest,39 electronics,40 and sensing.41,42

2.2. Molecular assembly beyond self-assembly

However, self-assembly has inherent limitations, which stem from the aspects of kinetics, thermodynamics and complexity. The over-reliance on it may impede the fast development of artificial molecular assembly.

First, the processes of complex and hierarchical molecular assemblies are usually controlled by kinetics with many energy barriers in the potential energy surface, and easily get stuck in some intermediate states. For example, the autonomous assembly of DNA and RNA is often impeded by high energy barriers; therefore annealing or a “catalyst” is required to allow the system to cross the barrier.26,43,44

Second, from the perspective of thermodynamics, the autonomous molecular assembly can only occur when the change in Gibbs free energy is negative. However, many molecular assembly processes involve a positive change in Gibbs free energy and require energy input. Such examples are abundant in living organisms. For instance, the assembly of microtubules requires the consumption of GTP.45

Third, the construction of hierarchical molecular assemblies with multiple components involved poses significant challenges for self-assembly. When the assembly occurs in a single pot, undesired interactions between subunits from different modules can result in incorrect assemblies. To prevent such promiscuous interactions, the design of suitable subunits with orthogonal interactions is necessary. However, that could be extremely difficult, especially in multicomponent assembly systems. As illustrated in Fig. 2, it is extremely hard to obtain multicomponent, hierarchical molecular assemblies solely by self-assembly of building blocks, and almost impossible to achieve the emergence of advanced properties.


image file: d3cs00634d-f2.tif
Fig. 2 To create complex and hierarchical molecular structures, we need more assembly methods than self-assembly. (A) Simple molecular structures can be easily obtained by self-assembly. (B) However, self-assembly may fail or produce errors when dealing with hierarchical and complex structures. (C) By using external aids, especially catassemblers, hierarchical and complex molecular architectures can be built in a much significantly efficient way, leading to the emergence of new functions.

To overcome the limitations of self-assembly, alternative strategies have been proposed, such as assisted assembly,8,12 controlled self-assembly,46 and multistep assembly, expanding the toolbox for molecular assembly.13 Assisted assembly employs external factors to regulate and control the assembly process. External assistance can be classified into two categories: physical and chemical ones. Physical assistance includes methods such as heating, stirring, ultrasonication, light, electrical field, and magnetic field. Chemical assistance includes methods such as adding or removing solvents, enzymes, templates, and chiral centres. These assistance strategies are devised and employed both deliberately and inadvertently. For a particular assembly process, multiple types of assistance can be coordinated to create a more efficient and effective assembly process. Nevertheless, most researchers still habitually refer to these assisted methods as “assisted self-assembly”. This term is quite confusing and fails to clearly distinguish the relative contribution of self-assembly versus assisted assembly.12

2.3. Catassembly

In 2014, we proposed a new assisted assembly strategy called catassembly (short for catalysed-assembly), in which catassemblers act as “catalysts” for the assembly process. The catassemblers can accelerate the assembly and/or guide it towards a specific pathway, i.e., high efficiency and/or selectivity. Once the assembly is complete, the catassemblers are absent from the final product.12 The verb and adjective forms of catassembly are catassemble and catassembled. (Additional discussions to elucidate the concept of catassembly are available in the ESI.)

Unlike self-assembly, which relies on pre-existing components spontaneously arranging into assemblies, catassembly employs catassemblers to actively drive and direct the assembly process (Fig. 2). Analogous to catalysis in covalent synthesis, catassembly enhances controllability over the assembly system.12 Compared to self-assembly and other assisted assembly strategies, catassembly offers several key advantages. First, the catassembler can change the potential energy surface to lower the energy barrier and speed up the assembly process (Movie S1 in the ESI). Second, the catassembler is reusable, allowing for the assembly of multiple subunits with just one equivalent of the catassembler. Third, catassembly is expected to be highly controllable. A catassembler can select specific subunits and guide their assembly in a multi-component system. In an assembly system with multiple pathways, a catassembler can direct a specific pathway leading to the target product. Moreover, using different catassemblers with the same subunits may result in products with different structures. Collectively, these advantages make catassembly a promising strategy for more efficient and controllable molecular assembly.

In the current state of the art of the field of molecular assembly, a few examples of catassembly have been reported.47–50 In contrast, catassembly has been widely found to play vital roles in living organisms, especially in the construction of many essential biomacromolecular complexes.15,16 In the following section, we will present several typical biological catassembly cases as examples.

3. Catassembly in living organisms

Unlike artificial molecular assemblies in chemistry-based research, biological molecular assemblies often consist of multiple and complex subunits, which interact with each other in precise and intricate ways. Therefore, the assembly process is more complex and usually requires the “catalysis” of other molecules, i.e., catassembly, to form hierarchical structures and even generate new properties (Fig. 2). These features endow biological molecular assemblies with powerful functionalities.

3.1. Catassemblers for protein folding

The functions of proteins depend on the correct folding of their structures, which can be regarded as an intramolecular assembly. Proteins were once thought to fold autonomously to their native state, guided by intrinsic information embedded in the sequence of amino acid residues.14 However, subsequent investigations have revealed a more complex picture of protein folding.51,52 Many proteins have multiple folding pathways, and some of these pathways lead to misfolding. Moreover, since proteins are in a crowded environment, the surrounding molecules can affect their folding pathways. When misfolded, the newly synthesized peptides will expose hydrophobic residues, leading to promiscuous interactions between them and the formation of aggregates. External pressures such as heat shock can also aggravate this process. To avoid the risk of misfolding, a mechanism is needed to ensure correct folding.

Molecular chaperones have been discovered that can assist proteins to fold correctly and do not remain in final structures. They can be divided into two categories: ATP-independent and ATP-dependent.51,53 Most ATP-independent chaperones act as “holdases”, binding to proteins and protecting them from forming aggregates.53 ATP-dependent chaperones include GroEL–GroES, Hsp70, and Hsp90, which can all assist the folding of proteins when fuelled by ATP.51

GroEL–GroES (Hsp60–Hsp10) is the most well-known ATP-dependent chaperone (Fig. 3).54–56 Through biochemical and structural biology methods, the detailed working mechanisms have been unveiled. Newly synthesized peptides are unfolded and have many hydrophobic residues exposed. GroEL with ADP bound has a hydrophobic cavity that can accommodate these newly synthesized hydrophobic peptides. After the allosteric binding of ATP and GroES to GroEL, the cavity changes from hydrophobic to hydrophilic, providing an optimal environment for protein folding and exposing its hydrophilic surface. Afterwards, ATP is hydrolysed to ADP, causing the cavity to change back to hydrophobic and therefore the dissociation of GroES. The folded protein is then released, and the catassembler GroEL–GroES starts a new cycle. This process is a typical example of catassembly. With the assistance of catassemblers, the folding pathway leading to the native state is favoured, and the pathways leading to aggregates are hindered.


image file: d3cs00634d-f3.tif
Fig. 3 Molecular chaperones act as catassemblers for the folding (intramolecular assembly) of proteins. (A) In a complex and crowded environment, the folding of newly synthesized peptides can easily fail and lead to aggregates. (B) The molecular chaperones GroEL–GroES can function as catassemblers that facilitate the folding of proteins using ATP as an energy source. (For the sake of simplicity, only one chamber of GroEL is shown.)

It has been shown that prokaryotes have one chaperone network to play the roles of assisting the newly synthesized peptide to fold and stress-denatured protein to refold, whereas eukaryotes have two distinct chaperone networks: one is linked with the translation process and has a function in de novo protein folding, and the other is stress-inducible and has a role in stress-denatured protein refolding.57 For example, in S. cerevisiae, the CLIPS chaperone network containing SSB1/2 has been identified to function in de novo protein folding, whereas the Hsp chaperone network containing Hsp82 and SSA-family members functions in the refolding of stress-denatured proteins.57 That means molecular chaperones can function in both “good” and “bad” times for cells.58

3.2. Catassemblers for the assembly of biomacromolecular complexes

Besides their roles in assisting protein folding, catassemblers in living organisms can also assist in the assembly of biomacromolecular complexes composed of multiple subunits, such as enzymes, architectural proteins, and chromatin. These catassemblers have various mechanisms: some catassemblers can protect unstable building blocks from forming unwanted interactions; some are indispensable for the formation of key intermediates in hierarchical assembly; and some can consume energy to assemble molecular architectures. It is necessary to note that in many cases, multiple catassemblers are required to work synergistically on an assembly line to complete the assembly process. This is distinctly different from chemical assembly using small and simple molecules.
3.2.1. Catassemblers protecting building blocks from undesired aggregation. In living organisms, the building blocks of complex structures are sometimes initially unstable and require protection until assembly is complete. The formation of pili is a typical example. Many Gram-negative bacteria have organelles called pili on their surface, which play roles in bacterial adhesion, colonization, and localization.59 Pili are molecular assemblies composed of thousands of subunits. The subunits of pili are secreted to the periplasm, where they are not fully folded, unstable, and tend to form aggregates (Fig. 4A). To avoid aggregation, the formation of pili is catassembled by several different types of machinery,59,60 including the most studied chaperone–usher pathway.61–64
image file: d3cs00634d-f4.tif
Fig. 4 The assembly of pilus in Gram-negative bacteria is an example of catassembly, which involves the catassemblers of the chaperone and usher. (A) Without the catassemblers, the subunits would fold incorrectly and aggregate. (B) The chaperone helps to fold the nascent subunits and transport them to the outer membrane, where the usher assists the subunits to join into the pilus and releases the chaperone by subunit displacement.

Currently, four chaperone–usher machineries have been identified for different types of pili.63 They share similar catassembly mechanisms (Fig. 4B).63 The chaperone molecule binds the subunit and helps to fold it, preventing the formation of aggregates. The chaperone then delivers the subunit to the usher, a pore protein located on the outer membrane. With the assistance of the usher, the subunit on the usher replaces the chaperone because the interactions between subunits are stronger than those between the subunit and the chaperone. This displacement leads to the assembly of a new subunit to the pili and the release of the chaperone. The chaperone then diffuses to the inner membrane to fetch another subunit and begin a new assembly cycle (Fig. 4B). Through this mechanism, the pili grow into an organelle that contains thousands of subunits outside the membrane. Without the protection and assistance of catassemblers, only aggregates will be obtained.

3.2.2. Catassemblers promoting the formation of assembly intermediates. RuBisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) is an important enzyme in photosynthesis. It catalyses the carboxylation of ribulose-1,5-bisphosphate, which is the first step of photosynthesis. RuBisCO is probably the most abundant enzyme on Earth, found in plants and many other living creatures.65 The most common form I RuBisCO is a hexadecamer complex that contains eight RbcL subunits and eight RbcS subunits.

When trying to express RbcL and RbcS from plants in E. coli, no active RuBisCO complexes but only random aggregates are formed.66 That means these building blocks cannot self-form the RuBisCO complex solely. Recent research found that the assembly of RuBisCO is a typical example of catassembly that needs the participation of a series of catassemblers. Only when several chaperones are co-expressed, a fully active RuBisCO complex can be assembled.67–70

With the determination of the structures of a series of intermediates, the detailed catassembly mechanisms are revealed (Fig. 5).65,69 Catassemblers Raf1 or RbcX can bind and stabilize the RbcL subunits, facilitating the formation of either RbcL2Raf12 or RbcL2RbcX4 intermediates. Without these catassemblers, the formation of the intermediates is not possible, and the subsequent recognition of other subunits cannot occur. Following their formation, these intermediates further assemble into the complex of RbcL8Raf18 or RbcL8RbcX16. The binding sites of RbcS to RbcL are different from those of Raf1 and RbcX. The binding of RbcS to RbcL8Raf18 or RbcL8RbcX16 will finally form RuBisCO and induce structural changes in RbcL, leading to the allosteric release of Raf1 or RbcX. The catassemblers Raf1 or RbcX will then assist in the assembly of other subunits. Throughout the process, the catassemblers function as indispensable components of the intermediates. Without them, the aggregate pathways rather than the correct assembly pathway would be chosen.


image file: d3cs00634d-f5.tif
Fig. 5 The catassembly of RuBisCO. The chaperone helps to fold RbcL, and then the catassemblers Raf1 and RbcX facilitate the formation of RbcL dimers and RbcL octamers. After RbcS binds allosterically, RuBisCO is formed and the catassemblers are released.
3.2.3. Catassemblers including the ATP-fuelled process. Nitrous oxide reductase (N2OR) is an enzyme found in P. stutzeri that can catalyse the reduction of N2O to N2.71 The catalytic core of N2OR contains a [4Cu[thin space (1/6-em)]:[thin space (1/6-em)]2S] cluster called Cuz.72 The Cuz centre cannot form without the N2OR protein backbone. Cu+ ions need to be assembled one by one onto the N2OR backbone, which cannot happen autonomously (Fig. 6A). Because Cu+ is toxic, this assembly takes place outside the membrane.73 Recently, Müller et al. found that this step is assisted by two membrane proteins NosL and NosDFY (Fig. 6B).73 First, NosL captures the Cu+ ion. When NosDFY is in a nucleotide-free state, NosL binds it and transfers the Cu+ ion to its NosD domain. Subsequently, when ATP binds to the NosY domain of NosDFY, it causes the NosD domain to rotate and release NosL. The rotated NosD then interacts with N2OR and delivers the Cu+ to it. After the hydrolysis of ATP to ADP, NosDFY leaves NosZ and returns to its unbound state. In this cycle, Cu+ is transferred from NosL to N2OR, completing the assembly. The entire assembly process is an ATP-powered catassembly. Through this elegant way, bacteria can assemble Cu+ one by one onto the N2OR backbone outside of the membrane without poisoning the cell.
image file: d3cs00634d-f6.tif
Fig. 6 (A) The assembly of Cu+ to N2OR cannot occur spontaneously. (B) The catassembly of N2OR powered by ATP. The NosL outside of the cell membrane catches the Cu+ ion and transfers it to NosDFY. Powered by ATP, NosDFY rotates and delivers the Cu+ ion to the N2OR protein. NosL and NosDFY act as catassemblers for the incorporation of Cu+ into N2OR.
3.2.4. Catassemblers working synergistically in an assembly line. The genetic information in eukaryotes is stored in chromatin, a molecular assembly of negatively charged DNA and positively charged histones. The basic unit of chromatin is the nucleosome, which has about 147 bp DNA wrapped around histone octamers.74 Nucleosomes form a beads-on-string structure and are packaged into highly ordered structures.75

As one of the most important molecular assemblies in cells, the assembly of chromatin primarily occurs through catassembly. Each nucleosome contains two copies of histones H2A, H2B, H3, and H4. However, direct mixing of histone octamers with DNA in vitro does not generate chromatin but precipitation, due to the promiscuous and irregular histone–DNA and histone–histone interactions. In 1978, Laskey et al. first discovered that the addition of Nap1 enables chromatin formation, and Nap1 is not present in established chromatin.76 Nap1 is the first molecular chaperone ever discovered.

Since then, more than 30 histone chaperones have been identified.77–79 In the current model, histone chaperones CAF1, ASF1, FACT, and MCM2 work synergistically to form an assembly line for building new chromatin after DNA replication. They cooperate to recycle the old histones, deliver new histones to the replication fork, and deposit them into the new chromatin (Fig. 7). CAF1 and ASF1 function in the assembly of new histones. And MCM2 and FACT work in the recycling of old histones. Most histone chaperones have no similarities in sequence but share the ability to bind histones. This binding shields the positively charged histones and protects them from forming improper interactions with DNA and other proteins. The histone chaperones then deliver the histones and assist their assembly with DNA to form nucleosomes.


image file: d3cs00634d-f7.tif
Fig. 7 During DNA replication, catassemblers like FACT, MCM2, CAF1, ASF1, etc. form an assembly line for assembling nucleosomes. These catassemblers can reuse old histones, and deliver and deposit new histones to the newly synthesized DNA to form nucleosomes.

Histone chaperones are not permanent components of chromatin, but rather play a role in building, restoring, and adjusting chromatin structures. Some histone chaperones, such as Nap1 and Hsp90, can bind to all histones, while others show specific selectivity for certain histones. For example, CAF1 only selects the H3.1–H4 tetramer, while HIRA can only bind to the H3.3–H4 tetramer.80,81 This selectivity means that a specific chaperone can build the nucleosome with a specific structure, which in turn affects gene transcription.

3.3. Catassembly widely exists in living organisms

Table 1 summarizes catassemblers found in a wide range of species such as viruses, bacteria, plants, and animals. Looking back on how to give different names like chaperone, chaperonin or the assembly complex in the past several decades, it seems to be more appropriate to collectively refer to them as catassemblers. They are involved in many biological activities, including chromatin formation, virus assembly, and assembly of biomacromolecular complexes. Many catassembled complexes, such as the nucleosome, proteasome, and spliceosome, are essential for living organisms as listed in Table 1.
Table 1 Catassemblers and their products in living organisms
Products Catassemblers Mechanisms
Assembly Chaperone–usher pili62,63 PapD–PapH, FimC–FimD, Caf1M–Caf1A, CfaA–CfaC Protect unstable subunits, delivery subunits, assist assembly
Type IV pili60 PilQ,TcpC, PilM, PilN, PilO, PilP, TcpR, TcpD, TcpS,PilC, PilG, TcpE
Chromatin78 Nap1, ASF1, MCM2, CAF1, NASP, HJURP, SET, NCL, HIRA, FACT, DAXX–ATRX, etc.
Phage tail83–85 gp57a, gp38, gpG, gpGT, gp17.5, gp17.5*
MHC-I86,87 Tapasin, TAPBPR
Myosin88,89 UNC-45
Outer membrane proteins90,91 BAM complex
RuBisCO65 Cpn60, Cpn20, RbcX, Raf1, Raf2, BSD2 Facilitate the formation of key intermediates
Proteasome82,92,93 pba1, pba2, pba3, pba4, ump1
HIV94,95 IP6
snRNPs96 SMN complex
N2OR73 NosL, NosD ATP-fuelled assembly/folding
Type IV pili60 PilB, PilF, TcpT
Folding Protein51,56 GroEL–GroES, Hsp90, Hsp70
RNA97,98 DEAD-box proteins
Protein53,99 Spy, SurA, Skp, SecB, Hsp27, ttHsp40, PAT Binding induced folding
RNA98 Hfq, CspA, NCp


Different types of catassemblers can have different roles in the assembly process. Some may act passively to avoid incorrect subunit–subunit interactions, while others play an active role. The binding of catassemblers to subunits may be allosteric or use the same binding sites as other subunits. As mentioned above, catassemblers may function through three mechanisms. First, catassemblers can bind and protect the immature subunit. Secondly, catassemblers are essential for the formation of key intermediates. Thirdly, catassemblers may act as “workers” on the assembly line fuelled by external energy.

One catassembler can have multiple functions. For example, a histone chaperone can either protect histones or function as a “worker” on the assembly line. Raf1 can both stabilize RbcL and participate in key intermediates.65 On the other hand, the formation of biomacromolecular complexes might need the cooperation of multiple catassemblers. For instance, the assembly of the proteasome requires the coordination of 10 chaperones.82

One may wonder why such complex mechanisms are necessary, and why self-assembly is insufficient in these cases. The complexity of the subunits, the assembly pathways, and the environment may provide some clues.

First, the initial structures of the subunits are unstable, and they do not adopt their final stable conformations until they are incorporated into the complexes. Therefore, catassemblers are needed to stabilize the subunits and make them compatible for assembly. Otherwise, the self-assembly of these immature subunits may lead to unwanted products. For example, the structure of RbcL is unstable until it binds to the catassemblers, and it adopts its final conformation only when the RuBisCO complex is formed.65

Second, the biological assembly process involves numerous subunits and complex assembly pathways, in which specific subunits must be added at the right time and position. Such complex processes cannot be achieved solely by self-assembly. The catassemblers act as protecting groups in covalent synthesis to ensure that the correct building blocks are added to the complex in the right order and result in the correct final product. For instance, in the catassembly of the proteasome, catassemblers pba1–4 can bind to the intermediates of the core particle and prevent them from interacting with another module called the lid before the core particle matures.82

Finally, the assembly process of biological macromolecules takes place in a complex and crowded environment, where they are exposed to other irrelevant molecules and subunits. These surrounding molecules or subunits may interfere with the interactions between the intended subunits. In such complex conditions, self-assembly alone cannot achieve targeted molecular assembly with desired functions, and catassemblers are required. For example, chaperonin molecules like GroEL–GroES can isolate the newly synthesized peptides from the complex environment and allow them to fold inside their cavity.51

3.4. Summary

The ubiquitous catassembly in living organisms may have been a selection of natural evolution, as it can efficiently facilitate the formation of various molecular assemblies, thereby smartly promoting the emergence of unique properties. Compared with the self-assembly strategy mostly applied in chemistry presently, the catassembly strategy may not be so straightforward, costing additional energy and materials, and introducing additional complexity to the system paradoxically. However, it has been shown that a great variety of catassemblers play dispensable roles in precisely achieving intricate targeted assemblies at specific times and locations in complex environments. Comprehensive learning from them can inspire the invention of more sophisticated and functional artificial molecular assemblies.

4. Noncanonical catassembly in living organisms

One of the important features of molecular assemblies in living organisms could be their dynamic nature, as they are constantly regulated to implement advanced functions. These molecular assemblies can disassemble completely, partially disassemble and reassemble synergistically, resulting in changes in their structure, composition, and function. For example, changes in the chromatin structure can modulate transcription levels of target genes. These processes usually involve the disruption, reformation, and rearrangement of noncovalent interactions, which are not always spontaneous and require assistance from other molecules. Notably, reassembly and disassembly often require more assistance and especially input of energy than assembly.

Although these processes are not typical assembly processes, they share the same physico-chemical nature in the formation, disruption, and rearrangement of noncovalent interactions by catassemblers. These catassemblers bring indispensable controllability and new functions to the assembly. Without them, these complicated processes cannot be achieved. Accordingly, they could be called noncanonical catassemblers, and the corresponding processes can be called catalysed-disassembly, catalysed-reassembly or noncanonical catassembly.

4.1. Catassemblers for disassembly

While the formation of biomolecular assemblies has been extensively studied, the disassembly of molecular complexes has received much less attention. Disassembly, the reverse process of assembly, is also important in cells. For instance, the disassembly of nucleosomes exposes DNA for the binding of other factors, the disassembly of cofactors inactivates enzymes, and the unfolding and degradation of folded proteins recycles the used biomaterials.

In cells, the degradation of proteins occurs through the ubiquitin–proteasome system. Wasted proteins are tagged with ubiquitin and caged into the proteasome for degradation. However, some stable proteins can only be degraded once they are unfolded. The unfolding breaks noncovalent interactions and costs energy. In S. cerevisiae, Cdc48 has been found to collaborate with Ufd1 and Npl4 to unfold proteins fuelled by ATP (Fig. 8).17,100 Cdc48 is a hexamer complex that has a double-ring structure with a pore at the centre. The ubiquitin attached to the protein to be unfolded is first recognized by Cdc48. The Cdc48 hexamer has 12 ATPases, and the hydrolysis of ATP makes the ring structure act as a “conveyer belt” that drags the ubiquitin inside the ring. The continuous rotation of Cdc48 makes the protein pass through the small pore and unfold. The unfolded protein is then released and transferred to the proteasome for further degradation. Therefore, Cdc48 is a noncanonical catassembler for catalysed disassembly.


image file: d3cs00634d-f8.tif
Fig. 8 Cdc48 and Ufd-Npl4 can function as uncanonical catassemblers that unfold proteins by continuous rotation fuelled by ATP.

4.2. Catassemblers for reshaping the assembly state

The structure of established assemblies can be reshaped without changing the subunits, which requires the partial disassembly of the complex and reassembly to a new state. This reshaping of the structure can endow assemblies with new properties. The reshaping of molecular assemblies usually requires energy input with the help of catassemblers.

The remodeler-assisted sliding of the histone octamer on DNA is a typical example of ATP-fuelled reshaping of assemblies.18,101,102 There are four families of remodelers, ISWI, SWI/SNF, CHD, and INO80, which all have a shared ATPase domain and can remodel nucleosomes. One important function of the remodelers is to make histone octamers slide on DNA. This process consumes ATP and may have two effects on chromatin: first, it makes the nucleosome-occupied DNA available and creates space for the binding of other factors; second, sliding of the nucleosome in a condensed direction compacts chromatin (Fig. 9).


image file: d3cs00634d-f9.tif
Fig. 9 Remodeler is an uncanonical catassembler that can facilitate the sliding of histone on DNA, which reshapes the assembly state of chromatin, e.g., makes the chromatin much more compacted.

The structures of a series of remodelers have been solved, revealing a possible shared twist-based mechanism for their actions.102,103 The target nucleosome is bound by a remodeler in an open state. The binding of ATP to the remodeler changes it to a closed state and induces a twist defect at the entry site of DNA, corresponding to a slight disassembly. Upon hydrolysis of ATP, the remodeler returns to the open state, and the twist defect is transferred to the exit side of DNA, accompanied by a reassembly process. Each cycle of this action results in the sliding of the histone octamer by 1 bp of DNA. Through this elaborate mechanism, catassemblers reshape the chromatin structures.

4.3. Catassemblers for subunit replacement

Catassemblers can also facilitate the replacement of subunits in established complexes, leading to changes in both structure and function. This process requires the removal of old subunits and the addition of new ones. It usually cannot occur autonomously and consumes energy. Catassembler-assisted histone exchange is a representative example. By replacing canonical histones with specific histone variants, the nucleosome structure can be altered and epigenetic markers can be installed on chromatin. Two remodelers, SWR1 and INO80, have been shown to facilitate the exchange between H2A.Z and H2B in nucleosomes, but they have opposite roles (Fig. 10A).104,105 SWR1 promotes the eviction of the H2A–H2B dimer and replaces it with H2A.Z–H2B, while INO80 facilitates the exchange of H2A.Z–H2B to H2A–H2B. Both processes require the hydrolysis of ATP.
image file: d3cs00634d-f10.tif
Fig. 10 (A) INO80 and SWR1 are ATP-fuelled noncanonical catassemblers that assist in the replacement of histone subunits in nucleosomes. (B) By consuming ATP, RecA can facilitate the exchange of strands between ssDNA and dsDNA.

The structures of both INO80–nucleosome and SWR1–nucleosome have been resolved, revealing part of the mechanisms involved.106–108 INO80 binding to the nucleosome disassembles 15 bp DNA at SHL6 and disrupts DNA–H2A interactions. SWR1 binding to the nucleosome is similar to other remodelers, as it distorts 1 bp DNA and changes the nucleosome structure. The binding of both INO80 and SWR1 destabilizes the nucleosomes and speeds up the replacement of subunits. However, detailed mechanisms for removal and addition are still unclear.

RecA is an important protein for the homologous recombination repair of DNA.109,110 As shown in Fig. 10B, RecA acts as a noncanonical catassembler to accelerate DNA strand exchange. RecA binds to ssDNA with higher affinity when ATP is present. When the ATP–RecA–ssDNA complex encounters dsDNA with the same sequence, a complex with a triple helix is formed. The invading strand gradually displaces the original strand. After the hydrolysis of ATP, RecA and the original strand dissociate.

4.4. Catassemblers for holding molecular assembly

The nucleosome is a natural suppressor of gene transcription. During elongation, RNA polymerase needs to cross nucleosomes. It was previously thought that the nucleosome undergoes a disassembly and re-assembly process during this process. However, a complete disassembly of the nucleosome would result in the loss of histones and their associated epigenetic information. A more likely scenario is that the nucleosome undergoes a partial disassembly and reassembly process during transcription. The histone chaperone FACT (facilitates chromatin transcription) has been found to play an important role in this process, allowing for the passage of RNA polymerase while maintaining the integrity of the nucleosome (Fig. 11).111
image file: d3cs00634d-f11.tif
Fig. 11 During the migration of RNA polymerase on DNA, the nucleosomes form a natural hurdle. Single-molecular studies have shown that FACT acts as a noncanonical catassembler to disassemble and reassemble nucleosomes during this process. Reproduced with permission from Ref. 112. Copyright 2018, Elsevier.

Chen et al. found that FACT can facilitate both the disassembly and reassembly of nucleosomes using single molecular magnetic tweezers.112,113 When binding FACT, the nucleosome can be unfolded at a lower force (5 pN) compared to the typical 10-18 pN required. More importantly, FACT preserves the integrity of the nucleosome, preventing its destruction during stretching. These results demonstrate that FACT is a unique catassembler that can loosen the nucleosome while maintaining its integrity, indicating that it can facilitate the passage of RNA polymerase. Further research using cryo-TEM confirms that FACT can facilitate nucleosome disassembly in front of the replication fork and restoration after the replication fork.114–116

4.5. Summary

As analysed above, living organisms have diverse mechanisms to synergistically control and regulate a variety of noncovalent interactions. However, these mechanisms have not received adequate attention in artificial molecular assembly. In this context, noncanonical catassemblers play an important role in disrupting, reshaping, and holding molecular assemblies. Table 2 summarizes these noncanonical catassemblers and their functions, demonstrating the superior strategies employed by living organisms to integrate assembly, (partially) disassembly and reassembly. The comprehensive analysis may also broaden and deepen our understanding of molecular assembly based on biosystems.
Table 2 Noncanonical catassembly in living organisms
Catassemblers Functions Mechanisms
Cdc48, Ufd1, Npl417,100 Unfolding protein ATP-fuelled unfolding
PilT60 Disassembly ATP-fuelled disassembly
Chromatin remodeler18,103 Reshaping nucleosome ATP-fuelled reshaping
INO80, SWR106–108 Histone exchange ATP-fuelled subunit replacement
RecA109,110 DNA strand displacement
FACT111,112 Hold nucleosome during transcription Hold subunits


5. What can we learn from biological catassembly?

The frontier of molecular assembly is moving toward the creation of complex functional matter consisting of multiple components and hierarchical structures.7,117 It is obvious that the traditional self-assembly strategy is not sufficient for constructing such complex systems. New assembly methods beyond self-assembly are highly desirable. Multiple-step non-covalent synthesis could partially solve the problems of complexity; however, it may be time-consuming. The development of complex molecular assemblies in general relies on several joint factors, including the design and synthesis of novel subunits, the advancement of new assembly methods, and the implementation of cutting-edge theoretical and characterization methods (Fig. 12).
image file: d3cs00634d-f12.tif
Fig. 12 The advancement of molecular assembly requires concerted efforts in the development of novel subunits, assembly methods, characterization and theoretical methods.

As discussed above, catassembly is a widely existing strategy for complex molecular assembly in living organisms. It has demonstrated its high efficiency in selecting assembly pathways, controlling the assembly with multiple subunits, and modulating the functionality of complex assemblies. However, only a few examples of artificial catassemblies have been reported. In the following section, we propose possible models for artificial catassembly, highlight some primitive works, and discuss the challenges in system design, characterization techniques, and theoretical approaches. We also anticipate the potential applications of catassembly in constructing functional complex assemblies.

5.1. Suggested models for artificial catassembly

In the development of artificial catassembly, the design of new building blocks is crucial. In living organisms, various biomacromolecules have controllable structural variations, which endow them with responsive properties. Structural changes can be achieved by binding with other molecules, covalent modifications, and alterations in pH. For instance, enzymes can undergo structural changes and alterations in activity through the allosteric binding of activators and inhibitors.118 This controllable variability can bring additional responsiveness and control to the system, as clearly demonstrated in biological catassembly systems.

In many biological molecular assemblies, the building blocks of complex structures are sometimes initially unstable and require protection until the final complex forms. When a similar issue arises in artificial assembly systems, catassemblers can be designed or screened to stabilize unstable subunits.

Therefore, in the first suggested model (Fig. 13A), the catassembler can prevent subunits A from self-formation through irregular interactions, promoting A and B to bind accurately. Subunit B will replace the catassembler and release it from the complex.


image file: d3cs00634d-f13.tif
Fig. 13 Three possible models of artificial catassembly. (A) The catassembler acts as a protective agent, stabilizing subunit A and accelerating its folding and assembly with subunit B. (B) The catassembler functions as an activator. It activates subunit A by exposing the binding sites for subunit B, and then the allosteric assembly of A and B releases the catassembler. (C) The catassembler needs to be activated by energy input, and then it functions as an activator for the assembly of subunits A and B.

Catassemblers can act as activators for molecular assembly in the second model (Fig. 13B). The direct assembly of subunits A and B is hindered due to the concealment of their interaction sites. The introduction of a catassembler initiates the assembly process. The catassembler binds to subunit A in an allosteric manner, thereby exposing the binding sites on subunit A. Subsequently, the activated subunit A interacts with subunit B, forming the complex AB. The binding of A and B weakens the interactions between A and the catassembler, resulting in the release of the catassembler. The catassembler then proceeds to locate a new subunit A and starts a new cycle. This system is a responsive molecular assembly system. It can also be designed as a sensing system, where molecules to be detected act as catassemblers.

Furthermore, catassembly can be powered by external energy in the third model. This requires that the catassemblers be responsive to energy inputs. The energy inputs could be in the form of chemical energy, light, electrical energy, etc. Assuming a catassembler exists in a deactivated state, as shown in Fig. 13C, the energy input can activate the catassembler, initiating a catassembly cycle like Fig. 13B. After the assembly finishes, the deactivated catassembler is released from the product and can be reactivated by further energy input.

It should be noted that catassemblers differ from co-assembly components in that they do not become a part of the final product and can be reused, making catassembly more efficient. The leaving and recycling of catassemblers are crucial in this process. We proposed four possible leaving and recycling mechanisms for catassemblers. (1) The catassembler shares the binding site with other subunits and can be replaced by them at the end of the assembly process (Fig. 13A). (2) The completion of the assembly weakens the binding between the catassembler and the subunits, eventually causing the catassembler to detach (Fig. 13B). (3) The catassembler and the final assembly product undergo phase separation (for instance, precipitation), with the product separated into another phase and the catassembler remaining with residual subunits. (4) The recycling of the catassembler is powered by energy input.

In the construction of complex molecular assemblies involving multiple components and hierarchical structures, catassembly is increasingly necessary. Catassemblers can protect the subunits from undesired interactions, much like protecting groups in covalent synthesis. They can be removed at the appropriate time once the target intermediates or products are formed. All in all, catassemblers ensure that specific subunits join the assembly at the right time and in the right manner, making it possible to assemble complex molecular architectures in a single pot like a single cell.

One important aspect of molecular assembly which has long been overlooked is mass transportation. In biological science, the term “recruitment” is often used to describe the result of mass transport of specific species, but not the process itself. The mass transport within cells is distinct from that in artificial assembly systems which are mainly in dilute solution. Cells are compartmentalized into various organelles and membrane-less organelles, and loaded by other biomacromolecules, glucose, and salts, making the interior of cells a crowded environment.119–121 Molecular assembly relying on molecular diffusion and/or Brownian motion is inefficient in such a crowded environment.122

Many biological catassemblers can deliver subunits in some effective ways. For example, in the case of pili assembly, catassemblers can deliver subunits from the inner membrane to usher proteins on the outer membrane.63 Many histone chaperones have also been reported as carriers that deliver histones to replication forks or other target positions.78 Many “transporter” proteins for RNA have also been found.123 In this way, catassemblers can assist molecular assembly by accelerating mass transport. However, due to the lack of appropriate techniques, this function of catassemblers is poorly understood and highly deserves more attention.

By learning from biological assembly and proposing three preliminary models of catassembly, one can gain some initial insights into how to advance artificial assembly by introducing catassembly. The more complex the assembly, the greater the demand for using catassembly, and the more types of catassemblers are required.

5.2. Characterization and theoretical methods for catassembly

From a physicochemical perspective, precise regulation and control of the target molecular assembly must jointly operate in three dimensions: energy, time, and space. In the energy dimension, catassemblers can select a specific pathway on the potential energy surface, resulting in a specific product; or they can lower the energy barrier of assembly, accelerating the assembly process. In the time dimension, catassemblers can control when to start and stop the assembly process, or even decide the time when a specific subunit or new catassembler is added. In the space dimension, catassemblers can be designed with specific interaction sites to act as a protective agent or activator, and deliver the subunits to targeted positions for the efficient mass transportation.

Currently, when self-assembly fails to produce desired products, researchers tend to redesign the structure of subunits rather than investigate the assembly process to identify the cause of failure. By studying the assembly pathways, kinetics, and mechanisms, we can gain a deeper and broader understanding of the construction and function of molecular assembly. Catassembly systems are usually highly complex, featuring hierarchical structures, multi-site long-range interactions, and feedback networks. These features make the investigation of the catassembly mechanism challenging. To reveal the physicochemical landscape of these systems, characterization techniques with high temporal, spatial, and energy resolution, as well as theoretical approaches, are highly anticipated.

Due to the weak interactions involved in catassembly, characterization methods with high energy resolution are required to monitor the formation and breakage of non-covalent interactions from the atomic level to the molecular level and the macromolecular level. Each level of the catassembly process requires different characterization methods. At the atomic level, nuclear magnetic resonance (NMR) is a high-resolution, in situ method with various hardware innovations and pulse sequences that have been widely applied to study biological systems.124,125 At the molecular level, spectroscopic methods based on vibrational and electronic effects can provide valuable information, for instance, fluorescence, circular dichroism, Raman, and UV-Vis spectroscopy. At the macromolecular level, characterization methods mainly involve the usage of light, magnetism, and microscopes, such as optical or magnetic tweezers.126,127 Some characterization methods can provide information about assembly processes at multi-levels, for instance, mass spectroscopy. The choice of the characterization method depends on the information desired. For example, single molecular magnetic tweezers can measure the force inside biological molecular complexes at a resolution of 1 pN, which has been used in the study of catassembly.112,128–130

In addition, time resolution is also an important aspect of characterization techniques, especially for the study of kinetics. Catassembly typically features multiple pathways and complex kinetic processes. During the assembly process, different pathways create transient intermediates with low content and complex structures. These features of intermediates demand characterization methods that can deal with complex samples. To reveal the details of the process, characterization techniques with both high structural resolution and time resolution are very much needed. NMR has been applied to characterize the transient, high-energy state complex of a protein ensemble whose content is as low as 0.5% of the protein population.131 Moreover, NMR can reveal information on spatial distance and dynamics of intermediates, which is important for structural elucidation and mechanistic explanation. The marriage of microfluidic and NMR techniques significantly improves the time resolution of the NMR method, enabling NMR to study the fast kinetics processes of supramolecular systems with high structural and time resolutions.132–134 Cryo-TEM combined with advanced particle-picking methods provides insights into key intermediates of biomacromolecular complexes.135 Moreover, time-resolved cryo-TEM has also been developed to study the dynamic process of biological macromolecules.136–138 Other techniques, including mass spectroscopy, stopped-flow spectroscopy, and single-molecule fluorescence techniques, can also be helpful.

Complex catassembly systems usually present the following characteristics: multiple components, multiple interactions at multiple sites, coupling of short- and long-range interactions, complex assembly pathways, entropy-driven, or complementation of entropy and enthalpy.139,140 Especially, catassembly systems in living organisms are more complex and always in larger temporal and spatial scales than chemical reactions.141 All these features bring great challenges to the current theoretical foundations of catassembly and computing science. To address this challenge, developing new physicochemical and computing methods is crucial for the theoretical study of catassembly.

The theoretical framework of physical chemistry should emphasize the synergy and feedback of non-covalent interactions in multiple sites, which often leads to nonlinear changes in key physicochemical parameters in catassembly processes. In addition, mass and energy transfer in catassembly processes might occur at the same temporal and spatial scales, especially in living systems. Therefore, current reaction-diffusion models should be improved to explain the coupling mechanism of mass transfer and catassembly processes. In particular, special mass transfer methods, such as active transport122 and laminar and turbulent flow in microfluidics,142,143 might provide new insights into the design of catassembly systems coupled with mass transfer processes. More importantly, most catassembly systems in living organisms undergo non-equilibrium pathways and often reside in a complex crowded environment, requiring a breakthrough in the non-equilibrium statistical theory to fully understand these systems.144,145

From the perspective of computing methods, developing new density functional approximations is vital for analysing the synergy of non-covalent interactions.146,147 Furthermore, the integration of quantum mechanics, all-atom, and coarse-grained models into a theoretical framework is necessary to construct a multi-scale model with both precision and efficiency.148,149 This model might further be connected with kinetic Monte Carlo and grand-canonical Monte Carlo methods to search for the structure of active intermediates at different scales.150–152

5.3. Summary

In this part, we discuss three possible models of how catassemblers can accelerate and regulate the assembly process, including the leaving mechanisms of catassemblers. These models are only simple catassemblies with a few catassemblers and subunits involved. As the complexity of the system increases, the catalysed assemblies as a whole may play a role as the catassembler for the higher hierarchical system. Then other catassemblies of high level will emerge. The development of catassembly not only depends on the design of new systems, but also relies on the development of more powerful theoretical and characterization methods. The deepening of our understanding of the assembly process will accelerate the development towards complex functional and even intelligent assembly systems.

6. What has artificial catassembly achieved?

In recent years, many studies on artificial catassembly from different aspects have illustrated how the catassembler can accelerate the assembly process, select specific assembly pathways, and even reshape established assemblies, although the authors did not use the same term of catassembly and catassemblers.

6.1 Artificial catassemblers to accelerate the assembly process

Steric hindrances are a common occurrence that often obstruct the assembly of subunits (Fig. 13B). Fortunately, multiple reports have confirmed the effectiveness of catassemblers’ ability to remove these obstructions and accelerate subunit assembly.

The most abundant examples of artificial catassemblies come from the realm of DNA assembly. Most DNAs are assembled using annealing (self-assembly) to obtain desired products.26,43 Turberfield et al. were the first to demonstrate the isothermal formation of the DNA structure through the catassembly of hairpin subunits.48 The hairpin structure contains two domains: the loop domain, whose hybridization is fuel for the assembly; and the stem domain, which provides an energy barrier for the process.

These catassembly processes rely on the same mechanism. For example, in the work by Yin et al., the hairpin structures of subunits provide a high energy barrier that hinders the interaction between them (Fig. 14A).153 The catassembler strand can bind to subunit A and open the hairpin by strand displacement. This removes the kinetical obstruction and exposes the interaction sites, leading to the sequential binding of subunits B and C. Subunit C then competes with the catassembler strand and induces the release of the catassembler strand. The released catassembler can start a new cycle. Through this method, the isothermal assembly of the DNA three-way junction is achieved. The catassembler acts as an activator for the system. This catassembly case was further applied in building DNA tetrahedra, and used as a detection system for other molecules.153–155


image file: d3cs00634d-f14.tif
Fig. 14 Catassemblers to accelerate the assembly process. (A) The catassembly of a three-way junction DNA from hairpin subunits A, B, and C. Reproduced with permission from Ref. 153. Copyright 2008 Springer Nature. (B) The catassembly of the DNA double strand by several steps of strand displacement. Reproduced with permission from Ref. 162. Copyright 2010 Oxford University Press. (C) Electrons act as catassemblers for the recognition of host and guest molecules. Reproduced with permission from Ref. 167. Copyright 2022 Springer Nature.

One system that has been explored is the “hybridization chain reaction” (HCR), which is a process of molecular assembly rather than a reaction.156,157 In this process, two DNA subunits with hairpin-like structures can form supramolecular polymers by self-assembly. However, the long stem of the subunits creates a spatial obstacle that slows down the assembly. To overcome this, an initiator strand is added that opens up one subunit and triggers sequential polymerization of the other subunits. Although the initiator strands remain in the final product, one equivalent initiator can catalyse the assembly of thousands of subunits. Therefore, the initiator strand acts as a catassembler in the HCR. This strategy has been successfully applied in the amplification of signals,158 construction of hydrogels,159,160 and solving mazes.161

Zhang et al. developed a DNA catassembly process using strand displacement.49,162 In the example shown in Fig. 14B, the assembly of strands a and d is prevented by strands b and c. When the catassembler strand binds to strand a, strand c is released and binding sites for strand d are exposed. Strand d then competes with strand b and the catassembler strand, leading to their gradual release. Finally, the catassembler strand binds to strand a with only a few base pairs and releases autonomously. This strategy has been used in DNA circuits, constructing DNA nanotubes, building multiple enzyme systems and assembly of nanocrystals.163–166

Jiao et al. recently reported that electrons can be used as a catassembler to accelerate the recognition of host and guest molecules (Fig. 14C).167 In their system, both the host and guest molecules are radical cations. Coulombic interactions between cations make molecular recognition kinetically unfavourable. The addition of catalytic amounts of electrons partially reduces the host or guest molecules, lowering the energy barrier and accelerating their assembly. This smart mechanism seems to be a versatile strategy that can be widely used.168

6.2. Artificial catassemblers to select the assembly pathway

The other important role of catassemblers is to guide the specific assembly pathway and hinder other pathways, resulting in the obtention of specific products rather than other products.

Zhao et al. reported that the molecule TTC4L has poor solubility at pH levels below 7.0 and will self-assemble into amorphous precipitates. However, the addition of β-CD greatly enhances the solubility of TTC4L. Furthermore, the encapsulation of TTC4L by β-CD reduces collisions between TTC4L molecules. Although this slows down the assembly process, it also alters the assembly pathway of TTC4L, resulting in the formation of a micelle structure (Fig. 15A). These micelles eventually form precipitates and release the catassembler β-CD.169


image file: d3cs00634d-f15.tif
Fig. 15 Catassemblers assist in choosing the assembly pathway. (A) The molecule TTC4L forms microspheres with the assistance of β-CD. (B) The chiral cage CAAA-1 functions as a catassembler for the assembly of right-handed TPPS supramolecular polymers. Reproduced from Ref. 170 with permission from the Royal Society of Chemistry. (C) The catassembler α-CD can help the molecule C4AZO to choose the assembly pathway leading to right-handed supramolecular polymers. Reproduced from Ref. 171 with permission from the American Chemistry Society.

Catassemblers can also participate in the assembly of supramolecular polymers to enhance their enantioselectivity. For example, the self-assembly of TPPS molecules occurs slowly and produces both left- and right-handed supramolecular polymers. However, Wang et al. reported that the addition of a chiral cage, CAAA-1, greatly increases the assembly rate of TPPS and results in the formation of only right-handed polymers (Fig. 15B).170 The catassembler interacts strongly with TPPS monomers to regulate their chirality and has a weaker interaction with the polymer, allowing it to be released autonomously. Similarly, Zhi et al. found that α-CD can facilitate the formation of right-handed C4AZO supramolecular polymers. While the self-assembly of C4AZO produces enantiomers, the addition of α-CD accelerates the assembly rate of right-handed polymers and decreases the assembly rate of left-handed polymers, resulting in the right-handed product (Fig. 15C).171

6.3. Artificial catassemblers to reshape molecular assemblies

As shown above, catassemblers can also adjust the structure of formed molecular assemblies and help them reform into new assemblies.

As a linear biological polymer, RNA can misfold into structures with incorrect conformations. In nature, protein chaperones are available to correct misfolded RNA. Recently, we demonstrated that RNA can also function as a chaperone to solve the misfolding problem of RNA (Fig. 16A).172 During assembly along with synthesis, RNA can easily misfold and form misfolded products due to complex intramolecular interactions. Without assistance, it took several hours to weeks for the misfolded product to refold into the native product. With the assistance of a designed RNA strand as a chaperone, the refolding process can be accelerated by around 1000 times. This research provides evidence that RNA chaperones may exist in the RNA world. Similarly, many kinds of artificial molecular chaperones for proteins have also been developed.173,174


image file: d3cs00634d-f16.tif
Fig. 16 Catassemblers to reshape molecular assemblies. (A) Folding alongside synthesis may produce a misfolded product. The RNA chaperone strand can disrupt the wrong interactions in the misfolded structure and assist it to refold into the native state. (B) Fe atoms and C3PC molecules can self-assemble into Sierpiński triangle structures, which can be disrupted by CO2 and re-assembled into a more stable chain structure.

Xie et al. found that cytosine can self-assemble into a glassy state on the surface of Au(111).175 The cytosine molecules mainly form T-junction structures and contain several types of dimers, which are not the most energetically favourable structures. However, when the catassembler, water, is deposited onto the system, the established molecular assemblies reform into extended molecular-chain arrays containing several stable hydrogen-bonded dimers. DFT calculations show that water can break the weak interactions in the glassy state and facilitate reassembly into stable chain structures. Water molecules have low absorption energy and can easily desorb from the system.

Similarly, Li et al. reported that CO2 can function as a catassembler on surfaces (Fig. 16B).176 On the surface of Au(111), Fe atoms and C3PC molecules can self-assemble into Sierpiński triangle structures, with each Fe atom coordinated with three C3PC molecules. However, this is not the most stable compound that Fe and C3PC can form. The introduction of CO2 to the system disrupts the formed structures and allows them to reassemble into a chain structure, with each Fe atom coordinated with four C3PC molecules. The interaction between CO2 and Fe is weak, so a large dosage is needed. CO2 can easily desorb from the surface and is not present in the final structure.

6.4. Summary

The above pioneering works are either deliberately designed or accidentally discovered. They have shown the potential to construct various molecular assemblies or serve as detection systems based on molecular recognition. However, current catassembly is still in its early stage and requires further exploration of more advanced systems, such as non-equilibrium catassembly, that are driven by external chemical, electrical or optical energy. Some sub-branches such as electro-catassembly and photo-catassembly will be developed, and more functional catassembly systems will emerge. Moreover, the development of artificial catassembly may also deepen our insight into biological catassembly phenomena.

7. Perspective of catassembly in complexity science

Fifty years ago, P. W. Anderson published a famous essay ‘More Is Different’, where he stated that “the behaviour of large and complex aggregates of elementary particles…is not to be understood in terms of a simple extrapolation of the properties of new particles. Instead, at each level of complexity entirely new properties appear”.177 This should be the philosophy guiding the investigation of complex assembly systems.178

Living beings are such integrated systems featuring multi-component, multi-pathway, and multi-level structures. Due to these “multiples,” there is a significant “difference” between living organisms and artificial molecular assembly systems in terms of complexity and diversity of structures and functions. In living organisms, different molecules serve different purposes and work synergistically without interfering with each other as an integrated smart system. Some molecules are responsible for energy harvesting, others serve as carriers, writers, and erasers for information, some work as sensors, and others are involved in the formation and breakage of covalent bonds. Moreover, the operation and maintenance of life require continuous input and output of matter, energy, and information. As a result, the assembly in living organisms cannot solely be dependent on thermodynamic spontaneous processes, but rather on other processes such as catassembly and dissipative processes, thus generating higher functions with hierarchical structures. This is a typical manifestation of “more is different”.

The ultimate goal of artificial molecular assembly is to construct molecular systems that can function like living organisms and play a key role in bionics.179–181 From the perspective of complexity science, the whole is greater than the sum of its parts. In current molecular assembly systems, adding controlling components beyond building blocks will greatly increase the system's complexity. This may promote the emergence of novel functions unachievable through self-assembly alone. For example, introducing catassemblers into the system provides extra controllability over when, where and how molecular assembly occurs. These machineries can collectively promote correct noncovalent interactions among subunits, eliminate incorrect interactions, and restore subunits to a correct assembly state.

It can be expected that as artificial molecular assemblies become more complex, catassembly will become an increasingly efficient and important assembly strategy. Catassemblers can function as “builders” constructing functional assemblies; as “sentry” watching and reacting to external signals; as “transporters” assisting in mass transport; as “coordinators” for controlling systems spatiotemporally; or even as “doctors” repairing damaged parts. For example, as depicted as a tower bridge in Fig. 17, “builders” can build complex and functional assembly systems, and “coordinators” can reshape the structure of the tower bridge, switch its function when a ship (external signal) approaches, and restore the structure and function after that. Overall, catassemblers can play multiple indispensable roles in a well-run molecular assembly system with high complexity.


image file: d3cs00634d-f17.tif
Fig. 17 Cartoons of a tower bridge with multiple functions that illustrate that catassemblers can play several roles such as (A) “builders” and “transporters” constructing complex functional molecular systems, and then (B) “coordinators” and “operators” reshaping the system's structure and function in response to external signals and restoring them accordingly.

The study of catassembly may also initiate reforms in conventional catalysis theory especially in biobased complex systems. Similar to catassembly systems, the interactions between enzymes and substrates, as well as transition states, also involve multiple binding sites and synergistic weak interactions, along with feedback mechanisms.5,182 A profound understanding of the fundamental principles governing these interactions is also crucial for advancing catalytic methodologies, such as organic catalysis and supramolecular catalysis.183–185

Catassembly and catalysis can also be coupled coherently to make complementary advantages to efficiently build multi-level structures of assemblies.12 For example, the formation of the capsid of bacteriophage HK97 is a typical process of catassembly coupled with chemical reactions.186 As the coupled reactions continuously consume the assembled products, they push the assembly equilibrium in the positive direction. The assembly provides the sites required for coupling, further promoting the coupling process. More importantly, the stepwise coupling of catassembly and catalysis can efficiently and precisely construct flexible and stable assembly systems with hierarchical structures.12 Furthermore, the rational integration of chemical reactions (including spontaneous reactions, field-assisted reactions, and catalysis) and molecular assembly (including self-assembly, field-assisted assembly, and catassembly) can construct highly advanced functional molecular systems that involve many components and complex processes.

It should be emphasized that over the past few years, the advent of the artificial intelligence (AI) era has offered unprecedented opportunities to revolutionize the landscape of molecular assembly. With the breakthrough of algorithms and increasing computational power, AI is enabling the design of building blocks and prediction of their interactions. More importantly, AI can elucidate the pathways and mechanisms of molecular assembly processes, which are highly complex and remain poorly understood nowadays.187,188 Of particular interest is the potential of AI to understand and predict how catassemblers control and regulate molecular assembly processes.

On the one hand, a series of machine learning (ML) approaches based on the neural network, including AlphaFold, AlphaFold2 and RoseTTAFold, have achieved accurate prediction of protein structures from their sequences, and have been used to design proteins as assembly subunits.187,189–191 Furthermore, methods like AlphaFold-Multimer8 have been developed to identify the protein docking sites and produce accurate structures of heterodimeric protein complexes.192–194 These tools could be applied in the design of catassemblers and catassembly processes. In addition, advances in ML-based data mining methods enable the discovery of hidden patterns from heterogeneous big data with implicit noise, thereby consequently enhancing resolution and relieving chemists of time-consuming tasks.195,196 Introducing the deep learning methods into multiscale molecular modelling could also optimize the parameters of force fields, extract equilibrium properties of systems containing rare events and promote the accessible time and spatial scales of atomistic simulations.197–199

On the other hand, large language models (LLMs) are anticipated to extract chemical information from the literature reports and provide comprehensive insights into the design of molecular assembly systems.200–202 Leveraging LLMs alongside big data techniques could uncover commonalities among seemingly irrelevant catassembly systems in living organisms. Furthermore, data-driven experimentation by high-throughput automation and real-time kinetic monitoring is of utmost importance in navigating the complexities of molecular assembly, which can generate precise and reliable data for training models.203–205 Meanwhile, optimization methods, such as Bayesian optimization and evolutionary algorithms, could act as a “brain” to expedite the exploration of optimal experimental conditions and facilitate error-free autonomous workflows.203,206–208 AI-assisted and AI-driven methods will offer new avenues for transforming the paradigm of molecular assembly and catassembly research. The synergy between the emergent nature of AI and molecular assembly systems holds the great promise of creating advanced functionality and even emergence of intelligent molecular systems that transcend the capabilities of living organisms. This will also deepen our understanding of complexity science, leading to better ways to manage and control complex systems.

8. Conclusions

This review has stemmed from continual exploration and investigation on catassembly by us in the past decade. Through systematic investigation and literature search on the relevant chemistry and biology, we have gradually realized that the catassembly phenomenon is ubiquitous in biological assembly systems. The comprehensive understanding of catassembly in living organisms may open new avenues not only promoting the development of molecular assembly but also revealing some long-term unrevealed mysteries of highly functional biological systems.

Catassembly has been shown to be an efficient and highly selective strategy for synergistically controlling and manipulating various noncovalent interactions, especially in hierarchical molecular assemblies. We emphasize that canonical catassemblers can either accelerate assembly or guide assembly pathways; on the other hand, noncanonical catassemblers play a pivotal role in enabling reassembly and (partial) disassembly to achieve the smart assembly with highly responsible and versatile properties. Catassemblers can play multiple roles, such as builders, transporters, and coordinators, making catassembly an indispensable strategy in constructing and regulating complex assembly systems.

Accordingly, we propose design principles for artificial catassembly and discuss corresponding characterization and theoretical methods. We summarize three basic models of catassembly, including (1) catassemblers preventing undesirable aggregation and promoting correct assembly; (2) catassemblers activating building blocks to facilitate assembly; and (3) energy-powered catassembly processes. Along this direction, we propose four pathways for catassembler departure, including competitive replacement by building blocks, allosteric effects, phase separation and dissipative recycle of catassemblers.

We further discuss representative catassembly works in which catassemblers accelerate assembly, guide assembly pathways, and reshape assemblies. By learning from biological assembly and advanced artificial assembly, it is evident that the more complex the assembly, the greater the demand for catassembly, and the more types of catassemblers are required. These features are distinctively different from those of self-assembly as the most widely used strategy in molecular assembly for decades. Moreover, the rational combination of self-assembly with catassembly, one of the important branches of assisted-assembly, could be necessary to advance the new generations of artificial molecular assembly and its relevant materials and devices.

Although the concept of catassembly was originally inspired by catalysis chemistry about ten years ago, we have now realized, through studying catassembly in living organisms, that catassembly and catalysis belong to a big family but may have some distinct principles based on the famous essay “More Is Different” by P. W. Anderson. Catassembly could be a much more complicated process based on noncovalent interactions, featuring synergistic multivalent and multicomponent interactions, and long-range feedback. These features are essential to construct hierarchical structures which can eventually generate new properties and functions, which have yet to be the focus of chemical catalysis. Therefore, it could be highly desirable to make great efforts to develop and even establish fundamental principles, theoretical framework and characterization tools for catassembly. This approach may also initiate reforms in conventional catalysis theory and develop strategies integrating covalent synthesis and noncovalent assembly to construct complex molecular systems.

Undoubtedly, the advancement of molecular assembly into complexity science must require more and more people from different fields to assemble and work synergistically to bridge the gap between chemistry and life science. There is a good reason to be optimistic about the new era of AI. It offers unprecedented opportunities to revolutionize the landscape of molecular assembly. Large language models will extract the chemical and biological information from the literature and then uncover commonalities among seemingly irrelevant or unworkable catassembly systems, such as those found in living organisms. More importantly, AI can act as a “catassembler” to greatly improve the communication language and skills of chemists, biologists and physicists from diverse research fields and disciplines to advance the complexity science.

Author contributions

Zhi-Chao Lei: writing – original draft, writing – review & editing, conceptualization, visualisation. Xinchang Wang writing – original draft, writing – review & editing, visualisation. Liulin Yang: writing – original draft, writing – review & editing. Hang Qu: writing – original draft, writing – review & editing. Yibin Sun: writing – review & editing. Yang Yang: writing – review & editing. Wei Li: visualisation. Wen-Bin Zhang: writing – review & editing. Xiao-Yu Cao: writing – review & editing. Chunhai Fan: writing – review & editing. Guohong Li: supervision, writing – review & editing, funding acquisition. Jiarui Wu: writing – review & editing. Zhong-Qun Tian: supervision, writing – review & editing, funding acquisition.

Conflicts of interest

There are no conflicts to declare.

Acknowledgements

We thank Prof Xi Zhang, Prof Minghua Liu, Prof Meixiang Wang, Prof Ling-Ling Chen, Prof Zengyi Chang, Prof Pengye Wang, Prof Xin Xu, Prof Yiqin Gao, Prof Ping Chen, Dr Cen Zhou, and Dr Zhihao Li for the helpful discussion. We would also like to thank all the students, postdocs, and collaborators who have contributed to our catassembly research over the past decade. This work is supported by the National Natural Science Foundation of China (NSFC) (No. 21991130, 21991131, 21991133, 21991134).

Notes and references

  1. J.-M. Lehn, Angew. Chem. Int., Ed. Engl., 1988, 27, 89–112 CrossRef .
  2. H. Ringsdorf, B. Schlarb and J. Venzmer, Angew. Chem. Int., Ed. Engl., 1988, 27, 113–158 CrossRef .
  3. D. Philp and J. F. Stoddart, Angew. Chem. Int., Ed. Engl., 1996, 35, 1154–1196 CrossRef .
  4. J. W. Steed and J. L. Atwood, Supramolecular Chemistry, Wiley, Hoboken, NJ Chichester, West Sussex, 3rd edn, 2022 Search PubMed .
  5. D. L. Nelson, Lehninger Principles of Biochemistry, W.H. Freeman, Austin, 8th edn, 2021 Search PubMed .
  6. M. C. T. Fyfe and J. F. Stoddart, Acc. Chem. Res., 1997, 30, 393–401 CrossRef CAS .
  7. J.-M. Lehn, Science, 2002, 295, 2400–2403 CrossRef CAS PubMed .
  8. J. S. Lindsey, New J. Chem., 1991, 15, 153–180 CAS .
  9. G. M. Whitesides and B. Grzybowski, Science, 2002, 295, 2418–2421 CrossRef CAS PubMed .
  10. G. T. Williams, C. J. E. Haynes, M. Fares, C. Caltagirone, J. R. Hiscock and P. A. Gale, Chem. Soc. Rev., 2021, 50, 2737–2763 RSC .
  11. M. J. Webber, E. A. Appel, E. W. Meijer and R. Langer, Nat. Mater., 2016, 15, 13–26 CrossRef CAS PubMed .
  12. Y. Wang, H.-X. Lin, L. Chen, S.-Y. Ding, Z.-C. Lei, D.-Y. Liu, X.-Y. Cao, H.-J. Liang, Y.-B. Jiang and Z.-Q. Tian, Chem. Soc. Rev., 2014, 43, 399–411 RSC .
  13. G. Vantomme and E. W. Meijer, Science, 2019, 363, 1396–1397 CrossRef CAS PubMed .
  14. C. B. Anfinsen, Science, 1973, 181, 223–230 CrossRef CAS PubMed .
  15. R. J. Ellis, Trends Biochem. Sci., 2006, 31, 395–401 CrossRef CAS PubMed .
  16. R. J. Ellis, Philos. Trans. R. Soc., B, 2013, 368, 20110398 CrossRef PubMed .
  17. E. C. Twomey, Z. Ji, T. E. Wales, N. O. Bodnar, S. B. Ficarro, J. A. Marto, J. R. Engen and T. A. Rapoport, Science, 2019, 365, eaax1033 CrossRef CAS PubMed .
  18. C. R. Clapier, J. Iwasa, B. R. Cairns and C. L. Peterson, Nat. Rev. Mol. Cell Biol., 2017, 18, 407–422 CrossRef CAS PubMed .
  19. Y. Wang, H. Lin, S. Ding, D. Liu, L. Chen, Z. Lei, F. Fan and Z. Tian, Sci. Sin.: Chim., 2012, 42, 525–547 CAS .
  20. Z.-C. Lei, X. Wang, H. Qu, C. Zhou, Z. Li, L. Yang, X. Cao and Z.-Q. Tian, Sci. Sin.: Chim., 2020, 50, 1781–1800 Search PubMed .
  21. J. M. Lehn, Pure Appl. Chem., 1978, 50, 871–892 CrossRef CAS .
  22. J.-M. Lehn, Science, 1993, 260, 1762–1763 CrossRef CAS PubMed .
  23. R. G. Nuzzo and D. L. Allara, J. Am. Chem. Soc., 1983, 105, 4481–4483 CrossRef CAS .
  24. C. D. Bain, E. B. Troughton, Y. T. Tao, J. Evall, G. M. Whitesides and R. G. Nuzzo, J. Am. Chem. Soc., 1989, 111, 321–335 CrossRef CAS .
  25. S. Zhang, T. Holmes, C. Lockshin and A. Rich, Proc. Natl. Acad. Sci. U. S. A., 1993, 90, 3334–3338 CrossRef CAS PubMed .
  26. J. Chen and N. C. Seeman, Nature, 1991, 350, 631–633 CrossRef CAS PubMed .
  27. K. Wang, K. Amin, Z. An, Z. Cai, H. Chen, H. Chen, Y. Dong, X. Feng, W. Fu, J. Gu, Y. Han, D. Hu, R. Hu, D. Huang, F. Huang, F. Huang, Y. Huang, J. Jin, X. Jin, Q. Li, T. Li, Z. Li, Z. Li, J. Liu, J. Liu, S. Liu, H. Peng, A. Qin, X. Qing, Y. Shen, J. Shi, X. Sun, B. Tong, B. Wang, H. Wang, L. Wang, S. Wang, Z. Wei, T. Xie, C. Xu, H. Xu, Z.-K. Xu, B. Yang, Y. Yu, X. Zeng, X. Zhan, G. Zhang, J. Zhang, M. Q. Zhang, X.-Z. Zhang, X. Zhang, Y. Zhang, Y. Zhang, C. Zhao, W. Zhao, Y. Zhou, Z. Zhou, J. Zhu, X. Zhu and B. Z. Tang, Mater. Chem. Front., 2020, 4, 1803–1915 RSC .
  28. Z. Zhao, Y. Dong, Z. Duan, D. Jin, W. Yuan and D. Liu, Aggregate, 2021, 2, e95 CrossRef CAS .
  29. C. A. Mirkin, R. L. Letsinger, R. C. Mucic and J. J. Storhoff, Nature, 1996, 382, 607–609 CrossRef CAS PubMed .
  30. H. Lin, S. Lee, L. Sun, M. Spellings, M. Engel, S. C. Glotzer and C. A. Mirkin, Science, 2017, 355, 931–935 CrossRef CAS PubMed .
  31. J.-F. Ayme, J. E. Beves, D. A. Leigh, R. T. McBurney, K. Rissanen and D. Schultz, Nat. Chem., 2012, 4, 15–20 CrossRef CAS PubMed .
  32. L. Chen, S. N. Berry, X. Wu, E. N. W. Howe and P. A. Gale, Chem, 2020, 6, 61–141 CAS .
  33. A. Ulman, Chem. Rev., 1996, 96, 1533–1554 CrossRef CAS PubMed .
  34. J. C. Love, L. A. Estroff, J. K. Kriebel, R. G. Nuzzo and G. M. Whitesides, Chem. Rev., 2005, 105, 1103–1170 CrossRef CAS PubMed .
  35. T. F. A. De Greef, M. M. J. Smulders, M. Wolffs, A. P. H. J. Schenning, R. P. Sijbesma and E. W. Meijer, Chem. Rev., 2009, 109, 5687–5754 CrossRef CAS PubMed .
  36. T. Aida, E. W. Meijer and S. I. Stupp, Science, 2012, 335, 813–817 CrossRef CAS PubMed .
  37. L. Yang, X. Tan, Z. Wang and X. Zhang, Chem. Rev., 2015, 115, 7196–7239 CrossRef CAS PubMed .
  38. X. Hou, T. Zaks, R. Langer and Y. Dong, Nat. Rev. Mater., 2021, 6, 1078–1094 CrossRef CAS PubMed .
  39. H.-Q. Peng, L.-Y. Niu, Y.-Z. Chen, L.-Z. Wu, C.-H. Tung and Q.-Z. Yang, Chem. Rev., 2015, 115, 7502–7542 CrossRef CAS PubMed .
  40. H. Chen and J. F. Stoddart, Nat. Rev. Mater., 2021, 6, 804–828 CrossRef CAS .
  41. Z. Chen, Q. Wang, X. Wu, Z. Li and Y.-B. Jiang, Chem. Soc. Rev., 2015, 44, 4249–4263 RSC .
  42. J. Qi, T. Wu, W. Wang, H. Jin, S. Gao, S. Jiang, J. Huang and Y. Yan, Aggregate, 2022, 3, e173 CrossRef CAS .
  43. P. W. K. Rothemund, Nature, 2006, 440, 297–302 CrossRef CAS PubMed .
  44. S. M. Douglas, H. Dietz, T. Liedl, B. Högberg, F. Graf and W. M. Shih, Nature, 2009, 459, 414–418 CrossRef CAS PubMed .
  45. C. Conde and A. Cáceres, Nat. Rev. Neurosci., 2009, 10, 319–332 CrossRef CAS PubMed .
  46. M. D. Ward and P. R. Raithby, Chem. Soc. Rev., 2013, 42, 1619–1636 RSC .
  47. L. J. Prins, F. De Jong, P. Timmerman and D. N. Reinhoudt, Nature, 2000, 408, 181–184 CrossRef CAS PubMed .
  48. A. J. Turberfield, J. C. Mitchell, B. Yurke, A. P. Mills, M. I. Blakey and F. C. Simmel, Phys. Rev. Lett., 2003, 90, 118102 CrossRef CAS PubMed .
  49. D. Y. Zhang, A. J. Turberfield, B. Yurke and E. Winfree, Science, 2007, 318, 1121–1125 CrossRef CAS PubMed .
  50. H. Li, X. Xu, J. Shang, J. Li, X. Hu, B. K. Teo and K. Wu, J. Phys. Chem. C, 2012, 116, 21753–21761 CrossRef CAS .
  51. F. U. Hartl, A. Bracher and M. Hayer-Hartl, Nature, 2011, 475, 324–332 CrossRef CAS PubMed .
  52. A. L. Horwich and W. A. Fenton, Q. Rev. Biophys., 2020, 53, e4 CrossRef PubMed .
  53. R. Mitra, K. Wu, C. Lee and J. C. A. Bardwell, Annu. Rev. Biophys., 2022, 51, 409–429 CrossRef CAS PubMed .
  54. K. Braig, Z. Otwinowski, R. Hegde, D. C. Boisvert, A. Joachimiak, A. L. Horwich and P. B. Sigler, Nature, 1994, 371, 578–586 CrossRef CAS PubMed .
  55. L. Chen and P. B. Sigler, Cell, 1999, 99, 757–768 CrossRef CAS PubMed .
  56. A. L. Horwich and W. A. Fenton, Q. Rev. Biophys., 2009, 42, 83–116 CrossRef CAS PubMed .
  57. V. Albanèse, A. Y.-W. Yam, J. Baughman, C. Parnot and J. Frydman, Cell, 2006, 124, 75–88 CrossRef PubMed .
  58. R. A. Quinlan and R. J. Ellis, Philos. Trans. R. Soc., B, 2013, 368, 20130091 CrossRef PubMed .
  59. M. K. Hospenthal, T. R. D. Costa and G. Waksman, Nat. Rev. Microbiol., 2017, 15, 365–379 CrossRef CAS PubMed .
  60. L. Craig, K. T. Forest and B. Maier, Nat. Rev. Microbiol., 2019, 17, 429–440 CrossRef CAS PubMed .
  61. F. G. Sauer, K. Fütterer, J. S. Pinkner, K. W. Dodson, S. J. Hultgren and G. Waksman, Science, 1999, 285, 1058–1061 CrossRef CAS PubMed .
  62. A. Busch and G. Waksman, Philos. Trans. R. Soc., B, 2012, 367, 1112–1122 CrossRef CAS PubMed .
  63. A. Busch, G. Phan and G. Waksman, Philos. Trans. R. Soc., A, 2015, 373, 20130153 CrossRef PubMed .
  64. G. T. Werneburg, N. S. Henderson, E. B. Portnoy, S. Sarowar, S. J. Hultgren, H. Li and D. G. Thanassi, Nat. Struct. Mol. Biol., 2015, 22, 540–546 CrossRef CAS PubMed .
  65. M. Hayer-Hartl and F. U. Hartl, Trends Biochem. Sci., 2020, 45, 748–763 CrossRef CAS PubMed .
  66. A. Bracher, S. M. Whitney, F. U. Hartl and M. Hayer-Hartl, Annu. Rev. Plant Biol., 2017, 68, 29–60 CrossRef CAS PubMed .
  67. S. Saschenbrecker, A. Bracher, K. V. Rao, B. V. Rao, F. U. Hartl and M. Hayer-Hartl, Cell, 2007, 129, 1189–1200 CrossRef CAS PubMed .
  68. C. Liu, A. L. Young, A. Starling-Windhof, A. Bracher, S. Saschenbrecker, B. V. Rao, K. V. Rao, O. Berninghausen, T. Mielke, F. U. Hartl, R. Beckmann and M. Hayer-Hartl, Nature, 2010, 463, 197–202 CrossRef CAS PubMed .
  69. L.-Y. Xia, Y.-L. Jiang, W.-W. Kong, H. Sun, W.-F. Li, Y. Chen and C.-Z. Zhou, Nat. Plants, 2020, 6, 708–717 CrossRef CAS PubMed .
  70. F. Huang, W.-W. Kong, Y. Sun, T. Chen, G. F. Dykes, Y.-L. Jiang and L.-N. Liu, Proc. Natl. Acad. Sci. U. S. A., 2020, 117, 17418–17428 CrossRef CAS PubMed .
  71. R. L. Thompson, L. Lassaletta, P. K. Patra, C. Wilson, K. C. Wells, A. Gressent, E. N. Koffi, M. P. Chipperfield, W. Winiwarter, E. A. Davidson, H. Tian and J. G. Canadell, Nat. Clim. Change, 2019, 9, 993–998 CrossRef CAS .
  72. A. Pomowski, W. G. Zumft, P. M. H. Kroneck and O. Einsle, Nature, 2011, 477, 234–237 CrossRef CAS PubMed .
  73. C. Müller, L. Zhang, S. Zipfel, A. Topitsch, M. Lutz, J. Eckert, B. Prasser, M. Chami, W. Lü, J. Du and O. Einsle, Nature, 2022, 608, 626–631 CrossRef PubMed .
  74. K. Luger, A. W. Mäder, R. K. Richmond, D. F. Sargent and T. J. Richmond, Nature, 1997, 389, 251 CrossRef CAS PubMed .
  75. F. Song, P. Chen, D. Sun, M. Wang, L. Dong, D. Liang, R.-M. Xu, P. Zhu and G. Li, Science, 2014, 344, 376–380 CrossRef CAS PubMed .
  76. R. A. Laskey, B. M. Honda, A. D. Mills and J. T. Finch, Nature, 1978, 275, 416–420 CrossRef CAS PubMed .
  77. Z. A. Gurard-Levin, J.-P. Quivy and G. Almouzni, Annu. Rev. Biochem., 2014, 83, 487–517 CrossRef CAS PubMed .
  78. C. M. Hammond, C. B. Strømme, H. Huang, D. J. Patel and A. Groth, Nat. Rev. Mol. Cell Biol., 2017, 18, 141–158 CrossRef CAS PubMed .
  79. P. Grover, J. S. Asa and E. I. Campos, Annu. Rev. Genet., 2018, 52, 109–130 CrossRef CAS PubMed .
  80. H. Tagami, D. Ray-Gallet, G. Almouzni and Y. Nakatani, Cell, 2004, 116, 51–61 CrossRef CAS PubMed .
  81. D. Ray-Gallet, J.-P. Quivy, C. Scamps, E. M.-D. Martini, M. Lipinski and G. Almouzni, Mol. Cell, 2002, 9, 1091–1100 CrossRef CAS PubMed .
  82. H. M. Schnell, R. M. Walsh, S. Rawson, M. Kaur, M. K. Bhanu, G. Tian, M. A. Prado, A. Guerra-Moreno, J. A. Paulo, S. P. Gygi, J. Roelofs, D. Finley and J. Hanna, Nat. Struct. Mol. Biol., 2021, 28, 418–425 CrossRef CAS PubMed .
  83. S. Hashemolhosseini, Y. D. Stierhof, I. Hindennach and U. Henning, J. Bacteriol., 1996, 178, 6258–6265 CrossRef CAS PubMed .
  84. S. Galan Bartual, C. Garcia-Doval, J. Alonso, G. Schoehn and M. J. van Raaij, Protein Expression Purif., 2010, 70, 116–121 CrossRef CAS PubMed .
  85. J. Xu, R. W. Hendrix and R. L. Duda, J. Mol. Biol., 2014, 426, 1004–1018 CrossRef CAS PubMed .
  86. A. C. McShan, C. A. Devlin, S. A. Overall, J. Park, J. S. Toor, D. Moschidi, D. Flores-Solis, H. Choi, S. Tripathi, E. Procko and N. G. Sgourakis, Proc. Natl. Acad. Sci. U. S. A., 2019, 116, 25602–25613 CrossRef CAS PubMed .
  87. C. Thomas and R. Tampé, Curr. Opin. Immunol., 2021, 70, 48–56 CrossRef CAS PubMed .
  88. J. M. Barral, A. H. Hutagalung, A. Brinker, F. U. Hartl and H. F. Epstein, Science, 2002, 295, 669–671 CrossRef CAS PubMed .
  89. D. Hellerschmied, A. Lehner, N. Franicevic, R. Arnese, C. Johnson, A. Vogel, A. Meinhart, R. Kurzbauer, L. Deszcz, L. Gazda, M. Geeves and T. Clausen, Nat. Commun., 2019, 10, 4781 CrossRef PubMed .
  90. C. L. Hagan, T. J. Silhavy and D. Kahne, Annu. Rev. Biochem., 2011, 80, 189–210 CrossRef CAS PubMed .
  91. L. Han, J. Zheng, Y. Wang, X. Yang, Y. Liu, C. Sun, B. Cao, H. Zhou, D. Ni, J. Lou, Y. Zhao and Y. Huang, Nat. Struct. Mol. Biol., 2016, 23, 192–196 CrossRef CAS PubMed .
  92. S. Murata, H. Yashiroda and K. Tanaka, Nat. Rev. Mol. Cell Biol., 2009, 10, 104–115 CrossRef CAS PubMed .
  93. A. Rousseau and A. Bertolotti, Nat. Rev. Mol. Cell Biol., 2018, 19, 697–712 CrossRef CAS PubMed .
  94. R. A. Dick, K. K. Zadrozny, C. Xu, F. K. M. Schur, T. D. Lyddon, C. L. Ricana, J. M. Wagner, J. R. Perilla, B. K. Ganser-Pornillos, M. C. Johnson, O. Pornillos and V. M. Vogt, Nature, 2018, 560, 509–512 CrossRef CAS PubMed .
  95. R. A. Dick, D. L. Mallery, V. M. Vogt and L. C. James, Viruses, 2018, 10, 640 CrossRef CAS PubMed .
  96. D. J. Battle, M. Kasim, J. Yong, F. Lotti, C.-K. Lau, J. Mouaikel, Z. Zhang, K. Han, L. Wan and G. Dreyfuss, Cold Spring Harbor Symp. Quant. Biol., 2006, 71, 313–320 CrossRef CAS PubMed .
  97. I. Iost, T. Bizebard and M. Dreyfus, Biochim. Biophys. Acta, Gene Regul. Mech., 2013, 1829, 866–877 CrossRef CAS PubMed .
  98. S. A. Woodson, S. Panja and A. Santiago-Frangos, Microbiol. Spectrum, 2018, 6, 6.4.21 CrossRef PubMed .
  99. L. Smalinskaitė, M. K. Kim, A. J. O. Lewis, R. J. Keenan and R. S. Hegde, Nature, 2022, 611, 161–166 CrossRef PubMed .
  100. Z. Ji, H. Li, D. Peterle, J. A. Paulo, S. B. Ficarro, T. E. Wales, J. A. Marto, S. P. Gygi, J. R. Engen and T. A. Rapoport, Mol. Cell, 2022, 82, 570–584.e8 CrossRef CAS PubMed .
  101. L. Yan and Z. Chen, Trends Biochem. Sci., 2020, 45, 217–227 CrossRef CAS PubMed .
  102. I. M. Nodelman and G. D. Bowman, Annu. Rev. Biophys., 2021, 50, 73–93 CrossRef CAS PubMed .
  103. G. D. Bowman and S. Deindl, Science, 2019, 366, 35–36 CrossRef CAS PubMed .
  104. G. Mizuguchi, X. Shen, J. Landry, W.-H. Wu, S. Sen and C. Wu, Science, 2004, 303, 343–348 CrossRef CAS PubMed .
  105. M. Papamichos-Chronakis, S. Watanabe, O. J. Rando and C. L. Peterson, Cell, 2011, 144, 200–213 CrossRef CAS PubMed .
  106. O. Willhoft, M. Ghoneim, C.-L. Lin, E. Y. D. Chua, M. Wilkinson, Y. Chaban, R. Ayala, E. A. McCormack, L. Ocloo, D. S. Rueda and D. B. Wigley, Science, 2018, 362, eaat7716 CrossRef PubMed .
  107. S. Eustermann, K. Schall, D. Kostrewa, K. Lakomek, M. Strauss, M. Moldt and K.-P. Hopfner, Nature, 2018, 556, 386–390 CrossRef CAS PubMed .
  108. R. Ayala, O. Willhoft, R. J. Aramayo, M. Wilkinson, E. A. McCormack, L. Ocloo, D. B. Wigley and X. Zhang, Nature, 2018, 556, 391–395 CrossRef CAS PubMed .
  109. E. del Val, W. Nasser, H. Abaibou and S. Reverchon, Biochem. Soc. Trans., 2019, 47, 1511–1531 CrossRef CAS PubMed .
  110. H. Yang, C. Zhou, A. Dhar and N. P. Pavletich, Nature, 2020, 586, 801–806 CrossRef CAS PubMed .
  111. G. Orphanides, G. LeRoy, C.-H. Chang, D. S. Luse and D. Reinberg, Cell, 1998, 92, 105–116 CrossRef CAS PubMed .
  112. P. Chen, L. Dong, M. Hu, Y.-Z. Wang, X. Xiao, Z. Zhao, J. Yan, P.-Y. Wang, D. Reinberg, M. Li, W. Li and G. Li, Mol. Cell, 2018, 71, 284–293.e4 CrossRef CAS PubMed .
  113. W. Li, P. Chen, J. Yu, L. Dong, D. Liang, J. Feng, J. Yan, P.-Y. Wang, Q. Li, Z. Zhang, M. Li and G. Li, Mol. Cell, 2016, 64, 120–133 CrossRef CAS PubMed .
  114. Y. Liu, K. Zhou, N. Zhang, H. Wei, Y. Z. Tan, Z. Zhang, B. Carragher, C. S. Potter, S. D’Arcy and K. Luger, Nature, 2020, 577, 426–431 CrossRef CAS PubMed .
  115. L. Farnung, M. Ochmann, M. Engeholm and P. Cramer, Nat. Struct. Mol. Biol., 2021, 28, 382–387 CrossRef CAS PubMed .
  116. H. Ehara, T. Kujirai, M. Shirouzu, H. Kurumizaka and S. Sekine, Science, 2022, 377, eabp9466 CrossRef CAS PubMed .
  117. J.-M. Lehn, Proc. Natl. Acad. Sci. U. S. A., 2002, 99, 4763–4768 CrossRef CAS PubMed .
  118. H. N. Motlagh, J. O. Wrabl, J. Li and V. J. Hilser, Nature, 2014, 508, 331–339 CrossRef CAS PubMed .
  119. R. J. Ellis, Trends Biochem. Sci., 2001, 26, 597–604 CrossRef CAS PubMed .
  120. R. J. Ellis, Curr. Opin. Struct. Biol., 2001, 11, 114–119 CrossRef CAS PubMed .
  121. T. Hirose, K. Ninomiya, S. Nakagawa and T. Yamazaki, Nat. Rev. Mol. Cell Biol., 2023, 24, 288–304 CrossRef CAS PubMed .
  122. P. C. Bressloff and J. M. Newby, Rev. Mod. Phys., 2013, 85, 135–196 CrossRef CAS .
  123. E. C. Rodrigues, J. Grawenhoff, S. J. Baumann, N. Lorenzon and S. P. Maurer, Front. Integr. Neurosci., 2021, 15, 692948 CrossRef CAS PubMed .
  124. E. Luchinat, M. Cremonini and L. Banci, Chem. Rev., 2022, 122, 9267–9306 CrossRef CAS PubMed .
  125. F.-X. Theillet, Chem. Rev., 2022, 122, 9497–9570 CrossRef CAS PubMed .
  126. C. J. Bustamante, Y. R. Chemla, S. Liu and M. D. Wang, Nat. Rev. Methods Primers, 2021, 1, 1–29 CrossRef .
  127. Q. Xin, P. Li, Y. He, C. Shi, Y. Qiao, X. Bian, J. Su, R. Qiao, X. Zhou and J. Zhong, Anal. Methods, 2017, 9, 5720–5730 RSC .
  128. Y.-Z. Wang, C. Liu, J. Zhao, J. Yu, A. Luo, X. Xiao, S.-X. Dou, L. Ma, P.-Y. Wang, M. Li, G. Li, J. Yan, P. Chen and W. Li, Nucleic Acids Res., 2022, 50, 833–846 CrossRef CAS PubMed .
  129. X. Xiao, C. Liu, Y. Pei, Y.-Z. Wang, J. Kong, K. Lu, L. Ma, S.-X. Dou, P.-Y. Wang, G. Li, P. Chen and W. Li, J. Am. Chem. Soc., 2020, 142, 3340–3345 CrossRef CAS PubMed .
  130. A. Luo, J. Kong, J. Chen, X. Xiao, J. Lan, X. Li, C. Liu, P.-Y. Wang, G. Li, W. Li and P. Chen, Nat. Commun., 2023, 14, 741 CrossRef CAS PubMed .
  131. J. B. Stiller, R. Otten, D. Häussinger, P. S. Rieder, D. L. Theobald and D. Kern, Nature, 2022, 603, 528–535 CrossRef CAS PubMed .
  132. H. Fang, Y. Sun, X. Wang, M. Sharma, Z. Chen, X. Cao, M. Utz and Z. Tian, Sci. China: Chem., 2018, 61, 1460–1464 CrossRef CAS .
  133. Y. Sun, H. Fang, X. Lin, X. Wang, G. Chen, X. Wang, Z. Tian, L. Yang, M. Utz and X. Cao, CCS Chem., 2021, 4, 557–565 CrossRef .
  134. G. Chen, P. Shi, L. Zeng, L. Feng, X. Wang, X. Lin, Y. Sun, H. Fang, X. Cao, X. Wang, L. Yang and Z. Tian, Chem. Sci., 2022, 13, 7796–7804 RSC .
  135. S. Wu, B. Tutuncuoglu, K. Yan, H. Brown, Y. Zhang, D. Tan, M. Gamalinda, Y. Yuan, Z. Li, J. Jakovljevic, C. Ma, J. Lei, M.-Q. Dong, J. L. Woolford and N. Gao, Nature, 2016, 534, 133–137 CrossRef CAS PubMed .
  136. Z. Fu, S. Kaledhonkar, A. Borg, M. Sun, B. Chen, R. A. Grassucci, M. Ehrenberg and J. Frank, Structure, 2016, 24, 2092–2101 CrossRef CAS PubMed .
  137. B. Chen, S. Kaledhonkar, M. Sun, B. Shen, Z. Lu, D. Barnard, T.-M. Lu, R. L. Gonzalez and J. Frank, Structure, 2015, 23, 1097–1105 CrossRef CAS PubMed .
  138. V. P. Dandey, W. C. Budell, H. Wei, D. Bobe, K. Maruthi, M. Kopylov, E. T. Eng, P. A. Kahn, J. E. Hinshaw, N. Kundu, C. M. Nimigean, C. Fan, N. Sukomon, S. A. Darst, R. M. Saecker, J. Chen, B. Malone, C. S. Potter and B. Carragher, Nat. Methods, 2020, 17, 897–900 CrossRef CAS PubMed .
  139. N. Schwierz, C. V. Frost, P. L. Geissler and M. Zacharias, J. Am. Chem. Soc., 2016, 138, 527–539 CrossRef CAS PubMed .
  140. H. Qu, T. Tong, Z.-C. Lei, P. Shi, L. Yang, X. Cao, Y. Gao, Z. Hou, X. Xu and Z.-Q. Tian, Sci. Sin.: Chim., 2023, 53, 145–173 Search PubMed .
  141. M. L. Klein and W. Shinoda, Science, 2008, 321, 798–800 CrossRef CAS PubMed .
  142. S. Sevim, A. Sorrenti, C. Franco, S. Furukawa, S. Pané, A. J. deMello and J. Puigmartí-Luis, Chem. Soc. Rev., 2018, 47, 3788–3803 RSC .
  143. J. Sun, Y. Li, F. Yan, C. Liu, Y. Sang, F. Tian, Q. Feng, P. Duan, L. Zhang, X. Shi, B. Ding and M. Liu, Nat. Commun., 2018, 9, 2599 CrossRef PubMed .
  144. A. Arango-Restrepo, D. Barragán and J. M. Rubi, Phys. Chem. Chem. Phys., 2019, 21, 17475–17493 RSC .
  145. G. Ragazzon and L. J. Prins, Nat. Nanotechnol., 2018, 13, 882–889 CrossRef CAS PubMed .
  146. L. A. Burns, Á. V. Mayagoitia, B. G. Sumpter and C. D. Sherrill, J. Chem. Phys., 2011, 134, 084107 CrossRef PubMed .
  147. Z. Liu, Z. Chen, J. Xi and X. Xu, Natl. Sci. Rev., 2020, 7, 1036–1045 CrossRef CAS PubMed .
  148. J. Zhang, Y.-K. Lei, Y. I. Yang and Y. Q. Gao, J. Chem. Phys., 2020, 153, 174115 CrossRef CAS PubMed .
  149. H. I. Ingólfsson, C. Arnarez, X. Periole and S. J. Marrink, J. Cell Sci., 2016, 129, 257–268 Search PubMed .
  150. A. Slepoy, A. P. Thompson and S. J. Plimpton, J. Chem. Phys., 2008, 128, 205101 CrossRef PubMed .
  151. I. G. Kevrekidis and G. Samaey, Annu. Rev. Phys. Chem., 2009, 60, 321–344 CrossRef CAS PubMed .
  152. J. Kaiser, M. Castellano, D. Gnandt and T. Koslowski, J. Comput. Chem., 2020, 41, 1105–1115 CrossRef CAS PubMed .
  153. P. Yin, H. M. T. Choi, C. R. Calvert and N. A. Pierce, Nature, 2008, 451, 318–322 CrossRef CAS PubMed .
  154. J. P. Sadowski, C. R. Calvert, D. Y. Zhang, N. A. Pierce and P. Yin, ACS Nano, 2014, 8, 3251–3259 CrossRef CAS PubMed .
  155. Y. Zhao, F. Chen, Q. Li, L. Wang and C. Fan, Chem. Rev., 2015, 115, 12491–12545 CrossRef CAS PubMed .
  156. R. M. Dirks and N. A. Pierce, Proc. Natl. Acad. Sci. U. S. A., 2004, 101, 15275–15278 CrossRef CAS PubMed .
  157. S. Venkataraman, R. M. Dirks, P. W. K. Rothemund, E. Winfree and N. A. Pierce, Nat. Nanotechnol., 2007, 2, 490–494 CrossRef PubMed .
  158. Z. Ge, M. Lin, P. Wang, H. Pei, J. Yan, J. Shi, Q. Huang, D. He, C. Fan and X. Zuo, Anal. Chem., 2014, 86, 2124–2130 CrossRef CAS PubMed .
  159. J. Wang, J. Chao, H. Liu, S. Su, L. Wang, W. Huang, I. Willner and C. Fan, Angew. Chem., Int. Ed., 2017, 56, 2171–2175 CrossRef CAS PubMed .
  160. D. Ye, M. Li, T. Zhai, P. Song, L. Song, H. Wang, X. Mao, F. Wang, X. Zhang, Z. Ge, J. Shi, L. Wang, C. Fan, Q. Li and X. Zuo, Nat. Protoc., 2020, 15, 2163–2185 CrossRef CAS PubMed .
  161. J. Chao, J. Wang, F. Wang, X. Ouyang, E. Kopperger, H. Liu, Q. Li, J. Shi, L. Wang, J. Hu, L. Wang, W. Huang, F. C. Simmel and C. Fan, Nat. Mater., 2019, 18, 273–279 CrossRef CAS PubMed .
  162. D. Y. Zhang and E. Winfree, Nucleic Acids Res., 2010, 38, 4182–4197 CrossRef CAS PubMed .
  163. T. Song and H. Liang, J. Am. Chem. Soc., 2012, 134, 10803–10806 CrossRef CAS PubMed .
  164. D. Y. Zhang, R. F. Hariadi, H. M. T. Choi and E. Winfree, Nat. Commun., 2013, 4, 1965 CrossRef PubMed .
  165. X. Zhou, D. Yao, W. Hua, N. Huang, X. Chen, L. Li, M. He, Y. Zhang, Y. Guo, S. Xiao, F. Bian and H. Liang, Proc. Natl. Acad. Sci. U. S. A., 2020, 117, 5617–5623 CrossRef CAS PubMed .
  166. H. Zhao, X. Xiu, M. Li, S. Dai, M. Gou, L. Tao, X. Zuo, C. Fan, Z. Tian and P. Song, Angew. Chem., Int. Ed., 2023, 62, e202214450 CrossRef CAS PubMed .
  167. Y. Jiao, Y. Qiu, L. Zhang, W.-G. Liu, H. Mao, H. Chen, Y. Feng, K. Cai, D. Shen, B. Song, X.-Y. Chen, X. Li, X. Zhao, R. M. Young, C. L. Stern, M. R. Wasielewski, R. D. Astumian, W. A. Goddard and J. F. Stoddart, Nature, 2022, 603, 265–270 CrossRef CAS PubMed .
  168. Y. Jiao and J. F. Stoddart, Tetrahedron, 2022, 126, 133065 CrossRef CAS .
  169. L. Zhao, L. Jiang, Y. Han, Z. Xian, J. Huang and Y. Yan, Soft Matter, 2013, 9, 7710–7717 RSC .
  170. Y. Wang, Y. Sun, P. Shi, M. M. Sartin, X. Lin, P. Zhang, H. Fang, P. Peng, Z. Tian and X. Cao, Chem. Sci., 2019, 10, 8076–8082 RSC .
  171. W. Zhi, Z. Pu, C. Ma, K. Liu, X. Wang, J. Huang, Y. Xiao and Y. Yan, ACS Nano, 2021, 15, 19621–19628 CrossRef CAS PubMed .
  172. Z.-C. Lei, X. Yin, X. Wang, G. Ke, X. Cao, C. Fan, C. J. Yang, H. Liang and Z.-Q. Tian, Giant, 2020, 1, 100008 CrossRef .
  173. F.-H. Ma, C. Li, Y. Liu and L. Shi, Adv. Mater., 2020, 32, 1805945 CrossRef CAS PubMed .
  174. T. Nishimura and K. Akiyoshi, Bioconjugate Chem., 2020, 31, 1259–1267 CrossRef CAS PubMed .
  175. L. Xie, H. Jiang, D. Li, M. Liu, Y. Ding, Y. Liu, X. Li, X. Li, H. Zhang, Z. Hou, Y. Luo, L. Chi, X. Qiu and W. Xu, ACS Nano, 2020, 14, 10680–10687 CrossRef CAS PubMed .
  176. C. Li, Z. Xu, Y. Zhang, J. Li, N. Xue, R. Li, M. Zhong, T. Wu, Y. Wang, N. Li, Z. Shen, S. Hou, R. Berndt, Y. Wang and S. Gao, Natl. Sci. Rev., 2023, 7, nwad088 CrossRef PubMed .
  177. P. W. Anderson, Science, 1972, 177, 393–396 CrossRef CAS PubMed .
  178. S. Strogatz, S. Walker, J. M. Yeomans, C. Tarnita, E. Arcaute, M. De Domenico, O. Artime and K.-I. Goh, Nat. Rev. Phys., 2022, 4, 508–510 CrossRef .
  179. J.-M. Lehn, Angew. Chem., Int. Ed., 2013, 52, 2836–2850 CrossRef CAS PubMed .
  180. R. F. Ludlow and S. Otto, Chem. Soc. Rev., 2007, 37, 101–108 RSC .
  181. G. Ashkenasy, T. M. Hermans, S. Otto and A. F. Taylor, Chem. Soc. Rev., 2017, 46, 2543–2554 RSC .
  182. E. Watson, L. S. Yilmaz and A. J. M. Walhout, Annu. Rev. Genet., 2015, 49, 553–575 CrossRef CAS PubMed .
  183. J. Meeuwissen and J. N. H. Reek, Nat. Chem., 2010, 2, 615–621 CrossRef CAS PubMed .
  184. M. Raynal, P. Ballester, A. Vidal-Ferran and P. W. N. M. van Leeuwen, Chem. Soc. Rev., 2014, 43, 1660–1733 RSC .
  185. M. Raynal, P. Ballester, A. Vidal-Ferran and P. W. N. M. van Leeuwen, Chem. Soc. Rev., 2014, 43, 1734–1787 RSC .
  186. W. R. Wikoff, L. Liljas, R. L. Duda, H. Tsuruta, R. W. Hendrix and J. E. Johnson, Science, 2000, 289, 2129–2133 CrossRef CAS PubMed .
  187. B. Liu, Y. Qiu, E. C. Goonetilleke and X. Huang, MRS Bull., 2022, 47, 958–966 CrossRef .
  188. H. Jung, R. Covino, A. Arjun, C. Leitold, C. Dellago, P. G. Bolhuis and G. Hummer, Nat. Comput. Sci., 2023, 3, 334–345 CrossRef PubMed .
  189. A. W. Senior, R. Evans, J. Jumper, J. Kirkpatrick, L. Sifre, T. Green, C. Qin, A. Žídek, A. W. R. Nelson, A. Bridgland, H. Penedones, S. Petersen, K. Simonyan, S. Crossan, P. Kohli, D. T. Jones, D. Silver, K. Kavukcuoglu and D. Hassabis, Nature, 2020, 577, 706–710 CrossRef CAS PubMed .
  190. J. Jumper, R. Evans, A. Pritzel, T. Green, M. Figurnov, O. Ronneberger, K. Tunyasuvunakool, R. Bates, A. Žídek, A. Potapenko, A. Bridgland, C. Meyer, S. A. A. Kohl, A. J. Ballard, A. Cowie, B. Romera-Paredes, S. Nikolov, R. Jain, J. Adler, T. Back, S. Petersen, D. Reiman, E. Clancy, M. Zielinski, M. Steinegger, M. Pacholska, T. Berghammer, S. Bodenstein, D. Silver, O. Vinyals, A. W. Senior, K. Kavukcuoglu, P. Kohli and D. Hassabis, Nature, 2021, 596, 583–589 CrossRef CAS PubMed .
  191. B. I. M. Wicky, L. F. Milles, A. Courbet, R. J. Ragotte, J. Dauparas, E. Kinfu, S. Tipps, R. D. Kibler, M. Baek, F. DiMaio, X. Li, L. Carter, A. Kang, H. Nguyen, A. K. Bera and D. Baker, Science, 2022, 378, 56–61 CrossRef CAS PubMed .
  192. R. Evans, M. O’Neill, A. Pritzel, N. Antropova, A. Senior, T. Green, A. Žídek, R. Bates, S. Blackwell, J. Yim, O. Ronneberger, S. Bodenstein, M. Zielinski, A. Bridgland, A. Potapenko, A. Cowie, K. Tunyasuvunakool, R. Jain, E. Clancy, P. Kohli, J. Jumper and D. Hassabis, BioRxiv, 2022, preprint,  DOI:10.1101/2021.10.04.463034.
  193. Q. Cong, I. Anishchenko, S. Ovchinnikov and D. Baker, Science, 2019, 365, 185–189 CrossRef CAS PubMed .
  194. P. Bryant, G. Pozzati and A. Elofsson, Nat. Commun., 2022, 13, 1265 CrossRef CAS PubMed .
  195. C. Xu and S. A. Jackson, Genome Biol., 2019, 20, 76 CrossRef PubMed .
  196. D. Chicco, BioData Min., 2017, 10, 35 CrossRef PubMed .
  197. K. Sun, W. Xie, L. Chen, W. Domcke and M. F. Gelin, J. Chem. Phys., 2020, 153, 174111 CrossRef CAS PubMed .
  198. F. Noé, A. Tkatchenko, K.-R. Müller and C. Clementi, Annu. Rev. Phys. Chem., 2020, 71, 361–390 CrossRef PubMed .
  199. S. Doerr, M. Majewski, A. Pérez, A. Krämer, C. Clementi, F. Noe, T. Giorgino and G. De Fabritiis, J. Chem. Theory Comput., 2021, 17, 2355–2363 CrossRef CAS PubMed .
  200. S. Bubeck, V. Chandrasekaran, R. Eldan, J. Gehrke, E. Horvitz, E. Kamar, P. Lee, Y. T. Lee, Y. Li, S. Lundberg, H. Nori, H. Palangi, M. T. Ribeiro and Y. Zhang, arXiv, 2023, preprint, arXiv.2303.12712,  DOI:10.48550/arXiv.2303.12712.
  201. Z. Zheng, O. Zhang, C. Borgs, J. T. Chayes and O. M. Yaghi, J. Am. Chem. Soc., 2023, 145, 18048–18062 CrossRef CAS PubMed .
  202. A. Madani, B. Krause, E. R. Greene, S. Subramanian, B. P. Mohr, J. M. Holton, J. L. Olmos, C. Xiong, Z. Z. Sun, R. Socher, J. S. Fraser and N. Naik, Nat. Biotechnol., 2023, 1–8 Search PubMed .
  203. B. Burger, P. M. Maffettone, V. V. Gusev, C. M. Aitchison, Y. Bai, X. Wang, X. Li, B. M. Alston, B. Li, R. Clowes, N. Rankin, B. Harris, R. S. Sprick and A. I. Cooper, Nature, 2020, 583, 237–241 CrossRef CAS PubMed .
  204. M. Abolhasani and E. Kumacheva, Nat. Synth., 2023, 2, 483–492 CrossRef .
  205. J. Liu and J. E. Hein, Nat. Synth., 2023, 2, 464–466 CrossRef .
  206. M. Thomas and R. Schwartz, BMC Syst. Biol., 2018, 12, 65 CrossRef PubMed .
  207. B. Shahriari, K. Swersky, Z. Wang, R. P. Adams and N. de Freitas, Proc. IEEE, 2016, 104, 148–175 Search PubMed .
  208. N. Krasnogor, G. Terrazas, D. A. Pelta and G. Ochoa, in Artificial Evolution, ed. E.-G. Talbi, P. Liardet, P. Collet, E. Lutton and M. Schoenauer, Springer, Berlin, Heidelberg, 2006, pp. 179–188 Search PubMed .

Footnotes

This review is dedicated to the memory of Prof. Haojun Liang.
Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d3cs00634d
§ These authors contributed equally.

This journal is © The Royal Society of Chemistry 2024
Click here to see how this site uses Cookies. View our privacy policy here.