You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: structure/symmetry.md
+61-45Lines changed: 61 additions & 45 deletions
Original file line number
Diff line number
Diff line change
@@ -3,65 +3,61 @@ Protein Symmetry using BioJava
3
3
4
4
BioJava can be used to detect, analyze, and visualize **symmetry** and
5
5
**pseudo-symmetry** in the **quaternary** (biological assembly) and tertiary
6
-
(**internal**) structural levels.
6
+
(**internal**) structural levels of proteins.
7
7
8
8
## Quaternary Symmetry
9
9
10
-
The **quaternary symmetry** of a structure defines the relations between
11
-
its individual chains or groups of chains. For a more extensive explanation
12
-
about symmetery visit the [PDB help page](http://www.rcsb.org/pdb/staticHelp.do?p=help/viewers/jmol_symmetry_view.html).
10
+
The **quaternary symmetry** of a structure defines the relation and arrangement of the individual chains or groups of chains that are part of a biological assembly.
11
+
For a more exhaustive explanation about protein quaternary symmetery and the different types visit the [PDB help page](http://www.rcsb.org/pdb/staticHelp.do?p=help/viewers/jmol_symmetry_view.html).
13
12
14
-
In the **quaternary symmetry** detection problem, we are given a set of chains
15
-
with its `Atom` coordinates and we are asked to find the higest overall symmetry that
16
-
relates them. The solution is divided into the following steps:
13
+
In the **quaternary symmetry** detection problem, we are given a set of chains (subunits) that are part of a biological assembly as input, defined by their atomic coordinates, and we are required to find the higest overall symmetry group that
14
+
relates them as ouptut.
15
+
The solution is divided into the following steps:
17
16
18
17
1. First, we need to identify the chains that are identical (or similar
19
-
in the pseudo-symmetry case). For that, we perform a pairwise alignment of all
20
-
chains and determine**clusters of identical chains**.
21
-
2. Next, we reduce the each chains to a single point, its**centroid** (center of mass).
22
-
3.After that, we try different **symmetry relations** to superimpose the chain centroids
23
-
and obtain their RMSD.
24
-
4.At last, based on the parameters (cutoffs), we determine the **overall symmetry** of the
18
+
in the pseudo-symmetry case). For that purpose, we perform a pairwise alignment of all
19
+
chains and identify**clusters of identical or similar subunits**.
20
+
2. Next, we reduce each of the polypeptide chains to a single point, their**centroid** (center of mass).
21
+
3.Afterwards, we try different **symmetry operations** using a grid search to superimpose the chain centroids
22
+
and score them using the RMSD.
23
+
4.Finally, based on the parameters (cutoffs), we determine the **overall symmetry** of the
25
24
structure, with the symmetry relations obtained in the previous step.
26
25
5. In case of asymmetric structure, we discard combinatorially a number of chains and try
27
-
to detect any **local symmetries** present.
26
+
to detect any **local symmetries** present (symmetry that does not involve all subunits of the biological assembly).
28
27
29
28
The **quaternary symmetry** detection algorithm is implemented in the biojava class
See also the [demo](https://github.com/biojava/biojava/blob/885600670be75b7f6bc5216bff52a93f43fff09e/biojava-structure/src/main/java/demo/DemoSymmetry.java#L37-L59) provided in **BioJava** for a real case working example.
It returns a MultipleAlignment, see the explanation of the model in [Data Models](alignment-data-model.md),
126
-
that describes the internal subunits multiple alignment. In case of no symmetry detected, the
120
+
It returns a `MultipleAlignment` object, see the explanation of the model in [Data Models](alignment-data-model.md),
121
+
that describes the similarity of the internal repeats. In case of no symmetry detected, the
127
122
returned alignment represents the optimal self-alignment produced by the first step of the **CE-Symm**
128
123
algorithm.
129
124
@@ -202,22 +197,43 @@ The subunit display highlights the differences and similarities between the symm
202
197
related subunits of the chain, and helps the user to identify conseved and divergent
203
198
regions, with the help of the *Sequence Alignment Panel*.
204
199
205
-
## Combined Global Symmetry
200
+
## Quaternary + Internal Overall Symmetry
206
201
207
-
Finally, the internal and quaternary symmetries can be combined to obtain the global
202
+
Finally, the internal and quaternary symmetries can be merged to obtain the
208
203
overall combined symmetry. As we have seen before, the protein 1VYM is a DNA-clamp that
209
-
has three chains relates by C3 symmetry. Each chain is internally C2 symmetric, and each
210
-
part of the C2 internal symmetry is C2 symmetric, so a case of **hierarchical symmetry**
211
-
(C2 + C2). Once we have divided the whole structure into its asymmetric parts, we can
212
-
analyze the global symmetry that related each one of them. The interesting result is that
213
-
in some cases, the internal symmetry **multiplies** the point group of the quaternary symmetry.
214
-
What seemed a C3 + C2 + C2 is combined into a D6 overall symmetry, as we can see in the figure
215
-
below:
204
+
has three chains arranged in a C3 symmetry.
205
+
Each chain is internally fourfold symmetric with two levels of symmetry. We can analyze the overall symmetry of the structure by considering together the C3 quaternary symmetry and the fourfold internal symmetry.
206
+
In this case, the internal symmetry **augments** the point group of the quaternary symmetry to a D6 overall symmetry, as we can see in the figure below:
216
207
217
208

218
209
219
-
These results can give hints about the function and evolution of proteins and biological
220
-
structures.
210
+
An example of how to toggle the **combined symmetry** (quaternary + internal symmetries) programatically is shown below:
211
+
212
+
```java
213
+
// First download the structure in the biological assembly form
See also the [test](https://github.com/biocryst/biojava/blob/df22da37a86a0dba3fb35bee7e17300d402ab469/biojava-integrationtest/src/test/java/org/biojava/nbio/structure/test/symmetry/TestQuatSymmetryDetectorExamples.java#L167-L192) provided in **BioJava** for a real case working example.
0 commit comments