|
| 1 | +# Finding contacts between atoms in a protein: contact maps |
| 2 | + |
| 3 | +Contacts are a useful tool to analyse protein structures. They simplify the 3-Dimensional view of the structures into a 2-Dimensional set of contacts between its atoms or its residues. The representation of the contacts in a matrix is known as the contact map. Many protein structure analysis and prediction efforts are done by using contacts. For instance they can be useful for: |
| 4 | + |
| 5 | ++ development of structural alignment algorithms [Holm 1993][] [Caprara 2004][] |
| 6 | ++ automatic domain identification [Alexandrov 2003][] [Emmert-Streib 2007][] |
| 7 | ++ structural modelling by extraction of contact-based empirical potentials [Benkert 2008][] |
| 8 | ++ structure prediction via contact prediction from sequence information [Jones 2012][] |
| 9 | + |
| 10 | +## Getting the contact map of a protein chain |
| 11 | + |
| 12 | +This code snippet will produce the set of contacts between all C alpha atoms for chain A of PDB entry [1SMT](http://www.rcsb.org/pdb/explore.do?structureId=1SMT): |
| 13 | + |
| 14 | +```java |
| 15 | + AtomCache cache = new AtomCache(); |
| 16 | + StructureIO.setAtomCache(cache); |
| 17 | + |
| 18 | + Structure structure = StructureIO.getStructure("1SMT"); |
| 19 | + |
| 20 | + Chain chain = structure.getChainByPDB("A"); |
| 21 | + |
| 22 | + // we want contacts between Calpha atoms only |
| 23 | + String[] atoms = {" CA "}; |
| 24 | + // the distance cutoff we use is 8A |
| 25 | + AtomContactSet contacts = StructureTools.getAtomsInContact(chain, atoms, 8.0); |
| 26 | + |
| 27 | + System.out.println("Total number of CA-CA contacts: "+contacts.size()); |
| 28 | + |
| 29 | + |
| 30 | +``` |
| 31 | + |
| 32 | +The algorithm to find the contacts uses geometric hashing without need to calculate a full distance matrix, thus it scales nicely. |
| 33 | + |
| 34 | +## Getting the contacts between two protein chains |
| 35 | + |
| 36 | +One can also find the contacting atoms between two protein chains. For instance the following code finds the contacts between the first 2 chains of PDB entry [1SMT](http://www.rcsb.org/pdb/explore.do?structureId=1SMT): |
| 37 | + |
| 38 | +```java |
| 39 | + AtomCache cache = new AtomCache(); |
| 40 | + StructureIO.setAtomCache(cache); |
| 41 | + |
| 42 | + Structure structure = StructureIO.getStructure("1SMT"); |
| 43 | + |
| 44 | + AtomContactSet contacts = |
| 45 | + StructureTools.getAtomsInContact(structure.getChain(0), structure.getChain(1), 5, false); |
| 46 | + |
| 47 | + System.out.println("Total number of atom contacts: "+contacts.size()); |
| 48 | + |
| 49 | + // the list of atom contacts can be reduced to a list of contacts between groups: |
| 50 | + GroupContactSet groupContacts = new GroupContactSet(contacts); |
| 51 | +``` |
| 52 | + |
| 53 | + |
| 54 | +See [DemoContacts](https://github.com/biojava/biojava/blob/master/biojava3-structure/src/main/java/demo/DemoContacts.java) for a fully working demo of the examples above. |
| 55 | + |
| 56 | + |
| 57 | + |
| 58 | +[Holm 1993]: http://www.biomedcentral.com/pubmed/8377180 |
| 59 | +[Caprara 2004]: http://www.biomedcentral.com/pubmed/15072687 |
| 60 | +[Alexandrov 2003]: http://www.biomedcentral.com/pubmed/12584135 |
| 61 | +[Emmert-Streib 2007]: http://www.biomedcentral.com/pubmed/17608939 |
| 62 | +[Benkert 2008]: http://www.biomedcentral.com/pubmed/17932912 |
| 63 | +[Jones 2012]: http://www.ncbi.nlm.nih.gov/pubmed/22101153 |
0 commit comments