CCL Home Page
Up Directory CCL arms.readme
................ SHORT DOC .............................................

ARMS: Spatial Alignment with the RMS (Root Mean Square) method.

Reference of the RMS numerical algorithm:
[1] M. Petitjean, On the Root Mean Square Quantitative Chirality and
    Quantitative Symmetry Measures, J.Math.Phys. 1999,40[9],4587-4595
    (see appendix of ref. [1])

Reference of the SDM algorithm:
[2] M. Petitjean, Interactive Maximal Common 3D Substructure Searching
    with the Combined SDM/RMS Algorithm, Comput.Chem. 1998,22[6],463-465
 
Author email: petitjean@itodys.jussieu.fr

ARMS reads the cartesian coordinates of two molecules, then optimally
rotates and translates the molecule 2 onto the molecule 1.
A pairwise correspondence between two equal length ranges of atoms
is input. The maximal common 3D motif is computed after alignment,
with the SDM algorithm.

The two input molecules should be concatenated into a single file
prior execution.


Input data and parameters:
-------------------------

INPUT  FORMAT:
  CAS : Reserved for internal purposes
  HIN : Hyperchem-type files
  MDL : Cambridge Crystallographic Model files
  ML2 : SYBYL Mol2 files
  PDB : Protein Data Bank or Nucleic Acid Data Bank files
        (only HEADER, ATOM, ENDMDL and END records are recognized)
  BIO : Biosym (MSI) files
  ISU : Reserved for internal purposes

INPUT  MOLEC FILE NAME: name of the input file containing both molecules

OUTPUT MOLEC FILE NAME: name of the output file containing the optimally
  rotated and translated molecule 2

IMOL1: sequential position number of molecule 1 in the input molecules file

IMOL2: sequential position number of molecule 2 in the input molecules file

IAT1: first atom in molecule 1 to be paired

IAT2: first atom in molecule 2 to be paired

LENGTH: number of atom pairs.
  Atoms IAT1 to IAT1-1+LENGTH are paired with atoms IAT2 to IAT2-1+LENGTH

CUT-OFF DIST:
  This parameter does NOT affect the results. It saves space and time.
  As a rule of thumb, this value should be roughly near a bondlength.
  E.g. about 1.5 to 2 for small inorganic molecules, 0.9 to 1.2 for full
  proteins, 4 to 5 for C-alpha protein backbones).

TRANSL: enter 'N' or 'n' to disable translations.
        Any other input enables translations


Output results:
--------------

The r.m.s. between the input pairs, and the optimal rotation and translation.
The size of the common 3D motif, followed by the recomputed
pairwise correspondence between the atoms in the common motif.

The new coordinates of the optimally rotated and translated molecule 2.


Remarks:
-------

The number of atoms is currently limited to 15000 for each molecule.
The source has to be recompiled to read larger molecules.

Many applications of the method involve protein backbones only,
although most PDB files contain other atoms. Building a PDB file
containing only alpha carbons involve the following steps:
- extract alpha carbons with editor and put them in a separate file.
  For unix/linux systems, it is possible to use the command:
  grep "  CA  " full_pdb_file > backbone_only_pdb_file
- edit the new file and add the adequate HEADER and END records,
  respectively at the beginning and the end of each protein backbone.
- concatenate protein backbones files in a single file.

The initial pairwise correspondence is set by the user.
An automatic search of the best input pairwise correspondence without
sequential numbering assumption is possible with the CSR freeware,
also available from the author (see ref. [2]).

The output optimal translation is splitted in two parts: one before
rotation (to be substracted), and one after rotation (to be added).

The generated file containing the output moved molecule 2 is empty for
CAS, MDL and BIO formats, and the message "EERCO2 = 1" is displayed.

................ END SHORT DOC .........................................
Modified: Sat Jan 4 06:14:14 2003 GMT
Page accessed 6180 times since Sun Jan 19 01:44:22 2003 GMT