Expo Help
Expo Help
Table of contents
2 / 126
EXPO2014
General Information
EXPO2014
The program EXPO2014 is the evolution of EXPO2009. EXPO2014 performs all
the steps of the ab-initio structure solution by powder data: indexing, space group
determination, estimation of the reflection integrated intensities, structure solution by
Direct/Patterson methods or/and by direct space/hybrid approaches, model refinement by
Rietveld technique.
The program has been designed to:
· require a minimal information as input
· work automatically
· reduce the user intervention and facilitate the interaction by means of a user-friendly
graphic interface.
Program specification
X-ray, synchrotron or neutron diffraction data can be used. There are no limits on the
number of counts in the pattern. The maximum number of reflections is 3000.
· Direct Methods for phasing the reflections. The electron density map is calculated and
then optimized.
· Direct Space Methods for generating trial structure models (compatible with the
expected molecular geometry) and selecting that one providing the best agreement
with the experimental data.
3 / 126
EXPO2014
Rietveld Refinement
The structure model is refined by Rietveld method.
Files generated by EXPO2014
EXPO2014 creates the following files (in the work directory):
- a direct access file to preserve data and results (name.bin);
- an ascii file containing the Miller indices (hkl) and integrated intensities of the generated
reflections (name.hkl);
- an ascii file to store peak positions if the indexing process has been carried out
(name.pea);
- output file containing information about all the steps of structure determination (name.out).
Authors
Created with the Personal Edition of HelpNDoc: Produce electronic books easily
EXPO2014 installation
EXPO2014 is built utilizing OpenGL for three-dimensional molecular visualization and the
GTK for the graphic user interface. All binary packages are pre-compiled by using the Intel
4 / 126
EXPO2014
Fortran Composer XE 2013 and the GNU C++ compiler. Expo2014 runs on the most
popular operating systems: Windows, Apple's Mac OS and some Linux distributions.
On Windows platforms
Both 32- and 64-bit versions of EXPO2014 are available for Windows. They were tested
on Windows 7, Vista, and XP.
To get the 32-bit or 64-bit version of EXPO2014 for Windows, download the file expo-
020513_install.exe or expo-020513-x64_install.exe from
http://www.ba.ic.cnr.it/content/expo2013
Note that the 32-bit version of EXPO2014 can also be used on 64-bit Windows. To install
the software, double click on the downloaded file and follow the instructions.
The program can be launched by double click on the icon of EXPO2014
Mac OS X
The Mac OS X version is contained in a file .dmg with a disc image format.
To do cleaning:
· Close any Finder Windows
· Eject the disk image (not the .dmg file).
· Delete the .dmg file by dragging it to the trash.
5 / 126
EXPO2014
On Linux platforms
Installing from binary package
Debian and Red Hat RPM packages are available via the download page.
· Install the .deb package typing from the terminal
sudo dpkg -i expo_1.13.05_ubuntu12.10_amd64.deb
2. Type the expo command. Read the section command line usage for more details.
6 / 126
EXPO2014
On a Windows PC to run EXPO2014 simply typing expo, the environmental variable PATH
must be set to include absolute path of the directory where EXPO2014 is placed.
On Linux, if the program has been installed from source files (file expo_1.13.05.tar.gz)
create a symbolic link with 'ln -s' of EXPO2014 in a directory where environment variable
path is set, or declare the path with the shell command `export
7 / 126
EXPO2014
Running examples
Test structures can be found in the folder examples. On Windows platform this folder is
typically located in C:\Users\username\share\expo\examples and on linux
operating system in /home/username/share/examples. To run the program select
the test structures by means of the menu File > Load Examples
Three test structures are supplied and described in the section Tutorials.
Contact
Created with the Personal Edition of HelpNDoc: Free help authoring tool
Tutorials
Tutorial 1 - Crystal structure determination of Cimetidine compound
The first tutorial is supplied for carrying out the full pathway of the ab-initio structure solution
process (from indexing up to the structure solution) in case of cimetidine.
The only required information by EXPO2014 is the diffraction data and the chemical
formula (C10H16N6S).
The examples directory contains the necessary files:
· cime.exp: input file for EXPO2014
· cime.dat: a synchrotron X-ray diffraction data file containing the initial 2-theta, the 2-
theta step and the final 2-theta values (in the first line) and the experimental counts (in
the rest of lines).
8 / 126
EXPO2014
%structure cime
%job cimetidine (C10H16N6S) -- Synchrotron data
%data
pattern cime.pow
wavelength 1.52904
synchrotron
%ntreor
%continue
It contains the minimal required information (name of data file, wavelength and type of
radiation used to collect the data if different from X-rays) and commands for the ab-initio
structure solution. This file can be visualized and eventually modified by the 'Edit' button.
More information about the creation of input file can be found in the section Preparing input
file.
Clicking on OK the powder diffraction data will be displayed.
9 / 126
EXPO2014
Selecting Next, an automatic peak search procedure will be applied and the d-values
corresponding to the selected experimental peaks will be automatically supplied to the
indexing procedure. At the end of this process EXPO2014 will provide the indexing results
by graphic interface:
Click on OK to accept the selected plausible cell parameters and to move to the next step:
the space group determination.
The space group determination step, as well as Direct Methods, requires the unit cell
content information. It can be easily derived by taking into account the structure formula and
the cell volume (Vcell) provided by the window above or by the output file of EXPO2014.
The cell content is given by Z multiplied by the structure formula. You can calculate Z as the
ratio between Vcell and the molecular volume (Vmol): Z = Vcell/Vmol.
Vmol can be calculated by one of the two following alternative ways:
1. Using the 18 Å3 rule: counting all the non-hydrogen atoms in the chemical formula and
10 / 126
EXPO2014
Click on OK in order to perform the space group determination and then on Next button to
go on continuously. At the end of the procedure a list of the Extinction Groups, compatible
with the lattice symmetry and ranked according to a probability value, is given:
the most plausibile Extinction Group P 1 21/a 1 is automatically selected and by clicking
on OK the space group P 21/a, corresponding to this extinction group, will be chosen.
After the determination of the space group, EXPO2014 starts with the extraction step
which is carried out in the selected space group. Click on Next button continuously. At the
end of the process a list of integrated intensity values, extracted from the experimental
powder pattern, is obtained and automatically supplied to Direct Methods for structure
solution.
The final result will be visualized on JAV Molecular Viewer.
11 / 126
EXPO2014
Some errors in the assignment of carbon or nitrogen atomic species can be easily
recognized and corrected by using the option 'Modify' from JAV menu.
The aim of this example is to guide the user in case of failure of a default run of
EXPO2014.
%Structure merca
%Job 2-Mercaptobenzoic acid
%Data
Cell 7.885 5.976 14.949 90.0 100.48 90
SpaceGroup p 21/c
Content C 28 O 8 S 4 H 24
Pattern merca.xy
Wavelength 1.54056
%continue
In this example it is assumed that the cell and space group have been already determined
12 / 126
EXPO2014
c) Clicking on GO
At the end of the procedure, EXPO2014 will rank the phasing trials (and the corresponding
structure models) according to the increasing RF agreement factor values.
The best solution (corresponding to the lowest RF value) will be selected and displayed.
More information about the strategies to apply when Direct Methods fail are described in
the next section of this manual.
The purpose of this tutorial is to guide the user through the crystal structure solution by
direct space methods of paracetamol (form I polymorph) compound.
13 / 126
EXPO2014
Open EXPO2014, select Load Examples and load the input file 'paracetamol.exp'.
Clicking on Next button, the file paracetamol.mol, containing a 3D molecular description of
paracetamol, will be loaded and displayed and the following window will appear:
Select Execute; the Simulated Annealing procedure will be applied performing 10 runs
whose corresponding final structure models can be explored by the button Solutions
References
Cimetidine - Cernik, R.J., Cheetham, A.K., Prout, C.K., Watkin, D.J., Wilkinson, A.P.,
Willis, B.T.M. (1991). J. Appl. Cryst. 24, 222-226.
Paracetamol (form I polymorph) - Florence, A. J., Shankland, N., Shankland, K., David,
W. I. F, Pidcock, E., Xu, X., Johnston, A., Kennedy, A. R., Cox, P. J., Evans, J. S. O.,
Steele, G., Cosgrove, S. D. & Frampton, C. S. (2005). J. Appl. Cryst. 38, 249-259.
14 / 126
EXPO2014
To run EXPO2014 you need of an input file (*.exp). Two possibilities are available:
1) to create a new input file; 2) to use an already existing input file.
Possibility n.1): creation of an input file by selecting, via graphic interface, the creation of a
new project by the menu "File">"New".
1. You must specify the structure name. This name will be used to create the name of
output file.
2. Load the profile counts file. The following powder diffraction data file can be imported:
ASCII XY files (*.xy), CIF powder files (*.rtv, *.cif), GSAS files (*.qda), CCDC Mercury
15 / 126
EXPO2014
xye (*.xye). Use the extension *.pow to indicate an ASCII xy file or an ASCII file with
2theta min, 2theta step, 2theta max in the first row and counts in free format in the
subsequent lines.
3. Select the 'Range' button if you want to change the 2theta min and/or the 2theta max.
4. Input the wavelength and the type of radiation source (X rays source is the default
choice).
5. To carry out the space group determination it is necessary to activate the check button
'Find Space Group'.
6. For indexing, it is necessary to click on button 'Cell Determination by N-TREOR'.
7. You must provide the content of unit cell. This information is important for the space
group determination and for the structure solution by Direct Methods. A valid molecular
formula for the cell content must meet the following criteria:
o Elements are represented by valid atomic symbols (IUPAC-nomenclature) each
optionally followed by a number.
o It is possible to write the formula in lower-case or upper-case but please note
that 'Sb' will be interpreted as one antimony but 'SB' will be one antimony and
one boron. Spaces are also permitted (eg. 'S B', one antimony and one boron).
We suggest to write the two-letter elements with the second letter in lower-case.
o Valid operators are: ( ) [ ] { }
Examples of valid formula: [C3H5(OH)3]4, C 28 O 8 S 4 h 24
If your formula doesn't follow the required criteria, the input string will appear written in
red.
When you press the button 'Save' (9) an EXPO input file (*.exp) will be created in the same
folder of powder data file and automatically loaded by the program for the indexing. The
button 'Open' (10) is used to open (and possibly modify with a text editor) an already
existing input file.
An already existing *.exp input file can be loaded from menu 'File' > 'Load & Go'.
EXPO2014 recognizes some formats of diffraction data that should be ASCII text files:
1. Files with extension .xy or .xye containing 2theta in the first column and the
intensities in the second one. The format of counts can vary. # could be used for comment
16 / 126
EXPO2014
line.
#This is a comment
4.700 4769.000
4.714 4860.000
4.729 4794.000
4.743 4857.000
4.758 4837.000
4.772 4891.000
4.787 4788.000
4.801 4886.000
4.816 4720.000
Additional column after the second will be ignored. Optionally, the wavelength can be
included in the file as the first line.
2. Files type with extension .dat containing 2theta minimum, 2theta step, 2theta
maximum in the first line. Counts are in the successive lines separated by blanks. The
number of counts for line and the format of counts can vary.
3. Additional readable ASCII file formats are: GSAS ( *.gda ), CIF ( *.rtv, *.cif )
If the file has the extension .pow or the extension is unknown, the program tries to read
according to the format of .xy or .dat file
Created with the Personal Edition of HelpNDoc: Generate Kindle eBooks with ease
Indexing by EXPO2014
Indexing by EXPO2014
In powder diffraction the first necessary step for solving crystal structure is the indexing, i.e.
finding the unit cell parameters and reconstructing the three dimensional lattice from the
one-dimensional information of the powder diffraction pattern. The indexing is performed
by EXPO2014 via the program N-TREOR09 (Altomare et al., 2009), the evolution of the N-
TREOR software (Altomare et al., 2000). In addition EXPO2014 provides a graphical
interface for two popular and freely available indexing program: DICVOL (Boultif & Louër,
2004), McMaille (Le Bail, 2004).
17 / 126
EXPO2014
The simplest method to perform indexing by EXPO2014 is input a powder diffraction data
file by the menu File > Import Diffraction Pattern. Select the appropriate radiation type
and the used radiation wavelength.
In section format of diffraction data the possible formats of the powder diffraction data file
required by EXPO2014 are described. As example you can input the diffraction data
merca.xy which is located in the directory examples and supply wavelength value
1.54056 Å.
Then select the indexing program from Pattern > Indexing. If you choose N-TREOR09 the
dialogue box shown in the following figure opens.
18 / 126
EXPO2014
The user can modify the default values of the maximum cell volume, the maximum cell axial
lengths, the maximum number of allowed impurity lines, the minimum de Wolff figure of
merit (M20). If known you can enter the measured density with the corresponding error.
Molecular weight is not used unless density is not zero.
If you tick DICVOL06 in the frame 'Indexing Programs' you can also activate the interface
for indexing by DICVOL06. In this case EXPO2014 creates a DICVOL06 input file and
automatically will run the program. On Windows operating system the executable file
dicvol06.exe is tipically located in the folder C:\Program Files\Expo2014\bin
and corresponds to the program freely available at http://www.ccp14.ac.uk/ccp/web-
mirrors/dicvol/ . On Linux OS dicvol06.exe is located in expo_install_dir\bin
where expo_install_dir denotes the directory where you have installed the package.
Under Linux OS, the Windows Emulator Wine is required to run DICVOL06. Wine is
available in the software center of the most common linux distributions. You should read
general explanation and suggestions about indexing in the DICVOL06 documentation
available at http://cds.dl.ac.uk/cds/peakquest/dicvol06.html.
19 / 126
EXPO2014
Clicking on Pattern > Indexing > and selecting the indexing program EXPO2014
automatically performs a peak search if peaks have not been already detected. If you think
that the automatically located peaks are correct click on button Run. On the contrary, if you
need to improve the selection of peaks press 'Cancel' and read carefully the next section.
2.By choosing the tool bar button to add peaks by left click of mouse and remove
20 / 126
EXPO2014
4.Use the button Pattern > Peak List to access to a list of values associated to each
peak: 2q, d, peak intensity calculated removing the contribution of background, full width at
half maximum (FWHM). This window can be used to edit the 2q or d values and to delete
peaks. If you click on a row in the list, the corresponding peaks will be marked with tick on
the pattern.
5.Use Pattern > Modify Peaks > Load Peaks / Save peaks to load or save peaks as
2theta or d values.
21 / 126
EXPO2014
During the peak search remember that the number of peaks must be not too large; and the
indexing problem can be generally solved with the 25 first peaks (even very weak peaks
included). Peaks at low angle not clearly resolved and with an appreciable peak width
should be not included to avoid large errors in peak position.
In case of failure try to improve the indexing by different attempts, gradually eliminating
suspicious Bragg peaks (e.g., the weakest peaks or peaks on a shoulder) and check if it
results in a better indexing solution. The failure could be due to the presence of impurity
peaks and/or to the absence of one or more useful peaks. Sometimes could be very useful
to reduce the intensity threshold when weaker peaks are missed. On the other hand if you
suspect the presence of an impurity and if the impurity phase is a minority phase in the
specimen, it can be useful to reduce the selection to the most intense peaks, e.g. 16-20
peaks may be sufficient. Indexing program as N-TREOR09 and DICVOL06 are able to
successfully index even if some peaks are omitted but they can fail if a lot of dubious peaks
are included.
Submit the selected peaks to a indexing program by click on Run in the window 'Indexing
Settings', accessible by Pattern > Indexing
22 / 126
EXPO2014
Sometimes more plausible cells are suggested, they are ranked according to a figure of
merit for each indexing program. When the quality of data is not good, uncertainty in the
indexing procedure is usually manifested by the generation of more than one solution. If
several unit cells are possible, the different solutions should be carefully examined, in
particular if their figures of merit are not encouraging. Clicking on each suggested cell of
the list, the tick marks at the bottom of the experimental pattern indicating the calculated
positions of the Bragg reflections, will be displayed. The tick marks must correspond to
peaks in the pattern. When no experimental peaks are associated to the tick mark, the
corresponding reflection is probably systematically absent. On the other side, if a
diffraction peak in the pattern is not associated to any tick, the peak can be referred to an
impurity phase or the determined cell might be wrong.
The quality and the reliability of indexing, therefore, is routinely characterized by means of
various numerical figures of merit. The figures of merit reported by EXPO2014 are M20,
FOMnew only for N-TREOR09 and McM20 only for McMaille. Consult the output file of the
specific program (Pattern > Indexing > Output) to additional figures of merit (e.g. F20) for
each solution. The more reliable indexing should yield the higher value of figure of merit. In
case of M20, the rule suggested by de Wolff is the following one: if M20 > 10 and the
number of unindexed lines, among the first twenty ones, is not larger than 2 the indexing
process is reliable and substantially correct; exceptions to this rule can be possible. In
case of high quality data (e.g., synchrotron data) the M20 of the correct cell is usually much
larger than 10; in case of laboratory data the correct solution can be characterized by M20
value not so larger than 10.
The correct solution is usually characterize by a high figure of merit and all diffraction peaks
(or most of them) should be indexed.
If a program fails in indexing a powder pattern, it is possible that another program can
succeed. Similar results obtained by different programs strengthen the confidence in the
correctness of the cell. Because N-TREOR09 and DICVOL06 run usually much faster than
McMaille, it is better to apply them first.
N-TREOR09 includes some features in the default strategy: an automatic correction of the
2theta zero-shift is made; it may successfully index powder diffraction patterns containing a
few impurity peaks; starting the search in the cubic system, if an acceptable solution is not
found the program explores all the rest of crystal systems up to triclinic and stops as soon
as a plausible solution has been found and therefore, indexing in the lower symmetry unit
cells will not be performed.
In case of DICVOL06 and McMaille these features are not included in the default strategy
and should be enabled by user in the graphical interface when a preliminary indexing fails.
The search of triclinic cell can take long time depending on the indexing program.
Once the cell parameters calculation step has been carried out, you may perform the
space group determination step by selecting the cell parameters and by clicking on the OK
button. The following dialog window will appear.
23 / 126
EXPO2014
Before clicking OK, in this window you must specify the unit cell content. In the case of the
2-Mercaptobenzoic acid structure the cell volume suggests that the cell can accommodate
4 molecules of C7H6O2S (Z=4) and the cell content may be written as (C7H6O2S)4. For
cimetidine compound the cell content is (C10H16N6S)4
During the editing of the cell content, the program calculates and displays a rough estimate
of the
1. Content Volume: an approximate value of the cell volume calculated as summation of
the average atomic volumes associated to the supplied atoms. Its value is expected to be
close the cell volume calculated directly by cell dimensions and reported in the window
below the cell parameters.
2. Volume per Atom: ratio between the cell volume and number of non-hydrogen atoms. Its
value is expected to be close to 18 Å3 ( 18 Å3 rule) but sensible deviation from this ideal
value can be observed due to the presence of H-bonding or large void space (e.g. zeolites)
in the crystal structure. A reasonable range is 15-22 Å3.
3. Density: is the density calculated by the mass of supplied atoms and the cell volume (g/
cm3). Keep in mind that the density of molecular compound is relatively low and lying to the
range 0.9-2.0 g/cm3.
If more cells are available at the end of indexing they are saved and always accessible
from Pattern> Indexing> Cell List
An alternative way to perform indexing by EXPO2014 is to edit a new input file or modify
an existing input file with a text editor and save it with the extension .exp.
An example of input file for indexing of powder pattern of cimetidine is available in the
directory examples (file cime.exp). Its content is the following one:
%structure cime
%job cimetidine (C10H16N6S) -- Synchrotron data
%initialize
%data
pattern cime.dat
wavelength 1.52904
synchrotron
%ntreor
%continue
The command that has to be introduced in the input file for carrying out the indexing
24 / 126
EXPO2014
Once the input file has been created, load the .exp input file from the menu File > Load &
Go.
Use the button in the toolbar to carry out the steps of the indexing procedure.
To execute the indexing procedure from command line, type the name of powder diffraction
data file and add the option -index=indexing program. You can specify the
program(s) to use for indexing with n for N-TREOR09, d for DICVOL06 and m for McMaille
after the argument -index=.
expo merca.xy -index=ndm
the indexing of powder data merca.xy will be performed by using N-TREOR09 (n),
DICVOL06 (d), McMaille (m).
Add -wavel=wavelength to define the wavelength of radiation source. If -wavel is
omitted the default wavelength of Cu K-alpha 1 ( 1.54056 Å) is used.
25 / 126
EXPO2014
Output of indexing
The output file contains details about indexing. Use Info > View output file to
access to the output file or open the file structure_name.out with a text editor or click
on the button from the main menu.
Additional output files are created by the specific indexing programs and are accessible
through the menu Pattern > Indexing > Output
26 / 126
EXPO2014
pattern cime.dat
synchr
wave 1.52904
extefile cime.pea
%ntreor
%continue
cime.pea is an ASCII file containing the observed 'd' values associated to the selected
peaks. In this case peak search and peak refinement will be skipped. When the directive
extefile is omitted the program saves the d values associated to the peaks in a file
with extension .pea before carrying out the indexing process.
2-theta
This directive is used combined with the extefile directive to supply to the indexing
program N-TREOR09 an external file containing 2theta values of the peak positions.
search 2theta-min 2theta-max peak_number threshold
This directive is used to specify the conditions of peak search. The user has to indicate
four parameters: the search range of the peaks in 2theta (2theta-min and 2theta-
max), the number of peaks (peak_number), the intensity threshold (threshold).
Negative or zero values are ignored, it is not necessary to specify all values.
E.g.
To search the 25 most intense peaks in the range 5-80 add the following line after %data
command:
search 5 80 25
%structure diltia
%data
pattern pd_0029.xye
wavelength 1.54056
extefile diltia.pea
%ntreor
CEM = 50,
27 / 126
EXPO2014
Contact
For suggestions and bugs contact annagrazia.moliterni@ic.cnr.it.
References
Altomare, A., Campi, G., Cuocci, C., Eriksson, L., Giacovazzo, C., Moliterni, A., Rizzi, R. &
Werner, P.-E. (2009). Advances in powder diffraction pattern indexing: N-TREOR09. J.
Appl. Cryst. 42, 768-775.
Altomare, A., Giacovazzo, C., Guagliardi, A., Moliterni, A. G. G., Rizzi, R. & Werner, P.-E.
(2000). New techniques for indexing: N-TREOR in EXPO. J. Appl. Cryst. 33, 1180-1186.
Boultif, A., Louër, D., (2004). Powder pattern indexing with the dichotomy method. J. Appl.
Cryst. 37, 724-731.
Le Bail, A., (2004). Monte Carlo indexing with McMaille. Powder Diffraction 19, 249-254.
De Wolff, P. M.(1968). A simplified criterion for the reliability of a powder pattern indexing.
J. Appl. Cryst. 1, 108-113
Werner, P.-E., Eriksson, L., & Westdahl, M. (1985). TREOR, a semi-exhaustive trial-and-
error powder indexing program for all symmetries. J. Appl. Cryst. 18, 367-370.
Created with the Personal Edition of HelpNDoc: Free PDF documentation generator
28 / 126
EXPO2014
You must specify the structure name, the profile counts filename and its format, the type of
radiation source. Click on button 'Find Space Group', fill the frame 'Cell Parameters' and
'Cell Content'. The following picture is an example of input file for the crystal structure
determination in case of cimetidine structure from the indexed pattern.
Once the button 'Save' has been selected, an input file cime.exp will be created and
automatically loaded by the program for the space group determination.
%structure cime
%job structure cime
29 / 126
EXPO2014
%init
%data
pattern cime.pow
filetype counts
wave 1.52904
findspace
cell 10.69808 18.81786 6.82267 90.00000 111.28911
90.00000
cont h 64 c 40 n 24 s 4
%continue
The directive findspace enables the space group determination procedure. It is a
directive of the command %data. The unit cell is assumed to be determined by indexing of
the experimental pattern. To know the meaning of the other lines of the input file, see the
chapter "command and their use" of this manual.
Alternatively you can edit a new input file or modify an exiting input file by a text editor and
save it with extension '.exp'. Once a time the input file has been created, load the exp input
file from menu 'File' > 'Load & Go'.
30 / 126
EXPO2014
The window contains the extinction symbols ranked according to their calculated
probability. When the user clicks on one of them, the corresponding list of compatible
space groups is supplied. The frequency of the selected space group in the Cambridge
Structural Database (CSD) is reported in terms of number of occurrences in the CSD and
percentage. This information can be useful when different space groups are associated to
an extinction group.
Press the button to access to a list of reflections that, according to the selected
extinction symbol, should be systematically absent. For each reflection the following
specifications are given: the order number of reflection, the Miller index, the symmetry
operator responsible for the extinction and the corresponding extinction condition, the
probability for the extinction, the type of reflection (single or in overlap).
By default the reflections are ordered according to the extinction condition but you can
change the order selecting a different ordering criterion by clicking on the title column. The
selected reflection on a list will be marked on the pattern.
The reflections that are expected to be absent according to the selected extinction symbol
are marked by a vertical bars situated below the observed pattern, just above the set of
bars defining the reflection positions. They have different colors depending on whether the
reflection is single (or belongs to a whole extinct cluster) or in overlapping. By default, a
black vertical bar marks single systematically absent reflections, while the rest of the
absent reflections (those in overlap) are marked with a purple bar. This graphical support
helps the user to discard unreliable extinction symbols even if estimated with large
probability.
Press the 'OK' button to accept the selected space group and continue with the structure
solution process.
31 / 126
EXPO2014
The output file contains details about the space group determination procedure.
Use 'Info' > 'View output file' to access to the output file or open the file structure_name.out
with a text editor. At the end of the procedure the new input file, called
structure_name1.exp (e.g. cime1.exp) will be created. This file contains the selected
space group and can be loaded for skip the space group determination and to perform the
structure solution with Direct Methods.
Contact
For suggestions and bugs contact angela.altomare@ic.cnr.it,
annagrazia.moliterni@ic.cnr.it, rosanna.rizzi@ic.cnr.it, corrado.cuocci@ic.cnr.it.
References
Altomare, A., Caliandro, R., Camalli, M., Cuocci, C., da Silva, I., Giacovazzo, C., Moliterni,
A. G. G. & Spagna, R. (2004). Space-group determination from powder diffraction data: a
probabilistic approach. J. Appl. Cryst. 37, 957-966.
Altomare, A., Camalli, M., Cuocci, C., da Silva, I. Giacovazzo, C., Moliterni, A. G. G &
Rizzi, R. (2005). Space group determination: improvements in EXPO2004. J. Appl.
Cryst. 38, 760-767.
Altomare, A., Camalli, M., Cuocci, C., Giacovazzo, C., Moliterni, A. G. G. and Rizzi, R.
(2007). Advances
in space-group determination from powder diffraction data. J. Appl. Cryst. 40, 743-748.
Created with the Personal Edition of HelpNDoc: Generate Kindle eBooks with ease
32 / 126
EXPO2014
You must specify the structure name, the profile count filename and its format, the type of
radiation source. It is supposed that the cell parameters and the space group have been
determined before so fill the frame 'Cell Parameters' and 'Space Group'. Activate the
check button 'Structure Solution' if you intend to perform the crystal structure solution
process with Direct Methods. Otherwise use the button 'Open' to import an existing input
exp, for example you could load the file cime.exp already existing in the directory
'examples'. The following picture is an example of the input file for crystal structure
determination of the cimetidine compound.
33 / 126
EXPO2014
When you press the button 'Save' an input file cime.exp will be created and automatically
loaded by the program for the structure solution process by DM.
%structure cime
%job structure cime
%init
%data
pattern cime.pow
content s 4 c 40 n 24 h 64
wave 1.52904
cell 10.6986 18.8181 6.8246 90.000 111.284 90.000
space p 21/n
sync
filetype counts
%continue
You can known the meaning of lines of the input file by looking at the chapter "command
and their use" of this manual.
Alternatively you can edit a new input file or modify an exiting input file with a text editor and
save it with extension '.exp'. Once a time the input file has been created, load the exp input
file from menu 'File' > 'Load & Go'.
34 / 126
EXPO2014
Press the button in the toolbar and the following steps of the structure solution
process will be automatically performed:
1) Extraction of the integrated intensities from the powder diffraction pattern;
2) The integrated intensities are normalized via Wilson method (Wilson,1942);
3) Triplet invariants are calculated and their phases are estimated via P10 formula
(Cascarano et al., 1984);
4) Phasing process: more plausible sets of phases are generated whose reliability is
assessed via the CFOM figure of merit (Cascarano et al., 1992);
5) Twenty sets of phases with the largest CFOM values are stored and ranked in
decreasing CFOM order. The first set of phases is automatically selected and used for
computing an electron density map via an E-Fourier synthesis (E-map): the most intense
peaks in the E-map are located and chemically interpreted. The obtained structure model
is optimized and completed via the automatic structure model optimization.
More information about the structure solution process by Direct Methods are in the chapter
'The steps of structure solution procedure'.
35 / 126
EXPO2014
Explore trials
In a typical Direct Methods procedure more sets of phases (trials) are generated and
ranked according to the combined CFOM figure of merit (estimating the quality of the trial)
and only the best trial is used to calculate a Fourier map. Because of the unavoidable
errors on the phases, the figure of merit can fail to evaluate the best trial, so it is strictly
suggested to develop the other trials by menu 'Solve' > 'Explore trials'.
The following window will be opened (the procedure has been applied to 2-
Mercaptobenzoic acid compound, whose input file is in the directory 'examples').
and the the CFOM value can be read. The highest CFOM value could not correspond to the
correct solution while subsequent different trials may be successful. This is particularly true
when several nearly equivalent CFOM's are available for different trials as in the picture. If
the trial has been already developed (done='yes'), you can read the RF figure of merit value
that quantifies the agreement between the structure factors extracted from the experimental
pattern and calculated by the model.
Check the button in the first column to decide which trial will be developed and press the
36 / 126
EXPO2014
button If you check the button 'Select all new trials', all the trial not already
explored will be automatically selected and developed.
You can choose three different actions for the structure model optimization (see later):
1)RBM: Resolution Bias Modification is the default action for organic and metal organic
compounds.
2)Fourier: classical Fourier refinement, default choice for inorganic compounds.
3)E-map: the first electron density map calculated directly after the phasing process by
Direct Methods.
4)COVMAP: the covariance principle based completion.
At the end of the procedure, all the trials will appear ordered according to the RF and the
user can also view the model corresponding to each trial by mouse selection.
Alternatively to the graphical option, the use of the command %alltrials in the input file
automatically activate the 'Explore trials' approach. An example of input file is here
reported:
%Structure merca
%Initialize
%Job merca
%Data
Cell 7.885 5.976 14.949 90.0 100.48 90
SpaceGroup p 21/c
Content c 28 O 8 S 4 h 24
Pattern pd_0002.xy
Wavelength 1.54056
%extra
%normal
%invar
%phase
%alltrials
37 / 126
EXPO2014
The structure model obtained at the end of the default strategy, generally is a partially
correct representation of the true model: some atoms are in correct positions but other are
incorrectly positioned or completely false. In this situation the user can try to improve the
quality of the Fourier map, as obtained at the end of Direct Methods, by using different
optimization strategies implemented in the EXPO2014 program:
1. suitably weighted least squares (wLSQ) (Altomare et al., 2006), able to compensate
the low accuracy of the intensities of strongly overlapped reflections. The procedure
is automatically applied in case of inorganic compounds.
2. the resolution bias correction algorithm (RBM) (Altomare et al., 2008a, 2008b;
2009, 2010a, 2010b)for reducing, in the electron density map, the errors caused by
the limited experimental resolution: peak broadening, peak shift, intensity distorsion.
The correction has been developed in direct space (it represents the default choice
in case of organic and metal organic compounds), in reciprocal space and in both
direct and reciprocal space;
3. the procedure of electron density modification based on the concept of covariance
between points of the map (COVMAP) (Altomare et al., 2012) The procedure
executes the following strategy: the structural model provided by the RBM procedure
is submitted to COVMAP approach which suitably modifies the electron density
map and successively submits the improved model to wLSQ analysis. The resulting
new model is again modified by COVMAP and cyclically processed in the other two
steps. The number of cycles depends on the structure complexityThe structure
model obtained at the end of the default strategy can be partly correct with some
atoms in correct position but other atoms are incorrectly positioned or completely
false. In this situation the user can try to optimize del model by using different
optimization strategies.
It is possible to graphically choice different RBM procedures for model optimization by the
menu 'Resolution Bias Modification (RBM)', via 'Refine' button from the upper EXPO2014
menu.
The new method RAMM (Altomare et al., 2013) has been developed and implemented in
the EXPO2014 computing program for improving the ab initio crystal structure solution
process. When the available information consists of only the experimental powder
diffraction pattern and the chemical formula of the compound under study, the structure
solution classical approach follows two main steps: 1) phasing by Direct Methods (or by
Patterson methods) in order to obtain a structure model (this last is usually uncompleted
and/or approximate); 2) improving the model by structure optimization techniques. The
alternative RAMM approach skips step 1) and supplies a fully random model to step 2).
38 / 126
EXPO2014
Such model is then submitted to important structure optimization tools present in EXPO:
wLSQ , RBM and COVMAP which are able to lead to correct structure. RAMM is based on
a cyclic process generating several random models which are then optimized. The process
stops automatically when recognizes the correct structure.
Click 'RAMM' on the menu 'Solve' to activate this alternative strategy for structure solution
at the end of the default run of EXPO2014. The following pictures are related to 2-
Mercaptobenzoic acid whose input file is in the directory 'examples'.
The procedure can take from some minutes to several hours depending on the complexity
of problem (number of atoms, quality of data, data resolution). If more than one plausibile
solution is found they will be ordered according to the R-structure factor in an interactive
list.
The selected model will be visualized on the molecular viewer. Press 'OK' to accept the
selected model.
When Direct Methods procedure fails, the RAMM approach can be a very useful tool to find
the correct solution.
The use of the command %randomsolve in the input file automatically activate the
'Explore trials' approach. An example of input file is here reported:
%Structure merca
%Initialize
%Job merca
%Data
Cell 7.885 5.976 14.949 90.0 100.48 90
SpaceGroup p 21/c
Content c 28 O 8 S 4 h 24
39 / 126
EXPO2014
Pattern pd_0002.xy
Wavelength 1.54056
%randomsolve
Suggestions
The number of options in the program is quite large. The optional strategies regard the
improvement of both the decomposition process for carrying out more reliable reflection
intensities and the Direct Methods performances. We give some other secondary
strategies.
· Magic integers procedure is an effective and simple way for starting a multi-solution
process.
A random approach may be alternatively chosen in the phasing process (directive
'RANDOM' in the PHASE module).
· Second representation formula for triplet invariants proved much more powerful than
first representation formula therefore it is used as a default for estimating triplets. In
case of failure, the user can choose P3 formula (directive 'COCHRAN' in the
INVARIANT module) and/or conveniently increase the number of symbols in the
phasing step (directive 'SYMBOLS' followed by the number of symbols to be used in
the 'PHASE' module).
· A check for possible pseudotranslational symmetry is always made by EXPO2014: if
PSEUDO keyword is not activated the information is given but not used in the
normalization routine and in the phasing process. The user can decide to use it
(directive 'PSEUDO' in the NORMAL module).
40 / 126
EXPO2014
· the value of NREF (number of reflections actively used in the phasing process) is fixed
by EXPO2014. For some special structures the ratio "number of active triplets/NREF"
is too small (less than 10). Larger values of NREF may improve the phasing
procedure (directive 'NREFLECTIONS' followed by the number of reflections in the
NORMAL module).
· High (or low) resolution reflections may occasionally play a too important role in the
first steps of convergence and divergence process. Fixing a thermal factor lower (or
larger) than that automatically provided by the normalization routine may successfully
change convergence and divergence process (directive 'BFACTOR' followed by the
isotropic thermal factor in the NORMAL module).
· An alternative space group should be carefully considered (directive 'SPACEGROUP'
in the DATA module).
· Sometimes the structure model is partial. Successive combined Fourier map
calculations (Fo-Fc, 2Fo-Fc or Fo) and Rietveld refinements may be conveniently
selected by graphical options.
Contact
For suggestions and bugs contact:
angela.altomare@ic.cnr.it
annagrazia.moliterni@ic.cnr.it
rosanna.rizzi@ic.cnr.it
corrado.cuocci@ic.cnr.it
References
Altomare, A., Cuocci, C., Giacovazzo, C., Moliterni, A.G.G., Rizzi, R. (2006.) Powder
diffraction: the new
automatic least-squares Fourier recycling procedure in EXPO2005. J. Appl. Cryst. 39
(2006) 558-562.
Altomare, A., Cuocci, C., Giacovazzo, C., Kamel, G. S., Moliterni, A. & Rizzi, R. (2008a).
Minimally
resolution biased electron-density maps. Acta Cryst. A64, 326-336.
Altomare, A., Cuocci, C., Giacovazzo, C., Moliterni, A. & Rizzi, R. (2008b). Correcting
resolution bias in electron density maps of organic molecules derived by direct methods
from powder data. J. Appl. Cryst. 41, 592-599.
Altomare, A., Cuocci, C., Giacovazzo, C., Moliterni, A., Rizzi, R. (2009) Correcting
electron-density
resolution bias in reciprocal space. Acta Cryst. A65, 183-189.
Altomare, A., Cuocci, C., Giacovazzo, C., Moliterni, A., Rizzi, R. (2010a) The dual-space
resolution bias
correction algorithm: applications to powder data. J. Appl. Cryst. 43, 798-804.
Altomare, A., Cuocci, C., Giacovazzo, C., Moliterni, A., Rizzi, R. The dual-space resolution
bias
correction in EXPO2010. Z. Kristallogr. 225, 548-551.
A. Altomare, C. Cuocci, C. Giacovazzo, A. Moliterni and R. Rizzi (2012). Covariance and
41 / 126
EXPO2014
Using the command %continue, the program runs in default conditions carrying out the
commands
%extraction
%normal
%invar
%phase
%fourier
%menu
%end
and each command corresponds to a step in the structure solution with Direct Methods. All
the following steps are described.
42 / 126
EXPO2014
(1)
s = scale factor
mh = reflection multiplicity
Fh = structure factor
P = profile function
A = asymmetry function
background intensity at the i-th
ybi =
step.
N, the polynomial degree, is automatically defined by the program as the number which
provides the best fit of the background interpolation points.
is the associated to the minimum intensity in the range in which the background is
estimated.
From the pattern a standard peak is automatically selected. The standard peak fitting is
performed to provide the starting values of the variables to refine in the least squares
procedure.
The full pattern is automatically divided into intervals. The refined parameters derived by
the fitting of each interval will be taken into account for the final full pattern fitting.
The following profile shape functions are available for user choice:
Pearson
VII
43 / 126
EXPO2014
Gaussian
Lorentzian
P
s
e
u
d
o
-
V
o
i
g
t
where
44 / 126
EXPO2014
otherwise
The peak shape, the asymmetry, the cell and the background parameters are all refined by
the program. To overcome the tendency of the Le Bail algorithm to equipartition the
intensities of strongly overlapping reflections, a new approach was introduced. It is based
on the random variation of the reflection intensities belonging to a given cluster and on the
subsequent selection of the good fit feasible trials.
After a first default run of EXTRA, it is possible to run it again starting from the refined
variables in the previous run. In this case the preliminary standard peak procedure and the
interval by interval extraction procedure are skipped. The related command is %
RECYCLE.
Directives in extraction command
The following directives must be added after the command '% extraction' in the
input file to activate some specific features of the procedure
ALL
To activate the procedure of selection of the profile function type (among Pearson VII,
Gaussian, Lorentzian, Pseudo-Voigt) from the best fit of each interval.
DAMPING x
x is the damping factor used in the least squares routine (default value 0.1). (It is convenient
to use a small x value).
FAST
This directive is used if the graphic interface has been requested by the "window"
command. In this case, the user graphic interaction in a default run is avoided.
FIRST
To activate the procedure of selection of the profile function type (among Pearson VII,
Gaussian, Lorentzian, Pseudo-Voigt) from the best fit of the first interval .
FRECYCLE filename.ext
If the "filename" is available from a previous run of EXPO2014 (or EXTRA), or from any
source containing hkl and the corresponding F-squared values, Le Bail algorithm is started
again for obtaining new estimates.
GAUSSIAN
45 / 126
EXPO2014
INTERVAL n
n is the number of intervals in which the powder pattern is divided (max 8). n is only given if
the user wants to change the automatic division of the pattern.
LORENTZIAN
LOWB
To reduce the background by a constant factor equal to the minimum value observed count.
NCYCLE n
NOBA
NOCELL
NOSINGLE
This directive is used if the preliminary fitting of the standard peak (automatically selected
by the program) is omitted. In this case a preliminary fitting of the first interval is carried out.
OPTI
POLYNOMIAL k
k is the background polynomial degree (max 10). k is given if the user wants to use a fixed
degree in each interval and not the degree automatically selected by the program.
PVOIGT
RANDOM p
To activate the random procedure in the integrated intensity extraction step. p is the
coefficient for the reflection overlapping definition. Default value = 0.5
SMOO
To activate the procedure for calculating the background by following (Brückner, 2000). In
this case the background is not refined.
STANDARD
46 / 126
EXPO2014
To activate the procedure of selection of the profile function type (among Pearson VII,
Gaussian, Lorentzian, Pseudo-Voigt) from the best fit of the standard peak.
SLOW
This directive can be used if the graphic interface has been requested by the "window"
command. In this case the program stands by after some relevant calculations and waits
for the user graphic interaction to go on.
VERBOSE
This directive is used to have a more detailed print in the output file. In this case, the Rp
and Rwp values and the values of the refined variables are printed interval by interval.
WIDTH x
x is the factor which, multiplied by the FWHM of each peak, gives the total peak width. (In
this case the automatic calculation of x is not taken into account).
ZEROPROFILE x
x is the 2q shift value supplied by the user (to be substracted from the 2q observed
positions). In this case the automatic calculation of the 2q shift using the preliminary fitting
of the standard peak is skipped by the program).
EXAMPLES
Example 1
A Pseudo-Voigt function will be used and the background in each interval will be described by a constant.
The counts file format is not the default one and the graphic window is suppressed.
%structure mes
%job MES - data from home diffractometer
%initialize
%data
pattern mes.pow
cell 8.588 9.931 11.105 90.0 93.754 90.0
content C 24 N 4 O 20 S 4 H 52
spacegroup p 21/c
wavelength 1.5406
filetype double
%extraction
pvoigt
polynomial 0
%end
Example 2
The starting values of the integrated intensities in the extraction step are supplied by the user in the file
"yono.ext" and the number of least-square cycles is set to 40.
%structure yono
%job YONO - Synchrotron data
%initialize
%data
pattern yono.pow
cell 9.385 16.394 3.627 90.000 101.067 90.000
47 / 126
EXPO2014
content Y 8 O 26 N 2 H 18
spacegroup p 21
wavelength 1.2323
synchrotron
%extraction
fstarting yono.ext
ncycle 40
%end
Example 3
The preliminary fitting of the standard peak is skipped and the starting 2q shift value is supplied by the user
(the program refines it).
%window
%structure agpz
%job AGPZ - data from home diffractometer
%init
%data
pattern agpz.pow
cell 6.526 20.059 6.464 90.000 90.000 90.000
spacegroup p b c a
content Ag 8 N 16 C 24 H 24
wavelength 1.54056
%extraction
nosing
zeroprof 0.02
%continue
Example 4
Neutron data and a deuterium atom in the cell content are used. Fast graphic interaction for the integrated
intensities extraction process is activated.
%structure baco
%job BACO - Neutron data
%initialize
%data
neutron
isotope H 0.67
pattern baco.pow
cell 10.0632 7.9330 6.8487 90.0 122.3423 90.0
spacegroup C 2/m
content Ba 4 C 8 O 20 H 8
wavelength 1.909
%extraction
fast
%continue
References
Le Bail, A., Duroy, H., Fourquet, J.L. (1988). Ab-initio structure determination of LiSbWO6
by X-ray
powder diffraction. Math. Res. Bull. 23, 447-452.
Created with the Personal Edition of HelpNDoc: Free HTML Help documentation generator
48 / 126
EXPO2014
49 / 126
EXPO2014
Example 1
Use of the pseudotranslational symmetry (if it is present) as prior information for a new extraction process is
inhibited.
%structure agpz
%job AGPZ - data from home diffractometer
%initialize
%data
pattern agpz.pow
cell 6.526 20.059 6.464 90.000 90.000 90.000
spacegroup p b c a
content Ag 8 N 16 C 24 H 24
wavelength 1.54056
filetype xy
%extraction
%normal
nopseudo
%continue
References
Cascarano G., Giacovazzo C. & Luic' M. (1988a). Acta Cryst. A44, 176-183.
Cascarano G., Giacovazzo C. & Luic' M. (1988b). Acta Cryst. A44, 183-188
Fan H., Yao J. & Qian J. (1988). Acta Cryst. A44, 688-691.
Altomare A., Burla M.C., Cascarano G., Giacovazzo C., Guagliardi A., Moliterni, A.G.G. &
Polidori, G. (1996). J. Appl. Cryst. 29, 341-345.
Altomare A., Cascarano G., Giacovazzo C., Guagliardi A., Burla M.C., Polidori G. &
Camalli M. (1994). J. Appl. Cryst. 27, 435 - 436.
Created with the Personal Edition of HelpNDoc: What is a Help Authoring tool?
50 / 126
EXPO2014
of triplets (PSI-E triplets) based on two strong and one intermediate reflections (just below
the threshold of strong reflections) are calculated and used in the FOURIER/LEAST-
SQUARES module in order to extend phase information (Altomare et al. 1991).
Negative quartets are generated by combining the psi-zero triplets in pairs, and those with
cross-magnitudes smaller than a given threshold are estimated by means of their first
representation, as described by Giacovazzo (1976). These quartets are used to provide an
important contribution to the CPHASE FOM.
Active triplets may be estimated according to Cochran's (1955) P3 distribution: the
concentration parameter of the von Mises distribution is then:
(
3)
Triplets can also be estimated according to their second representation (i.e. P10 formula,
as described by Cascarano et al., 1984). The concentration parameter of the new von
Mises distribution is given by
G = C (1 + q) (4)
GMIN x
Positive triplets with G < x are not actively used.
Default value x=0.6 (in any case 0.2 < x < / = 1.0 is accepted range of value.
COCHRAN
51 / 126
EXPO2014
CORRECTION x
If a PSEUDOTRANSLATION procedure is on, a correction factor depending on x is
applied to the probabilistic argument of weak-weak-weak triplets when they are present.
The default value is calculated by program.
x = -1 eliminates weak-weak-weak triplets,
x = 1 corresponds to the largest confidence in them.
EMAX x
Maximum value of E for the weak reflections for the psi-zero search (up to a maximum of
NRPSIZERO reflections). x > 0.3 not accepted.
Minimum value for the normalized structure factors in the sigma2 search. x < 1.0 not
accepted.
Default = minimum value of E for the strongest reflections selected by normalization
routine.
ESTIMATION
To run the procedure which estimates the |E|'s for all the reflections and use them as a prior
information in a new extraction process.
The procedure will not be applied if PSEUDOTRANSLATIONAL information has been
recycled.
NPSI0
To not calculate PSI-0 triplets (Default uses them).
NRPSIZERO n
The number of weakest reflections for the psi-zero search.
Default = number computed by normalization routine.
The value will substitute the default one in the direct access file.
NRTRIPLETS n
The number of strongest reflections for the sigma2 search. ( Up to Emin = 1.0 )
Default = number computed by normalization routine (nstrong).
The value will substitute the default one in the direct access file.
Value less than nstrong/2 is not accepted.
NLARGE n
To supply the number of largest reflections for the psi-zero search ( default = number of
strongest reflections used for sigma2 search).
nlarge > nrtriplets is not accepted.
NQUARTETS
To not calculate negative quartets (Default uses them).
NUMK n
The number of free-vectors used by P10 formula.
Default value is provided by EXPO according to the structural complexity:
n = min ( natom / msym + 30, 70 )
where natom is the number of atoms in the cell and msym is the number of symmetry
operators of the space group.
52 / 126
EXPO2014
YPSIE
To calculate PSI-E triplets (Default doesn't use them).
EXAMPLE
Example 1
Use of the triplets estimation as prior information for a new extraction process.
%structure mes
%job MES - data from home diffractometer
%initialize
%data
pattern mes.pow
cell 8.588 9.931 11.105 90.0 93.754 90.0
content C 24 N 4 O 20 S 4 H 52
spacegroup p 21/c
wavelength 1.5406
filetype double
%extraction
%normal
nopseudo
%invariants
estimation
%continue
References
Altomare A., Cascarano G., Giacovazzo C. & Viterbo D. (1991). Acta Cryst. A47, 744-748.
Carrozzini B., Giacovazzo C., Guagliardi A., Rizzi R., Burla M.C. & Polidori G. (1997). J.
Appl. Cryst. 30, 92-97
Cascarano G., Giacovazzo C., Burla M.C., Nunzi A. & Polidori G. (1984). Acta Cryst. A40,
389-394.
Cascarano, G., Giacovazzo, C., Guagliardi, A. & Steadman, N. (1991). Acta Cryst. A47,
480-484.
Cochran W.(1955). Acta Cryst. 8, 473-478.
Giacovazzo C. (1976). Acta Cryst. A32, 958-966.
Created with the Personal Edition of HelpNDoc: Free Qt Help documentation generator
In the EXPO2014 program the most reliable one-phase s.s. are treated as known
phases. Besides triplets, also the most reliable negative quartets and two-phase s.s. may
be actively used.
Each relationship is used with its proper weight: the concentration parameter of the first
representation for quartets and two-phase s.s., and C or G for triplets.
- CONVERGENCE/DIVERGENCE PROCEDURE -
The convergence procedure (Germain et al. 1970) is a convenient way of defining an
optimum starting set of phases to be expanded by the tangent formula or by any other
algorithm.
When the P10 formula is used, as a default, a special convergence process is devised
which chooses the starting set according to
53 / 126
EXPO2014
(
5
)
(
6
)
I1 and I0 represent modified Bessel functions of order one and zero respectively. The
summation in (5) is over all relationships defining the reflection h. If P3 formula is used the
default choice is
(7)
Once the starting set has been defined, a good pathway for phase expansion is
determined by a divergence procedure. In the divergence map, starting from the reflections
in the starting set, each new reflection is linked to the preceding ones with the highest value
of < a >.
- PHASE EXTENSION AND REFINEMENT -
The starting set defined by the preceding step is usually formed by the origin (and
enantiomorph) fixing reflections, a few one phase s.s. and a number of other phases which
may be obtained:
a) by magic integer permutation (White et al. 1975; Main, 1978),
b) by a random approach (Baggio et al. 1978; Burla et al. 1992).
The option a) is the default, b) runs if the directive RANDOM is used. In this last case a
large number (depending on the available computer time) of trials can be requested.
Directives SYMBOLS, SPECIALS, MAXTRIAL may be used to change default values.
Phase expansion and refinement are carried out by means of a tangent formula using
triplets, negative quartets, psi-zero triplets and the most reliable two-phase structure
seminvariants. In the weighting scheme the experimental distributions of the alpha
parameters are forced to match with the theoretical ones (Burla et al. 1987).
For each phase set, several FOM's are computed using all invariants and seminvariants
estimated by means of the representation method. Their meaning and an optimized way of
combining all the computed FOM's to give a highly selective combined figure of merit
(CFOM) is described in the papers by Cascarano et al. (1987, 1992).
All FOM's, as well as the combined CFOM, are expected to be equal to 1.0 for correct
solutions. CFOM larger than 0.5 can be considered encouraging.
If pseudotranslational symmetry is present then CFOM > 0.3 may characterize the correct
solution.
54 / 126
EXPO2014
ENANTIOMORPH n
Code of the reflection chosen by the user to fix the enantiomorph (by default the
enantiomorph is chosen by the program ).
LIST n
The number of reflections at the top of the divergence map for which a list of the map is
obtained. Default = no printout of the map.
MAXTRIALS n
Maximum number of trials when random approach is used. (Default 100). Only if RANDOM
directive is on.
MINFOM x n
x: the program automatically stops when a solution is found with CFOM > x . Default value =
1.00.
n: is the maximum number of phase sets (with the largest combined figure of merit -
CFOM) to retain (default value = 10).
NOREJECT
Codes and phases of the reflections chosen by the user to fix the origin. The total number
of reflections (max 3) needed to fix the origin must be given (by default the origin is chosen
by the program).
PERMUTE n(i)
Codes of the permuted reflections chosen by the user (i.e. included in the number of
required symbols, max 12).
Codes, phases and weights of reflections with known phase (max 200).
If one card is not sufficient, the directive PHASE must be repeated in the other card(s).
RANDOM n
n: is the number of random phases. If n is omitted, it is equal to one half of the strong
reflections (nstrong).
55 / 126
EXPO2014
SEED n
Random generation seed. n must be an integer large value and must be odd.
SPECIALS n
The number of reflections with restricted phase to be permuted (i.e. included in the number
of required symbols, max 12).
SYMBOLS n
TABLE
To print the table of alpha values (at the beginning of PHASE procedure).
TEST
To restart the random number generator using the default seed. Only if RANDOM directive
is on.
TRIAL n
YNEG
To actively use negative triplets in the phasing process. (Default does not use them).
YNGQ
To actively use negative quartets in the phasing process. (Default does not use them).
EXAMPLE
Example 1
The program restarts from the PHASE procedure.
A random approach is used and the best 100 sets of phases, over 250 trials, are retained in the direct
access file. No Fourier is required.
%structure nizr
%job nizr structure
%init
%data
pattern nizr.pow
cell 12.388 8.9269 8.8449 90.0 90.550 90.0
cont zr 8 p 12 o 48 ni 4
56 / 126
EXPO2014
space p 21/n
wave 1.0302
filetype double
synch
%extraction
%normal
%invar
%phase
random
maxtrials 250
minfom 1.0 100
%end
References
Baggio R., Woolfson M.M., Declerq J.P. & Germain G. (1978). Acta Cryst. A34, 883-892.
Burla M.C., Cascarano G., Giacovazzo C., Nunzi A. & Polidori G. (1987). Acta Cryst. A43,
370-374.
Burla M.C., Cascarano G. & Giacovazzo C. (1992). Acta Cryst. A48, 906-912.
Cascarano G., Giacovazzo C. & Viterbo D. (1987). Acta Cryst. A43, 22-29.
Cascarano G., Giacovazzo C. & Guagliardi A. (1992). Acta Cryst. A48, 859-865.Germain
G., Main P. & Woolfson M.M. (1970). Acta Cryst. B26, 274-285.
Main P. (1978). Acta Cryst. A34, 31-38.
White P.S. & Woolfson M.M. (1975). Acta Cryst. A31, 53-56.
Created with the Personal Edition of HelpNDoc: Free EPub producer
57 / 126
EXPO2014
(8)
(9)
(10)
An alternative approach, based on real space techniques (Altomare et al. 2002) originally
used for phase extension and refinement of proteins (Giacovazzo & Siliqi, 1997; Burla et
al. 2000), has been introduced in EXPO and modified to improve the phasing process
from powder data. It combines an Electron Density Modification procedure with Fourier
map calculations.
Directives in fourier command
The following directives must be added after the command '% fourier' in the input
file to activate some specific features of the procedure
COORDINATION El dmin dmax n
The minimum and maximum value of the bond distance are assigned to element El. n is the
coordination type.
This directive is used to enable a direct space procedure consisting of an Electron Density
Modification (EDM) process followed by successive Fourier maps using (2Fobs -Fcalc)
coefficients. Except for the Fourier directive 'set', all the rest of Fourier recycling directives
are not valid for EDM procedure.
2EOEC
This directive enables to use (2Eobs-Ecalc) as Fourier coefficients in the Fourier step (if
EDM procedure is applied).
EOBS
This directive enables to use the observed normalised structure factors (Eobs) as Fourier
coefficient in the Fourier step (if EDM procedure is applied).
FRAGMENT string
58 / 126
EXPO2014
Used to supply a known fragment to be completed. String is the name of the file in which,
for each atom, are stored: Element X, Y, Z, isotropic thermal factor.
FWHM D
This directive is used in order to change the value of D that is the parameter for the
condition 2q1 - 2q2 < D*FWHM.
Default value is approximately 0.9 depending on the ratio between the number of
observations and the number of parameters to refine.
GRID x
The grid on which the Fourier-map is evaluated will have a spacing of approximately x
angstroms. The default spacing is computed by the program (1/3 Å).
INTEGRATION x n
x is the radius of integration in the peak search routine. (Default value 0.5). n may be 1 to
use the gaussian fitting of the fourier peak in the map or 2 to use the peak average density.
(Default value 2).
LAYX
This directive can be used only if MAP directive is given. The Fourier-map will be printed in
section of constant x.
LAYY
This directive can be used only if MAP directive is given. The Fourier-map will be printed in
section of constant y.
LAYZ
This directive can be used only if MAP directive is given. The Fourier-map will be printed in
section of constant z.
LEVEL n
All numerical values in the E-map greater than n will be underlined with **** in the lineprinter
output to facilitate contouring.
The default is 100. The map is automatically scaled between ± 999 (approximately) at the
grid points at which it is calculated.
Note, however, that it is printed between ± 99. (This directive can be used only if
associated to MAP directive).
LIMITS x1 x2 x3
The E-map will be printed from 0 to x1 along x, from 0 to x2 along y, from 0 to x3 along z.
(This directive can be used only if associated to MAP directive).
MAP
If MAP is specified, the E-map will be printed on the lineprinter. The printing of the E-map
is rarely used.
59 / 126
EXPO2014
The output is in sections of constant y with x across the page and z running down it on a
grid of about three points per angstrom.
The maximum number of grid points across the page is 36 ( i.e. about 12 angstroms in
length). If more are needed, a new page is started.
PEAKS n
The number of peaks to be searched for in the map. The default is the number of peaks
which
å(occ(i)*m) = 1.3 * n
where occ is the crystallographic site occupancy factor, m is the number of symmetry
operators, n is the number of non-hydrogen atoms in the unit cell.
PSE
To actively use PSI-E triplets in these procedures (Default doesn't use them). (This
directive can be used only if associated to YPSIE directive of INVARIANTS routine).
RADIUS El x
The value x of the radius used to compute the connectivity is assigned to element El.
RECYCLE n
Used to stop the recycling procedure after cycle number n. If n is equal to zero only the E-
map will be calculated.
SET n
The serial number of the set from which to calculate the E-map. If 'SET n' is not specified,
the set with the highest combined figure of merit will be used, and then if FOURIER routine
is re-run, the set with the next highest combined FOM will be used.
SOGF x
The percentage of the largest |F| value reflections actively used in the Fourier recycling
routine.
The reflections with the smallest |F| values corresponding to the rejected percentage have
a zero weight value in the refinement process. (Default value 100.0).
SOGI x
The percentage of the most intense groups of overlapping reflections actively used in the
Fourier recycling routine. (Default value 91.0).
STOPF x
This directive enables to stop the Fourier step at the cycle number x (if EDM procedure is
applied). The default value of x is 10 (the maximum allowed value).
THRE x
60 / 126
EXPO2014
x is the threshold for the E value of the reflections actively used in the Electron Density
Modification step. The default value is 0.8.
THRM x
x is the fraction of the Fourier map points actively used in the Electron Density Modification
step. The default value is 0.025.
THRW x
x is the threshold of the Bessel weight of the reflections actively used in the Fourier step (if
EDM procedure is applied). The default value is 0.5.
EXAMPLES
Example 1
Only the E-map corresponding to the set number 7 and the complete FOURIER/LEAST-SQUARES
procedure for set number 10 are requested by the user. The binary file "vfi.bin" must exist.
%structure nbpo
%fourier
set 7
recyc 0
%fourier
set 10
%continue
DLIGHT d tol
tol is the tolerance around the ideal distances. Default value = 0.5.
The program uses only one value of 'd' for all types of light atoms.
DWEIGHT d tol
61 / 126
EXPO2014
tol defines the tolerance around the ideal distances. Default value = 0.5.
The program uses only one value of 'd' for all types of heavy atoms.
COORD xi
x specifies the expected number of heavy atoms surrounding each i-th heavy atom (the
coordination type) (i = 1,2).
EXAMPLE
Example 1
Use of the label change procedure to relabel the peaks of an electron density map obtained by applying the
FOURIER procedure. The binary file "vfi.bin" must exist.
% window
% structure vfi
% changelab
label 3 al 3 p
coord 4 4
% continue
The following directives must be added after the command '% polyhedra' in the
input file to activate some specific features of the polyhedra procedure
62 / 126
EXPO2014
EXAMPLES
Example 1
The POLPO1 procedure is applied to locate the light atoms around each specified heavy atoms. The crox.fra
file contains the cations coordinates.
The binary file 'crox.bin' must exist.
% window
%structure crox
%job CROX - home diffractometer data
%polyhedra
fragment crox.fra
octahedron 1 1.92 0.2 0.2
tetrahedron 2 1.75 0.2 0.2
tetrahedron 3 1.75 0.2 0.2
tetrahedron 4 1.75 0.2 0.2
%continue
Example 2
The POLPO2 procedure is applied to locate one or more missing cations and surronding anions. The
sapo.fra file contains the located cations coordinates.
The binary file 'sapo.bin' must exist.
% window
%structure sapo
%job SAPO - data from home diffractometer
%polyhedra
fragment sapo.fra
tetrahedron Si 1 1.6 0.2 0.2
tetrahedron Si 2 1.6 0.2 0.2
missing 1 2 Si tetrahedron 1.6 0.2 0.2
63 / 126
EXPO2014
The following directives must be added after the command '% patterson' in the
input file to activate some specific features of the patterson procedure.
INVERSE
To activate the procedure of calculating the |F| values from an inverted suitably modified
Patterson map to use as starting point for the Le Bail algorithm in the extraction routine.
MAP, LAYX, LAYY, LAYZ, PEAKS, LIMIT, GRID: See FOURIER directives.
NPAT n
The number of cycles of successive inversions of the Patterson map.( Default values: 6 for
X-ray data, 5 for neutron data
References
Altomare A., Cascarano G., Giacovazzo C. & Viterbo D. (1991). Acta Cryst. A47, 744-748.
Main P. (1978). Acta Cryst. A34, 31-38.
Main P., Fiske S.J., Hull S.E., Lessinger L., Germain G., Declercq J.P. & Woolfson M.M.
(1980) - MULTAN80, a system of computer programs for the automatic solution of crystal
structures from x-ray diffraction data - Univ. of York, England.
Ten Eick L.F. (1977). Acta Cryst. A33, 977-979.
Created with the Personal Edition of HelpNDoc: Easy EBook and documentation generator
64 / 126
EXPO2014
been applied to solve the structures of organic, inorganic and organometallic materials.
Different but similar procedures can be realized: grid search, Monte Carlo, Simulated
Annealing (SA), parallel tempering, genetic algorithm. Each method involves the
generation of a random sequence of trial structures starting from an appropriate 3D model
and moving it until a good match between the calculated and the observed pattern is found.
The information about chemical knowledge of molecules is actively used to reduce the
number of parameters to be varied: bond distances and angles are usually known and kept
fixed while only the torsion angles are varied during the procedure.
Two Direct Space algorithms are now available in the EXPO2014 program, for
crystal structure solution.
They are:
1) a classical Simulated Annealing (SA) approach;
2) the Hybrid Big Bang Big Crunch (HBB-BC) algorithm (Altomare et al., 2013). It results
from an appropriate combination of BB-BC approach with SA and relies on one of the
evolutionary theory of the universe consisting of two successive phases: 1) the Big Bang,
corresponding to an energy dissipation procedure for creating a completely random initial
population; 2) the Big Crunch, corresponding to a contraction procedure for converging to
a global optimum point.
The procedure is supported by the same graphic interface of Simulated Annealing and is
automatically applied by the program for compounds described by less than 6 internal
degrees of freedom (torsion angles) and constituted by less than 3 fragments. In all other
cases the classical SA approach is used.
To run EXPO2014 for structure solution by Direct Space approaches, first you need to
create the input file. It is supposed that the cell parameters and the space group have been
determined before so, fill the frame 'Cell Parameters' and 'Space Group'. Activate the
check button 'Structure Solution', select 'Simulated Annealing' and import the starting
model from 'fragment filename'. The following picture is an example for the crystal structure
determination of the paracetamol molecule. The profile counts file (paracetamol.xy) is in the
directory 'examples'.
65 / 126
EXPO2014
When you press the button 'Save' an input file paracetamol.exp (shown below) will be
created and automatically loaded by the program for the structure solution process by
Simulated Annealing.
%Structure paracetamol
%Initialize
%Job paracetamol
%Data
Cell 7.100 9.380 11.708 90.0 97.42 90.0
SpaceGroup p 21/n
Pattern paracetamol.xy
Wavelength 1.54056
%fragment paracetamol.mol
%sannel
Otherwise you can import an existing input file by the button 'Open' on the 'New Project'
dialog window or selecting 'File' > 'Load & GO' in the main menu. For example you
could load the file paracetamol.exp already existing in the example folder.
66 / 126
EXPO2014
model for Simulated Annealing in MDL Molfile format (*.mol). This command can be
repeated several times to import more than one fragment (see the section Usage of
command %fragment). Some other file types can be imported in the same way:
MOPAC file (*.mop), Tripos Sybyl file (*.mol2,*.ml2), C.I.F. file (*.cif), Protein Data
Bank file (*.pdb), Fenske-Hall Z-matrix (*.zmt), EXPO fragment or Free Fractional
Format (*.frac), XYZ format (*.xyz), Tripos SYBYL (*.mol2, *.ml2), Shelx File (*.ins). A
complete list of the supported input file can be find in section Import of this document.
Instead of command %fragment name_starting_model.ext, with ext=mol,
frac, cif etc, you can import the starting structural model directly by the graphic
interface from the menu File > Import.
Modify > Add fragments to add fragment to an existing partial structural model.
Instead of command %sannel, you can access directly to the graphic interface from menu
"Solve" > "Simulated Annealing".
Press the button in the dialog window to run the Simulated Annealing
procedure.
Consider the following two strategies on the preparing starting model for direct-space
solution.
1) The user is strongly advised to check for similar molecules in the Cambridge Structure
Database (CSD) (organics & organometallics) or in the Inorganic Crystal Structure
67 / 126
EXPO2014
68 / 126
EXPO2014
or HBB-BC algorithm:
Cost function: 2 cost functions can be selected: R weighted profile (default cost
function), R-Bragg intensities.
Resolution: defines the maximum resolution used by Simulated Annealing procedure.
Random seed: selects the value determining the sequence of random numbers used
from the algorithm. When is set to 0 the random seed will be calculated by the system
clock.
Nr. of runs: select the number of Simulated Annealing runs. At the beginning of each
run a new value of random seed is calculated.
Starting temperature: selects the starting temperature. Check on 'automatic' and the
program automatically will find the starting temperature at the beginning of the
procedure.
Number of cycles: the number of moves for each step of temperature is Np*N*20
69 / 126
EXPO2014
where Np is the number of refined parameters and N is a number set by the user in the
entry box. Choosing 'automatic' the program automatically will determine the value of N
by taking into account the external and internal DOFs and the flexibility of the molecule.
This number can be modified also by directive niter (see above).
Temperature reduction factor: the reduction factor applied to the temperature at
each step in the annealing schedule. The default value is 0.90. Increasing this value,
the chance to find the global minimum can be improved even if a longer execution time
will be taken by the procedure.
Solutions: browse the best solutions saved at the end of each run.
The second page External DOF contains information about the external degrees of
freedom (DOFs):
Select fragment: selects the fragment and visualizes the corresponding structure
information.
Atoms: list of the atoms in the selected fragment.
Parameter for fragment: list of the external DOFs for the selected fragment. Check
the parameters to refine, enter the lower and upper bounds of the parameters.
The third page Internal DOF contains information about the internal degrees of
freedom (DOFS):
70 / 126
EXPO2014
Internal DOFs: list of the torsions associated to each refinable internal DOF. Check
the parameters to refine, enter the lower and upper bounds of parameters.
Press the button 'Atomic parameters' to access to the following dialog window:
Thermal parameters: check the buttons on column 'B[iso]' to refine the thermal
parameters of atoms. Click on label 'B[iso]' to select all checks in the column. If the
thermal parameters are not indicated in the imported fragment file, the program
assigns the default values of B=3.0 for non-hydrogen atoms and B=6.0 for hydrogen
atoms. These default values can be changed editing the new values in the
column. The refinement of the thermal parameters is discouraged because usually
doesn't improve the results.
X Y Z: check the buttons on column 'Shift on xyz' to refine the atom positions, in this
71 / 126
EXPO2014
case the fragment is not rigid but the atom positions are shifted respect to the
barycentre. Click on label 'Shift on syz' to select all checks in the column. The
positions refinement (x,y,z) is generally discouraged because can considerably
increase the time to reach the global minimum.
Maximum shift on position: enter the value of the maximum shift of the atomic
positions. 0.020 is the suggested value. Increasing this value the explored parameter
space becomes wider so increasing the probability of falling into a local minimum.
Dynamical Occupancy Correction (DOC): automatic detection of atoms in special
position and atoms that share the same position. If this option is not active all refined
atoms are considered in general position (default). DOC will be applied only on the
atoms selected in the column D.O.C., select all atoms if you don't know exactly which
atoms will fall into special positions. DOC slows down the computation time so it
should be avoid if non special position or shared atoms are expected
Pattern: Enter the direction of the preferred orientation and check 'G Factor'. The
March-Dollase correction will be applied and the magnitude of the preferred
orientation will be optimized.
A visual match between observed and calculated powder pattern is plotted when
Simulated Annealing is running, the progress of structure solution is monitored and
the user can examine:
1) the graph of the minimum values of the cost function (CF) vs. the number of moves;
3) the crystal packing by using the button on the JAV viewer.
During the Direct Space runs, three buttons are active on the toolbar of the main window:
opens the JAV viewer for crystal structure visualization.
skips the current run.
72 / 126
EXPO2014
At the end of each run the structure coordinates are saved in CIF file with name
created by project name with suffix _best1,_best2, ... (e.g. paracetamol_best1.cif is
the best solution, paracetamol_best2.cif is the second best solution, etc.). The order
number in the name represents the position in the list of structures ordered according
to the cost function.
When Direct Space approach fails
When Direct Space approaches fail:
1) The quality of data is not good and the diffraction pattern is not suitable for the
extraction of integrated intensities. In this case can be convenient to perform the
optimization by using the cost function R weighted profile.
2) The starting model is incorrect: bond distances and angles are not entirely
accurate, the number of building blocks is wrong. Improve your model with Cambridge
Structural Database (CSD) or building packages (Avogadro, ChemSketch,
ChemDraw, ...), check for atom in the output file (about 15-20 Ang/atom).
3) The assumption about thermal factors is invalid. Check thermal factors from similar
structures.
4) Space group and cell parameters are not correct. Additionally, in many cases it
may be necessary to carry out a series of independent calculations to test different
potential space groups and/or unit cell choices.
5) The default conditions, for complex structure, cannot be appropriate. Increase the
number of moves and/or runs, try with slower temperature reduction.
The directives in Simulated Annealing
Usually you don't need to read this paragraph unless you are interested to run
Simulated Annealing without interaction with graphic interface. In this case use the
command %sannel to run Simulated Annealing from the input file (*.exp). Use command
%automatic to skip the interaction with the graphical interface and the program will
perform the Simulated Annealing using the default values. An example of input file with
command %automatic and %sannel is the following:
%automatic
%Structure paracetamol
%Job paracetamol
%Data
73 / 126
EXPO2014
To modify the default values of SA, some directives can be used after the command %
sannel.
An example of application of directive niter and nrun to increase the percentage of
success in case of structure solution of the largely flexible molecule verapamil
hydrochloride
%Structure verapamil
%Job Verapamil hydrochloride (C27H39N2O4.Cl)
%Data
Cell 7.089 10.593 19.207 100.11 93.75 101.56
SpaceGroup p -1
Pattern verapamil_hydrochloride.xy
Wavelength 1.54056
%fragment verapamil_hydrochloride.mol
%sannel
niter 2000
nrun 20
The following directives must be added after the command %sannel in the input
file to activate some specific features of the Simulated Annealing procedure
Abbreviations for directives name at least 4 character are permissible, i.e. "cost 2" instead
of "cost_function 2".
nrun n
To modify the number of Simulated Annealing runs. The default is 10.
niter n
To modify the number of moves for each temperature step. In a default run the number of
moves is automatically calculated.
resmax val
To define the maximum resolution used by Simulated Annealing. The default value is 2.0 Å.
temper temp
To modify the initial temperature. In a default run the initial temperature is automatically
calculated.
tfactor val
Determines the temperature reduction factor. val ranges between 0 and 1 and its default
value is 0.90.
rotate_around_axis Ax1 Ax2 At1 At2 At3 ...
or
rotate_around_axis Ax1 Ax2 At1 At2 At3 ... theta
74 / 126
EXPO2014
Rotate atoms At1, At2, At3, ... around a rotation axis defined by atoms Ax1 and Ax2.
theta is an optional value (degrees) used to specify the limits of rotation angle from -theta
to +theta.
When rotate directive is used the specified rotation will be included in the panel 'Internal
DOF' of the graphical interface.
E.g.
rotate C1 C4 C5 C6 C7 C8 C9 C10
the phenyl ring C5-C6-C7-C8-C9-C10 will be rigidly rotated around the axis C1-C4
bump
Automatic generation of anti-bumping restraints extended to all C, N, O and S atoms
res atom1 atom2 target_dist tol weight
Apply restraints between 2 atoms. atom1 and atom2 are the labels of atoms. The other
parameters are optional and can be omitted.
target_dist is the ideal distance between the pair of atoms, when omitted the
distance is automatically deduced by using an internal table of distances.
tol is a permitted tolerance, when omitted the default value is 0.2
weight is a user supplied weight. A default weight is specified as 100.
75 / 126
EXPO2014
shift_atom val
or
shift_atom atom1 atom2 atom3 …. val
To optimize the atomic parameters by applying shifts (up to val) on the atoms with respect
to the centre of gravity of the fragment. The default val is 0.5. Add atom1, atom2,
atom3,… to refine only some specific atoms.
cost_function n
To choose the cost function: 1 for Rw-profile, 2 for RF, 3 for RI. The default choice is 1.
randomize n
To randomize the internal and external DOFs and the atomic parameters (if refined). n is an
optional parameters used as seed of random generator.
rotat n1 n2
To specify the internal DOF using the input file.
doc
or
doc atom1 atom2 atom3 ...
Activate the dynamical occupancy correction (DOC). By default DOC is applied to all the
atoms unless you specify a list of atoms.
E.g.
doc Ni1
DOC will be applied only to the atom Ni1.
po H K L
Activate the March-Dollase preferred orientation correction. H K L are three integer
numbers necessary to specify the direction of the preferred orientation
%Structure creatinem
%Job Creatine monohydrate (C4H9N3O2.H2O)
%Data
Cell 12.506 5.046 12.169 90 108.88 90
SpaceGroup P 21/c
Pattern creatinem.xy
Wavelength 1.54056
%fragment creatine.cif
%fragment H2O.sdf
76 / 126
EXPO2014
%sannel
The command %fragment was use to import the cif file of the creatine molecule
(creatine.cif) and the sdf file of the water molecule (H2O.sdf). The cell parameters and
space group reported in the creatine.cif were ignored and the creatine molecule was
imported in the cell and space group indicated by the user in the input file by directive
Cell and SpaceGroup.
Input file for structure solution of S-Ibuprofen containing two molecules of Ibuprofen in the
asymmetric unit (Z'=2)
Contact
References
Altomare, A., Corriero, N., Cuocci, C., Moliterni, A., Rizzi, R. (2013). J. Appl. Cryst., 46,
779-787.
Created with the Personal Edition of HelpNDoc: Single source CHM, PDF, DOC and HTML Help creation
77 / 126
EXPO2014
of the model is fixed, while its orientation and internal conformation have to be found by SA
runs.
The first page 'SA conditions' contains the general settings of the algorithm:
External model filename: the name of the imported starting structural model must be
78 / 126
EXPO2014
specified. Some file types can be imported: C.I.F. file (*.cif), Expo fragment or Free
Fractional Format (*.fra), XYZ format (*.xyz), MOPAC file (*.mop), MDL MolFile (*.mol),
Protein Data Bank file (*.pdb), Tripos SYBYL (*.mol2, *.ml2), Fenske-Hall Z-Matrix (*.zmt)
and Shelx File (*.ins).
The second page 'Fourier map condition' contains the information about the Fourier map
at which the imported starting model will be associated:
Use Direct Methods map: if selected, the program will use the atomic positions of the
Fourier map obtained at the end of the current run of EXPO2014.
Map filename: it is activated only if 'Use Direct Methods map' is de-selected. The filename
of the external Fourier map (in term of atomic positions) must be supply. Some file types
can be imported: C.I.F. file (*.cif), Expo fragment or Free Fractional Format (*.fra), etc.
79 / 126
EXPO2014
Number of peaks of Fourier Map: the number of atomic positions selected from the
map. If 'automatic' is selected, the program will use the 40% of the electron density peaks
(those with the largest intensity) to associate in turn to each model atom. This number can
be reduced by the user if 'automatic' is de-selected.
The third page 'Internal DOF' contains the information about the internal degrees of
freedom (DOFs):
Torsion: List of torsions associated to each refinable internal DOF. The order number of
each atom involved in the torsion angle to vary are also shown. The internal DOFs can be
or not refined (check Refine).
Lower-Upper: range of variation of the torsion angle. 180 means that the allowed range is
(x-180, x+180), where x is the value of the torsion angle in the initial model.
A visual match between the observed and calculated powder diffraction pattern is plotted
when the procedure is running. When Direct Methods combined with Simulated Annealing
procedure are running the progress of structure solution is monitored and the user can
examine:
1) the graph of the minimum value of the cost function (CF) vs. the number of moves;
2) the match between the observed and calculated pattern.
During the Direct Methods combined with Simulated Annealing procedure runs two buttons
on the toolbar are activated:
opens the JAV viewer for crystal structure visualization.
procedure.
80 / 126
EXPO2014
Contact
References
A. Altomare, R. Caliandro, C. Cuocci, C. Giacovazzo, A. G. G. Moliterni, R. Rizzi and C.
Platteau. 'Direct methods and simulated annealing: a hybrid approach for powder
diffraction data'. J. Appl. Cryst. (2008). 41, 56-61.
Created with the Personal Edition of HelpNDoc: Full-featured multi-format Help generator
Rietveld Refinement
The refined variables include peak shape parameters, unit cell dimensions, coordinates of
atoms in the structure model, background coefficients, scale factor, isotropic displacement
parameters. Atomic displacement parameters can be refined individually or in a group of
atoms with the same atomic type or the same environment. Correction for the preferred
orientation can be achieved by the March-Dollase approach. The background is described
using empirical functions: classical polynomial function, Chebyshev polynomial function,
cosine Fourier series. The background can also modeled by mouse-click selection of
81 / 126
EXPO2014
pattern points. The available peak shape functions are: Pseudo-Voigt, Pearson VII,
Thompson-Cox-Hastings pseudo-Voigt.
The non-linear least squares are carried out by employing the damped Gauss-Newton
method. The refinement convergence condition is reached when the increments on
parameters become smaller with respect to their standard deviations or when the relative
gradient of χ2 minimized function is less then a tolerance value. Tolerance value and the
maximum number of cycles could be suitably modified by the user.
The Le Bail technique can be adopted to perform a full pattern decomposition prior to
Rietveld refinement in order to determine starting values of parameters (background, peak
shape, zero shift and unit cell dimensions) to refine. This strategy is suggested especially if
the available structure model is not completed (McCusker et al., 1999) or when the starting
model is too different from the target model.
The refinement can be carried out by following two alternative approaches: 1) the user can
decide the refinement strategy via graphic interface; 2) an automatic refinement schedule
can be applied. In the automatic mode, groups of parameters are refined according to a
fixed sequence as established in the Rietveld guidelines (Young, 1995). In the last step of
refinement all parameters are refined simultaneously.
The knowledge of molecular geometry can be exploited in the refinement in form of
restraints on bond distances, angles and planes. To simplify the setting of restraints, the
program is able to extract from the connectivity of the initial model the possible restraints
providing a list of current and target values. The user can select the restraints to be
included in the refinement procedure and eventually modify the target value.
Some figures of merit are available in order to follow the progress and evaluate the quality
of refinement: the weighted profile R-factor and the unweighted profile R-factor calculated
for the full pattern (Rwp, Rp); the corresponding figures background-subtracted (Rwp', Rp');
the statistically expected R value (Rexp); the goodness-of-fit (Rwp/Rexp); the Durbin-Watson
d-statistic. R values similar to those used in case of single crystal data are also available:
RF and the Bragg-intensity R (Rb)which use the structure factor moduli extracted from the
experimental profile. In addition, graphical tools are available for visualizing: 1) the
observed, calculated and difference pattern and the cumulative χ2 value; 2) the evolution of
the structure model during the refinement in real time.
82 / 126
EXPO2014
where:
N is the total number of counts in the pattern and P is the number of refined parameters.
To run EXPO2014 for Rietveld refinement, first you need to create the input file. This is an
example of the typical input file:
%Structure paracetamol
%Initialize
%Job paracetamol
%Data
Cell 7.100 9.380 11.708 90.0 97.42 90.0
SpaceGroup p 21/n
Pattern paracetamol.xy
Wavelength 1.54056
%fragment paracetamol.cif
%rietveld
It is supposed that the cell parameters and the space group have been determined
before and a structure model is available and imported by using the command %
fragment followed by the file name containing the structure. %rietveld is a
command to access to the Rietveld Refinement graphical interface.
83 / 126
EXPO2014
Alternatively, when a structure model is available, you can access directly to the
graphic interface from menu "Refine" > "Rietveld"
Constraints
Constraints are defined as exact mathematical relationships between least squares
parameters and can be used to reduce the number of parameters.
Constraints are often employed to control site occupancies when disorder is present in
one or more sites. It is not uncommon in minerals to find an atomic site that can be
occupied by either of two element types. For example, if three atoms A, B, and C occupy
the same site then, assuming the full total occupancy of the site, the following relationship
(or constraint) results: occA + occB + occC = 1. Thus, only two of the fractional occupancies
should be refined, for example, occA and occB are free least squares variables but occC =
1 - occA - occB. Atomic displacement parameters (ADP) are made to shift synchronously.
EXPO2014 automatically identifies atoms that occupy the same site and impose the
constrains on site occupancies and on ADPs.
84 / 126
EXPO2014
Other constraints may be imposed by the user. Clicking on the button 'Constraints' in the
'Crystal Structure' section is possible to access to a dialogue window to define group of
ADPs that can have the same shift.
Contraints are applied to H atoms by using the riding model. This technique is used to
move H atoms synchronously with the C atoms to which they are bonded, thereby
preserving the bond length and direction. The isotropic displacement parameters Uiso of
the hydrogens are constrained to be 1.2 times of that of the heavy atom to which they are
attached. The factor 1.2 can be changed by graphic interface.
Contact
For suggestions and bugs contact:
angela.altomare@ic.cnr.it
annagrazia.moliterni@ic.cnr.it
rosanna.rizzi@ic.cnr.it
85 / 126
EXPO2014
corrado.cuocci@ic.cnr.it
References
Rietveld, H.M.: A profile refinement method for nuclear and magnetic structures. J. Appl.
Cryst. 2 (1969) 65–71.
McCusker, L.B.; Von Dreele, R.B.; Cox, D.E.; Louër, D.; Scardi, P.: Rietveld refinement
guidelines. J. Appl. Cryst. (1999). 32, 36-50
The Rietveld Method. Edited by R. A. Young, Oxford University Press, Oxford (1995)
Created with the Personal Edition of HelpNDoc: Free help authoring environment
86 / 126
EXPO2014
· first routine used must be DATA, if it has not been used in a previous run;
· NORMAL must be used if reflections file has been supplied or EXTRACTION step
has been performed;
· FOURIER routine cannot run before PHASE routine (unless a fragment information is
supplied by the user; in this case only DATA, EXTRACTION and FOURIER
commands are needed).
The minimal information needed by EXPO2014 for a full run (from indexing to structure
solution) is constituted by (see Example 1):
87 / 126
EXPO2014
· space group symbol (not necessary if the Space Group Determination procedure is
applied by using "findspace" directive)
· cell content
· diffraction pattern (or reflections if an external reflection file is provided by any profile
fitting program)
· wavelength
· type of radiation (if neutron or synchrotron data have been used; home diffractometer
data is the default).
Add '%EXPORT filename.xxx' in the input file to export files. Filename is the name of file to
export and xxx define the type of file. Use xxx=cif, xyz, frac to export structure model
respectively in cif file, xyz cartesian coordinates, fractional coordinates; use xxx=hkl to
export file with hkl, extracted structure factors, calculated structure factors, phases from the
obtained model.
In the following example, the command export before '%data' is used to export hkl and cif
file and it will be processed only at the end of structure solution process with Direct
Methods.
%structure cime
%job structure cime
%init
%export cime.hkl
%export cime.cif
%data
pattern cime.pow
cont s 4 c 40 n 24 h 64
wave 1.52904
cell 10.6986 18.8181 6.8246 90.000 111.284 90.000
space p 21/n
sync
%continue
%Structure paracetamol
%Job paracetamol
%Initialize%Initialize
%export paracetamol.hkl
%export paracetamol.cif
%Data
Cell 7.100 9.380 11.708 90.0 97.42 90.0
88 / 126
EXPO2014
SpaceGroup p 21/n
Content c 32 o 8 n 4 h 36
Pattern paracetamol.pow
Wavelength 1.54056
%fragment paracetamol.mol
%sannel
%continue
Created with the Personal Edition of HelpNDoc: Write EPub books for the iPad
EXPO module
LIST module
It is the software interface between EXPO2013 and the direct access file on which data
and results are stored.
DATA module
This routine reads the basic crystallographic informations like cell parameters, space
group symbol (it may be also found by the program, using "findspace" directive),
wavelength, unit cell content and the counts (or reflections). It includes a modified version of
the subroutine SYMM Burzlaff & Hountas (1982). Symmetry operators are directly derived
from the space group symbol.
An external file (containing counts or reflections) is supplied by the user. If an external
reflection file is supplied (avoiding the extraction step), diffraction data are checked in
order to find out equivalent reflections or systematically absent reflections (which are then
excluded from the data set).
N-TREOR module
This routine performs the indexing step (a peak search procedure is automatically carried
out by the program before the indexing step). If available, an external peak positions file
can be supplied to the program and the automatic peak search procedure will be skipped
89 / 126
EXPO2014
by EXPO2014. The indexing procedure retains the basic search algorithm of TREOR90,
(Werner et al., 1985) and has been strengthened in the last version of N-TREOR09
(Altomare et al., 2009).
EXTRACTION module
In this module diffraction integrated intensities are extracted from the powder diffraction
pattern.
To calculate the integrated intensities, the program uses the Le Bail algorithm (Le Bail,
Duroy & Fourquet, 1988) and combines it with a least squares procedure minimizing the
quantity:
NORMAL module
In this module diffraction intensities are normalized using the Wilson method
(Wilson, 1942). Statistical analysis of the intensities is made in order to suggest the
presence or absence of the inversion centre, to identify the possible presence and type of
pseudotranslational symmetry (Cascarano et al., 1988 a,b; Fan, Yao & Qian, 1988) and to
detect preferred orientation effects (Altomare et al., 1994; Altomare et al., 1996). Possible
deviations (of displacive type) from ideal pseudotranslational symmetry are also detected.
All the above information is used as prior information in the integrated intensities extraction
process, unless the directive NOPSEUDO is given to the program.
When some additional prior information, besides positivity and atomicity of electron
density, is available, then a suitable renormalization of structure factors is made.
INVARIANTS module
90 / 126
EXPO2014
PHASE module
In the EXPO2014 program the most reliable one-phase s.s. are treated as known
phases. Besides triplets, also the most reliable negative quartets and two-phase s.s. may
be actively used.
Each relationship is used with its proper weight: the concentration parameter of the first
representation for quartets and two-phase s.s., and C or G for triplets.
- CONVERGENCE/DIVERGENCE PROCEDURE -
Once the starting set has been defined, a good pathway for phase expansion is
determined by a divergence procedure. In the divergence map, starting from the reflections
in the starting set, each new reflection is linked to the preceding ones with the highest value
of < a >.
The starting set defined by the preceding step is usually formed by the origin (and
enantiomorph) fixing reflections, a few one phase s.s. and a number of other phases which
may be obtained:
a) by magic integer permutation (White & Woolfson, 1975; Main, 1978),
b) by a random approach (Baggio et al., 1978; Burla et al., 1992).
The option a) is the default, b) runs if the directive RANDOM is used. In this last case a
large number (depending on the available computer time) of trials can be requested.
FOURIER/LEAST-SQUARES module
The sets of phases generated by the tangent routine are first expanded through psi-E
relationships and then passed to the fast fourier transform routine written by L.F.Ten Eick
(1977) and subsequently modified by the MULTAN team (Main et al., 1980).
DIRECT module
91 / 126
EXPO2014
In this module some "direct-space" approaches for solving crystal structures from powder
diffraction data, can be used. The basic idea is to actively use structural and geometrical
information, available a priori, during the structure solution process.
LABELLING procedure
The atomic species assigned to the peaks of an electron density map obtained at the end
of a direct method procedure, can be modified using chemical information about the
structure. The procedure (Altomare et al., 2002) requires: 1) the prior information on the
number and coordination of the heavy atoms (the procedure can only handle tetrahedral
and octohedral coordinations); 2) the range of the typical distances between heavy atoms;
3) the range of the typical distances between heavy and light atoms.
POLPO procedure
Using the "%polyhedra" command, two procedures can be activated. Both of them use a
Monte Carlo approach and exploit the experimental information about the heavy - atom
connectivity. The first one (POLPO1) (Altomare et al., 2000) is suggested when all the
heavy atoms are rightly positioned and labelled. It is able to correctly locate the light atoms.
The second one (POLPO2) (Giacovazzo et al., 2002) is able to position one or more
missing cations and surrounding anions.
PATTERSON module
References
Altomare A., Burla M.C., Cascarano G., Giacovazzo C., Guagliardi A., Moliterni, A.G.G. &
Polidori, G. (1996). J. Appl. Cryst. 29, 341-345.
Altomare A., Cascarano G., Giacovazzo C., Guagliardi A., Burla M.C., Polidori G. &
Camalli M. (1994). J. Appl. Cryst. 27, 435 - 436.
Altomare A., Cascarano G., Giacovazzo C. & Viterbo D. (1991). Acta Cryst. A47, 744-748.
Altomare, A., Campi, G., Cuocci, C., Eriksson, L., Giacovazzo, C., Moliterni, A., Rizzi, R. &
Werner, P.-E. (2009). J. Appl. Cryst. 42, 768-775.35,182-184.
Altomare A., Giacovazzo C., Guagliardi A. Moliterni A.G.G., Rizzi R. (2000). J. Appl. Cryst.
33, 1305-1310.
Altomare A., Giacovazzo C., Ianigro M., Moliterni A.G.G., Rizzi R. (2002). J. Appl. Cryst.
35, 21-27.
92 / 126
EXPO2014
Baggio R., Woolfson M.M., Declerq J.P. & Germain G. (1978). Acta Cryst. A34, 883-892.
Burla M.C., Cascarano G. & Giacovazzo C. (1992). Acta Cryst. A48, 906-912
Burzlaff H. & Hountas A. (1982). J. Appl. Cryst. 15, 464-467.
Cascarano G., Giacovazzo C. & Luic' M. (1988a). Acta Cryst. A44, 176-183.
Cascarano G., Giacovazzo C. & Luic' M. (1988b). Acta Cryst. A44, 183-188
Fan H., Yao J. & Qian J. (1988). Acta Cryst. A44, 688-691.
Germain G., Main P. & Woolfson M.M. (1970). Acta Cryst. B26, 274-285.
Giacovazzo C. (1976). Acta Cryst. A32, 958-966.
Giacovazzo C., Altomare A., Cuocci C., Moliterni A.G.G., Rizzi R. (2002). J. Appl. Cryst.
35, 422-429
Le Bail A., Duroy H. & Fourquet J.L. (1988). Math. Res. Bull. 23, 447-452.
Main P. (1978). Acta Cryst. A34, 31-38.
Main P., Fiske S.J., Hull S.E., Lessinger L., Germain G., Declercq J.P. & Woolfson M.M.
(1980) - MULTAN80, a system of computer programs for the automatic solution of crystal
structures from x-ray diffraction data - Univ. of York, England.
Ten Eick L.F. (1977). Acta Cryst. A33, 977-979.
Werner P.-E., Eriksson L. & Westdahl M. (1985). J.Appl.Cryst. 18, 367-370.
White P.S. & Woolfson M.M. (1975). Acta Cryst. A31, 53-56.
Wilson, A. J. C. (1942). Determination of Absolute from Relative X-Ray Intensity Data.
Nature, Load.
150, 152.
Created with the Personal Edition of HelpNDoc: Produce electronic books easily
Preparation of data
This routine reads the basic crystallographic information like cell parameters, space
group symbol (it may be also found by the program, using "findspace" directive),
wavelength, unit cell content and the counts (or reflections). It includes a modified version of
the subroutine SYMM Burzlaff & Hountas (1982). Symmetry operators are directly derived
from the space group symbol.
An external file (containing counts or reflections) is supplied by the user. If an external
reflection file is supplied (avoiding the extraction step),diffraction data are checked in order
to find out equivalent reflections or systematically absent reflections (which are then
excluded from the data set).
ALPHA
CELL a b c a b g
93 / 126
EXPO2014
Unit cell contents. Eli is the chemical symbol of atomic species i, ni is the corresponding
number of atoms in the unit cell (to a maximum of 8 atomic species). For each chemical
element up to Cf (Z=98) X-ray scattering factor constants are stored, together with
information on the atomic number and weight, covalent and Van-der-Waals radii, etc. in a
file (see notes on implementation). Neutron scattering factors are coded in the program for
elements up to Pu (Z=94).
EQUIPORTION
EXTEFILE string
This directive is used to supply an external file containing 'd' or '2q' values of peak
positions. In the latter case the additional '2-THETA' directive has to be used.
If this directive is used the program will skip the automatic "peaksearch " procedure.
FILETYPE string
This directive is used to specify the format of input file containing the diffraction data. string
defines the type of file and can be 'counts', 'xy', 'double', 'gsas'.
If the directive FILETYPE string is absent or string is 'counts' the first three number at the
beginning of the counts file must be the 2theta minimum, 2theta step and 2theta maximum.
The following lines must be counts. The number of counts for row or the format of counts
can vary.
#Example 1: use, in the input file, the directive 'FILETYPE counts' or nothing
7.00 0.020 100.00
3017 2925 3056 2983 3008 2953 2875 2976 2886 2928
2973 2831 2960 2821 2854 2900 2857 2798 2876 2798
2819 2865 2830 2735 2771 2813 2806 2704 2823 2878
2793 2795 2756 2722 2705 2736 2802 2657 2811 2736
2753 2722 2780 2734 2795 2741 2812 2808 2713 2746
2696 2747 2799 2876 2741 2921 2776 2966 3017 3082
3058 3193 3147 3309 3319 3404 3490 3554 3684 3845
4015 4069 4293 4450 4657 4905 5201 5587 5796 6172
94 / 126
EXPO2014
6646 7225 7834 8918 9767 10641 11891 13411 15091 17902
....................................................................................................................
#Example 2: use, in the input file, directive 'FILETYPE counts' or nothing
6 0.01 55.98
412.98
404.51
412.40
405.57
403.30
405.94
404.89
389.39
396.55
408.00
403.24
385.44
396.46
393.20
390.52
401.58
.............................................................................................................
If string is 'xy' the counts file contains 2theta in the first column and the intensities in the
second one. The format of counts can vary.
95 / 126
EXPO2014
If string is 'double' the first line in the counts file contains 2theta minimum, 2theta step,
2theta maximum. The rest of file contains intensities and standard deviations on separated
lines.
In the previous cases comment records can appear at the beginning of data file if the first
character is '#' or '!' or '>'. Examples in directory examples.
'FILETYPE gsas' can be used to read counts files in GSAS standard format with keyword
TYPE='STD' and TYPE='ESD'.
FINDSPACE
This directive is used in order to find a possible space group when the cell parameters are
known.
FWHM x
This directive supplies the value of x for the overlapping calculation. Two reflections are
assumed to be in overlapping if 2q1 -2q2 < x * FWHM. Default is x = 0.1.
HISTOGRAM n
96 / 126
EXPO2014
n is the number of bars in the histogram for the alpha2 stripping procedure (only if ALPHA
directive is used).
ISOTOPE El sf
To modify the value of the neutron scattering factor for element El using the value sf. (It must
be used if NEUTRON directive is given).
MOREPEAKS
This directive is used together with the peaksearch directive in order to add more low
intensity peaks to the set of peaks already detected via the peaksearch directive.
NEUTRON
This directive is used to specify that neutron radiation has been used.
NOREFINE
This directive is used to skip the cluster refinement step in the peak search procedure.
PATTERN filename
Filename is the name of the profile counts file (up to 40 characters). It is used to change the
default. [Default is name.pow where name is defined using the command %STRUCTURE].
Only intensities included in the range thmin-thmax will be considered; thmin and thmax are
the minimum and maximum 2q value of the powder pattern respectively.
REF2 string
This directive is used to supply an external reflection file. EXPO reads directly the output
provided by any profile fitting program. HKL, FWHM and F**2 must be supplied [default
format is (3i4,2f15.4)]. If a comment line is at the beginning of the reflection file, the first
character must be '>'. If this directive is used, the directives 'pattern' and 'range' must not
be introduced.
SHIFT sx sy sz
Origin shift components if desired by the user. They must be integral multiples of 1/24.
SPACEGROUP string
String is the symbol of the space group, according to International Tables (1974). Blanks
97 / 126
EXPO2014
are necessary among the terms constituting the space group symbol (see examples at the
end of this manual).
SYNCHROTRON
This directive is used to specify that synchrotron radiation has been used.
STEP step_value
You can use this directive to specify that your data were collected at fixed step. This
directive is activated when the data file contains two columns (2theta values and
intensities) and you are using the directive 'FILETYPE xy'. If you use the directive STEP,
the first column is ignored and the program automatically recreates the 2theta values by
using step_value for the step and the first 2theta value in the data file.
THCORR x
x is the Dq0 value that is used to correct by zeropoint shift the set of 2q peak positions.
The shift value will be egual to 2(n-1)Dq0, where n is the sequential number of the N-TREOR
trial.
If this directive is not introduced the Dq0 default value is the experimental 2q step.
2-THETA
This directive is used combined with the EXTEFILE directive to supply to the indexing
program N-TREOR09 an external file containing 2q values of peak positions.
WAVELENGTH w
References
Burzlaff H. & Hountas A. (1982). J. Appl. Cryst. 15, 464-467.
Created with the Personal Edition of HelpNDoc: Free help authoring environment
Examples
98 / 126
EXPO2014
The following example shows the default use of EXPO2014 in case of indexed pattern.
Most of the structures can be solved in this way. Diffraction data are in the file crox.pow.
The content of the file crox.exp is
%structure crox
%job CROX- data from home diffractometer
%initialize
%data
pattern crox.pow
cell 5.447 6.5576 12.1147 106.382 95.715 77.970
content Cr 8 O 21
wavelength 1.3922
spacegroup p -1
%continue
99 / 126
EXPO2014
Example 2
Example 3
Data are from neutron source.
%structure cfcl
%job CFCL - data from neutron source
%initialize
100 / 126
EXPO2014
%data
pattern cfcl.pow
cell 10.168 14.964 5.100 90.000 90.000 90.000
content C 8 F 16 Cl 16
spacegroup f d d 2
wavelength 1.595
neutron
%continue
Example 4
A Pseudo-Voigt function will be used and the background in each interval will be described
by a constant. The counts file format is not the default one and the graphic window is
suppressed.
%structure mes
%job MES - data from home diffractometer
%initialize
%data
pattern mes.pow
cell 8.588 9.931 11.105 90.0 93.754 90.0
content C 24 N 4 O 20 S 4 H 52
spacegroup p 21/c
wavelength 1.5406
%extraction
pvoigt
polynomial 0
%end
Example 5
The starting values of the integrated intensities in the extraction step are supplied by the
user in the file "yono.ext" and the number of least-square cycles is set to 40.
%structure yono
%job YONO - Synchrotron data
%initialize
%data
pattern yono.pow
cell 9.385 16.394 3.627 90.000 101.067 90.000
content Y 8 O 26 N 2 H 18
101 / 126
EXPO2014
spacegroup p 21
wavelength 1.2323
synchrotron
%extraction
frecycle yono.ext
ncycle 40
%end
Example 6
The preliminary fitting of the standard peak is skipped and the starting 2q shift value is
supplied by the user (the program refines it).
%structure agpz
%job AGPZ - data from home diffractometer
%init
%data
pattern agpz.pow
cell 6.526 20.059 6.464 90.000 90.000 90.000
spacegroup p b c a
content Ag 8 N 16 C 24 H 24
wavelength 1.54056
%extraction
nosing
zeroprof 0.02
%continue
Example 7
Neutron data and a deuterium atom in the cell content are used. Fast graphic interaction for
the integrated intensities extraction process is activated.
%structure baco
%job BACO - Neutron data
%initialize
%data
neutron
isotope H 0.67
pattern baco.pow
cell 10.0632 7.9330 6.8487 90.0 122.3423 90.0
102 / 126
EXPO2014
spacegroup C 2/m
content Ba 4 C 8 O 20 H 8
wavelength 1.909
%extraction
fast
%continue
Example 8
A random approach is used and the best 100 sets of phases, over 250 trials, are retained
in the direct access file. No Fourier is required.
%structure nizr
%phase
random
maxtrials 250
minfom 1.0 100
%end
Example 9
Only the E-map corresponding to the set number 7 and the complete FOURIER/LEAST-
SQUARES procedure for set number 10 are requested by the user.
%structure nbpo
%fourier
set 7
recyc 0
%fourier
set 10
%continue
Example 10
Example 11
Use of the pseudotranslational symmetry (if it is present) as prior information for a new
extraction process is inhibited. Use of nopse directive after command %normal.
103 / 126
EXPO2014
%structure agpz
%job AGPZ - data from home diffractometer
%initialize
%data
pattern agpz.pow
cell 6.526 20.059 6.464 90.000 90.000 90.000
spacegroup p b c a
content Ag 8 N 16 C 24 H 24
wavelength 1.54056
%extraction
%normal
nopseudo
%continue
Example 12
Use of the triplets estimation as prior information for a new extraction process.
%structure mes
%job MES - data from home diffractometer
%initialize
%data
pattern mes.pow
cell 8.588 9.931 11.105 90.0 93.754 90.0
content C 24 N 4 O 20 S 4 H 52
spacegroup p 21/c
wavelength 1.5406
%extraction
%normal
nopseudo
%invariants
estimation
%continue
Example 13
Use of the Patterson map inversion as prior information for a new extraction process.
%structure lasi
%job lasi - Neutron data
%initialize
%data
pattern lasi.pow
cell 5.4059 8.7934 14.2754 90.000 112.731 90.00
content La 8 Si 8 O 8
spacegroup p 21/c
wavelength 2.3400
neutron
%extraction
104 / 126
EXPO2014
%normal
nopse
%patterson
inverse
%continue
Example 14
Example 15
Example 16
If a set of d (or 2 q) values is already available, the automatic peak search procedure is
skipped:
%structure mes
%job MES - data from home diffractometer
%initialize
%data
pattern mes.pow
wavelength 1.5406
2-theta
extefile mes.pea
%ntreor
105 / 126
EXPO2014
The N-TREOR09 directives 'VOL = 6000' and 'CEM = 40' activate a search for a cell
having a maximum volume of 6000 Å3 and maximum axis value of 40 Å.
Example 17
Use of the change label procedure to relabel the peaks of an electron density map
obtained by applying the FOURIER procedure. The binary file "vfi.bin" must exist.
%structure vfi
%job VFI - Synchrotron data
%changelab
label 3 al 3 p
coord 4 4
dwei 3.1 0.5
dlig 1.9 0.5
%continue
Example 18
The POLPO1 procedure is applied to locate the light atoms around each specified heavy
atoms. The crox.fra file contains the cations coordinates.
The binary file 'crox.bin' must exist.
%structure crox
%job CROX - home diffractometer data
%polyhedra
fragment crox.fra
octahedron 1 1.92 0.2 0.2
tetrahedron 2 1.75 0.2 0.2
tetrahedron 3 1.75 0.2 0.2
tetrahedron 4 1.75 0.2 0.2
%continue
Example 19
The POLPO2 procedure is applied to locate one or more missing cations and surronding
anions. The sapo.fra file contains the located cations coordinates.
The binary file 'sapo.bin' must exist.
%structure sapo
%job SAPO - data from home diffractometer
%polyhedra
fragment sapo.fra
tetrahedron Si 1 1.6 0.2 0.2
tetrahedron Si 2 1.6 0.2 0.2
missing 1 2 Si tetrahedron 1.6 0.2 0.2
106 / 126
EXPO2014
Example 20
Table 1
107 / 126
EXPO2014
h = 2n & k= 2n &
70) l = 2n & h+k = 2n 71) h+k = 2n & h+l = 2n 72)
l= 2n
Created with the Personal Edition of HelpNDoc: Produce online help for Qt applications
108 / 126
EXPO2014
Created with the Personal Edition of HelpNDoc: Single source CHM, PDF, DOC and HTML Help creation
How to
Many possibilities are available to interact with the program: through the toolbar, the menu
and using keyboard/mouse.
Created with the Personal Edition of HelpNDoc: Single source CHM, PDF, DOC and HTML Help creation
File
The sub-menu File is shown on the left. It allows:
109 / 126
EXPO2014
Import
EXPO2014 can also read the coordinates of the final molecular geometry in the output files
generated by well known quantum chemistry programs:
110 / 126
EXPO2014
Isolated atoms and some common chemical geometries (tetrahedron, octahedron, square
plane) can also imported. They are useful building blocks in structure solution by global
optimization.
Export
Information about the currently displayed crystal structure can be in following text files:
· Crystallographic Information File (*.cif) http://www.iucr.org/resources/cif
· Input/Output files of SHELXL (*.ins,*.res) http://shelx.uni-ac.gwdg.de/SHELX/
· Free Form Fractional (*.fra, *.frac) http://openbabel.org/wiki/Free_Form_Fractional
· Protein Data Bata Bank (*.pdb) http://www.wwpdb.org/
· POV-Ray input file (*.pov) input file for the freeware program POV-Ray. http://
www.povray.org/download/
· MDL Molfile (*.mol) http://accelrys.com/products/informatics/cheminformatics/ctfile-
formats/no-fee.php
· Fenske-Hall Z-matrix (*.zmt), Z-matrix file created by Open Babel
· MDL Molfile (*.mol) http://accelrys.com/products/informatics/cheminformatics/ctfile-
formats/no-fee.php
· XYZ Cartesian Format (*.xyz) http://en.wikipedia.org/wiki/XYZ_file_format
· MOPAC File Formats with cartesian coordinates. http://openmopac.net/manual/
index.html
EXPO2014 create input files for well known quantum chemistry programs. Input files for
GAMESS and NWCHEM contain keywords for geometry optimization by density functional
111 / 126
EXPO2014
theory (DFT) calculations to perform using the B3LYP functional and a standard 6-31G*
basis set. This approach is useful to generate molecular model with accurate angles and
bond lengths, suitable as a starting model for structure solution by real space methods
Screenshot
Images of crystal structure displayed in the graphic area can be saved in files of different
following formats:
· Bitmap File (*.bmp)
· ICO file (*.ico)
· jpeg file (*.jpg)
· Portable Network Graphics file (*.png)
· Tagged Image File (*.tif)
Exit
This is used to exit the program.
Created with the Personal Edition of HelpNDoc: Generate Kindle eBooks with ease
Modify
The sub-menu Modify is shown on the left. It allows:
112 / 126
EXPO2014
Undo/Redo
Delete
Selection
Delete
Bonds
Delete All
Type
Change
Species
Rename
Renumber
Sort by
Species
Edit
structure
Add
fragments
Created with the Personal Edition of HelpNDoc: Full-featured Kindle eBooks generator
Select
The sub-menu Select is shown on the left. It allows:
Selection Mode
Zoom Mode
Manipulation Mode
Invert selection
Select none
Select Type
View
The sub-menu View is shown on the left. It allows:
113 / 126
EXPO2014
Preferences
Symmetry
Zoom
Rotate
Label All Atoms
Label except
Hydrogens
Label Type
Show/Hide
View Plane
Polyhedra
Polyhedra
Properties
Contacts
Preferences
In order to get a better view of a model it is possible to tune the radius of the spheres
114 / 126
EXPO2014
representing the atoms and the radius of the cylinders representing the bonds. The atom
colors can be modified.
Polyhedra Properties
This window provides some tools to set the properties of the polyhedra; it is possible to
visualize or to hide some of them, it is possible to change the colors and the transparency,
to show or to hide the edges etc.
115 / 126
EXPO2014
View plane
Created with the Personal Edition of HelpNDoc: Write EPub books for the iPad
Tools
The sub-menu Tools is shown on the left. It allows:
116 / 126
EXPO2014
Geometry Optimization
· To optimize the molecular geometry in the gas phase to obtain accurate starting
molecular structure suitable for structure solution by real space methods. In this stage
different levels of theory are applied: molecular mechanics, semi-empirical methods,
Hartree-Fock methods, density functional theory (DFT).
· To minimize the energy of crystal structure and provide reasonable bond lengths and
angles to input into the Rietveld refinement as chemical restraints. Calculations are
performed using plane wave DFT with dispersion correction (DFT-D).
· Because it is difficult to accurately locate the positions of H atoms from the XRPD data,
energy optimization provides the most reasonable approach to compute optimal
positions for the H atoms as the final step, with unit cell and non-H atomic positions
fixed to those established by Rietveld refinement.
· To clarify an ambiguity concerning the orientation of functional group that could not be
distinguished on the basis of the XRPD data alone.
· Geometry optimization with DFT-D approach in the solid state has been applied to
refine the crystal structure when Rietveld refinement yielded inaccurate molecular
geometry, providing results whose accuracy is comparable to that of single-crystal
refinement.
117 / 126
EXPO2014
EXPO2014 provides crystallographers with tools to perform theoretical calculations for the
geometry optimization. The first geometry optimization should be done with a faster level of
theory, such as molecular mechanics or a semiempirical method. Once a geometry close
to the correct geometry has been obtained with this lower level of theory, it is used as the
starting geometry for a second optimization at the final, more accurate level of theory (e.g.,
DFT).
In molecular mechanics, the energy of a compound consists of the sum of simple classical
equations and the molecule is described as a collection of balls (corresponding to the
atoms) held together by springs (corresponding to the bonds). The molecular mechanics
model clearly does not use wave function and electrons are not explicitly included.
The total potential energy is typically taken to be the sum of the bond stretching energy Estr ,
the bending energy Ebend , the twisting (or torsion) energy Etor , and the energy of interaction
between non-bonded atoms Enon-bond. The last contribution includes van der Waals, steric,
and electrostatic interactions between atoms not chemically bound.
Etot = Estr + Ebend + Etor + Enon-bond
The equations for the potential energy terms contain parameters and the specified set of
equations and parameters is called the force field. The constants in this equation are
obtained either from spectroscopic data or ab initio calculations. A set of equations with
their associated constants is called a force field. Many good force fields have already been
developed and they differ in the number of terms in the energy expression, the complexity
of those terms, and the way in which the constants were obtained.
To optimize the geometry and find the more stable conformation, each of the forces fi
118 / 126
EXPO2014
fi = - ¶Etot/¶qi
· The Merck molecular force field (MMFF) can be used for organic molecules and
biomolecules [1]
· The Universal force field (UFF) is a full periodic table force field. This is widely used for
system containing inorganic elements [2]
In EXPO2014 the support for molecular mechanics is provided using the class
OBForceField from the library Open Babel [3]
Clicking on Tools > Optimize Geometry > Optimize MMFF94 force field will be applied
and in case of failure (e.g. inorganic elements) UFF will be executed. Use Modify> Undo
Optimize Geometry to restore the initial molecular geometry.
There are two general approaches to solving the Schrödinger equation of a molecular
system: semi-empirical and ab initio methods. The semi-empirical methods assume an
approximate Hamiltonian operator and the calculations are further simplified by
approximating integrals with various experimental data such as ionization energies,
electronic spectral transition energies, and bond energies. On the other hand, ab initio
methods use a “correct” Hamiltonian operator, which includes kinetic energy of the
electrons, attractions between electrons and nuclei, and repulsions between electrons and
those between nuclei, to calculate all integrals without making use of any experimental data
other than the values of the fundamental constants. An example of these methods is the
self-consistent field (SCF) method first introduced by D. R. Hartree and V. Fock in the
1920s.
MOPAC (Molecular Orbital PACkage) is one of the most widely used semi-empirical
software packages and is designed for a wide range of functionality. We tested version
2012 and 2016. MOPAC is not distributed with EXPO, so a copy of the software needs to
be separately downloaded. For more information or to obtain a copy, please visit http://
www.openmopac.net/MOPAC2016.html, the software is currently free for academic use.
An interface of MOPAC is provided in EXPO and allows users to perform a gas-phase
geometry optimization. This approach is useful to generate molecular model with accurate
angles and bond lengths, suitable as a starting model for structure solution by real space
methods. The advantage of semi-empirical calculations is that they are much faster than
119 / 126
EXPO2014
Title: define the title of the calculations, e.g., the name of molecule. It is reported in the
second line of the MOPAC input file
Method: Different semi-empirical Hamiltonians are available and are used in the
electronic part of the calculation to obtain molecular orbitals and its derivative with respect
to molecular geometry. Not all chemical elements are supported by the selected
Hamiltonian. For more information about the elements available for the specified method
see the geometry section in MOPAC manual: http://openmopac.net/manual/
Charge: Write the charge of studied system
Multiplicity: Allow the user to specify the number of singly occupied orbitals. Singlet is the
default, and specify a closed shell that includes the most of organic molecules; doublet
specify one singly occupied orbital (e.g. organic radicals); triplet specifies two singly
occupied orbitals; and so forth. If the multiplicity is unknown, you have to try all.
Format : Geometries can be specified within MOPAC using the Cartesian coordinate
definition or the Gaussian Z-matrix format. The position of molecule in the cell unit will be
lost if the geometry is defined in Z-matrix format.
Mopac Program: EXPO2014 automatically locates the position where MOPAC exists, if
MOPAC is installed in a standard directory (C:/Program Files/MOPAC/ on Windows, /opt/
mopac/ on Linux) or the environmental variable PATH includes the installation directory,
otherwise the MOPAC executable must be specified directly by the user.
If you click on Submit, MOPAC will be launched and at the end of the geometry
optimization, the new geometry will be displayed and an output file, called
structure_name_mopac.out, will be produced in the working directory. The output file can
be viewed from Tools> Optimize Geometry> View Mopac Output. You can read
previously generated MOPAC output files and display the molecular geometry from File >
120 / 126
EXPO2014
Import Structure and selecting 'MOPAC Output File (*.out)' in the file extension menu.
Before performing the calculation, add the missing hydrogen. You can use the option
Tools>Add Hydrogens. Wrong starting geometry, missing hydrogens or wrong
multiplicity are possible causes of an incorrect result. In this situation use Modify> Undo
Optimize Geometry to restore the initial geometry, then read the output file to find a
possible cause of error.
Additional calculations can be performed by clicking the preview button and changing the
input file. For example, it may be particularly important to calculate the bond order: add the
keyword BONDS in the first line and bond order between all pairs of atoms will be printed
in the output file.
Another approach closely related to the ab initio methods that has gained increasing
prominence in recent years is the DFT. This method bypasses the determination of the
wavefunction ψ. Instead, it determines the molecular electronic probability density ρ directly
and then calculates the energy of the system from ρ.
EXPO2014 creates input files for well known quantum chemistry programs, clicking on File
> Import Structure. Input files for GAMESS [4] and NWCHEM [5] contain keywords for
geometry optimization by density functional theory (DFT) calculations to perform using the
B3LYP functional and a standard 6-31G* basis set.
In addition Tools > Optimize Crystal Structure by DFT-D can create an NWCHEM input
file for the optimization of crystal structure by plane wave density function theory with
Grimme dispersion correction. In order to obtain a good accuracy on the interatomic
distances, you should do a convergence test with respect the cutoff energy and a
convergence study associated with the sampling of the Brillouin zone.
References
[1] Thomas A. Halgren, J. Comput. Chem., 17, 490-519 (1996)
[2] J. Am. Chem. Soc. 1992, Vol. 114, No. 25, 10024-10035
[4] http://www.msg.ameslab.gov/gamess/
[5] http://www.nwchem-sw.org/index.php/Main_Page
Created with the Personal Edition of HelpNDoc: Easy to use tool to create HTML Help files and Help web sites
Info
The sub-menu Info is shown on the left. It allows:
121 / 126
EXPO2014
Structure
List of Atoms
122 / 126
EXPO2014
List of Bonds
123 / 126
EXPO2014
Bar Chart
124 / 126
EXPO2014
The tab Bar Chart contains the histograms of the fourier intensities of the atoms.
List of Contacts
125 / 126
EXPO2014
Geometry
The value of the distances, angles and torsion angles among the atoms can be calculated.
Created with the Personal Edition of HelpNDoc: Free help authoring environment
126 / 126