Click here to go to the corresponding page for the latest version of DIALS
Multi-crystal symmetry analysis and scaling with DIALS¶
Introduction¶
Recent additions to DIALS and xia2 have enabled multi-crystal analysis to be
performed following integration. These tools are particularly relevant
for analysis of many partial-datasets, which may be the only practical way of
performing data collections for certain crystals. After integration, the
space group symmetry can be investigated by testing for the presence of symmetry
operations relating the integrated intensities of groups of reflections - the
program to perform this is analysis is dials.symmetry
(with algorithms
similar to those of the program Pointless).
Another thing to consider is that for certain space groups (polar space groups),
there is an inherent ambiguity in the way that the diffraction pattern can be
indexed. In order to combine multiple datasets from these space groups, one must
reindex all data to a consistent setting, which can be done with the program
dials.cosym
(see Gildea and Winter for details).
Finally, the data must be scaled, to correct for experimental effects such as
differences in crystal size/illuminated volume and radiation damage - this can
be done with the program dials.scale
(with algorithms similar to those
of the program Aimless). After the data has been scaled, choices
can then be made about applying a resolution limit to exclude certain regions
of the data which may be negatively affected by radiation damage.
In this tutorial, we shall investigate a multi-crystal dataset collected on
the VMXi beamline, Diamond’s automated facility for data collection from
crystallisation experiments in-situ. The dataset consists of four repeats of
a 60-degree rotation measurement on a crystal of Proteinase K, taken at different
locations on the crystal. We shall start with the integrated reflections and
experiments files generated by running the automated processing software
xia2
with pipeline=dials
.
Have a look at the Processing in Detail tutorial if you
want to know more about the different processing steps up to this point.
Note
To obtain the data for this tutorial you can run
dials.data get vmxi_proteinase_k_sweeps
. If you are at Diamond
Light Source on BAG training then the data are already available.
After typing module load bagtraining
you’ll be moved to a working
folder, with the data already located in the tutorial-data/ccp4/integrated_files
subdirectory. The processing in this tutorial will produce quite a few files,
so it’s recommended to make an move to new directory:
mkdir multi_crystal
cd multi_crystal
xia2.multiplex¶
The easiest way to run these tools for a multi-dataset analysis is through the
program xia2.multiplex
.
This runs several DIALS programs, including the programs described above, while
producing useful plots and output files.
To run xia2.multiplex
, we must provide the path to the input integrated files from
dials.integrate
:
xia2.multiplex experiments_0.expt experiments_1.expt experiments_2.expt experiments_3.expt reflections_0.refl reflections_1.refl reflections_2.refl reflections_3.refl
DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
input {
experiments = experiments_0.expt
experiments = experiments_1.expt
experiments = experiments_2.expt
experiments = experiments_3.expt
reflections = reflections_0.refl
reflections = reflections_1.refl
reflections = reflections_2.refl
reflections = reflections_3.refl
}
Selecting 4 experiments with profile-fitted reflections
Selecting 4 experiments with refined reflections
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
0 singletons:
Point group a b c alpha beta gamma
1 cluster:
Cluster_id N_xtals Med_a Med_b Med_c Med_alpha Med_beta Med_gamma Delta(deg)
4 in P422.
cluster_1 4 68.36 (0.01 ) 68.36 (0.01 ) 103.95(0.02 ) 90.00 (0.00) 90.00 (0.00) 90.00 (0.00)
P 4/m m m (No. 123) 68.36 68.36 103.95 90.00 90.00 90.00 0.0
Standard deviations are in brackets.
Each cluster:
Input lattice count, with integration Bravais setting space group.
Cluster median with Niggli cell parameters (std dev in brackets).
Highest possible metric symmetry and unit cell using LePage (J Appl Cryst 1982, 15:255) method, maximum delta 3deg.
Using all data sets for subsequent analysis
Laue group determined by dials.cosym: P 4 2 2
Resolution limit: 1.80 (cc_half > 0.3)
Overall merging statistics:
+--------------------+--------------+------------------+-------------------+
| | Overall | Low resolution | High resolution |
|--------------------+--------------+------------------+-------------------|
| Resolution (Å) | 68.36 - 1.80 | 68.41 - 4.89 | 1.83 - 1.80 |
| Observations | 212605 | 20714 | 133 |
| Unique reflections | 20140 | 1334 | 127 |
| Multiplicity | 10.6 | 15.5 | 1.0 |
| Completeness | 85.42% | 100.00% | 11.07% |
| Mean I/σ(I) | 28.8 | 55.6 | 2.7 |
| Rmerge | 0.057 | 0.049 | 0.397 |
| Rmeas | 0.059 | 0.050 | 0.562 |
| Rpim | 0.016 | 0.013 | 0.397 |
| CC½ | 0.999 | 0.999 | -0.379 |
+--------------------+--------------+------------------+-------------------+
Resolution shells:
+------------------+----------+-------------+----------------+----------------+----------+---------------+----------+---------+--------+--------+---------+
| Resolution (Å) | N(obs) | N(unique) | Multiplicity | Completeness | Mean I | Mean I/σ(I) | Rmerge | Rmeas | Rpim | CC½ | CCano |
|------------------+----------+-------------+----------------+----------------+----------+---------------+----------+---------+--------+--------+---------|
| 68.41 - 4.89 | 20714 | 1334 | 15.53 | 100 | 262.9 | 55.6 | 0.049 | 0.05 | 0.013 | 0.999* | -0.257 |
| 4.89 - 3.88 | 20458 | 1244 | 16.45 | 100 | 397.1 | 59.3 | 0.046 | 0.048 | 0.012 | 0.999* | -0.214 |
| 3.88 - 3.39 | 19966 | 1208 | 16.53 | 100 | 304.4 | 54.9 | 0.049 | 0.051 | 0.012 | 0.999* | -0.056 |
| 3.39 - 3.08 | 20074 | 1189 | 16.88 | 100 | 214.3 | 49.5 | 0.054 | 0.056 | 0.013 | 0.999* | -0.063 |
| 3.08 - 2.86 | 20270 | 1185 | 17.11 | 100 | 148.6 | 43.1 | 0.06 | 0.062 | 0.015 | 0.999* | 0.006 |
| 2.86 - 2.69 | 20467 | 1191 | 17.18 | 100 | 120.1 | 40.1 | 0.066 | 0.068 | 0.016 | 0.998* | -0.086 |
| 2.69 - 2.55 | 20171 | 1172 | 17.21 | 100 | 94.2 | 34.3 | 0.075 | 0.078 | 0.019 | 0.998* | -0.001 |
| 2.55 - 2.44 | 16109 | 1167 | 13.8 | 100 | 84.9 | 28.8 | 0.078 | 0.081 | 0.022 | 0.997* | -0.077 |
| 2.44 - 2.35 | 12118 | 1162 | 10.43 | 99.91 | 73.8 | 24 | 0.082 | 0.087 | 0.027 | 0.996* | -0.001 |
| 2.35 - 2.27 | 10137 | 1180 | 8.59 | 99.92 | 72.5 | 21.4 | 0.085 | 0.09 | 0.03 | 0.995* | 0.092* |
| 2.27 - 2.20 | 8323 | 1150 | 7.24 | 99.65 | 62.7 | 17.6 | 0.087 | 0.094 | 0.035 | 0.995* | -0.129 |
| 2.20 - 2.13 | 6830 | 1165 | 5.86 | 99.06 | 56.7 | 14.3 | 0.091 | 0.1 | 0.04 | 0.993* | 0.002 |
| 2.13 - 2.08 | 5300 | 1116 | 4.75 | 97.04 | 49.4 | 11.5 | 0.096 | 0.107 | 0.047 | 0.991* | -0.126 |
| 2.08 - 2.03 | 4021 | 1056 | 3.81 | 91.27 | 43.7 | 9.6 | 0.101 | 0.115 | 0.055 | 0.986* | -0.041 |
| 2.03 - 1.98 | 3076 | 1001 | 3.07 | 86.14 | 36.9 | 7.6 | 0.109 | 0.13 | 0.068 | 0.981* | 0.030 |
| 1.98 - 1.94 | 2060 | 896 | 2.3 | 77.85 | 35 | 6.3 | 0.103 | 0.128 | 0.074 | 0.975* | -0.204 |
| 1.94 - 1.90 | 1163 | 700 | 1.66 | 61.03 | 30 | 4.8 | 0.12 | 0.156 | 0.098 | 0.958* | 0.000 |
| 1.90 - 1.86 | 751 | 531 | 1.41 | 46.21 | 26.1 | 3.9 | 0.113 | 0.151 | 0.1 | 0.967* | 0.000 |
| 1.86 - 1.83 | 464 | 366 | 1.27 | 31.88 | 23.8 | 3.4 | 0.116 | 0.161 | 0.111 | 0.955* | 0.000 |
| 1.83 - 1.80 | 133 | 127 | 1.05 | 11.07 | 20.6 | 2.7 | 0.397 | 0.562 | 0.397 | -0.379 | 0.000 |
+------------------+----------+-------------+----------------+----------------+----------+---------------+----------+---------+--------+--------+---------+
Space group determined by dials.symmetry: P 41 21 2
Intensity correlation clustering summary:
========= ============== ========== ======== ============== ==============
Cluster No. datasets Datasets Height Multiplicity Completeness
========= ============== ========== ======== ============== ==============
1 2 0 3 0.0024 3.4 0.68
2 2 1 2 0.0038 5.7 0.79
3 4 0 1 2 3 0.0045 8.2 0.83
========= ============== ========== ======== ============== ==============
Cos(angle) clustering summary:
========= ============== ========== ======== ============== ==============
Cluster No. datasets Datasets Height Multiplicity Completeness
========= ============== ========== ======== ============== ==============
1 2 1 2 1e-05 5.7 0.79
2 3 1 2 3 0.0093 5.7 0.79
3 4 0 1 2 3 0.032 8.2 0.83
========= ============== ========== ======== ============== ==============
This runs dials.cosym
to analyse the Laue symmetry and reindex all datasets
consistently, scales the data with dials.scale
,
calculates a resolution limit with dials.resolutionizer
and reruns
dials.scale
with the determined resolution cutoff. The
final dataset is exported to an unmerged mtz and a
HTML report
is generated. The easiest way to see the results is to open the
HTML report
in your browser of choice e.g.:
firefox xia2.multiplex.html
Provided is a summary of the merging statistics as well as several plots, please explore these for a few minutes now! This dataset results in good merging statistics, however if you navigate to the “Analysis by batch” tab in “All data”, you will see that the fourth dataset has poorer statistics compared to the others. Let’s repeat the processing manually to explore the different steps and address this issue.
Manual reprocessing¶
The first step is Laue/Patterson group analysis using dials.cosym:
dials.cosym experiments_0.expt experiments_1.expt experiments_2.expt experiments_3.expt reflections_0.refl reflections_1.refl reflections_2.refl reflections_3.refl
Scoring all possible sub-groups
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
| Patterson group | | Likelihood | NetZcc | Zcc+ | Zcc- | delta | Reindex operator |
|-------------------+-----+--------------+----------+--------+--------+---------+--------------------|
| P 4/m m m | *** | 1 | 9.92 | 9.92 | 0 | 0 | -a,b,-c |
| P m m m | | 0 | 0 | 9.93 | 9.92 | 0 | -a,b,-c |
| C m m m | | 0 | 0 | 9.92 | 9.92 | 0 | a-b,a+b,c |
| P 4/m | | 0 | -0.01 | 9.92 | 9.93 | 0 | -a,b,-c |
| C 1 2/m 1 | | 0 | 0.01 | 9.93 | 9.92 | 0 | a-b,a+b,c |
| P 1 2/m 1 | | 0 | 0.01 | 9.93 | 9.92 | 0 | -a,b,-c |
| P 1 2/m 1 | | 0 | -0 | 9.92 | 9.92 | 0 | -b,-a,-c |
| P 1 2/m 1 | | 0 | -0 | 9.92 | 9.93 | 0 | -a,-c,-b |
| C 1 2/m 1 | | 0 | -0.01 | 9.92 | 9.93 | 0 | a+b,-a+b,c |
| P -1 | | 0 | -9.92 | 0 | 9.92 | 0 | -a,b,-c |
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
Best solution: P 4/m m m
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
Reindex operator: -a,b,-c
Laue group probability: 1.000
Laue group confidence: 1.000
Space groups:
P 4 2 2
[0, 1, 2, 3]
Reindexing operators:
x,y,z
[0, 1, 2, 3]
DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
input {
experiments = experiments_0.expt
experiments = experiments_1.expt
experiments = experiments_2.expt
experiments = experiments_3.expt
reflections = reflections_0.refl
reflections = reflections_1.refl
reflections = reflections_2.refl
reflections = reflections_3.refl
}
Hierarchical clustering of unit cells
Using Andrews-Bernstein distance from Andrews & Bernstein J Appl Cryst 47:346 (2014)
Distances have been calculated
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
0 singletons:
Point group a b c alpha beta gamma
1 cluster:
Cluster_id N_xtals Med_a Med_b Med_c Med_alpha Med_beta Med_gamma Delta(deg)
4 in P422.
cluster_1 4 68.36 (0.01 ) 68.36 (0.01 ) 103.95(0.02 ) 90.00 (0.00) 90.00 (0.00) 90.00 (0.00)
P 4/m m m (No. 123) 68.36 68.36 103.95 90.00 90.00 90.00 0.0
Standard deviations are in brackets.
Each cluster:
Input lattice count, with integration Bravais setting space group.
Cluster median with Niggli cell parameters (std dev in brackets).
Highest possible metric symmetry and unit cell using LePage (J Appl Cryst 1982, 15:255) method, maximum delta 3deg.
Filtering reflections for dataset 0
Read 76079 predicted reflections
Selected 54367 reflections integrated by profile and summation methods
Combined 1127 partial reflections with other partial reflections
Removed 491 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 12 intensity.prf.value reflections with I/Sig(I) < -5
Filtering reflections for dataset 1
Read 75607 predicted reflections
Selected 54845 reflections integrated by profile and summation methods
Combined 1284 partial reflections with other partial reflections
Removed 554 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 10 intensity.prf.value reflections with I/Sig(I) < -5
Filtering reflections for dataset 2
Read 77983 predicted reflections
Selected 54461 reflections integrated by profile and summation methods
Combined 1404 partial reflections with other partial reflections
Removed 541 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 7 intensity.prf.value reflections with I/Sig(I) < -5
Filtering reflections for dataset 3
Read 76468 predicted reflections
Selected 53877 reflections integrated by profile and summation methods
Combined 1062 partial reflections with other partial reflections
Removed 514 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 3 intensity.prf.value reflections with I/Sig(I) < -5
Patterson group: P 4/m m m
--------------------------------------------------------------------------------
Normalising intensities for dataset 1
ML estimate of overall B value:
13.46 A**2
ML estimate of -log of scale factor:
-3.04
--------------------------------------------------------------------------------
Normalising intensities for dataset 2
ML estimate of overall B value:
11.06 A**2
ML estimate of -log of scale factor:
-3.50
--------------------------------------------------------------------------------
Normalising intensities for dataset 3
ML estimate of overall B value:
11.45 A**2
ML estimate of -log of scale factor:
-2.96
--------------------------------------------------------------------------------
Normalising intensities for dataset 4
ML estimate of overall B value:
12.14 A**2
ML estimate of -log of scale factor:
-2.67
--------------------------------------------------------------------------------
Estimation of resolution for Laue group analysis
Resolution estimate from <I>/<σ(I)> > 4.0 : 2.10
Resolution estimate from CC½ > 0.60: 1.80
High resolution limit set to: 1.80
Selecting 188689 reflections with d > 1.80
================================================================================
Automatic determination of number of dimensions for analysis
+--------------+--------------+
| Dimensions | Functional |
|--------------+--------------|
| 1 | 14.5345 |
| 2 | 15.6716 |
| 3 | 14.608 |
| 4 | 15.0478 |
| 5 | 15.2777 |
| 6 | 14.7404 |
| 7 | 14.8183 |
| 8 | 14.8374 |
+--------------+--------------+
Best number of dimensions: 6
Using 6 dimensions for analysis
Principal component analysis:
Explained variance: 0.0022, 0.0016, 0.0013, 0.0011, 0.00094, 7.3e-05
Explained variance ratio: 0.31, 0.22, 0.18, 0.15, 0.13, 0.01
Scoring individual symmetry elements
+--------------+--------+------+-----+-----------------+
| likelihood | Z-CC | CC | | Operator |
|--------------+--------+------+-----+-----------------|
| 0.947 | 9.93 | 0.99 | *** | 2 |(1, 1, 0) |
| 0.946 | 9.92 | 0.99 | *** | 2 |(-1, 1, 0) |
| 0.947 | 9.93 | 0.99 | *** | 2 |(0, 1, 0) |
| 0.946 | 9.91 | 0.99 | *** | 4^-1 |(0, 0, 1) |
| 0.947 | 9.93 | 0.99 | *** | 4 |(0, 0, 1) |
| 0.946 | 9.92 | 0.99 | *** | 2 |(0, 0, 1) |
| 0.946 | 9.92 | 0.99 | *** | 2 |(1, 0, 0) |
+--------------+--------+------+-----+-----------------+
Scoring all possible sub-groups
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
| Patterson group | | Likelihood | NetZcc | Zcc+ | Zcc- | delta | Reindex operator |
|-------------------+-----+--------------+----------+--------+--------+---------+--------------------|
| P 4/m m m | *** | 1 | 9.92 | 9.92 | 0 | 0 | -a,b,-c |
| P m m m | | 0 | 0 | 9.93 | 9.92 | 0 | -a,b,-c |
| C m m m | | 0 | 0 | 9.92 | 9.92 | 0 | a-b,a+b,c |
| P 4/m | | 0 | -0.01 | 9.92 | 9.93 | 0 | -a,b,-c |
| C 1 2/m 1 | | 0 | 0.01 | 9.93 | 9.92 | 0 | a-b,a+b,c |
| P 1 2/m 1 | | 0 | 0.01 | 9.93 | 9.92 | 0 | -a,b,-c |
| P 1 2/m 1 | | 0 | -0 | 9.92 | 9.92 | 0 | -b,-a,-c |
| P 1 2/m 1 | | 0 | -0 | 9.92 | 9.93 | 0 | -a,-c,-b |
| C 1 2/m 1 | | 0 | -0.01 | 9.92 | 9.93 | 0 | a+b,-a+b,c |
| P -1 | | 0 | -9.92 | 0 | 9.92 | 0 | -a,b,-c |
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
Best solution: P 4/m m m
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
Reindex operator: -a,b,-c
Laue group probability: 1.000
Laue group confidence: 1.000
Space groups:
P 4 2 2
[0, 1, 2, 3]
Reindexing operators:
x,y,z
[0, 1, 2, 3]
Writing html report to: dials.cosym.html
Writing json to: dials.cosym.json
Saving reindexed experiments to symmetrized.expt
Saving reindexed reflections to symmetrized.refl
As you can see, the \(P\,4/m\,m\,m\) Patterson group is found with the highest confidence. For the corresponding space group, the mirror symmetries are removed to give \(P\,4\,2\,2\), as the chiral nature of macromolecules means we have a restricted choice of space groups. In this example, all datasets were indexed consistently, but this is not the case in general.
Next, the data can be scaled:
dials.scale symmetrized.expt symmetrized.refl
From the merging statistics it is clear that the data quality is good out to the furthest resolution (\(CC_{1/2} > 0.3\)), which can be confirmed by a resolution analysis:
dials.resolutionizer scaled.expt scaled.refl
Resolution cc_half: 1.80
The following parameters have been modified:
input {
experiments = scaled.expt
reflections = scaled.refl
}
DIALS 2.2.9-g061426e04-release
Detected existence of a multi-dataset reflection table
containing 4 datasets.
Read 76079 predicted reflections
Selected 53256 scaled reflections
Combined 68 partial reflections with other partial reflections
Read 75607 predicted reflections
Selected 53551 scaled reflections
Combined 77 partial reflections with other partial reflections
Read 77983 predicted reflections
Selected 53063 scaled reflections
Combined 106 partial reflections with other partial reflections
Read 76468 predicted reflections
Selected 52749 scaled reflections
Combined 69 partial reflections with other partial reflections
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Resolution cc_half: 1.80
Resolution I/sig: 1.80
Resolution Mn(I/sig): 1.80
If the resolution limit was lower than the extent of the data, scaling would be rerun with a new resolution limit, for example:
dials.scale scaled.expt scaled.refl d_min=1.80
DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
cut_data {
d_min = 1.8
}
input {
experiments = scaled.expt
reflections = scaled.refl
}
Checking for the existence of a reflection table
containing multiple datasets
Detected existence of a multi-dataset reflection table
containing 4 datasets.
Found 4 reflection tables & 4 experiments in total.
Dataset ids are: 0,1,2,3
Space group being used during scaling is P 4 2 2
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Scaling models have been initialised for all experiments.
================================================================================
The experiment id for this dataset is 0.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22812/76079 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13656
criterion: excluded for scaling, reflections: 22812
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
The experiment id for this dataset is 1.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22043/75607 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13186
criterion: excluded for scaling, reflections: 22041
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
The experiment id for this dataset is 2.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 24879/77983 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 15473
criterion: excluded for scaling, reflections: 24878
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
The experiment id for this dataset is 3.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 23621/76468 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13968
criterion: excluded for scaling, reflections: 23620
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
Configuring a MultiScaler to handle the individual Scalers.
Determining symmetry equivalent reflections across datasets.
Using quasi-random reflection selection. Selecting from 16683 symmetry groups
with <I/sI> > 1.0 (197826 reflections)). Selection target of 717.07 reflections
from each dataset, with a total number between 11473.09 and 13767.71.
Summary of cross-dataset reflection groups chosen (1331 groups, 13804 reflections):
+---------------+------------+----------+-----+-----+-----+-----+
| d-range | n_groups | n_refl | 0 | 1 | 2 | 3 |
|---------------+------------+----------+-----+-----+-----+-----|
| 68.428 - 8.11 | 42 | 689 | 105 | 203 | 178 | 203 |
| 8.11 - 5.755 | 31 | 724 | 71 | 242 | 197 | 214 |
| 5.755 - 4.704 | 33 | 813 | 72 | 271 | 226 | 244 |
| 4.704 - 4.076 | 33 | 896 | 70 | 298 | 256 | 272 |
| 4.076 - 3.647 | 36 | 957 | 72 | 313 | 276 | 296 |
| 3.647 - 3.33 | 31 | 875 | 70 | 294 | 242 | 269 |
| 3.33 - 3.084 | 32 | 831 | 70 | 271 | 228 | 262 |
| 3.084 - 2.885 | 30 | 784 | 71 | 256 | 212 | 245 |
| 2.885 - 2.72 | 35 | 834 | 72 | 275 | 218 | 269 |
| 2.72 - 2.581 | 35 | 792 | 71 | 247 | 212 | 262 |
| 2.581 - 2.461 | 30 | 648 | 71 | 215 | 206 | 156 |
| 2.461 - 2.356 | 33 | 501 | 99 | 167 | 163 | 72 |
| 2.356 - 2.264 | 43 | 543 | 127 | 176 | 169 | 71 |
| 2.264 - 2.182 | 53 | 563 | 144 | 188 | 159 | 72 |
| 2.182 - 2.108 | 65 | 528 | 142 | 166 | 105 | 115 |
| 2.108 - 2.041 | 135 | 764 | 186 | 194 | 140 | 244 |
| 2.041 - 1.98 | 140 | 696 | 175 | 220 | 140 | 161 |
| 1.98 - 1.924 | 184 | 661 | 140 | 210 | 171 | 140 |
| 1.924 - 1.873 | 222 | 520 | 106 | 141 | 141 | 132 |
| 1.873 - 1.826 | 88 | 185 | 29 | 56 | 65 | 35 |
+---------------+------------+----------+-----+-----+-----+-----+
Summary of reflections chosen for minimisation from each dataset (52765 total):
+--------------+------------------+----------------------+----------------------+--------------------+
| Dataset id | reflections | randomly selected | randomly selected | combined number |
| | connected to | reflections | reflections | of reflections |
| | other datasets | within dataset | across datasets | |
|--------------+------------------+----------------------+----------------------+--------------------|
| 0 | 1963 | 4530 | 6335 | 11832 |
| 1 | 4403 | 4890 | 6406 | 13956 |
| 2 | 3704 | 4763 | 6508 | 13534 |
| 3 | 3734 | 4752 | 6379 | 13443 |
| total | 13804 | 18935 | 25628 | 52765 |
+--------------+------------------+----------------------+----------------------+--------------------+
Completed configuration of MultiScaler.
================================================================================
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer.
Time taken for refinement 5.53
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52718 | 1.8181 |
| 1 | 52718 | 1.8131 |
| 2 | 52718 | 1.8024 |
| 3 | 52718 | 1.7984 |
| 4 | 52718 | 1.7942 |
| 5 | 52718 | 1.7909 |
| 6 | 52718 | 1.7895 |
| 7 | 52718 | 1.7884 |
| 8 | 52718 | 1.7868 |
| 9 | 52718 | 1.7861 |
| 10 | 52718 | 1.7853 |
| 11 | 52718 | 1.7846 |
| 12 | 52718 | 1.7839 |
| 13 | 52718 | 1.7833 |
| 14 | 52718 | 1.7826 |
| 15 | 52718 | 1.7821 |
| 16 | 52718 | 1.7811 |
| 17 | 52718 | 1.7803 |
| 18 | 52718 | 1.7796 |
| 19 | 52718 | 1.779 |
| 20 | 52718 | 1.7787 |
| 21 | 52718 | 1.7782 |
| 22 | 52718 | 1.7768 |
| 23 | 52718 | 1.7763 |
| 24 | 52718 | 1.7756 |
| 25 | 52718 | 1.7747 |
| 26 | 52718 | 1.7746 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True
================================================================================
Scale factors determined during minimisation have now been
applied to all datasets.
Combined outlier rejection has been performed across multiple datasets,
2718 outliers have been identified.
Performing multi-dataset profile/summation intensity optimisation.
+-----------------+---------+---------+
| Combination | CC1/2 | Rmeas |
|-----------------+---------+---------|
| prf only | 0.99922 | 0.05458 |
| sum only | 0.9989 | 0.06199 |
| Imid = 856.68 | 0.99567 | 0.17552 |
| Imid = 11484.08 | 0.94959 | 0.29629 |
| Imid = 1148.41 | 0.99415 | 0.20362 |
| Imid = 114.84 | 0.99881 | 0.07134 |
+-----------------+---------+---------+
Profile intensities determined to be best for scaling.
Combined outlier rejection has been performed across multiple datasets,
2718 outliers have been identified.
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer.
Time taken for refinement 2.57
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52036 | 1.3209 |
| 1 | 52036 | 1.3195 |
| 2 | 52036 | 1.317 |
| 3 | 52036 | 1.3163 |
| 4 | 52036 | 1.3156 |
| 5 | 52036 | 1.3149 |
| 6 | 52036 | 1.3146 |
| 7 | 52036 | 1.3143 |
| 8 | 52036 | 1.3141 |
| 9 | 52036 | 1.3139 |
| 10 | 52036 | 1.3138 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True
================================================================================
Scale factors determined during minimisation have now been
applied to all datasets.
Combined outlier rejection has been performed across multiple datasets,
2663 outliers have been identified.
Performing a round of error model refinement.
Error model details:
Type: basic
Parameters: a = 0.95970, b = 0.04706
Error model formula: σ'² = a²(σ² + (bI)²)
estimated I/sigma asymptotic limit: 22.144
Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>) | n_refl | Uncorrected variance | Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10255.11 - 1823.56 | 1430 | 20.001 | 0.863 |
| 1823.56 - 1394.56 | 1430 | 14.371 | 0.977 |
| 1394.56 - 1175.87 | 1430 | 12.422 | 1.031 |
| 1175.87 - 924.15 | 2693 | 11.077 | 1.132 |
| 924.15 - 506.34 | 12232 | 7.606 | 1.221 |
| 506.34 - 277.42 | 20962 | 4.771 | 1.291 |
| 277.42 - 152.00 | 28611 | 3.017 | 1.296 |
| 152.00 - 83.28 | 31211 | 1.926 | 1.173 |
| 83.28 - 45.63 | 26310 | 1.395 | 1.056 |
| 45.63 - 24.99 | 16739 | 0.999 | 0.877 |
+--------------------------+----------+------------------------+----------------------+
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.
Time taken for refinement 11.18
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52048 | 0.89327 |
| 1 | 52048 | 0.8886 |
| 2 | 52048 | 0.88833 |
| 3 | 52048 | 0.88821 |
| 4 | 52048 | 0.88817 |
+--------+--------+----------+
RMSD no longer decreasing
================================================================================
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.
Time taken for refinement 4.37
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52048 | 0.88817 |
| 1 | 52048 | 0.88817 |
+--------+--------+----------+
RMSD no longer decreasing
================================================================================
Calculating error estimates of inverse scale factors.
Scale factors determined during minimisation have now been
applied to all datasets.
Combined outlier rejection has been performed across multiple datasets,
165 outliers have been identified.
Performing a round of error model refinement.
Error model details:
Type: basic
Parameters: a = 0.96121, b = 0.04702
Error model formula: σ'² = a²(σ² + (bI)²)
estimated I/sigma asymptotic limit: 22.125
Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>) | n_refl | Uncorrected variance | Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10316.01 - 1819.10 | 1430 | 19.463 | 0.837 |
| 1819.10 - 1390.90 | 1430 | 14.515 | 0.984 |
| 1390.90 - 1174.71 | 1430 | 12.975 | 1.066 |
| 1174.71 - 927.44 | 2649 | 11.192 | 1.142 |
| 927.44 - 507.84 | 12198 | 7.677 | 1.219 |
| 507.84 - 278.08 | 20951 | 4.779 | 1.286 |
| 278.08 - 152.27 | 28607 | 3.02 | 1.292 |
| 152.27 - 83.38 | 31265 | 1.934 | 1.17 |
| 83.38 - 45.66 | 26335 | 1.396 | 1.053 |
| 45.66 - 24.99 | 16765 | 1.008 | 0.879 |
+--------------------------+----------+------------------------+----------------------+
The reflection table variances have been adjusted to account for the
uncertainty in the scaling models for all datasets
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Total time taken: 32.1469s
================================================================================
Warning: Over half (51.19%) of model parameters have signficant
uncertainty (sigma/abs(parameter) > 0.5), which could indicate a
poorly-determined scaling problem or overparameterisation.
Summary of dataset partialities
+------------------+----------+
| Partiality (p) | n_refl |
|------------------+----------|
| all reflections | 306137 |
| p > 0.99 | 244580 |
| 0.5 < p < 0.99 | 4670 |
| 0.01 < p < 0.5 | 10499 |
| p < 0.01 | 46388 |
+------------------+----------+
Reflections below a partiality_cutoff of 0.4 are not considered for any
part of the scaling analysis or for the reporting of merging statistics.
Additionally, if applicable, only reflections with a min_partiality > 0.95
were considered for use when refining the scaling model.
----------Merging statistics----------
Resolution: 68.36 - 1.80
Observations: 212617
Unique reflections: 20140
Redundancy: 10.6
Completeness: 85.42%
Mean intensity: 131.1
Mean I/sigma(I): 29.2
R-merge: 0.056
R-meas: 0.058
R-pim: 0.015
Statistics by resolution bin:
d_max d_min #obs #uniq mult. %comp <I> <I/sI> r_mrg r_meas r_pim cc1/2 cc_ano
68.41 4.89 20721 1334 15.53 100.00 274.0 56.6 0.047 0.049 0.012 0.999* -0.015
4.89 3.88 20460 1244 16.45 100.00 414.7 60.3 0.046 0.047 0.011 0.999* -0.168
3.88 3.39 19968 1208 16.53 100.00 318.3 55.9 0.049 0.050 0.012 0.999* -0.046
3.39 3.08 20074 1189 16.88 100.00 224.3 50.3 0.054 0.055 0.013 0.999* -0.044
3.08 2.86 20271 1185 17.11 100.00 155.7 43.8 0.059 0.061 0.015 0.999* -0.000
2.86 2.69 20468 1191 17.19 100.00 125.9 40.7 0.065 0.067 0.016 0.998* -0.090
2.69 2.55 20171 1172 17.21 100.00 98.9 34.8 0.074 0.076 0.018 0.998* 0.015
2.55 2.44 16112 1167 13.81 100.00 89.2 29.3 0.077 0.080 0.021 0.998* -0.074
2.44 2.35 12116 1162 10.43 99.91 77.7 24.4 0.079 0.084 0.026 0.996* 0.008
2.35 2.27 10135 1180 8.59 99.92 76.4 21.7 0.082 0.087 0.029 0.995* 0.110*
2.27 2.20 8323 1150 7.24 99.65 66.2 17.8 0.084 0.091 0.033 0.995* -0.132
2.20 2.13 6830 1165 5.86 99.06 59.8 14.6 0.087 0.095 0.038 0.994* -0.016
2.13 2.08 5300 1116 4.75 97.04 52.1 11.7 0.091 0.102 0.045 0.992* -0.141
2.08 2.03 4021 1056 3.81 91.27 46.2 9.7 0.096 0.110 0.052 0.988* -0.057
2.03 1.98 3076 1001 3.07 86.14 39.1 7.7 0.102 0.122 0.064 0.984* 0.136
1.98 1.94 2060 896 2.30 77.85 36.9 6.4 0.095 0.118 0.069 0.979* -0.041
1.94 1.90 1163 700 1.66 61.03 31.7 4.8 0.115 0.149 0.094 0.963* 0.000
1.90 1.86 751 531 1.41 46.21 27.5 3.9 0.109 0.147 0.098 0.971* 0.000
1.86 1.83 464 366 1.27 31.88 25.1 3.4 0.100 0.139 0.096 0.963* 0.000
1.83 1.80 133 127 1.05 11.07 21.7 2.7 0.381 0.539 0.381 -0.331 0.000
68.36 1.80 212617 20140 10.56 85.42 131.1 29.2 0.056 0.058 0.015 0.999* -0.056
----------Resolution cutoff estimates----------
resolution of all data : 1.800
based on CC(1/2) >= 0.33 : 1.831
based on mean(I/sigma) >= 2.0 : 1.831
based on R-merge < 0.5 : 1.831
based on R-meas < 0.5 : 1.831
based on completeness >= 90% : 2.028
based on completeness >= 50% : 1.901
NOTE: we recommend using all data out to the CC(1/2) limit
for refinement.
Writing html report to: scaling.html
================================================================================
Saving the experiments to scaled.expt
Saving the scaled reflections to scaled.refl
See dials.github.io/dials_scale_user_guide.html for more info on scaling options
For exploring the scaling results, a wide variety of scaling and merging plots
can be found in the scaling.html
report generated by dials.scale
.
Almost there¶
As mentioned previously, the fourth dataset is giving significantly higher
R-merge values and much lower I/sigma.
Therefore the question one must ask is if it is better to exclude this dataset.
We can get some useful information about the agreement between datasets by
running the program dials.compute_delta_cchalf
. This program implements
a version of the algorithms described in Assmann et al. :
dials.compute_delta_cchalf scaled.refl scaled.expt
# Datasets: 4
# Groups: 4
# Reflections: 208216
# Unique reflections: 20132
CC 1/2 mean: 99.458
CC 1/2 excluding group 0: 99.463
CC 1/2 excluding group 1: 99.451
CC 1/2 excluding group 2: 99.395
CC 1/2 excluding group 3: 99.383
Dataset: 0, Delta CC 1/2: -0.005
Dataset: 1, Delta CC 1/2: 0.007
Dataset: 2, Delta CC 1/2: 0.063
Dataset: 3, Delta CC 1/2: 0.076
mean delta_cc_half 0.03535378806998335
stddev delta_cc_half 0.034706720651087486
cutoff value: -0.1034730945343666
Read 306137 predicted reflections
Selected 212617 scaled reflections
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Resolution bins
0: 68.363, 5.675
1: 5.675, 4.020
2: 4.020, 3.284
3: 3.284, 2.845
4: 2.845, 2.545
5: 2.545, 2.324
6: 2.324, 2.151
7: 2.151, 2.013
8: 2.013, 1.898
9: 1.898, 1.800
Summary of input data:
# Datasets: 4
# Groups: 4
# Reflections: 208216
# Unique reflections: 20132
CC 1/2 mean: 99.458
CC 1/2 excluding group 0: 99.463
CC 1/2 excluding group 1: 99.451
CC 1/2 excluding group 2: 99.395
CC 1/2 excluding group 3: 99.383
Dataset: 0, Delta CC 1/2: -0.005
Dataset: 1, Delta CC 1/2: 0.007
Dataset: 2, Delta CC 1/2: 0.063
Dataset: 3, Delta CC 1/2: 0.076
mean delta_cc_half 0.03535378806998335
stddev delta_cc_half 0.034706720651087486
cutoff value: -0.1034730945343666
Writing table to delta_cchalf.dat
Saving 306137 reflections to filtered.refl
Saving the experiments to filtered.expt
Writing html report to: compute_delta_cchalf.html
It looks like we could get a significantly better \(CC_{1/2}\) by excluding the final dataset - it has a negative \(\Delta CC_{1/2}\). But how bad is too bad that it warrants exclusion? Unfortunately this is a difficult question to answer and it may be the case that one would need to refine several structures with different data excluded to properly address this question. If we had many datasets and only a small fraction had a very large negative \(\Delta CC_{1/2}\) then one could argue that these measurements are not drawn from the same population as the rest of the data and should be excluded.
To see the effect of removing the last dataset (dataset ‘3’), we can rerun
dials.scale
(note that this will overwrite the previous scaled files):
dials.scale scaled.expt scaled.refl d_min=1.80
DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
cut_data {
d_min = 1.8
}
input {
experiments = scaled.expt
reflections = scaled.refl
}
Checking for the existence of a reflection table
containing multiple datasets
Detected existence of a multi-dataset reflection table
containing 4 datasets.
Found 4 reflection tables & 4 experiments in total.
Dataset ids are: 0,1,2,3
Space group being used during scaling is P 4 2 2
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Scaling models have been initialised for all experiments.
================================================================================
The experiment id for this dataset is 0.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22812/76079 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13656
criterion: excluded for scaling, reflections: 22812
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
The experiment id for this dataset is 1.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22043/75607 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13186
criterion: excluded for scaling, reflections: 22041
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
The experiment id for this dataset is 2.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 24879/77983 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 15473
criterion: excluded for scaling, reflections: 24878
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
The experiment id for this dataset is 3.
The scaling model type being applied is physical.
Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 23621/76468 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13968
criterion: excluded for scaling, reflections: 23620
The following corrections will be applied to this dataset:
+--------------+----------------+
| correction | n_parameters |
|--------------+----------------|
| scale | 10 |
| decay | 8 |
| absorption | 24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities
Completed preprocessing and initialisation for this dataset.
================================================================================
Configuring a MultiScaler to handle the individual Scalers.
Determining symmetry equivalent reflections across datasets.
Using quasi-random reflection selection. Selecting from 16683 symmetry groups
with <I/sI> > 1.0 (197826 reflections)). Selection target of 717.07 reflections
from each dataset, with a total number between 11473.09 and 13767.71.
Summary of cross-dataset reflection groups chosen (1331 groups, 13804 reflections):
+---------------+------------+----------+-----+-----+-----+-----+
| d-range | n_groups | n_refl | 0 | 1 | 2 | 3 |
|---------------+------------+----------+-----+-----+-----+-----|
| 68.428 - 8.11 | 42 | 689 | 105 | 203 | 178 | 203 |
| 8.11 - 5.755 | 31 | 724 | 71 | 242 | 197 | 214 |
| 5.755 - 4.704 | 33 | 813 | 72 | 271 | 226 | 244 |
| 4.704 - 4.076 | 33 | 896 | 70 | 298 | 256 | 272 |
| 4.076 - 3.647 | 36 | 957 | 72 | 313 | 276 | 296 |
| 3.647 - 3.33 | 31 | 875 | 70 | 294 | 242 | 269 |
| 3.33 - 3.084 | 32 | 831 | 70 | 271 | 228 | 262 |
| 3.084 - 2.885 | 30 | 784 | 71 | 256 | 212 | 245 |
| 2.885 - 2.72 | 35 | 834 | 72 | 275 | 218 | 269 |
| 2.72 - 2.581 | 35 | 792 | 71 | 247 | 212 | 262 |
| 2.581 - 2.461 | 30 | 648 | 71 | 215 | 206 | 156 |
| 2.461 - 2.356 | 33 | 501 | 99 | 167 | 163 | 72 |
| 2.356 - 2.264 | 43 | 543 | 127 | 176 | 169 | 71 |
| 2.264 - 2.182 | 53 | 563 | 144 | 188 | 159 | 72 |
| 2.182 - 2.108 | 65 | 528 | 142 | 166 | 105 | 115 |
| 2.108 - 2.041 | 135 | 764 | 186 | 194 | 140 | 244 |
| 2.041 - 1.98 | 140 | 696 | 175 | 220 | 140 | 161 |
| 1.98 - 1.924 | 184 | 661 | 140 | 210 | 171 | 140 |
| 1.924 - 1.873 | 222 | 520 | 106 | 141 | 141 | 132 |
| 1.873 - 1.826 | 88 | 185 | 29 | 56 | 65 | 35 |
+---------------+------------+----------+-----+-----+-----+-----+
Summary of reflections chosen for minimisation from each dataset (52765 total):
+--------------+------------------+----------------------+----------------------+--------------------+
| Dataset id | reflections | randomly selected | randomly selected | combined number |
| | connected to | reflections | reflections | of reflections |
| | other datasets | within dataset | across datasets | |
|--------------+------------------+----------------------+----------------------+--------------------|
| 0 | 1963 | 4530 | 6335 | 11832 |
| 1 | 4403 | 4890 | 6406 | 13956 |
| 2 | 3704 | 4763 | 6508 | 13534 |
| 3 | 3734 | 4752 | 6379 | 13443 |
| total | 13804 | 18935 | 25628 | 52765 |
+--------------+------------------+----------------------+----------------------+--------------------+
Completed configuration of MultiScaler.
================================================================================
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer.
Time taken for refinement 5.54
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52719 | 1.78 |
| 1 | 52719 | 1.7766 |
| 2 | 52719 | 1.7716 |
| 3 | 52719 | 1.7689 |
| 4 | 52719 | 1.7666 |
| 5 | 52719 | 1.7651 |
| 6 | 52719 | 1.7641 |
| 7 | 52719 | 1.7633 |
| 8 | 52719 | 1.763 |
| 9 | 52719 | 1.7625 |
| 10 | 52719 | 1.7623 |
| 11 | 52719 | 1.7621 |
| 12 | 52719 | 1.7617 |
| 13 | 52719 | 1.7615 |
| 14 | 52719 | 1.7612 |
| 15 | 52719 | 1.7609 |
| 16 | 52719 | 1.7607 |
| 17 | 52719 | 1.7602 |
| 18 | 52719 | 1.7597 |
| 19 | 52719 | 1.7594 |
| 20 | 52719 | 1.7591 |
| 21 | 52719 | 1.7588 |
| 22 | 52719 | 1.7584 |
| 23 | 52719 | 1.7579 |
| 24 | 52719 | 1.7576 |
| 25 | 52719 | 1.7572 |
| 26 | 52719 | 1.757 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True
================================================================================
Scale factors determined during minimisation have now been
applied to all datasets.
Combined outlier rejection has been performed across multiple datasets,
2719 outliers have been identified.
Performing multi-dataset profile/summation intensity optimisation.
+-----------------+---------+---------+
| Combination | CC1/2 | Rmeas |
|-----------------+---------+---------|
| prf only | 0.99924 | 0.05463 |
| sum only | 0.99896 | 0.06206 |
| Imid = 856.55 | 0.99578 | 0.17488 |
| Imid = 11484.08 | 0.94353 | 0.29657 |
| Imid = 1148.41 | 0.99443 | 0.20296 |
| Imid = 114.84 | 0.99875 | 0.07129 |
+-----------------+---------+---------+
Profile intensities determined to be best for scaling.
Combined outlier rejection has been performed across multiple datasets,
2719 outliers have been identified.
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer.
Time taken for refinement 2.21
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52037 | 1.3217 |
| 1 | 52037 | 1.3202 |
| 2 | 52037 | 1.3179 |
| 3 | 52037 | 1.3165 |
| 4 | 52037 | 1.3156 |
| 5 | 52037 | 1.3152 |
| 6 | 52037 | 1.3148 |
| 7 | 52037 | 1.3146 |
| 8 | 52037 | 1.3145 |
| 9 | 52037 | 1.3144 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True
================================================================================
Scale factors determined during minimisation have now been
applied to all datasets.
Combined outlier rejection has been performed across multiple datasets,
2634 outliers have been identified.
Performing a round of error model refinement.
Error model details:
Type: basic
Parameters: a = 0.95808, b = 0.04714
Error model formula: σ'² = a²(σ² + (bI)²)
estimated I/sigma asymptotic limit: 22.144
Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>) | n_refl | Uncorrected variance | Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10239.19 - 1825.33 | 1430 | 19.924 | 0.853 |
| 1825.33 - 1395.21 | 1430 | 14.519 | 0.999 |
| 1395.21 - 1176.43 | 1430 | 12.339 | 1.033 |
| 1176.43 - 923.30 | 2718 | 11.023 | 1.129 |
| 923.30 - 505.95 | 12215 | 7.525 | 1.215 |
| 505.95 - 277.25 | 20953 | 4.748 | 1.286 |
| 277.25 - 151.93 | 28586 | 3.005 | 1.297 |
| 151.93 - 83.26 | 31227 | 1.922 | 1.172 |
| 83.26 - 45.62 | 26306 | 1.393 | 1.057 |
| 45.62 - 24.99 | 16730 | 1 | 0.88 |
+--------------------------+----------+------------------------+----------------------+
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.
Time taken for refinement 8.92
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52056 | 0.89461 |
| 1 | 52056 | 0.88928 |
| 2 | 52056 | 0.8891 |
| 3 | 52056 | 0.88906 |
+--------+--------+----------+
RMSD no longer decreasing
================================================================================
Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.
Time taken for refinement 4.37
Refinement steps:
+--------+--------+----------+
| Step | Nref | RMSD_I |
| | | (a.u) |
|--------+--------+----------|
| 0 | 52056 | 0.88906 |
| 1 | 52056 | 0.88905 |
+--------+--------+----------+
RMSD no longer decreasing
================================================================================
Calculating error estimates of inverse scale factors.
Scale factors determined during minimisation have now been
applied to all datasets.
Combined outlier rejection has been performed across multiple datasets,
163 outliers have been identified.
Performing a round of error model refinement.
Error model details:
Type: basic
Parameters: a = 0.96198, b = 0.04693
Error model formula: σ'² = a²(σ² + (bI)²)
estimated I/sigma asymptotic limit: 22.151
Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>) | n_refl | Uncorrected variance | Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10313.12 - 1820.17 | 1430 | 19.757 | 0.852 |
| 1820.17 - 1391.24 | 1430 | 14.419 | 0.979 |
| 1391.24 - 1174.70 | 1430 | 12.698 | 1.053 |
| 1174.70 - 927.28 | 2656 | 11.072 | 1.13 |
| 927.28 - 507.77 | 12186 | 7.621 | 1.218 |
| 507.77 - 278.05 | 20946 | 4.771 | 1.288 |
| 278.05 - 152.26 | 28602 | 3.018 | 1.293 |
| 152.26 - 83.37 | 31267 | 1.935 | 1.171 |
| 83.37 - 45.66 | 26333 | 1.396 | 1.052 |
| 45.66 - 24.99 | 16769 | 1.008 | 0.878 |
+--------------------------+----------+------------------------+----------------------+
The reflection table variances have been adjusted to account for the
uncertainty in the scaling models for all datasets
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Total time taken: 29.5517s
================================================================================
Warning: Over half (51.19%) of model parameters have signficant
uncertainty (sigma/abs(parameter) > 0.5), which could indicate a
poorly-determined scaling problem or overparameterisation.
Summary of dataset partialities
+------------------+----------+
| Partiality (p) | n_refl |
|------------------+----------|
| all reflections | 306137 |
| p > 0.99 | 244580 |
| 0.5 < p < 0.99 | 4670 |
| 0.01 < p < 0.5 | 10499 |
| p < 0.01 | 46388 |
+------------------+----------+
Reflections below a partiality_cutoff of 0.4 are not considered for any
part of the scaling analysis or for the reporting of merging statistics.
Additionally, if applicable, only reflections with a min_partiality > 0.95
were considered for use when refining the scaling model.
----------Merging statistics----------
Resolution: 68.36 - 1.80
Observations: 212619
Unique reflections: 20140
Redundancy: 10.6
Completeness: 85.42%
Mean intensity: 131.7
Mean I/sigma(I): 29.2
R-merge: 0.056
R-meas: 0.058
R-pim: 0.015
Statistics by resolution bin:
d_max d_min #obs #uniq mult. %comp <I> <I/sI> r_mrg r_meas r_pim cc1/2 cc_ano
68.41 4.89 20723 1334 15.53 100.00 275.1 56.6 0.047 0.049 0.012 0.999* -0.014
4.89 3.88 20459 1244 16.45 100.00 416.4 60.3 0.046 0.047 0.011 0.999* -0.183
3.88 3.39 19969 1208 16.53 100.00 319.6 55.9 0.049 0.050 0.012 0.999* -0.043
3.39 3.08 20075 1189 16.88 100.00 225.2 50.4 0.054 0.055 0.013 0.999* -0.043
3.08 2.86 20269 1185 17.10 100.00 156.3 43.8 0.059 0.061 0.015 0.999* -0.013
2.86 2.69 20468 1191 17.19 100.00 126.4 40.7 0.065 0.067 0.016 0.998* -0.089
2.69 2.55 20172 1172 17.21 100.00 99.3 34.8 0.074 0.076 0.018 0.998* 0.019
2.55 2.44 16112 1167 13.81 100.00 89.5 29.3 0.077 0.080 0.021 0.998* -0.074
2.44 2.35 12116 1162 10.43 99.91 77.9 24.4 0.079 0.084 0.026 0.996* 0.008
2.35 2.27 10135 1180 8.59 99.92 76.7 21.7 0.082 0.087 0.029 0.995* 0.111*
2.27 2.20 8323 1150 7.24 99.65 66.4 17.8 0.084 0.091 0.033 0.995* -0.132
2.20 2.13 6830 1165 5.86 99.06 60.1 14.6 0.087 0.095 0.038 0.994* -0.017
2.13 2.08 5300 1116 4.75 97.04 52.3 11.7 0.091 0.102 0.045 0.992* -0.143
2.08 2.03 4021 1056 3.81 91.27 46.3 9.7 0.096 0.110 0.052 0.988* -0.056
2.03 1.98 3076 1001 3.07 86.14 39.2 7.7 0.102 0.122 0.064 0.984* 0.131
1.98 1.94 2060 896 2.30 77.85 37.1 6.4 0.095 0.118 0.069 0.979* -0.033
1.94 1.90 1163 700 1.66 61.03 31.8 4.8 0.115 0.149 0.094 0.963* 0.000
1.90 1.86 751 531 1.41 46.21 27.6 3.9 0.109 0.147 0.098 0.971* 0.000
1.86 1.83 464 366 1.27 31.88 25.2 3.4 0.100 0.139 0.096 0.964* 0.000
1.83 1.80 133 127 1.05 11.07 21.8 2.7 0.382 0.540 0.382 -0.330 0.000
68.36 1.80 212619 20140 10.56 85.42 131.7 29.2 0.056 0.058 0.015 0.999* -0.040
----------Resolution cutoff estimates----------
resolution of all data : 1.800
based on CC(1/2) >= 0.33 : 1.831
based on mean(I/sigma) >= 2.0 : 1.831
based on R-merge < 0.5 : 1.831
based on R-meas < 0.5 : 1.831
based on completeness >= 90% : 2.028
based on completeness >= 50% : 1.901
NOTE: we recommend using all data out to the CC(1/2) limit
for refinement.
Writing html report to: scaling.html
================================================================================
Saving the experiments to scaled.expt
Saving the scaled reflections to scaled.refl
See dials.github.io/dials_scale_user_guide.html for more info on scaling options
The overall merging statistics look significantly improved and therefore one would probably proceed with the first three datasets:
Resolution: 68.40 - 1.78 > 68.40 - 1.79
Observations: 222563 > 166095
Unique reflections: 16534 > 16285
Redundancy: 13.5 > 10.2
Completeness: 68.18% > 67.56%
Mean intensity: 45.3 > 46.0
Mean I/sigma(I): 25.0 > 26.1
R-merge: 0.132 > 0.059
R-meas: 0.136 > 0.062
R-pim: 0.033 > 0.017
We could have also excluded a subset of images, for example using the option
exclude_images=3:301:600
to exclude the last 300 images of dataset 3.
This option could be used to exclude the end of a dataset that was showing
sigificant radiation damage, or if the crystal had moved out of the beam part-way
through the measurement.
It is also worth checking the assigned space group using dials.symmetry
.
In dials.cosym
, only the Laue/Patterson group was tested to determine a space
group of \(P\,4\,2\,2\). However, a number of other MX space groups are possible for the
Laue group (due to the possibility of screw-axes), such as \(P\,4\,2_1\,2\),
\(P\,4_1\,2\,2\) etc. The screw-axes tests are performed by dials.symmetry
, and we can disable the
Laue group testing as we are already confident about this:
dials.symmetry scaled.expt scaled.refl laue_group=None
Read 76079 predicted reflections
Selected 53251 scaled reflections
Combined 67 partial reflections with other partial reflections
Read 75607 predicted reflections
Selected 53549 scaled reflections
Combined 77 partial reflections with other partial reflections
Read 77983 predicted reflections
Selected 53055 scaled reflections
Combined 106 partial reflections with other partial reflections
Read 76468 predicted reflections
Selected 52764 scaled reflections
Combined 69 partial reflections with other partial reflections
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Resolution estimate from <I>/<σ(I)> > 4.0 : 1.83
Resolution estimate from CC½ > 0.60: 1.80
Performing systematic absence checks on scaled data
Read 306137 predicted reflections
Selected 212619 scaled reflections
Removed 1 reflections with d < 1.80
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Laue group: P 4/m m m
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
| Screw axis | Score | No. present | No. absent | <I> present | <I> absent | <I/sig> present | <I/sig> absent |
|--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------|
| 41c | 1 | 11 | 33 | 621.629 | 0.112 | 34.285 | 0.397 |
| 21a | 1 | 14 | 14 | 827.13 | 0.449 | 22.739 | 1.01 |
| 42c | 1 | 22 | 22 | 310.674 | 0.309 | 17.47 | 0.268 |
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
+---------------+---------+
| Space group | score |
|---------------+---------|
| P 4 2 2 | 0 |
| P 4 21 2 | 0 |
| P 41 2 2 | 0 |
| P 42 2 2 | 0 |
| P 41 21 2 | 1 |
| P 42 21 2 | 0 |
+---------------+---------+
Recommended space group: P 41 21 2
Space group with equivalent score (enantiomorphic pair): P 43 21 2
DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
laue_group = None
input {
experiments = scaled.expt
reflections = scaled.refl
}
Detected existence of a multi-dataset reflection table
containing 4 datasets.
================================================================================
Analysing systematic absences
Laue group: P 4/m m m
Read 76079 predicted reflections
Selected 53251 scaled reflections
Combined 67 partial reflections with other partial reflections
Read 75607 predicted reflections
Selected 53549 scaled reflections
Combined 77 partial reflections with other partial reflections
Read 77983 predicted reflections
Selected 53055 scaled reflections
Combined 106 partial reflections with other partial reflections
Read 76468 predicted reflections
Selected 52764 scaled reflections
Combined 69 partial reflections with other partial reflections
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Resolution estimate from <I>/<σ(I)> > 4.0 : 1.83
Resolution estimate from CC½ > 0.60: 1.80
Performing systematic absence checks on scaled data
Read 306137 predicted reflections
Selected 212619 scaled reflections
Removed 1 reflections with d < 1.80
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Laue group: P 4/m m m
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
| Screw axis | Score | No. present | No. absent | <I> present | <I> absent | <I/sig> present | <I/sig> absent |
|--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------|
| 41c | 1 | 11 | 33 | 621.629 | 0.112 | 34.285 | 0.397 |
| 21a | 1 | 14 | 14 | 827.13 | 0.449 | 22.739 | 1.01 |
| 42c | 1 | 22 | 22 | 310.674 | 0.309 | 17.47 | 0.268 |
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
+---------------+---------+
| Space group | score |
|---------------+---------|
| P 4 2 2 | 0 |
| P 4 21 2 | 0 |
| P 41 2 2 | 0 |
| P 42 2 2 | 0 |
| P 41 21 2 | 1 |
| P 42 21 2 | 0 |
+---------------+---------+
Recommended space group: P 41 21 2
Space group with equivalent score (enantiomorphic pair): P 43 21 2
Saving reindexed experiments to symmetrized.expt in space group P 41 21 2
Saving 306137 reindexed reflections to symmetrized.refl
By analysing the sets of reflections we expect to be present and absent, the existence of the \(4_1\) and \(2_1\) screw axes are confirmed, hence the space group is assigned as \(P\,4_1\,2_1\,2\). Note that we can do this analysis before or after scaling, as we only need to know the Laue group for scaling, however it is preferable to do this after scaling as outliers may have been removed by scaling.
Finally, we must merge the data and produce an MTZ file for downstream structure solution:
dials.merge symmetrized.expt symmetrized.refl
DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
input {
experiments = symmetrized.expt
reflections = symmetrized.refl
}
Merging scaled reflection data
Read 306137 predicted reflections
Selected 212619 scaled reflections
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Running systematic absences check
Laue group: P 4/m m m
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
| Screw axis | Score | No. present | No. absent | <I> present | <I> absent | <I/sig> present | <I/sig> absent |
|--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------|
| 41c | 1 | 11 | 33 | 621.629 | 0.112 | 34.285 | 0.397 |
| 21a | 1 | 14 | 14 | 827.13 | 0.449 | 22.739 | 1.01 |
| 42c | 1 | 22 | 22 | 310.674 | 0.309 | 17.47 | 0.268 |
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
+---------------+---------+
| Space group | score |
|---------------+---------|
| P 4 2 2 | 0 |
| P 4 21 2 | 0 |
| P 41 2 2 | 0 |
| P 42 2 2 | 0 |
| P 41 21 2 | 1 |
| P 42 21 2 | 0 |
+---------------+---------+
Recommended space group: P 41 21 2
Space group with equivalent score (enantiomorphic pair): P 43 21 2
Scaling input intensities via French-Wilson Method
Total number of rejected intensities 0
===================== Absolute scaling and Wilson analysis ====================
----------Maximum likelihood isotropic Wilson scaling----------
ML estimate of overall B value:
12.72 A**2
Estimated -log of scale factor:
-2.90
The overall B value ("Wilson B-factor", derived from the Wilson plot) gives
an isotropic approximation for the falloff of intensity as a function of
resolution. Note that this approximation may be misleading for anisotropic
data (where the crystal is poorly ordered along an axis). The Wilson B is
strongly correlated with refined atomic B-factors but these may differ by
a significant amount, especially if anisotropy is present.
----------Maximum likelihood anisotropic Wilson scaling----------
ML estimate of overall B_cart value:
11.92, 0.00, 0.00
11.92, 0.00
14.05
Equivalent representation as U_cif:
0.15, -0.00, -0.00
0.15, 0.00
0.18
Eigen analyses of B-cart:
-------------------------------------------------
| Eigenvector | Value | Vector |
-------------------------------------------------
| 1 | 14.053 | ( 0.00, 0.00, 1.00) |
| 2 | 11.920 | (-0.71, 0.71, -0.00) |
| 3 | 11.920 | ( 0.71, 0.71, -0.00) |
-------------------------------------------------
ML estimate of -log of scale factor:
-2.91
----------Anisotropy analyses----------
For the resolution shell spanning between 1.95 - 1.80 Angstrom,
the mean I/sigI is equal to 4.25. 52.9 % of these intensities have
an I/sigI > 3. When sorting these intensities by their anisotropic
correction factor and analysing the I/sigI behavior for this ordered
list, we can gauge the presence of 'anisotropy induced noise amplification'
in reciprocal space.
The quarter of Intensities *least* affected by the anisotropy correction show
<I/sigI> : 4.55e+00
Fraction of I/sigI > 3 : 5.69e-01 ( Z = 1.81 )
The quarter of Intensities *most* affected by the anisotropy correction show
<I/sigI> : 3.34e+00
Fraction of I/sigI > 3 : 4.08e-01 ( Z = 5.42 )
Z-scores are computed on the basis of a Bernoulli model assuming independence
of weak reflections with respect to anisotropy.
----------Wilson plot----------
The Wilson plot shows the falloff in intensity as a function in resolution;
this is used to calculate the overall B-factor ("Wilson B-factor") for the
data shown above. The expected plot is calculated based on analysis of
macromolecule structures in the PDB, and the distinctive appearance is due to
the non-random arrangement of atoms in the crystal. Some variation is
natural, but major deviations from the expected plot may indicate pathological
data (including ice rings, detector problems, or processing errors).
----------Mean intensity analyses----------
Inspired by: Morris et al. (2004). J. Synch. Rad.11, 56-59.
The following resolution shells are worrisome:
*** None ***
----------Possible outliers----------
Inspired by: Read, Acta Cryst. (1999). D55, 1759-1764
Acentric reflections:
None
Centric reflections:
-----------------------------------------------------------------------------------------------------
| Centric reflections |
|---------------------------------------------------------------------------------------------------|
| d_spacing | H K L | |E| | p(wilson) | p(extreme) |
|---------------------------------------------------------------------------------------------------|
| 2.628 | 26, 0, 1 | 4.31 | 1.66e-05 | 4.46e-02 |
-----------------------------------------------------------------------------------------------------
p(wilson) : 1-(erf[|E|/sqrt(2)])
p(extreme) : 1-(erf[|E|/sqrt(2)])^(n_acentrics)
p(wilson) is the probability that an E-value of the specified
value would be observed when it would selected at random from
the given data set.
p(extreme) is the probability that the largest |E| value is
larger or equal than the observed largest |E| value.
Both measures can be used for outlier detection. p(extreme)
takes into account the size of the dataset.
----------Ice ring related problems----------
The following statistics were obtained from ice-ring insensitive resolution
ranges:
mean bin z_score : 1.37
( rms deviation : 1.10 )
mean bin completeness : 0.86
( rms deviation : 0.28 )
The following table shows the Wilson plot Z-scores and completeness for
observed data in ice-ring sensitive areas. The expected relative intensity
is the theoretical intensity of crystalline ice at the given resolution.
Large z-scores and high completeness in these resolution ranges might
be a reason to re-assess your data processsing if ice rings were present.
-------------------------------------------------------------
| d_spacing | Expected rel. I | Data Z-score | Completeness |
-------------------------------------------------------------
| 3.897 | 1.000 | 2.86 | 1.00 |
| 3.669 | 0.750 | 1.17 | 1.00 |
| 3.441 | 0.530 | 3.41 | 0.99 |
| 2.671 | 0.170 | 1.10 | 1.00 |
| 2.249 | 0.390 | 1.36 | 0.97 |
| 2.072 | 0.300 | 0.99 | 0.77 |
| 1.948 | 0.040 | 1.51 | 0.43 |
| 1.918 | 0.180 | 0.14 | 0.31 |
| 1.883 | 0.030 | 0.56 | 0.24 |
-------------------------------------------------------------
Abnormalities in mean intensity or completeness at resolution ranges with a
relative ice ring intensity lower than 0.10 will be ignored.
No ice ring related problems detected.
If ice rings were present, the data does not look worse at ice ring related
d_spacings as compared to the rest of the data set.
----------Merging statistics----------
Resolution: 68.36 - 1.80
Observations: 208218
Unique reflections: 20132
Redundancy: 10.3
Completeness: 85.61%
Mean intensity: 131.8
Mean I/sigma(I): 28.5
R-merge: 0.055
R-meas: 0.058
R-pim: 0.015
Statistics by resolution bin:
d_max d_min #obs #uniq mult. %comp <I> <I/sI> r_mrg r_meas r_pim cc1/2 cc_ano
68.41 4.89 20280 1334 15.20 100.00 274.8 55.5 0.047 0.049 0.012 0.998* -0.128
4.89 3.88 19993 1244 16.07 100.00 415.9 59.1 0.045 0.047 0.011 0.999* -0.057
3.88 3.39 19494 1208 16.14 100.00 319.4 54.5 0.048 0.050 0.012 0.999* -0.018
3.39 3.08 19558 1189 16.45 100.00 225.4 48.9 0.053 0.055 0.013 0.999* -0.060
3.08 2.86 19743 1185 16.66 100.00 156.5 42.3 0.059 0.060 0.014 0.999* -0.069
2.86 2.69 19921 1191 16.73 100.00 126.6 39.2 0.064 0.066 0.016 0.998* -0.131
2.69 2.55 19693 1172 16.80 100.00 99.6 33.5 0.073 0.075 0.018 0.999* -0.001
2.55 2.44 15819 1167 13.56 100.00 90.1 28.4 0.075 0.078 0.021 0.998* -0.081
2.44 2.35 11841 1162 10.19 100.00 78.4 23.6 0.078 0.082 0.025 0.997* 0.002
2.35 2.27 9895 1180 8.39 100.00 77.4 21.0 0.078 0.083 0.028 0.997* 0.003
2.27 2.20 8269 1150 7.19 99.91 66.5 17.7 0.084 0.090 0.033 0.992* -0.137
2.20 2.13 6811 1164 5.85 98.98 60.1 14.5 0.087 0.095 0.038 0.994* -0.016
2.13 2.08 5282 1115 4.74 97.13 52.3 11.7 0.091 0.102 0.045 0.992* -0.067
2.08 2.03 4005 1056 3.79 91.43 46.3 9.7 0.095 0.110 0.053 0.987* -0.039
2.03 1.98 3058 999 3.06 85.97 39.2 7.7 0.102 0.122 0.064 0.983* 0.004
1.98 1.94 2052 895 2.29 77.89 37.1 6.4 0.095 0.118 0.069 0.979* 0.045
1.94 1.90 1161 699 1.66 60.99 31.7 4.8 0.115 0.149 0.094 0.964* 0.000
1.90 1.86 748 530 1.41 46.17 27.6 3.9 0.109 0.147 0.098 0.971* 0.000
1.86 1.83 463 365 1.27 31.82 25.2 3.4 0.100 0.139 0.096 0.964* 0.000
1.83 1.80 132 127 1.04 11.08 21.8 2.7 0.428 0.606 0.428 -0.378 0.000
68.36 1.80 208218 20132 10.34 85.61 131.8 28.5 0.055 0.058 0.015 0.999* -0.097
----------Resolution cutoff estimates----------
resolution of all data : 1.800
based on CC(1/2) >= 0.33 : 1.831
based on mean(I/sigma) >= 2.0 : 1.831
based on R-merge < 0.5 : 1.831
based on R-meas < 0.5 : 1.831
based on completeness >= 90% : 2.028
based on completeness >= 50% : 1.901
NOTE: we recommend using all data out to the CC(1/2) limit
for refinement.
Writing reflections to merged.mtz
Title: From dials.merge
Space group symbol from file: P41212
Space group number from file: 92
Space group from matrices: P 41 21 2 (No. 92)
Point group symbol from file: 422
Number of crystals: 1
Number of Miller indices: 20132
Resolution range: 68.3603 1.80033
History:
From DIALS 2.2.9-g061426e04-release, run on 2020-09-12 at 08:37:11 BST (2020-09
Crystal 1:
Name: XTAL
Project: AUTOMATIC
Id: 1
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
Number of datasets: 1
Dataset 1:
Name: NATIVE
Id: 1
Wavelength: 0.979493
Number of columns: 17
label #valid %valid min max type
H 20132 100.00% 0.00 36.00 H: index h,k,l
K 20132 100.00% 0.00 24.00 H: index h,k,l
L 20132 100.00% 0.00 56.00 H: index h,k,l
IMEAN 20132 100.00% -10.44 5402.43 J: intensity
SIGIMEAN 20132 100.00% 0.14 174.45 Q: standard deviation
I(+) 19874 98.72% -10.44 5402.43 K: I(+) or I(-)
SIGI(+) 19874 98.72% 0.14 174.45 M: standard deviation
I(-) 14163 70.35% -6.83 3534.70 K: I(+) or I(-)
SIGI(-) 14163 70.35% 0.40 77.31 M: standard deviation
N(+) 19874 98.72% 2.00 8.00 I: integer
N(-) 14163 70.35% 8.00 8.00 I: integer
F 20132 100.00% 0.34 73.21 F: amplitude
SIGF 20132 100.00% 0.04 1.51 Q: standard deviation
F(+) 19874 98.72% 0.34 73.21 G: F(+) or F(-)
SIGF(+) 19874 98.72% 0.04 1.51 L: standard deviation
F(-) 14163 70.35% 0.90 59.41 G: F(+) or F(-)
SIGF(-) 14163 70.35% 0.06 1.38 L: standard deviation
This merges the data and performs a truncation procedure, to give a merged MTZ file containing intensities and strictly-positive structure factors (Fs).