This documentation page refers to a previous release of DIALS (2.2).
Click here to go to the corresponding page for the latest version of DIALS

Multi-crystal symmetry analysis and scaling with DIALS¶

Introduction¶

Recent additions to DIALS and xia2 have enabled multi-crystal analysis to be performed following integration. These tools are particularly relevant for analysis of many partial-datasets, which may be the only practical way of performing data collections for certain crystals. After integration, the space group symmetry can be investigated by testing for the presence of symmetry operations relating the integrated intensities of groups of reflections - the program to perform this is analysis is dials.symmetry (with algorithms similar to those of the program Pointless). Another thing to consider is that for certain space groups (polar space groups), there is an inherent ambiguity in the way that the diffraction pattern can be indexed. In order to combine multiple datasets from these space groups, one must reindex all data to a consistent setting, which can be done with the program dials.cosym (see Gildea and Winter for details). Finally, the data must be scaled, to correct for experimental effects such as differences in crystal size/illuminated volume and radiation damage - this can be done with the program dials.scale (with algorithms similar to those of the program Aimless). After the data has been scaled, choices can then be made about applying a resolution limit to exclude certain regions of the data which may be negatively affected by radiation damage.

In this tutorial, we shall investigate a multi-crystal dataset collected on the VMXi beamline, Diamond’s automated facility for data collection from crystallisation experiments in-situ. The dataset consists of four repeats of a 60-degree rotation measurement on a crystal of Proteinase K, taken at different locations on the crystal. We shall start with the integrated reflections and experiments files generated by running the automated processing software xia2 with pipeline=dials. Have a look at the Processing in Detail tutorial if you want to know more about the different processing steps up to this point.

Note

To obtain the data for this tutorial you can run dials.data get vmxi_proteinase_k_sweeps. If you are at Diamond Light Source on BAG training then the data are already available. After typing module load bagtraining you’ll be moved to a working folder, with the data already located in the tutorial-data/ccp4/integrated_files subdirectory. The processing in this tutorial will produce quite a few files, so it’s recommended to make an move to new directory:

mkdir multi_crystal
cd multi_crystal

xia2.multiplex¶

The easiest way to run these tools for a multi-dataset analysis is through the program xia2.multiplex. This runs several DIALS programs, including the programs described above, while producing useful plots and output files.

To run xia2.multiplex, we must provide the path to the input integrated files from dials.integrate:

xia2.multiplex experiments_0.expt experiments_1.expt experiments_2.expt experiments_3.expt reflections_0.refl reflections_1.refl reflections_2.refl reflections_3.refl

DIALS 2.2.9-g061426e04-release
The following parameters have been modified:

input {
  experiments = experiments_0.expt
  experiments = experiments_1.expt
  experiments = experiments_2.expt
  experiments = experiments_3.expt
  reflections = reflections_0.refl
  reflections = reflections_1.refl
  reflections = reflections_2.refl
  reflections = reflections_3.refl
}

Selecting 4 experiments with profile-fitted reflections
Selecting 4 experiments with refined reflections
                       Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)

0 singletons:

Point group    a           b           c          alpha        beta         gamma       

1 cluster:

Cluster_id       N_xtals  Med_a         Med_b         Med_c         Med_alpha    Med_beta     Med_gamma   Delta(deg)
4 in P422.
cluster_1        4        68.36 (0.01 ) 68.36 (0.01 ) 103.95(0.02 ) 90.00 (0.00) 90.00 (0.00) 90.00 (0.00)
     P 4/m m m (No. 123)  68.36         68.36         103.95        90.00        90.00        90.00         0.0   


Standard deviations are in brackets.
Each cluster:
Input lattice count, with integration Bravais setting space group.
Cluster median with Niggli cell parameters (std dev in brackets).
Highest possible metric symmetry and unit cell using LePage (J Appl Cryst 1982, 15:255) method, maximum delta 3deg.
Using all data sets for subsequent analysis
Laue group determined by dials.cosym: P 4 2 2
Resolution limit: 1.80 (cc_half > 0.3)

Overall merging statistics:
+--------------------+--------------+------------------+-------------------+
|                    | Overall      | Low resolution   | High resolution   |
|--------------------+--------------+------------------+-------------------|
| Resolution (Å)     | 68.36 - 1.80 | 68.41 - 4.89     | 1.83 - 1.80       |
| Observations       | 212605       | 20714            | 133               |
| Unique reflections | 20140        | 1334             | 127               |
| Multiplicity       | 10.6         | 15.5             | 1.0               |
| Completeness       | 85.42%       | 100.00%          | 11.07%            |
| Mean I/σ(I)        | 28.8         | 55.6             | 2.7               |
| Rmerge             | 0.057        | 0.049            | 0.397             |
| Rmeas              | 0.059        | 0.050            | 0.562             |
| Rpim               | 0.016        | 0.013            | 0.397             |
| CC½                | 0.999        | 0.999            | -0.379            |
+--------------------+--------------+------------------+-------------------+

Resolution shells:
+------------------+----------+-------------+----------------+----------------+----------+---------------+----------+---------+--------+--------+---------+
| Resolution (Å)   |   N(obs) |   N(unique) |   Multiplicity |   Completeness |   Mean I |   Mean I/σ(I) |   Rmerge |   Rmeas |   Rpim | CC½    | CCano   |
|------------------+----------+-------------+----------------+----------------+----------+---------------+----------+---------+--------+--------+---------|
| 68.41 - 4.89     |    20714 |        1334 |          15.53 |         100    |    262.9 |          55.6 |    0.049 |   0.05  |  0.013 | 0.999* | -0.257  |
| 4.89 - 3.88      |    20458 |        1244 |          16.45 |         100    |    397.1 |          59.3 |    0.046 |   0.048 |  0.012 | 0.999* | -0.214  |
| 3.88 - 3.39      |    19966 |        1208 |          16.53 |         100    |    304.4 |          54.9 |    0.049 |   0.051 |  0.012 | 0.999* | -0.056  |
| 3.39 - 3.08      |    20074 |        1189 |          16.88 |         100    |    214.3 |          49.5 |    0.054 |   0.056 |  0.013 | 0.999* | -0.063  |
| 3.08 - 2.86      |    20270 |        1185 |          17.11 |         100    |    148.6 |          43.1 |    0.06  |   0.062 |  0.015 | 0.999* | 0.006   |
| 2.86 - 2.69      |    20467 |        1191 |          17.18 |         100    |    120.1 |          40.1 |    0.066 |   0.068 |  0.016 | 0.998* | -0.086  |
| 2.69 - 2.55      |    20171 |        1172 |          17.21 |         100    |     94.2 |          34.3 |    0.075 |   0.078 |  0.019 | 0.998* | -0.001  |
| 2.55 - 2.44      |    16109 |        1167 |          13.8  |         100    |     84.9 |          28.8 |    0.078 |   0.081 |  0.022 | 0.997* | -0.077  |
| 2.44 - 2.35      |    12118 |        1162 |          10.43 |          99.91 |     73.8 |          24   |    0.082 |   0.087 |  0.027 | 0.996* | -0.001  |
| 2.35 - 2.27      |    10137 |        1180 |           8.59 |          99.92 |     72.5 |          21.4 |    0.085 |   0.09  |  0.03  | 0.995* | 0.092*  |
| 2.27 - 2.20      |     8323 |        1150 |           7.24 |          99.65 |     62.7 |          17.6 |    0.087 |   0.094 |  0.035 | 0.995* | -0.129  |
| 2.20 - 2.13      |     6830 |        1165 |           5.86 |          99.06 |     56.7 |          14.3 |    0.091 |   0.1   |  0.04  | 0.993* | 0.002   |
| 2.13 - 2.08      |     5300 |        1116 |           4.75 |          97.04 |     49.4 |          11.5 |    0.096 |   0.107 |  0.047 | 0.991* | -0.126  |
| 2.08 - 2.03      |     4021 |        1056 |           3.81 |          91.27 |     43.7 |           9.6 |    0.101 |   0.115 |  0.055 | 0.986* | -0.041  |
| 2.03 - 1.98      |     3076 |        1001 |           3.07 |          86.14 |     36.9 |           7.6 |    0.109 |   0.13  |  0.068 | 0.981* | 0.030   |
| 1.98 - 1.94      |     2060 |         896 |           2.3  |          77.85 |     35   |           6.3 |    0.103 |   0.128 |  0.074 | 0.975* | -0.204  |
| 1.94 - 1.90      |     1163 |         700 |           1.66 |          61.03 |     30   |           4.8 |    0.12  |   0.156 |  0.098 | 0.958* | 0.000   |
| 1.90 - 1.86      |      751 |         531 |           1.41 |          46.21 |     26.1 |           3.9 |    0.113 |   0.151 |  0.1   | 0.967* | 0.000   |
| 1.86 - 1.83      |      464 |         366 |           1.27 |          31.88 |     23.8 |           3.4 |    0.116 |   0.161 |  0.111 | 0.955* | 0.000   |
| 1.83 - 1.80      |      133 |         127 |           1.05 |          11.07 |     20.6 |           2.7 |    0.397 |   0.562 |  0.397 | -0.379 | 0.000   |
+------------------+----------+-------------+----------------+----------------+----------+---------------+----------+---------+--------+--------+---------+
Space group determined by dials.symmetry: P 41 21 2

Intensity correlation clustering summary:
=========  ==============  ==========  ========  ==============  ==============
  Cluster    No. datasets  Datasets      Height    Multiplicity    Completeness
=========  ==============  ==========  ========  ==============  ==============
        1               2  0 3           0.0024             3.4            0.68
        2               2  1 2           0.0038             5.7            0.79
        3               4  0 1 2 3       0.0045             8.2            0.83
=========  ==============  ==========  ========  ==============  ==============

Cos(angle) clustering summary:
=========  ==============  ==========  ========  ==============  ==============
  Cluster    No. datasets  Datasets      Height    Multiplicity    Completeness
=========  ==============  ==========  ========  ==============  ==============
        1               2  1 2           1e-05              5.7            0.79
        2               3  1 2 3         0.0093             5.7            0.79
        3               4  0 1 2 3       0.032              8.2            0.83
=========  ==============  ==========  ========  ==============  ==============

This runs dials.cosym to analyse the Laue symmetry and reindex all datasets consistently, scales the data with dials.scale, calculates a resolution limit with dials.resolutionizer and reruns dials.scale with the determined resolution cutoff. The final dataset is exported to an unmerged mtz and a HTML report is generated. The easiest way to see the results is to open the HTML report in your browser of choice e.g.:

firefox xia2.multiplex.html

Provided is a summary of the merging statistics as well as several plots, please explore these for a few minutes now! This dataset results in good merging statistics, however if you navigate to the “Analysis by batch” tab in “All data”, you will see that the fourth dataset has poorer statistics compared to the others. Let’s repeat the processing manually to explore the different steps and address this issue.

Manual reprocessing¶

The first step is Laue/Patterson group analysis using dials.cosym:

dials.cosym experiments_0.expt experiments_1.expt experiments_2.expt experiments_3.expt reflections_0.refl reflections_1.refl reflections_2.refl reflections_3.refl

Scoring all possible sub-groups
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
| Patterson group   |     |   Likelihood |   NetZcc |   Zcc+ |   Zcc- |   delta | Reindex operator   |
|-------------------+-----+--------------+----------+--------+--------+---------+--------------------|
| P 4/m m m         | *** |            1 |     9.92 |   9.92 |   0    |       0 | -a,b,-c            |
| P m m m           |     |            0 |     0    |   9.93 |   9.92 |       0 | -a,b,-c            |
| C m m m           |     |            0 |     0    |   9.92 |   9.92 |       0 | a-b,a+b,c          |
| P 4/m             |     |            0 |    -0.01 |   9.92 |   9.93 |       0 | -a,b,-c            |
| C 1 2/m 1         |     |            0 |     0.01 |   9.93 |   9.92 |       0 | a-b,a+b,c          |
| P 1 2/m 1         |     |            0 |     0.01 |   9.93 |   9.92 |       0 | -a,b,-c            |
| P 1 2/m 1         |     |            0 |    -0    |   9.92 |   9.92 |       0 | -b,-a,-c           |
| P 1 2/m 1         |     |            0 |    -0    |   9.92 |   9.93 |       0 | -a,-c,-b           |
| C 1 2/m 1         |     |            0 |    -0.01 |   9.92 |   9.93 |       0 | a+b,-a+b,c         |
| P -1              |     |            0 |    -9.92 |   0    |   9.92 |       0 | -a,b,-c            |
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
Best solution: P 4/m m m
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
Reindex operator: -a,b,-c
Laue group probability: 1.000
Laue group confidence: 1.000
Space groups:
P 4 2 2
[0, 1, 2, 3]
Reindexing operators:
x,y,z
[0, 1, 2, 3]

DIALS 2.2.9-g061426e04-release
The following parameters have been modified:

input {
  experiments = experiments_0.expt
  experiments = experiments_1.expt
  experiments = experiments_2.expt
  experiments = experiments_3.expt
  reflections = reflections_0.refl
  reflections = reflections_1.refl
  reflections = reflections_2.refl
  reflections = reflections_3.refl
}

Hierarchical clustering of unit cells
Using Andrews-Bernstein distance from Andrews & Bernstein J Appl Cryst 47:346 (2014)
Distances have been calculated
                       Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)

0 singletons:

Point group    a           b           c          alpha        beta         gamma       

1 cluster:

Cluster_id       N_xtals  Med_a         Med_b         Med_c         Med_alpha    Med_beta     Med_gamma   Delta(deg)
4 in P422.
cluster_1        4        68.36 (0.01 ) 68.36 (0.01 ) 103.95(0.02 ) 90.00 (0.00) 90.00 (0.00) 90.00 (0.00)
     P 4/m m m (No. 123)  68.36         68.36         103.95        90.00        90.00        90.00         0.0   


Standard deviations are in brackets.
Each cluster:
Input lattice count, with integration Bravais setting space group.
Cluster median with Niggli cell parameters (std dev in brackets).
Highest possible metric symmetry and unit cell using LePage (J Appl Cryst 1982, 15:255) method, maximum delta 3deg.
Filtering reflections for dataset 0
Read 76079 predicted reflections
Selected 54367 reflections integrated by profile and summation methods
Combined 1127 partial reflections with other partial reflections
Removed 491 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 12 intensity.prf.value reflections with I/Sig(I) < -5
Filtering reflections for dataset 1
Read 75607 predicted reflections
Selected 54845 reflections integrated by profile and summation methods
Combined 1284 partial reflections with other partial reflections
Removed 554 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 10 intensity.prf.value reflections with I/Sig(I) < -5
Filtering reflections for dataset 2
Read 77983 predicted reflections
Selected 54461 reflections integrated by profile and summation methods
Combined 1404 partial reflections with other partial reflections
Removed 541 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 7 intensity.prf.value reflections with I/Sig(I) < -5
Filtering reflections for dataset 3
Read 76468 predicted reflections
Selected 53877 reflections integrated by profile and summation methods
Combined 1062 partial reflections with other partial reflections
Removed 514 reflections below partiality threshold
Removed 0 intensity.sum.value reflections with I/Sig(I) < -5
Removed 3 intensity.prf.value reflections with I/Sig(I) < -5
Patterson group: P 4/m m m

--------------------------------------------------------------------------------

Normalising intensities for dataset 1

ML estimate of overall B value:
   13.46 A**2
ML estimate of  -log of scale factor:
  -3.04

--------------------------------------------------------------------------------

Normalising intensities for dataset 2

ML estimate of overall B value:
   11.06 A**2
ML estimate of  -log of scale factor:
  -3.50

--------------------------------------------------------------------------------

Normalising intensities for dataset 3

ML estimate of overall B value:
   11.45 A**2
ML estimate of  -log of scale factor:
  -2.96

--------------------------------------------------------------------------------

Normalising intensities for dataset 4

ML estimate of overall B value:
   12.14 A**2
ML estimate of  -log of scale factor:
  -2.67

--------------------------------------------------------------------------------

Estimation of resolution for Laue group analysis

Resolution estimate from <I>/<σ(I)> > 4.0 : 2.10
Resolution estimate from CC½ > 0.60: 1.80
High resolution limit set to: 1.80
Selecting 188689 reflections with d > 1.80
================================================================================

Automatic determination of number of dimensions for analysis
+--------------+--------------+
|   Dimensions |   Functional |
|--------------+--------------|
|            1 |      14.5345 |
|            2 |      15.6716 |
|            3 |      14.608  |
|            4 |      15.0478 |
|            5 |      15.2777 |
|            6 |      14.7404 |
|            7 |      14.8183 |
|            8 |      14.8374 |
+--------------+--------------+
Best number of dimensions: 6
Using 6 dimensions for analysis
Principal component analysis:
Explained variance: 0.0022, 0.0016, 0.0013, 0.0011, 0.00094, 7.3e-05
Explained variance ratio: 0.31, 0.22, 0.18, 0.15, 0.13, 0.01
Scoring individual symmetry elements
+--------------+--------+------+-----+-----------------+
|   likelihood |   Z-CC |   CC |     | Operator        |
|--------------+--------+------+-----+-----------------|
|        0.947 |   9.93 | 0.99 | *** | 2 |(1, 1, 0)    |
|        0.946 |   9.92 | 0.99 | *** | 2 |(-1, 1, 0)   |
|        0.947 |   9.93 | 0.99 | *** | 2 |(0, 1, 0)    |
|        0.946 |   9.91 | 0.99 | *** | 4^-1 |(0, 0, 1) |
|        0.947 |   9.93 | 0.99 | *** | 4 |(0, 0, 1)    |
|        0.946 |   9.92 | 0.99 | *** | 2 |(0, 0, 1)    |
|        0.946 |   9.92 | 0.99 | *** | 2 |(1, 0, 0)    |
+--------------+--------+------+-----+-----------------+
Scoring all possible sub-groups
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
| Patterson group   |     |   Likelihood |   NetZcc |   Zcc+ |   Zcc- |   delta | Reindex operator   |
|-------------------+-----+--------------+----------+--------+--------+---------+--------------------|
| P 4/m m m         | *** |            1 |     9.92 |   9.92 |   0    |       0 | -a,b,-c            |
| P m m m           |     |            0 |     0    |   9.93 |   9.92 |       0 | -a,b,-c            |
| C m m m           |     |            0 |     0    |   9.92 |   9.92 |       0 | a-b,a+b,c          |
| P 4/m             |     |            0 |    -0.01 |   9.92 |   9.93 |       0 | -a,b,-c            |
| C 1 2/m 1         |     |            0 |     0.01 |   9.93 |   9.92 |       0 | a-b,a+b,c          |
| P 1 2/m 1         |     |            0 |     0.01 |   9.93 |   9.92 |       0 | -a,b,-c            |
| P 1 2/m 1         |     |            0 |    -0    |   9.92 |   9.92 |       0 | -b,-a,-c           |
| P 1 2/m 1         |     |            0 |    -0    |   9.92 |   9.93 |       0 | -a,-c,-b           |
| C 1 2/m 1         |     |            0 |    -0.01 |   9.92 |   9.93 |       0 | a+b,-a+b,c         |
| P -1              |     |            0 |    -9.92 |   0    |   9.92 |       0 | -a,b,-c            |
+-------------------+-----+--------------+----------+--------+--------+---------+--------------------+
Best solution: P 4/m m m
Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
Reindex operator: -a,b,-c
Laue group probability: 1.000
Laue group confidence: 1.000
Space groups:
P 4 2 2
[0, 1, 2, 3]
Reindexing operators:
x,y,z
[0, 1, 2, 3]
Writing html report to: dials.cosym.html
Writing json to: dials.cosym.json
Saving reindexed experiments to symmetrized.expt
Saving reindexed reflections to symmetrized.refl

As you can see, the \(P\,4/m\,m\,m\) Patterson group is found with the highest confidence. For the corresponding space group, the mirror symmetries are removed to give \(P\,4\,2\,2\), as the chiral nature of macromolecules means we have a restricted choice of space groups. In this example, all datasets were indexed consistently, but this is not the case in general.

Next, the data can be scaled:

dials.scale symmetrized.expt symmetrized.refl

From the merging statistics it is clear that the data quality is good out to the furthest resolution (\(CC_{1/2} > 0.3\)), which can be confirmed by a resolution analysis:

dials.resolutionizer scaled.expt scaled.refl

Resolution cc_half:      1.80

The following parameters have been modified:

input {
  experiments = scaled.expt
  reflections = scaled.refl
}

DIALS 2.2.9-g061426e04-release
Detected existence of a multi-dataset reflection table 
containing 4 datasets. 

Read 76079 predicted reflections
Selected 53256 scaled reflections
Combined 68 partial reflections with other partial reflections
Read 75607 predicted reflections
Selected 53551 scaled reflections
Combined 77 partial reflections with other partial reflections
Read 77983 predicted reflections
Selected 53063 scaled reflections
Combined 106 partial reflections with other partial reflections
Read 76468 predicted reflections
Selected 52749 scaled reflections
Combined 69 partial reflections with other partial reflections
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Resolution cc_half:      1.80
Resolution I/sig:        1.80
Resolution Mn(I/sig):    1.80

If the resolution limit was lower than the extent of the data, scaling would be rerun with a new resolution limit, for example:

dials.scale scaled.expt scaled.refl d_min=1.80

DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
cut_data {
  d_min = 1.8
}
input {
  experiments = scaled.expt
  reflections = scaled.refl
}

Checking for the existence of a reflection table 
containing multiple datasets 

Detected existence of a multi-dataset reflection table 
containing 4 datasets. 

Found 4 reflection tables & 4 experiments in total.

Dataset ids are: 0,1,2,3 

Space group being used during scaling is P 4 2 2
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)

Scaling models have been initialised for all experiments.

================================================================================

The experiment id for this dataset is 0.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22812/76079 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13656
criterion: excluded for scaling, reflections: 22812

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

The experiment id for this dataset is 1.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22043/75607 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13186
criterion: excluded for scaling, reflections: 22041

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

The experiment id for this dataset is 2.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 24879/77983 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 15473
criterion: excluded for scaling, reflections: 24878

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

The experiment id for this dataset is 3.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 23621/76468 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13968
criterion: excluded for scaling, reflections: 23620

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

Configuring a MultiScaler to handle the individual Scalers. 

Determining symmetry equivalent reflections across datasets.


Using quasi-random reflection selection. Selecting from 16683 symmetry groups
with <I/sI> > 1.0 (197826 reflections)). Selection target of 717.07 reflections
from each dataset, with a total number between 11473.09 and 13767.71.

Summary of cross-dataset reflection groups chosen (1331 groups, 13804 reflections):
+---------------+------------+----------+-----+-----+-----+-----+
| d-range       |   n_groups |   n_refl |   0 |   1 |   2 |   3 |
|---------------+------------+----------+-----+-----+-----+-----|
| 68.428 - 8.11 |         42 |      689 | 105 | 203 | 178 | 203 |
| 8.11 - 5.755  |         31 |      724 |  71 | 242 | 197 | 214 |
| 5.755 - 4.704 |         33 |      813 |  72 | 271 | 226 | 244 |
| 4.704 - 4.076 |         33 |      896 |  70 | 298 | 256 | 272 |
| 4.076 - 3.647 |         36 |      957 |  72 | 313 | 276 | 296 |
| 3.647 - 3.33  |         31 |      875 |  70 | 294 | 242 | 269 |
| 3.33 - 3.084  |         32 |      831 |  70 | 271 | 228 | 262 |
| 3.084 - 2.885 |         30 |      784 |  71 | 256 | 212 | 245 |
| 2.885 - 2.72  |         35 |      834 |  72 | 275 | 218 | 269 |
| 2.72 - 2.581  |         35 |      792 |  71 | 247 | 212 | 262 |
| 2.581 - 2.461 |         30 |      648 |  71 | 215 | 206 | 156 |
| 2.461 - 2.356 |         33 |      501 |  99 | 167 | 163 |  72 |
| 2.356 - 2.264 |         43 |      543 | 127 | 176 | 169 |  71 |
| 2.264 - 2.182 |         53 |      563 | 144 | 188 | 159 |  72 |
| 2.182 - 2.108 |         65 |      528 | 142 | 166 | 105 | 115 |
| 2.108 - 2.041 |        135 |      764 | 186 | 194 | 140 | 244 |
| 2.041 - 1.98  |        140 |      696 | 175 | 220 | 140 | 161 |
| 1.98 - 1.924  |        184 |      661 | 140 | 210 | 171 | 140 |
| 1.924 - 1.873 |        222 |      520 | 106 | 141 | 141 | 132 |
| 1.873 - 1.826 |         88 |      185 |  29 |  56 |  65 |  35 |
+---------------+------------+----------+-----+-----+-----+-----+
Summary of reflections chosen for minimisation from each dataset (52765 total):
+--------------+------------------+----------------------+----------------------+--------------------+
| Dataset id   |     reflections  |   randomly selected  |   randomly selected  |   combined number  |
|              |    connected to  |         reflections  |         reflections  |     of reflections |
|              |   other datasets |       within dataset |      across datasets |                    |
|--------------+------------------+----------------------+----------------------+--------------------|
| 0            |             1963 |                 4530 |                 6335 |              11832 |
| 1            |             4403 |                 4890 |                 6406 |              13956 |
| 2            |             3704 |                 4763 |                 6508 |              13534 |
| 3            |             3734 |                 4752 |                 6379 |              13443 |
| total        |            13804 |                18935 |                25628 |              52765 |
+--------------+------------------+----------------------+----------------------+--------------------+
Completed configuration of MultiScaler. 

================================================================================

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer. 

Time taken for refinement 5.53

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52718 |   1.8181 |
|      1 |  52718 |   1.8131 |
|      2 |  52718 |   1.8024 |
|      3 |  52718 |   1.7984 |
|      4 |  52718 |   1.7942 |
|      5 |  52718 |   1.7909 |
|      6 |  52718 |   1.7895 |
|      7 |  52718 |   1.7884 |
|      8 |  52718 |   1.7868 |
|      9 |  52718 |   1.7861 |
|     10 |  52718 |   1.7853 |
|     11 |  52718 |   1.7846 |
|     12 |  52718 |   1.7839 |
|     13 |  52718 |   1.7833 |
|     14 |  52718 |   1.7826 |
|     15 |  52718 |   1.7821 |
|     16 |  52718 |   1.7811 |
|     17 |  52718 |   1.7803 |
|     18 |  52718 |   1.7796 |
|     19 |  52718 |   1.779  |
|     20 |  52718 |   1.7787 |
|     21 |  52718 |   1.7782 |
|     22 |  52718 |   1.7768 |
|     23 |  52718 |   1.7763 |
|     24 |  52718 |   1.7756 |
|     25 |  52718 |   1.7747 |
|     26 |  52718 |   1.7746 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True

================================================================================

Scale factors determined during minimisation have now been
applied to all datasets.

Combined outlier rejection has been performed across multiple datasets, 
2718 outliers have been identified. 

Performing multi-dataset profile/summation intensity optimisation.
+-----------------+---------+---------+
| Combination     |   CC1/2 |   Rmeas |
|-----------------+---------+---------|
| prf only        | 0.99922 | 0.05458 |
| sum only        | 0.9989  | 0.06199 |
| Imid = 856.68   | 0.99567 | 0.17552 |
| Imid = 11484.08 | 0.94959 | 0.29629 |
| Imid = 1148.41  | 0.99415 | 0.20362 |
| Imid = 114.84   | 0.99881 | 0.07134 |
+-----------------+---------+---------+
Profile intensities determined to be best for scaling. 

Combined outlier rejection has been performed across multiple datasets, 
2718 outliers have been identified. 

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer. 

Time taken for refinement 2.57

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52036 |   1.3209 |
|      1 |  52036 |   1.3195 |
|      2 |  52036 |   1.317  |
|      3 |  52036 |   1.3163 |
|      4 |  52036 |   1.3156 |
|      5 |  52036 |   1.3149 |
|      6 |  52036 |   1.3146 |
|      7 |  52036 |   1.3143 |
|      8 |  52036 |   1.3141 |
|      9 |  52036 |   1.3139 |
|     10 |  52036 |   1.3138 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True

================================================================================

Scale factors determined during minimisation have now been
applied to all datasets.

Combined outlier rejection has been performed across multiple datasets, 
2663 outliers have been identified. 

Performing a round of error model refinement.

Error model details:
  Type: basic
  Parameters: a = 0.95970, b = 0.04706
  Error model formula: σ'² = a²(σ² + (bI)²)
  estimated I/sigma asymptotic limit: 22.144

Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>)   |   n_refl |   Uncorrected variance |   Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10255.11 - 1823.56       |     1430 |                 20.001 |                0.863 |
| 1823.56 - 1394.56        |     1430 |                 14.371 |                0.977 |
| 1394.56 - 1175.87        |     1430 |                 12.422 |                1.031 |
| 1175.87 - 924.15         |     2693 |                 11.077 |                1.132 |
| 924.15 - 506.34          |    12232 |                  7.606 |                1.221 |
| 506.34 - 277.42          |    20962 |                  4.771 |                1.291 |
| 277.42 - 152.00          |    28611 |                  3.017 |                1.296 |
| 152.00 - 83.28           |    31211 |                  1.926 |                1.173 |
| 83.28 - 45.63            |    26310 |                  1.395 |                1.056 |
| 45.63 - 24.99            |    16739 |                  0.999 |                0.877 |
+--------------------------+----------+------------------------+----------------------+

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.

Time taken for refinement 11.18

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52048 |  0.89327 |
|      1 |  52048 |  0.8886  |
|      2 |  52048 |  0.88833 |
|      3 |  52048 |  0.88821 |
|      4 |  52048 |  0.88817 |
+--------+--------+----------+
RMSD no longer decreasing

================================================================================

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.

Time taken for refinement 4.37

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52048 |  0.88817 |
|      1 |  52048 |  0.88817 |
+--------+--------+----------+
RMSD no longer decreasing

================================================================================

Calculating error estimates of inverse scale factors. 

Scale factors determined during minimisation have now been
applied to all datasets.

Combined outlier rejection has been performed across multiple datasets, 
165 outliers have been identified. 

Performing a round of error model refinement.

Error model details:
  Type: basic
  Parameters: a = 0.96121, b = 0.04702
  Error model formula: σ'² = a²(σ² + (bI)²)
  estimated I/sigma asymptotic limit: 22.125

Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>)   |   n_refl |   Uncorrected variance |   Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10316.01 - 1819.10       |     1430 |                 19.463 |                0.837 |
| 1819.10 - 1390.90        |     1430 |                 14.515 |                0.984 |
| 1390.90 - 1174.71        |     1430 |                 12.975 |                1.066 |
| 1174.71 - 927.44         |     2649 |                 11.192 |                1.142 |
| 927.44 - 507.84          |    12198 |                  7.677 |                1.219 |
| 507.84 - 278.08          |    20951 |                  4.779 |                1.286 |
| 278.08 - 152.27          |    28607 |                  3.02  |                1.292 |
| 152.27 - 83.38           |    31265 |                  1.934 |                1.17  |
| 83.38 - 45.66            |    26335 |                  1.396 |                1.053 |
| 45.66 - 24.99            |    16765 |                  1.008 |                0.879 |
+--------------------------+----------+------------------------+----------------------+


The reflection table variances have been adjusted to account for the
uncertainty in the scaling models for all datasets
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)

Total time taken: 32.1469s 

================================================================================

Warning: Over half (51.19%) of model parameters have signficant
uncertainty (sigma/abs(parameter) > 0.5), which could indicate a
poorly-determined scaling problem or overparameterisation.

Summary of dataset partialities
+------------------+----------+
| Partiality (p)   |   n_refl |
|------------------+----------|
| all reflections  |   306137 |
| p > 0.99         |   244580 |
| 0.5 < p < 0.99   |     4670 |
| 0.01 < p < 0.5   |    10499 |
| p < 0.01         |    46388 |
+------------------+----------+

Reflections below a partiality_cutoff of 0.4 are not considered for any
part of the scaling analysis or for the reporting of merging statistics.
Additionally, if applicable, only reflections with a min_partiality > 0.95
were considered for use when refining the scaling model.


                     ----------Merging statistics----------                    

Resolution: 68.36 - 1.80
Observations: 212617
Unique reflections: 20140
Redundancy: 10.6
Completeness: 85.42%
Mean intensity: 131.1
Mean I/sigma(I): 29.2
R-merge: 0.056
R-meas:  0.058
R-pim:   0.015


Statistics by resolution bin:
 d_max  d_min   #obs  #uniq   mult.  %comp       <I>  <I/sI>    r_mrg   r_meas    r_pim   cc1/2   cc_ano
 68.41   4.89  20721   1334   15.53 100.00     274.0    56.6    0.047    0.049    0.012   0.999*  -0.015
  4.89   3.88  20460   1244   16.45 100.00     414.7    60.3    0.046    0.047    0.011   0.999*  -0.168
  3.88   3.39  19968   1208   16.53 100.00     318.3    55.9    0.049    0.050    0.012   0.999*  -0.046
  3.39   3.08  20074   1189   16.88 100.00     224.3    50.3    0.054    0.055    0.013   0.999*  -0.044
  3.08   2.86  20271   1185   17.11 100.00     155.7    43.8    0.059    0.061    0.015   0.999*  -0.000
  2.86   2.69  20468   1191   17.19 100.00     125.9    40.7    0.065    0.067    0.016   0.998*  -0.090
  2.69   2.55  20171   1172   17.21 100.00      98.9    34.8    0.074    0.076    0.018   0.998*   0.015
  2.55   2.44  16112   1167   13.81 100.00      89.2    29.3    0.077    0.080    0.021   0.998*  -0.074
  2.44   2.35  12116   1162   10.43  99.91      77.7    24.4    0.079    0.084    0.026   0.996*   0.008
  2.35   2.27  10135   1180    8.59  99.92      76.4    21.7    0.082    0.087    0.029   0.995*   0.110*
  2.27   2.20   8323   1150    7.24  99.65      66.2    17.8    0.084    0.091    0.033   0.995*  -0.132
  2.20   2.13   6830   1165    5.86  99.06      59.8    14.6    0.087    0.095    0.038   0.994*  -0.016
  2.13   2.08   5300   1116    4.75  97.04      52.1    11.7    0.091    0.102    0.045   0.992*  -0.141
  2.08   2.03   4021   1056    3.81  91.27      46.2     9.7    0.096    0.110    0.052   0.988*  -0.057
  2.03   1.98   3076   1001    3.07  86.14      39.1     7.7    0.102    0.122    0.064   0.984*   0.136
  1.98   1.94   2060    896    2.30  77.85      36.9     6.4    0.095    0.118    0.069   0.979*  -0.041
  1.94   1.90   1163    700    1.66  61.03      31.7     4.8    0.115    0.149    0.094   0.963*   0.000
  1.90   1.86    751    531    1.41  46.21      27.5     3.9    0.109    0.147    0.098   0.971*   0.000
  1.86   1.83    464    366    1.27  31.88      25.1     3.4    0.100    0.139    0.096   0.963*   0.000
  1.83   1.80    133    127    1.05  11.07      21.7     2.7    0.381    0.539    0.381  -0.331   0.000
 68.36   1.80 212617  20140   10.56  85.42     131.1    29.2    0.056    0.058    0.015   0.999*  -0.056




                ----------Resolution cutoff estimates----------                

  resolution of all data          :   1.800
  based on CC(1/2) >= 0.33        :   1.831
  based on mean(I/sigma) >= 2.0   :   1.831
  based on R-merge < 0.5          :   1.831
  based on R-meas < 0.5           :   1.831
  based on completeness >= 90%    :   2.028
  based on completeness >= 50%    :   1.901

NOTE: we recommend using all data out to the CC(1/2) limit
for refinement.

Writing html report to: scaling.html

================================================================================

Saving the experiments to scaled.expt
Saving the scaled reflections to scaled.refl
See dials.github.io/dials_scale_user_guide.html for more info on scaling options

For exploring the scaling results, a wide variety of scaling and merging plots can be found in the scaling.html report generated by dials.scale.

Almost there¶

As mentioned previously, the fourth dataset is giving significantly higher R-merge values and much lower I/sigma. Therefore the question one must ask is if it is better to exclude this dataset. We can get some useful information about the agreement between datasets by running the program dials.compute_delta_cchalf. This program implements a version of the algorithms described in Assmann et al. :

dials.compute_delta_cchalf scaled.refl scaled.expt

# Datasets: 4
# Groups: 4
# Reflections: 208216
# Unique reflections: 20132
CC 1/2 mean: 99.458
CC 1/2 excluding group 0: 99.463
CC 1/2 excluding group 1: 99.451
CC 1/2 excluding group 2: 99.395
CC 1/2 excluding group 3: 99.383
Dataset: 0, Delta CC 1/2: -0.005
Dataset: 1, Delta CC 1/2: 0.007
Dataset: 2, Delta CC 1/2: 0.063
Dataset: 3, Delta CC 1/2: 0.076

mean delta_cc_half 0.03535378806998335
stddev delta_cc_half 0.034706720651087486
cutoff value: -0.1034730945343666 

Read 306137 predicted reflections
Selected 212617 scaled reflections
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Resolution bins
0: 68.363, 5.675
1: 5.675, 4.020
2: 4.020, 3.284
3: 3.284, 2.845
4: 2.845, 2.545
5: 2.545, 2.324
6: 2.324, 2.151
7: 2.151, 2.013
8: 2.013, 1.898
9: 1.898, 1.800

Summary of input data:
# Datasets: 4
# Groups: 4
# Reflections: 208216
# Unique reflections: 20132
CC 1/2 mean: 99.458
CC 1/2 excluding group 0: 99.463
CC 1/2 excluding group 1: 99.451
CC 1/2 excluding group 2: 99.395
CC 1/2 excluding group 3: 99.383
Dataset: 0, Delta CC 1/2: -0.005
Dataset: 1, Delta CC 1/2: 0.007
Dataset: 2, Delta CC 1/2: 0.063
Dataset: 3, Delta CC 1/2: 0.076

mean delta_cc_half 0.03535378806998335
stddev delta_cc_half 0.034706720651087486
cutoff value: -0.1034730945343666 

Writing table to delta_cchalf.dat
Saving 306137 reflections to filtered.refl
Saving the experiments to filtered.expt
Writing html report to: compute_delta_cchalf.html

It looks like we could get a significantly better \(CC_{1/2}\) by excluding the final dataset - it has a negative \(\Delta CC_{1/2}\). But how bad is too bad that it warrants exclusion? Unfortunately this is a difficult question to answer and it may be the case that one would need to refine several structures with different data excluded to properly address this question. If we had many datasets and only a small fraction had a very large negative \(\Delta CC_{1/2}\) then one could argue that these measurements are not drawn from the same population as the rest of the data and should be excluded.

To see the effect of removing the last dataset (dataset ‘3’), we can rerun dials.scale (note that this will overwrite the previous scaled files):

dials.scale scaled.expt scaled.refl d_min=1.80

DIALS 2.2.9-g061426e04-release
The following parameters have been modified:
cut_data {
  d_min = 1.8
}
input {
  experiments = scaled.expt
  reflections = scaled.refl
}

Checking for the existence of a reflection table 
containing multiple datasets 

Detected existence of a multi-dataset reflection table 
containing 4 datasets. 

Found 4 reflection tables & 4 experiments in total.

Dataset ids are: 0,1,2,3 

Space group being used during scaling is P 4 2 2
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)

Scaling models have been initialised for all experiments.

================================================================================

The experiment id for this dataset is 0.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22812/76079 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13656
criterion: excluded for scaling, reflections: 22812

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

The experiment id for this dataset is 1.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 22043/75607 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13186
criterion: excluded for scaling, reflections: 22041

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

The experiment id for this dataset is 2.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 24879/77983 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 15473
criterion: excluded for scaling, reflections: 24878

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

The experiment id for this dataset is 3.
The scaling model type being applied is physical. 

Applying filter of min_isigi > -5.0, partiality > 0.4
Excluding 23621/76468 reflections
Reflections passing individual criteria:
criterion: user excluded, reflections: 13968
criterion: excluded for scaling, reflections: 23620

The following corrections will be applied to this dataset: 

+--------------+----------------+
| correction   |   n_parameters |
|--------------+----------------|
| scale        |             10 |
| decay        |              8 |
| absorption   |             24 |
+--------------+----------------+
Using previously determined optimal intensity choice: profile intensities

Completed preprocessing and initialisation for this dataset.

================================================================================

Configuring a MultiScaler to handle the individual Scalers. 

Determining symmetry equivalent reflections across datasets.


Using quasi-random reflection selection. Selecting from 16683 symmetry groups
with <I/sI> > 1.0 (197826 reflections)). Selection target of 717.07 reflections
from each dataset, with a total number between 11473.09 and 13767.71.

Summary of cross-dataset reflection groups chosen (1331 groups, 13804 reflections):
+---------------+------------+----------+-----+-----+-----+-----+
| d-range       |   n_groups |   n_refl |   0 |   1 |   2 |   3 |
|---------------+------------+----------+-----+-----+-----+-----|
| 68.428 - 8.11 |         42 |      689 | 105 | 203 | 178 | 203 |
| 8.11 - 5.755  |         31 |      724 |  71 | 242 | 197 | 214 |
| 5.755 - 4.704 |         33 |      813 |  72 | 271 | 226 | 244 |
| 4.704 - 4.076 |         33 |      896 |  70 | 298 | 256 | 272 |
| 4.076 - 3.647 |         36 |      957 |  72 | 313 | 276 | 296 |
| 3.647 - 3.33  |         31 |      875 |  70 | 294 | 242 | 269 |
| 3.33 - 3.084  |         32 |      831 |  70 | 271 | 228 | 262 |
| 3.084 - 2.885 |         30 |      784 |  71 | 256 | 212 | 245 |
| 2.885 - 2.72  |         35 |      834 |  72 | 275 | 218 | 269 |
| 2.72 - 2.581  |         35 |      792 |  71 | 247 | 212 | 262 |
| 2.581 - 2.461 |         30 |      648 |  71 | 215 | 206 | 156 |
| 2.461 - 2.356 |         33 |      501 |  99 | 167 | 163 |  72 |
| 2.356 - 2.264 |         43 |      543 | 127 | 176 | 169 |  71 |
| 2.264 - 2.182 |         53 |      563 | 144 | 188 | 159 |  72 |
| 2.182 - 2.108 |         65 |      528 | 142 | 166 | 105 | 115 |
| 2.108 - 2.041 |        135 |      764 | 186 | 194 | 140 | 244 |
| 2.041 - 1.98  |        140 |      696 | 175 | 220 | 140 | 161 |
| 1.98 - 1.924  |        184 |      661 | 140 | 210 | 171 | 140 |
| 1.924 - 1.873 |        222 |      520 | 106 | 141 | 141 | 132 |
| 1.873 - 1.826 |         88 |      185 |  29 |  56 |  65 |  35 |
+---------------+------------+----------+-----+-----+-----+-----+
Summary of reflections chosen for minimisation from each dataset (52765 total):
+--------------+------------------+----------------------+----------------------+--------------------+
| Dataset id   |     reflections  |   randomly selected  |   randomly selected  |   combined number  |
|              |    connected to  |         reflections  |         reflections  |     of reflections |
|              |   other datasets |       within dataset |      across datasets |                    |
|--------------+------------------+----------------------+----------------------+--------------------|
| 0            |             1963 |                 4530 |                 6335 |              11832 |
| 1            |             4403 |                 4890 |                 6406 |              13956 |
| 2            |             3704 |                 4763 |                 6508 |              13534 |
| 3            |             3734 |                 4752 |                 6379 |              13443 |
| total        |            13804 |                18935 |                25628 |              52765 |
+--------------+------------------+----------------------+----------------------+--------------------+
Completed configuration of MultiScaler. 

================================================================================

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer. 

Time taken for refinement 5.54

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52719 |   1.78   |
|      1 |  52719 |   1.7766 |
|      2 |  52719 |   1.7716 |
|      3 |  52719 |   1.7689 |
|      4 |  52719 |   1.7666 |
|      5 |  52719 |   1.7651 |
|      6 |  52719 |   1.7641 |
|      7 |  52719 |   1.7633 |
|      8 |  52719 |   1.763  |
|      9 |  52719 |   1.7625 |
|     10 |  52719 |   1.7623 |
|     11 |  52719 |   1.7621 |
|     12 |  52719 |   1.7617 |
|     13 |  52719 |   1.7615 |
|     14 |  52719 |   1.7612 |
|     15 |  52719 |   1.7609 |
|     16 |  52719 |   1.7607 |
|     17 |  52719 |   1.7602 |
|     18 |  52719 |   1.7597 |
|     19 |  52719 |   1.7594 |
|     20 |  52719 |   1.7591 |
|     21 |  52719 |   1.7588 |
|     22 |  52719 |   1.7584 |
|     23 |  52719 |   1.7579 |
|     24 |  52719 |   1.7576 |
|     25 |  52719 |   1.7572 |
|     26 |  52719 |   1.757  |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True

================================================================================

Scale factors determined during minimisation have now been
applied to all datasets.

Combined outlier rejection has been performed across multiple datasets, 
2719 outliers have been identified. 

Performing multi-dataset profile/summation intensity optimisation.
+-----------------+---------+---------+
| Combination     |   CC1/2 |   Rmeas |
|-----------------+---------+---------|
| prf only        | 0.99924 | 0.05463 |
| sum only        | 0.99896 | 0.06206 |
| Imid = 856.55   | 0.99578 | 0.17488 |
| Imid = 11484.08 | 0.94353 | 0.29657 |
| Imid = 1148.41  | 0.99443 | 0.20296 |
| Imid = 114.84   | 0.99875 | 0.07129 |
+-----------------+---------+---------+
Profile intensities determined to be best for scaling. 

Combined outlier rejection has been performed across multiple datasets, 
2719 outliers have been identified. 

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with an LBFGS minimizer. 

Time taken for refinement 2.21

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52037 |   1.3217 |
|      1 |  52037 |   1.3202 |
|      2 |  52037 |   1.3179 |
|      3 |  52037 |   1.3165 |
|      4 |  52037 |   1.3156 |
|      5 |  52037 |   1.3152 |
|      6 |  52037 |   1.3148 |
|      7 |  52037 |   1.3146 |
|      8 |  52037 |   1.3145 |
|      9 |  52037 |   1.3144 |
+--------+--------+----------+
RMSD no longer decreasing
lbfgs minimizer stop: callback_after_step is True

================================================================================

Scale factors determined during minimisation have now been
applied to all datasets.

Combined outlier rejection has been performed across multiple datasets, 
2634 outliers have been identified. 

Performing a round of error model refinement.

Error model details:
  Type: basic
  Parameters: a = 0.95808, b = 0.04714
  Error model formula: σ'² = a²(σ² + (bI)²)
  estimated I/sigma asymptotic limit: 22.144

Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>)   |   n_refl |   Uncorrected variance |   Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10239.19 - 1825.33       |     1430 |                 19.924 |                0.853 |
| 1825.33 - 1395.21        |     1430 |                 14.519 |                0.999 |
| 1395.21 - 1176.43        |     1430 |                 12.339 |                1.033 |
| 1176.43 - 923.30         |     2718 |                 11.023 |                1.129 |
| 923.30 - 505.95          |    12215 |                  7.525 |                1.215 |
| 505.95 - 277.25          |    20953 |                  4.748 |                1.286 |
| 277.25 - 151.93          |    28586 |                  3.005 |                1.297 |
| 151.93 - 83.26           |    31227 |                  1.922 |                1.172 |
| 83.26 - 45.62            |    26306 |                  1.393 |                1.057 |
| 45.62 - 24.99            |    16730 |                  1     |                0.88  |
+--------------------------+----------+------------------------+----------------------+

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.

Time taken for refinement 8.92

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52056 |  0.89461 |
|      1 |  52056 |  0.88928 |
|      2 |  52056 |  0.8891  |
|      3 |  52056 |  0.88906 |
+--------+--------+----------+
RMSD no longer decreasing

================================================================================

Components to be refined in this cycle for all datasets: scale, decay, absorption
Performing a round of scaling with a Levenberg-Marquardt minimizer.

Time taken for refinement 4.37

Refinement steps:
+--------+--------+----------+
|   Step |   Nref |   RMSD_I |
|        |        |    (a.u) |
|--------+--------+----------|
|      0 |  52056 |  0.88906 |
|      1 |  52056 |  0.88905 |
+--------+--------+----------+
RMSD no longer decreasing

================================================================================

Calculating error estimates of inverse scale factors. 

Scale factors determined during minimisation have now been
applied to all datasets.

Combined outlier rejection has been performed across multiple datasets, 
163 outliers have been identified. 

Performing a round of error model refinement.

Error model details:
  Type: basic
  Parameters: a = 0.96198, b = 0.04693
  Error model formula: σ'² = a²(σ² + (bI)²)
  estimated I/sigma asymptotic limit: 22.151

Results of error model refinement. Uncorrected and corrected variances
of normalised intensity deviations for given intensity ranges. Variances
are expected to be ~1.0 for reliable errors (sigmas).
+--------------------------+----------+------------------------+----------------------+
| Intensity range (<Ih>)   |   n_refl |   Uncorrected variance |   Corrected variance |
|--------------------------+----------+------------------------+----------------------|
| 10313.12 - 1820.17       |     1430 |                 19.757 |                0.852 |
| 1820.17 - 1391.24        |     1430 |                 14.419 |                0.979 |
| 1391.24 - 1174.70        |     1430 |                 12.698 |                1.053 |
| 1174.70 - 927.28         |     2656 |                 11.072 |                1.13  |
| 927.28 - 507.77          |    12186 |                  7.621 |                1.218 |
| 507.77 - 278.05          |    20946 |                  4.771 |                1.288 |
| 278.05 - 152.26          |    28602 |                  3.018 |                1.293 |
| 152.26 - 83.37           |    31267 |                  1.935 |                1.171 |
| 83.37 - 45.66            |    26333 |                  1.396 |                1.052 |
| 45.66 - 24.99            |    16769 |                  1.008 |                0.878 |
+--------------------------+----------+------------------------+----------------------+


The reflection table variances have been adjusted to account for the
uncertainty in the scaling models for all datasets
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)

Total time taken: 29.5517s 

================================================================================

Warning: Over half (51.19%) of model parameters have signficant
uncertainty (sigma/abs(parameter) > 0.5), which could indicate a
poorly-determined scaling problem or overparameterisation.

Summary of dataset partialities
+------------------+----------+
| Partiality (p)   |   n_refl |
|------------------+----------|
| all reflections  |   306137 |
| p > 0.99         |   244580 |
| 0.5 < p < 0.99   |     4670 |
| 0.01 < p < 0.5   |    10499 |
| p < 0.01         |    46388 |
+------------------+----------+

Reflections below a partiality_cutoff of 0.4 are not considered for any
part of the scaling analysis or for the reporting of merging statistics.
Additionally, if applicable, only reflections with a min_partiality > 0.95
were considered for use when refining the scaling model.


                     ----------Merging statistics----------                    

Resolution: 68.36 - 1.80
Observations: 212619
Unique reflections: 20140
Redundancy: 10.6
Completeness: 85.42%
Mean intensity: 131.7
Mean I/sigma(I): 29.2
R-merge: 0.056
R-meas:  0.058
R-pim:   0.015


Statistics by resolution bin:
 d_max  d_min   #obs  #uniq   mult.  %comp       <I>  <I/sI>    r_mrg   r_meas    r_pim   cc1/2   cc_ano
 68.41   4.89  20723   1334   15.53 100.00     275.1    56.6    0.047    0.049    0.012   0.999*  -0.014
  4.89   3.88  20459   1244   16.45 100.00     416.4    60.3    0.046    0.047    0.011   0.999*  -0.183
  3.88   3.39  19969   1208   16.53 100.00     319.6    55.9    0.049    0.050    0.012   0.999*  -0.043
  3.39   3.08  20075   1189   16.88 100.00     225.2    50.4    0.054    0.055    0.013   0.999*  -0.043
  3.08   2.86  20269   1185   17.10 100.00     156.3    43.8    0.059    0.061    0.015   0.999*  -0.013
  2.86   2.69  20468   1191   17.19 100.00     126.4    40.7    0.065    0.067    0.016   0.998*  -0.089
  2.69   2.55  20172   1172   17.21 100.00      99.3    34.8    0.074    0.076    0.018   0.998*   0.019
  2.55   2.44  16112   1167   13.81 100.00      89.5    29.3    0.077    0.080    0.021   0.998*  -0.074
  2.44   2.35  12116   1162   10.43  99.91      77.9    24.4    0.079    0.084    0.026   0.996*   0.008
  2.35   2.27  10135   1180    8.59  99.92      76.7    21.7    0.082    0.087    0.029   0.995*   0.111*
  2.27   2.20   8323   1150    7.24  99.65      66.4    17.8    0.084    0.091    0.033   0.995*  -0.132
  2.20   2.13   6830   1165    5.86  99.06      60.1    14.6    0.087    0.095    0.038   0.994*  -0.017
  2.13   2.08   5300   1116    4.75  97.04      52.3    11.7    0.091    0.102    0.045   0.992*  -0.143
  2.08   2.03   4021   1056    3.81  91.27      46.3     9.7    0.096    0.110    0.052   0.988*  -0.056
  2.03   1.98   3076   1001    3.07  86.14      39.2     7.7    0.102    0.122    0.064   0.984*   0.131
  1.98   1.94   2060    896    2.30  77.85      37.1     6.4    0.095    0.118    0.069   0.979*  -0.033
  1.94   1.90   1163    700    1.66  61.03      31.8     4.8    0.115    0.149    0.094   0.963*   0.000
  1.90   1.86    751    531    1.41  46.21      27.6     3.9    0.109    0.147    0.098   0.971*   0.000
  1.86   1.83    464    366    1.27  31.88      25.2     3.4    0.100    0.139    0.096   0.964*   0.000
  1.83   1.80    133    127    1.05  11.07      21.8     2.7    0.382    0.540    0.382  -0.330   0.000
 68.36   1.80 212619  20140   10.56  85.42     131.7    29.2    0.056    0.058    0.015   0.999*  -0.040




                ----------Resolution cutoff estimates----------                

  resolution of all data          :   1.800
  based on CC(1/2) >= 0.33        :   1.831
  based on mean(I/sigma) >= 2.0   :   1.831
  based on R-merge < 0.5          :   1.831
  based on R-meas < 0.5           :   1.831
  based on completeness >= 90%    :   2.028
  based on completeness >= 50%    :   1.901

NOTE: we recommend using all data out to the CC(1/2) limit
for refinement.

Writing html report to: scaling.html

================================================================================

Saving the experiments to scaled.expt
Saving the scaled reflections to scaled.refl
See dials.github.io/dials_scale_user_guide.html for more info on scaling options

The overall merging statistics look significantly improved and therefore one would probably proceed with the first three datasets:

Resolution:         68.40 - 1.78  > 68.40 - 1.79
Observations:       222563        > 166095
Unique reflections: 16534         > 16285
Redundancy:         13.5          > 10.2
Completeness:       68.18%        > 67.56%
Mean intensity:     45.3          > 46.0
Mean I/sigma(I):    25.0          > 26.1
R-merge:            0.132         > 0.059
R-meas:             0.136         > 0.062
R-pim:              0.033         > 0.017

We could have also excluded a subset of images, for example using the option exclude_images=3:301:600 to exclude the last 300 images of dataset 3. This option could be used to exclude the end of a dataset that was showing sigificant radiation damage, or if the crystal had moved out of the beam part-way through the measurement.

It is also worth checking the assigned space group using dials.symmetry. In dials.cosym, only the Laue/Patterson group was tested to determine a space group of \(P\,4\,2\,2\). However, a number of other MX space groups are possible for the Laue group (due to the possibility of screw-axes), such as \(P\,4\,2_1\,2\), \(P\,4_1\,2\,2\) etc. The screw-axes tests are performed by dials.symmetry, and we can disable the Laue group testing as we are already confident about this:

dials.symmetry scaled.expt scaled.refl laue_group=None

Read 76079 predicted reflections
Selected 53251 scaled reflections
Combined 67 partial reflections with other partial reflections
Read 75607 predicted reflections
Selected 53549 scaled reflections
Combined 77 partial reflections with other partial reflections
Read 77983 predicted reflections
Selected 53055 scaled reflections
Combined 106 partial reflections with other partial reflections
Read 76468 predicted reflections
Selected 52764 scaled reflections
Combined 69 partial reflections with other partial reflections
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Resolution estimate from <I>/<σ(I)> > 4.0 : 1.83
Resolution estimate from CC½ > 0.60: 1.80
Performing systematic absence checks on scaled data
Read 306137 predicted reflections
Selected 212619 scaled reflections
Removed 1 reflections with d < 1.80
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Laue group: P 4/m m m
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
| Screw axis   |   Score |   No. present |   No. absent |   <I> present |   <I> absent |   <I/sig> present |   <I/sig> absent |
|--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------|
| 41c          |       1 |            11 |           33 |       621.629 |        0.112 |            34.285 |            0.397 |
| 21a          |       1 |            14 |           14 |       827.13  |        0.449 |            22.739 |            1.01  |
| 42c          |       1 |            22 |           22 |       310.674 |        0.309 |            17.47  |            0.268 |
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
+---------------+---------+
| Space group   |   score |
|---------------+---------|
| P 4 2 2       |       0 |
| P 4 21 2      |       0 |
| P 41 2 2      |       0 |
| P 42 2 2      |       0 |
| P 41 21 2     |       1 |
| P 42 21 2     |       0 |
+---------------+---------+
Recommended space group: P 41 21 2
Space group with equivalent score (enantiomorphic pair): P 43 21 2

DIALS 2.2.9-g061426e04-release
The following parameters have been modified:

laue_group = None
input {
  experiments = scaled.expt
  reflections = scaled.refl
}

Detected existence of a multi-dataset reflection table 
containing 4 datasets. 

================================================================================

Analysing systematic absences

Laue group: P 4/m m m
Read 76079 predicted reflections
Selected 53251 scaled reflections
Combined 67 partial reflections with other partial reflections
Read 75607 predicted reflections
Selected 53549 scaled reflections
Combined 77 partial reflections with other partial reflections
Read 77983 predicted reflections
Selected 53055 scaled reflections
Combined 106 partial reflections with other partial reflections
Read 76468 predicted reflections
Selected 52764 scaled reflections
Combined 69 partial reflections with other partial reflections
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Resolution estimate from <I>/<σ(I)> > 4.0 : 1.83
Resolution estimate from CC½ > 0.60: 1.80
Performing systematic absence checks on scaled data
Read 306137 predicted reflections
Selected 212619 scaled reflections
Removed 1 reflections with d < 1.80
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Laue group: P 4/m m m
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
| Screw axis   |   Score |   No. present |   No. absent |   <I> present |   <I> absent |   <I/sig> present |   <I/sig> absent |
|--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------|
| 41c          |       1 |            11 |           33 |       621.629 |        0.112 |            34.285 |            0.397 |
| 21a          |       1 |            14 |           14 |       827.13  |        0.449 |            22.739 |            1.01  |
| 42c          |       1 |            22 |           22 |       310.674 |        0.309 |            17.47  |            0.268 |
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
+---------------+---------+
| Space group   |   score |
|---------------+---------|
| P 4 2 2       |       0 |
| P 4 21 2      |       0 |
| P 41 2 2      |       0 |
| P 42 2 2      |       0 |
| P 41 21 2     |       1 |
| P 42 21 2     |       0 |
+---------------+---------+
Recommended space group: P 41 21 2
Space group with equivalent score (enantiomorphic pair): P 43 21 2
Saving reindexed experiments to symmetrized.expt in space group P 41 21 2
Saving 306137 reindexed reflections to symmetrized.refl

By analysing the sets of reflections we expect to be present and absent, the existence of the \(4_1\) and \(2_1\) screw axes are confirmed, hence the space group is assigned as \(P\,4_1\,2_1\,2\). Note that we can do this analysis before or after scaling, as we only need to know the Laue group for scaling, however it is preferable to do this after scaling as outliers may have been removed by scaling.

Finally, we must merge the data and produce an MTZ file for downstream structure solution:

dials.merge symmetrized.expt symmetrized.refl

DIALS 2.2.9-g061426e04-release
The following parameters have been modified:

input {
  experiments = symmetrized.expt
  reflections = symmetrized.refl
}


Merging scaled reflection data

Read 306137 predicted reflections
Selected 212619 scaled reflections
Combined 67 partial reflections with other partial reflections
Combined 77 partial reflections with other partial reflections
Combined 106 partial reflections with other partial reflections
Combined 69 partial reflections with other partial reflections
Removed 4082 reflections below partiality threshold
Using median unit cell across experiments : (68.3603, 68.3603, 103.953, 90, 90, 90)
Running systematic absences check
Laue group: P 4/m m m
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
| Screw axis   |   Score |   No. present |   No. absent |   <I> present |   <I> absent |   <I/sig> present |   <I/sig> absent |
|--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------|
| 41c          |       1 |            11 |           33 |       621.629 |        0.112 |            34.285 |            0.397 |
| 21a          |       1 |            14 |           14 |       827.13  |        0.449 |            22.739 |            1.01  |
| 42c          |       1 |            22 |           22 |       310.674 |        0.309 |            17.47  |            0.268 |
+--------------+---------+---------------+--------------+---------------+--------------+-------------------+------------------+
+---------------+---------+
| Space group   |   score |
|---------------+---------|
| P 4 2 2       |       0 |
| P 4 21 2      |       0 |
| P 41 2 2      |       0 |
| P 42 2 2      |       0 |
| P 41 21 2     |       1 |
| P 42 21 2     |       0 |
+---------------+---------+
Recommended space group: P 41 21 2
Space group with equivalent score (enantiomorphic pair): P 43 21 2

Scaling input intensities via French-Wilson Method
Total number of rejected intensities 0

===================== Absolute scaling and Wilson analysis ====================


        ----------Maximum likelihood isotropic Wilson scaling----------

 ML estimate of overall B value:
   12.72 A**2
 Estimated -log of scale factor:
   -2.90
 The overall B value ("Wilson B-factor", derived from the Wilson plot) gives
 an isotropic approximation for the falloff of intensity as a function of
 resolution.  Note that this approximation may be misleading for anisotropic
 data (where the crystal is poorly ordered along an axis).  The Wilson B is
 strongly correlated with refined atomic B-factors but these may differ by
 a significant amount, especially if anisotropy is present.

       ----------Maximum likelihood anisotropic Wilson scaling----------

ML estimate of overall B_cart value:
  11.92,  0.00,  0.00
         11.92,  0.00
                14.05

Equivalent representation as U_cif:
   0.15, -0.00, -0.00
          0.15,  0.00
                 0.18

Eigen analyses of B-cart:
  -------------------------------------------------
  | Eigenvector | Value   | Vector                |
  -------------------------------------------------
  | 1           |  14.053 | ( 0.00,  0.00,  1.00) |
  | 2           |  11.920 | (-0.71,  0.71, -0.00) |
  | 3           |  11.920 | ( 0.71,  0.71, -0.00) |
  -------------------------------------------------
ML estimate of  -log of scale factor:
  -2.91

                    ----------Anisotropy analyses----------


 For the resolution shell spanning between 1.95 - 1.80 Angstrom,
 the mean I/sigI is equal to  4.25. 52.9 % of these intensities have
 an I/sigI > 3. When sorting these intensities by their anisotropic
 correction factor and analysing the I/sigI behavior for this ordered
 list, we can gauge the presence of 'anisotropy induced noise amplification'
 in reciprocal space.

 The quarter of Intensities *least* affected by the anisotropy correction show

    <I/sigI>                 :   4.55e+00
    Fraction of I/sigI > 3   :   5.69e-01     ( Z =     1.81 )
  The quarter of Intensities *most* affected by the anisotropy correction show

    <I/sigI>                 :   3.34e+00
    Fraction of I/sigI > 3   :   4.08e-01     ( Z =     5.42 )
 Z-scores are computed on the basis of a Bernoulli model assuming independence
 of weak reflections with respect to anisotropy.

                        ----------Wilson plot----------

 The Wilson plot shows the falloff in intensity as a function in resolution;
 this is used to calculate the overall B-factor ("Wilson B-factor") for the
 data shown above.  The expected plot is calculated based on analysis of
 macromolecule structures in the PDB, and the distinctive appearance is due to
 the non-random arrangement of atoms in the crystal.  Some variation is
 natural, but major deviations from the expected plot may indicate pathological
 data (including ice rings, detector problems, or processing errors).

                  ----------Mean intensity analyses----------

 Inspired by: Morris et al. (2004). J. Synch. Rad.11, 56-59.
 The following resolution shells are worrisome:
 *** None ***

                     ----------Possible outliers----------

  Inspired by: Read, Acta Cryst. (1999). D55, 1759-1764

Acentric reflections:
            None

Centric reflections:
  -----------------------------------------------------------------------------------------------------
  | Centric reflections                                                                               |
  |---------------------------------------------------------------------------------------------------|
  | d_spacing         | H K L             | |E|               | p(wilson)         | p(extreme)        |
  |---------------------------------------------------------------------------------------------------|
  |    2.628          |    26,    0,    1 |   4.31            |  1.66e-05         |   4.46e-02        |
  -----------------------------------------------------------------------------------------------------



 p(wilson)  : 1-(erf[|E|/sqrt(2)])
 p(extreme) : 1-(erf[|E|/sqrt(2)])^(n_acentrics)


 p(wilson) is the probability that an E-value of the specified
 value would be observed when it would selected at random from
 the given data set.
 p(extreme) is the probability that the largest |E| value is
 larger or equal than the observed largest |E| value.

 Both measures can be used for outlier detection. p(extreme)
 takes into account the size of the dataset.


                 ----------Ice ring related problems----------

 The following statistics were obtained from ice-ring insensitive resolution
 ranges:

    mean bin z_score      : 1.37
        ( rms deviation   : 1.10 )
    mean bin completeness : 0.86
        ( rms deviation   : 0.28 )

 The following table shows the Wilson plot Z-scores and completeness for
 observed data in ice-ring sensitive areas.  The expected relative intensity
 is the theoretical intensity of crystalline ice at the given resolution.
 Large z-scores and high completeness in these resolution ranges might
 be a reason to re-assess your data processsing if ice rings were present.

  -------------------------------------------------------------
  | d_spacing | Expected rel. I | Data Z-score | Completeness |
  -------------------------------------------------------------
  |     3.897 |      1.000      |    2.86      |    1.00      |
  |     3.669 |      0.750      |    1.17      |    1.00      |
  |     3.441 |      0.530      |    3.41      |    0.99      |
  |     2.671 |      0.170      |    1.10      |    1.00      |
  |     2.249 |      0.390      |    1.36      |    0.97      |
  |     2.072 |      0.300      |    0.99      |    0.77      |
  |     1.948 |      0.040      |    1.51      |    0.43      |
  |     1.918 |      0.180      |    0.14      |    0.31      |
  |     1.883 |      0.030      |    0.56      |    0.24      |
  -------------------------------------------------------------
 Abnormalities in mean intensity or completeness at resolution ranges with a
 relative ice ring intensity lower than 0.10 will be ignored.
 No ice ring related problems detected.
 If ice rings were present, the data does not look worse at ice ring related
 d_spacings as compared to the rest of the data set.


                     ----------Merging statistics----------                    

Resolution: 68.36 - 1.80
Observations: 208218
Unique reflections: 20132
Redundancy: 10.3
Completeness: 85.61%
Mean intensity: 131.8
Mean I/sigma(I): 28.5
R-merge: 0.055
R-meas:  0.058
R-pim:   0.015


Statistics by resolution bin:
 d_max  d_min   #obs  #uniq   mult.  %comp       <I>  <I/sI>    r_mrg   r_meas    r_pim   cc1/2   cc_ano
 68.41   4.89  20280   1334   15.20 100.00     274.8    55.5    0.047    0.049    0.012   0.998*  -0.128
  4.89   3.88  19993   1244   16.07 100.00     415.9    59.1    0.045    0.047    0.011   0.999*  -0.057
  3.88   3.39  19494   1208   16.14 100.00     319.4    54.5    0.048    0.050    0.012   0.999*  -0.018
  3.39   3.08  19558   1189   16.45 100.00     225.4    48.9    0.053    0.055    0.013   0.999*  -0.060
  3.08   2.86  19743   1185   16.66 100.00     156.5    42.3    0.059    0.060    0.014   0.999*  -0.069
  2.86   2.69  19921   1191   16.73 100.00     126.6    39.2    0.064    0.066    0.016   0.998*  -0.131
  2.69   2.55  19693   1172   16.80 100.00      99.6    33.5    0.073    0.075    0.018   0.999*  -0.001
  2.55   2.44  15819   1167   13.56 100.00      90.1    28.4    0.075    0.078    0.021   0.998*  -0.081
  2.44   2.35  11841   1162   10.19 100.00      78.4    23.6    0.078    0.082    0.025   0.997*   0.002
  2.35   2.27   9895   1180    8.39 100.00      77.4    21.0    0.078    0.083    0.028   0.997*   0.003
  2.27   2.20   8269   1150    7.19  99.91      66.5    17.7    0.084    0.090    0.033   0.992*  -0.137
  2.20   2.13   6811   1164    5.85  98.98      60.1    14.5    0.087    0.095    0.038   0.994*  -0.016
  2.13   2.08   5282   1115    4.74  97.13      52.3    11.7    0.091    0.102    0.045   0.992*  -0.067
  2.08   2.03   4005   1056    3.79  91.43      46.3     9.7    0.095    0.110    0.053   0.987*  -0.039
  2.03   1.98   3058    999    3.06  85.97      39.2     7.7    0.102    0.122    0.064   0.983*   0.004
  1.98   1.94   2052    895    2.29  77.89      37.1     6.4    0.095    0.118    0.069   0.979*   0.045
  1.94   1.90   1161    699    1.66  60.99      31.7     4.8    0.115    0.149    0.094   0.964*   0.000
  1.90   1.86    748    530    1.41  46.17      27.6     3.9    0.109    0.147    0.098   0.971*   0.000
  1.86   1.83    463    365    1.27  31.82      25.2     3.4    0.100    0.139    0.096   0.964*   0.000
  1.83   1.80    132    127    1.04  11.08      21.8     2.7    0.428    0.606    0.428  -0.378   0.000
 68.36   1.80 208218  20132   10.34  85.61     131.8    28.5    0.055    0.058    0.015   0.999*  -0.097




                ----------Resolution cutoff estimates----------                

  resolution of all data          :   1.800
  based on CC(1/2) >= 0.33        :   1.831
  based on mean(I/sigma) >= 2.0   :   1.831
  based on R-merge < 0.5          :   1.831
  based on R-meas < 0.5           :   1.831
  based on completeness >= 90%    :   2.028
  based on completeness >= 50%    :   1.901

NOTE: we recommend using all data out to the CC(1/2) limit
for refinement.


Writing reflections to merged.mtz
Title: From dials.merge
Space group symbol from file: P41212
Space group number from file: 92
Space group from matrices: P 41 21 2 (No. 92)
Point group symbol from file: 422
Number of crystals: 1
Number of Miller indices: 20132
Resolution range: 68.3603 1.80033
History:
  From DIALS 2.2.9-g061426e04-release, run on 2020-09-12 at 08:37:11 BST  (2020-09
Crystal 1:
  Name: XTAL
  Project: AUTOMATIC
  Id: 1
  Unit cell: (68.3603, 68.3603, 103.953, 90, 90, 90)
  Number of datasets: 1
  Dataset 1:
    Name: NATIVE
    Id: 1
    Wavelength: 0.979493
    Number of columns: 17
    label    #valid  %valid    min     max type
    H         20132 100.00%   0.00   36.00 H: index h,k,l
    K         20132 100.00%   0.00   24.00 H: index h,k,l
    L         20132 100.00%   0.00   56.00 H: index h,k,l
    IMEAN     20132 100.00% -10.44 5402.43 J: intensity
    SIGIMEAN  20132 100.00%   0.14  174.45 Q: standard deviation
    I(+)      19874  98.72% -10.44 5402.43 K: I(+) or I(-)
    SIGI(+)   19874  98.72%   0.14  174.45 M: standard deviation
    I(-)      14163  70.35%  -6.83 3534.70 K: I(+) or I(-)
    SIGI(-)   14163  70.35%   0.40   77.31 M: standard deviation
    N(+)      19874  98.72%   2.00    8.00 I: integer
    N(-)      14163  70.35%   8.00    8.00 I: integer
    F         20132 100.00%   0.34   73.21 F: amplitude
    SIGF      20132 100.00%   0.04    1.51 Q: standard deviation
    F(+)      19874  98.72%   0.34   73.21 G: F(+) or F(-)
    SIGF(+)   19874  98.72%   0.04    1.51 L: standard deviation
    F(-)      14163  70.35%   0.90   59.41 G: F(+) or F(-)
    SIGF(-)   14163  70.35%   0.06    1.38 L: standard deviation

This merges the data and performs a truncation procedure, to give a merged MTZ file containing intensities and strictly-positive structure factors (Fs).