PhParser: allow for pattern initialization #1034

bastonero · 2024-06-18T16:56:49Z

The ph.x should be parallelized by setting in the input parameters start_irr and last_irr to 0. This allows the program to exit smoothly and it further avoids to wait for the rewriting of the wavefunctions, which can be a rather long and intensive IO operation, not really suited for initialization runs.

The parser is then adjusted to account for this option, as for some reason the line having JOB DONE is not printed in such cases. A simple specialized parser is also added to store the number of q-points and their values, which can be later on used to parallelize over q-points by specifying last_q and start_q.

PS: this also avoids the creation of the aiida.EXIT file, which triggers the program to stop.

Note: this is also the recommended way from the official webpage. To report the statements in case the link will be broken in the future:

NB: The program ph.x writes on the tmp_dir/_ph0/{prefix}.phsave directory
a file for each representation of each q point. This file is called
dynmat.#iq.#irr.xml where #iq is the number of the q point and #irr
is the number of the representation. These files contain the
contribution to the dynamical matrix of the irr representation for the
iq point.

If [recover](https://www.quantum-espresso.org/Doc/INPUT_PH.html#recover)=.true. ph.x does not recalculate the
representations already saved in the tmp_dir/_ph0/{prefix}.phsave
directory.  Moreover ph.x writes on the files patterns.#iq.xml in the
tmp_dir/_ph0/{prefix}.phsave directory the displacement patterns that it
is using. If [recover](https://www.quantum-espresso.org/Doc/INPUT_PH.html#recover)=.true. ph.x does not recalculate the
displacement patterns found in the tmp_dir/_ph0/{prefix}.phsave directory.

This mechanism allows:

  1) To recover part of the ph.x calculation even if the recover file
     or files are corrupted. You just remove the _ph0/{prefix}.recover
     files from the tmp_dir directory. You can also remove all the _ph0
     files and keep only the _ph0/{prefix}.phsave directory.

  2) To split a phonon calculation into several jobs for different
     machines (or set of nodes). Each machine calculates a subset of
     the representations and saves its dynmat.#iq.#irr.xml files on
     its tmp_dir/_ph0/{prefix}.phsave directory. Then you collect all the
     dynmat.#iq.#irr.xml files in one directory and run ph.x to
     collect all the dynamical matrices and diagonalize them.

NB: To split the q points in different machines, use the input
variables start_q and last_q. To split the irreducible
representations, use the input variables [start_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#start_irr), [last_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#last_irr). Please
note that different machines will use, in general, different
displacement patterns and it is not possible to recollect partial
dynamical matrices generated with different displacement patterns.  A
calculation split into different machines will run as follows: A
preparatory run of ph.x with [start_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#start_irr)=0, [last_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#last_irr)=0 produces the sets
of displacement patterns and save them on the patterns.#iq.xml files.
These files are copied in all the tmp_dir/_ph0/{prefix}.phsave directories
of the machines where you plan to run ph.x. ph.x is run in different
machines with complementary sets of start_q, last_q, [start_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#start_irr) and
[last_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#last_irr) variables.  All the files dynmat.#iq.#irr.xml are
collected on a single tmp_dir/_ph0/{prefix}.phsave directory (remember to
collect also dynmat.#iq.0.xml).  A final run of ph.x in this
machine collects all the data contained in the files and diagonalizes
the dynamical matrices.  This is done requesting a complete dispersion
calculation without using start_q, last_q, [start_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#start_irr), or [last_irr](https://www.quantum-espresso.org/Doc/INPUT_PH.html#last_irr).
See an example in examples/GRID_example.

On parallel machines the q point and the irreps calculations can be split
automatically using the -nimage flag. See the phonon user guide for further
information.

The ph.x should be parallelized by setting in the input parameters `start_irr` and `last_irr` to 0. This allows the program to exit smoothly and it further avoids to wait for the rewriting of the wavefunctions, which can be a rather long and intensive IO operation, not really suited for initialization runs. The parser is then adjusted to account for this option, as for some reason the line having `JOB DONE` is not printed in such cases. A simple specialized parser is also added to store the number of q-points and their values, which can be later on used to parallelize over q-points by specifying `last_q` and `start_q`.

sphuber · 2024-06-20T13:06:44Z

src/aiida_quantumespresso/parsers/ph.py

+            if parameters:
+                self.out('output_parameters', orm.Dict(parameters))
+                return


In the case that parameters is empty, wouldn't that reasonably correspond to some kind of error? Or are you intentionally letting it continue the parsing in that case to find a generic error?

Yeah in principle ph.x can still throw some errors, say if something was wrong with some files etc.

sphuber · 2024-06-20T13:08:53Z

src/aiida_quantumespresso/parsers/parse_raw/ph.py

+        q_points = [list(map(float, coord)) for coord in coords]
+
+        parameters.update({'q_points': q_points})


Would it make sense perhaps to return this as an actual KpointsData instead of a Dict?

Yes, I thought about that, maybe it would. On the other hand, I was even wondering whether it's worth it to actually parse it or not. If one has a grid, one can/should use start/last_q, if you provide a KpointsData, this would return the same node basically. So, don't know. Any strong opinion? What about maybe adding some "post-process" parsing via tools?

I am not quite sure I fully understand the use case of this initialization run. But if for the common use case a user would actually want to use the parsed grid as an input for the next calculation (i.e. they are going to turn it into a KpointsData anyway) then we might as well have the parser do it here.

If, instead, the kpoints won't be used as is, but in parts and so the KpointsData would have to be transformed, then you might as well just leave it as is.

This initialization run is the proper initialization for ph.x, which avoids the use of the .EXIT file, which would still make ph.x to rewrite the wavefunctions for nothing (hence, wasting time - to give an idea, an 18 atoms system it would take ~20 min, which are wasted node hours). The key ingredient here is just to determine the number of q points, and the next runs would be parallelized not with the specific q point but using start_q and last_q instead. The parsing of the q points as either dictionary in the output parameters or as kpointsdata is just out of completeness, but not really meant to be used (at least, as I am thinking to use this initialization run). I could simply remove it at this point.

src/aiida_quantumespresso/parsers/parse_raw/ph.py

sphuber · 2024-06-23T19:25:38Z

src/aiida_quantumespresso/parsers/parse_raw/ph.py

+    if parameters['number_of_qpoints'] != len(parameters['q_points']):
+        return parameters, False
+
+    return parameters, True


I am not a big fan of communicating errors through return values, especially not if that means having to turn the return value into a tuple. Since you only really use the parameters result in case there isn't a problem, what is the problem with just raising an exception and catching that in the caller?

sphuber

Thanks @bastonero . Just the one test is failing. If you fix that, I will merge

bastonero added topic/parsers priority/important labels Jun 18, 2024

bastonero force-pushed the ph-initialization branch from 4df9fb0 to 5c396c1 Compare June 18, 2024 17:08

bastonero requested a review from sphuber June 18, 2024 17:08

sphuber reviewed Jun 20, 2024

View reviewed changes

Address review

f72e4b8

sphuber reviewed Jun 23, 2024

View reviewed changes

Address review 2

1f55493

bastonero requested a review from sphuber June 26, 2024 22:06

sphuber approved these changes Jun 27, 2024

View reviewed changes

bastonero added the pr/blocked PR is blocked by another PR that should be merged first label Jun 28, 2024

Merge branch 'main' into ph-initialization

c73a9dd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PhParser: allow for pattern initialization #1034

PhParser: allow for pattern initialization #1034

bastonero commented Jun 18, 2024

sphuber Jun 20, 2024

bastonero Jun 20, 2024

sphuber Jun 20, 2024

bastonero Jun 20, 2024

sphuber Jun 23, 2024

bastonero Jun 26, 2024

sphuber Jun 23, 2024

sphuber left a comment •

edited

Loading

		q_points = [list(map(float, coord)) for coord in coords]

		parameters.update({'q_points': q_points})

PhParser: allow for pattern initialization #1034

Are you sure you want to change the base?

PhParser: allow for pattern initialization #1034

Conversation

bastonero commented Jun 18, 2024

sphuber Jun 20, 2024

Choose a reason for hiding this comment

bastonero Jun 20, 2024

Choose a reason for hiding this comment

sphuber Jun 20, 2024

Choose a reason for hiding this comment

bastonero Jun 20, 2024

Choose a reason for hiding this comment

sphuber Jun 23, 2024

Choose a reason for hiding this comment

bastonero Jun 26, 2024

Choose a reason for hiding this comment

sphuber Jun 23, 2024

Choose a reason for hiding this comment

sphuber left a comment • edited Loading

Choose a reason for hiding this comment

sphuber left a comment •

edited

Loading