Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added methods to compute and compare DOS fingerprints #2772

Merged
merged 35 commits into from
Dec 24, 2022

Conversation

naik-aakash
Copy link
Contributor

Compute and compare fingerprints of project and total density of states.

This functionality could be really useful to compare the DOS of materials from different softwares (eg:- LOBSTER and VASP ) or versions (Eg:- VASP v5 and VASP v6)

  • Added method to compute fingerprints
  • Added method to compare two DOS fingerprints (tanimoto index and dot product)
  • Added test cases for the corresponding methods

@coveralls
Copy link

coveralls commented Dec 13, 2022

Coverage Status

Coverage: 46.362% (-0.5%) from 46.884% when pulling 8309b1b on naik-aakash:dos_fingerprinting into b566980 on materialsproject:master.

"some error in the spelling."
)

def _fp_to_dict(self, fp):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It could be static, I assume

Copy link
Contributor Author

@naik-aakash naik-aakash Dec 13, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, It could be. Will make the changes. Thanks for the feedback 😃

@naik-aakash naik-aakash requested a review from JaGeo December 13, 2022 14:38
Copy link
Member

@JaGeo JaGeo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You could replace one "finger print" with fingerprint in the documentation but otherwise it looks fine from my side.

@naik-aakash
Copy link
Contributor Author

You could replace one "finger print" with fingerprint in the documentation but otherwise it looks fine from my side.

Thanks for the review @JaGeo 😄

@naik-aakash
Copy link
Contributor Author

Hi @janosh , This PR is ready to be merge.

Copy link
Member

@janosh janosh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @naik-aakash! I left some comments.

@@ -1165,6 +1166,150 @@ def get_upper_band_edge(
upper_band_edge = energies[np.argmax(densities)]
return upper_band_edge

def get_dos_fp(self, type="summed_pdos", binning=True, min_e=None, max_e=None, nbins=256, normalize=True):
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would be good to add type annotations here.

def get_dos_fp(self, type: str = "summed_pdos", binning: bool = True, ...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the suggestions , I have made the requested changes.

pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
type (str): Specify fingerprint type needed can accept 's/p/d/f/summed_pdos/tdos'
min_e (float): The minimum mode energy to include in the fingerprint
max_e (float): The maximum mode energy to include in the fingerprint
nbins (int): Number of bins to be used in the fingerprint
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we rename this to n_bins for proper snake casing?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done 😃

@naik-aakash
Copy link
Contributor Author

Hi @janosh , all suggestions have been incorporated now. And tests seems to pass as well. 😄

Copy link
Member

@janosh janosh left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@naik-aakash Thanks, looks great! Just a few more nitpicks.

pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
pymatgen/electronic_structure/dos.py Outdated Show resolved Hide resolved
@naik-aakash
Copy link
Contributor Author

Hi @janosh , thank you for the really useful tips and suggestions. 😃

@naik-aakash
Copy link
Contributor Author

Hi @janosh , I do not know why so many tests are failing. I looked at Failed tests and they do not have anything to do with this PR. I hope it could still be merged.

@janosh
Copy link
Member

janosh commented Dec 24, 2022

@naik-aakash Sorry for the delay here. Back from traveling now. PR looks good! Made a few last changes. Failures like you said are unrelated. Ready to merge.

@janosh janosh merged commit 4847ec7 into materialsproject:master Dec 24, 2022
@naik-aakash
Copy link
Contributor Author

@naik-aakash Sorry for the delay here. Back from traveling now. PR looks good! Made a few last changes. Failures like you said are unrelated. Ready to merge.

@janosh No worries! Thank you. Happy Holidays

lbluque pushed a commit to lbluque/pymatgen that referenced this pull request Jan 5, 2023
…ct#2772)

* added dos_fingerprint and similarity index methods

* Added test cases and reformatted and cleaned code

* added binwidth ,renamed states to densities in fp obj,updated tests

* added source link

* changed get_dos_fp_similarity and fp_to_dict methods to static

* Delete Test.py, unnecessary file

* simplified dict updating, added missing type annotations

* NamedTuple return type fixed

* small clean up

* document get_dos_fp() and get_dos_fp_similarity() raise conditions in doc str

* add types for fp1,fp2 and update doc str

* update exception tests

Co-authored-by: anaik <anaik@sv2218.zit.bam.de>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
lbluque pushed a commit to lbluque/pymatgen that referenced this pull request May 23, 2023
…ct#2772)

* added dos_fingerprint and similarity index methods

* Added test cases and reformatted and cleaned code

* added binwidth ,renamed states to densities in fp obj,updated tests

* added source link

* changed get_dos_fp_similarity and fp_to_dict methods to static

* Delete Test.py, unnecessary file

* simplified dict updating, added missing type annotations

* NamedTuple return type fixed

* small clean up

* document get_dos_fp() and get_dos_fp_similarity() raise conditions in doc str

* add types for fp1,fp2 and update doc str

* update exception tests

Co-authored-by: anaik <anaik@sv2218.zit.bam.de>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
lbluque pushed a commit to lbluque/pymatgen that referenced this pull request May 25, 2023
…ct#2772)

* added dos_fingerprint and similarity index methods

* Added test cases and reformatted and cleaned code

* added binwidth ,renamed states to densities in fp obj,updated tests

* added source link

* changed get_dos_fp_similarity and fp_to_dict methods to static

* Delete Test.py, unnecessary file

* simplified dict updating, added missing type annotations

* NamedTuple return type fixed

* small clean up

* document get_dos_fp() and get_dos_fp_similarity() raise conditions in doc str

* add types for fp1,fp2 and update doc str

* update exception tests

Co-authored-by: anaik <anaik@sv2218.zit.bam.de>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
janosh added a commit that referenced this pull request May 26, 2023
* added dos_fingerprint and similarity index methods

* Added test cases and reformatted and cleaned code

* added binwidth ,renamed states to densities in fp obj,updated tests

* added source link

* changed get_dos_fp_similarity and fp_to_dict methods to static

* Delete Test.py, unnecessary file

* simplified dict updating, added missing type annotations

* NamedTuple return type fixed

* small clean up

* document get_dos_fp() and get_dos_fp_similarity() raise conditions in doc str

* add types for fp1,fp2 and update doc str

* update exception tests

Co-authored-by: anaik <anaik@sv2218.zit.bam.de>
Co-authored-by: Janosh Riebesell <janosh.riebesell@gmail.com>
@naik-aakash naik-aakash deleted the dos_fingerprinting branch January 10, 2024 12:23
@naik-aakash naik-aakash mentioned this pull request Jul 22, 2024
7 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants