Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

initial investigation into harmonizing particles and atoms #150

Open
wants to merge 21 commits into
base: cg-particle-support
Choose a base branch
from

Conversation

JFRudzinski
Copy link
Collaborator

@JFRudzinski JFRudzinski commented Dec 4, 2024

ndaelman-hu and others added 3 commits October 16, 2024 14:36

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
* Add support for comparison operators to (`Atomic`)`Cell`, similar to `<`, `>`, `<=`, `>=`
* Rewrite (in)equality operators

---------

Co-authored-by: ndaelman <[email protected]>
Co-authored-by: Bernadette Mohr <[email protected]>
Co-authored-by: ndaelman <[email protected]>
@coveralls
Copy link

coveralls commented Dec 4, 2024

Pull Request Test Coverage Report for Build 12687257448

Details

  • 163 of 217 (75.12%) changed or added relevant lines in 7 files are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage increased (+79.4%) to 79.402%

Changes Missing Coverage Covered Lines Changed/Added Lines %
src/nomad_simulations/schema_packages/utils/utils.py 7 10 70.0%
src/nomad_simulations/schema_packages/particles_state.py 20 32 62.5%
src/nomad_simulations/schema_packages/model_system.py 113 152 74.34%
Totals Coverage Status
Change from base Build 12144007686: 79.4%
Covered Lines: 2124
Relevant Lines: 2675

💛 - Coveralls


def __init__(self, m_def: 'Section' = None, m_context: 'Context' = None, **kwargs):
super().__init__(m_def, m_context, **kwargs)
self.labels = None
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is one approach to simplifying the logic in functions like the composition formula ones, but it seems maybe not the best for typing purposes (I can't make it a protected variable because we intend to use it outside of this class).

Also, the approach doesn't really work for atoms_state vs. particles_state so I am still thinking about that.


from nomad.datamodel.datamodel import EntryArchive
from structlog.stdlib import BoundLogger
if not TYPE_CHECKING:
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This seems to be a general problem, I need to ask someone

get_composition_recurs(system=system_parent, atom_labels=atom_labels)
labels = []
if system_parent.cell is not None:
if system_parent.cell[0].name == 'AtomicCell':
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is just a temporary solution to make sure the rest works.

I think now that we can use some method of an archive section called m_xpath to grab the section by name of the parent class...see get_sibling_section() in utils.py

More generally I am also considering making an intermediate parent for AtomicCell and ParticleCell to share particle-specific quantities and functionalities that we don't want in cell...

self.elemental_composition = sec_chemical_formula.m_cache.get(
'elemental_composition', []
)
if any(cell.name == 'AtomicCell' for cell in self.cell):
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This approach seems fine to me, I just generalized in case later we have some case with both atomic and particle cells (although this seems far away at this point)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I agree. Keeping it as general as possible for as long as possible will make possible extensions later easier, even if we don't have concrete ideas about them now.

@JFRudzinski
Copy link
Collaborator Author

We do also need to consider whether a straight copy of the ase Atoms class is a sustainable/robust approach. There are many things to consider. I started investigating, but there is more work to do before I have a clear idea

'The class does not have atoms_state or particles_state'
)

def get(self, key, logger=None):
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the crux of the implementation of getting the labels from either type of cell. Still not sure if there is a better way

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we do need to be able to differentiate at various points, e.g., to selectively skip certain normalizations. I've been trying to come up with a more elegant way than constantly looking for the cell type, but I'm not positive there is one at this point.

@@ -552,7 +552,16 @@ def normalize(self, archive: 'EntryArchive', logger: 'BoundLogger') -> None:
)


class AtomsState(Entity):
class State(Entity):
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Didn't end up using this after all, but I left it for now in case we need it. But will remove before merge if not

def normalize(self, archive: 'EntryArchive', logger: 'BoundLogger') -> None:
super().normalize(archive, logger)

# Set the name of the section
self.name = self.m_def.name if self.name is None else self.name


# TODO Consider changing name to BeadCell or CGBeadCell, only using "particle" for the more abstract reference to atoms or beads
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Bernadette-Mohr how would you feel about this type of naming?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That makes a lot of sense!
Distinctive, clear naming goes a long way toward well-structured, understandable code.

@JFRudzinski
Copy link
Collaborator Author

@Bernadette-Mohr This is mostly working now (in terms of the tests for atomic cell), but I haven't adjusted the parser branch yet. I will try to do that tomorrow.

Just take a look what I did and let me know if you think it makes sense

Copy link
Collaborator

@Bernadette-Mohr Bernadette-Mohr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good as far as I can see.

Copy link
Collaborator

@Bernadette-Mohr Bernadette-Mohr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I missed this one in the first round.

src/nomad_simulations/schema_packages/model_system.py Outdated Show resolved Hide resolved
@JFRudzinski JFRudzinski force-pushed the cg-particle-support-JFR branch from 163f846 to b4df654 Compare January 9, 2025 09:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants