Creating a new HDF5 nuclear data library for UQ by Grego01-biot · Pull Request #3911 · openmc-dev/openmc

Grego01-biot · 2026-04-01T14:32:44Z

Description

The goal of this PR is to create a new HDF5 nuclear data library with covariance data stored in that would live on the OpenMC nuclear data libraries page. This is only the HDF5 data storage part with no modifications to the C++ source code. The final goal would be to have the capability to perform on-the-fly cross section sampling using the stored data in HDF5 format. Another draft PR will be added shortly to openmc-dev/data repository in order to have the capability to generate the entire nuclear data library with MF=33 evaluations.

Python API addition

In order to do so, two new Python files are created and added to openmc/data:

mf33_njoy.py is a minimal driver that runs ERRORR module from NJOY to produce multigroup relative covariance matrices from an ENDF-6 evaluation. The user provides an explicit energy grid (group boundaries in eV) and optionally a weight spectrum used to produce multi-group cross sections. The output is the raw ERRORR tape33 text.
xs_covariance_njoy.py is where the data model, ENDF text parser, covariance factorization. The key class is NeutronXSCovariances, which holds parsed MF=33 covariance matrices and their lower-triangular factors. It provides the capability to read/write to HDF5 files and to write/read MF33 sub-tree inside an existing HDF5 group (used by IncidentNeutron).
neutron.py is modified to add an mg_covariance property on IncidentNeutron and write covariance data in export_to_hdf5() under /<nuclide>/covariance/mf33/. It also can read covariance data back in from_hdf5().

The idea is to start by generating multigroup covariance matrices for MF33 using ERRORR module from NJOY, parsing output tape33, storing the covariance data, the temperature, the type of covariance (relative or absolute), the energy grids and the reactions number.

HDF5 storage strategy

It is important to consider two types of covariance data:

For self-covariance blocks (MT = MT), the matrices are symmetric positive semi-definite representing the uncertainty of a single reaction with itself. Due to these assumptions, it is possible to perform a factorization in order to store only a lower triangular matrix L rather than the full matrix. The goal is to reduce the storage size of covariance data in HDF5 files. The factorization can use Cholesky if the matrix is numerically positive definite, falling back to eigendecomposition with thresholding of small/negative eigenvalues followed by a QR pivot to obtain the desired lower-triangular form.
For cross-covariance blocks, they are not symmetric and cannot be factorized easily. The current solution is to store them as raw G*G matrices. Following ENDF convention, only the upper-triangle pair is stored (the transpose gives the complementary block).

It is important to note that the assembly and joint eigendecomposition of the full joint covariance matrix are deferred to OpenMC's preprocessing step, where the user can decide which reactions to jointly sample. The idea is that before transport, the code will assemble the full joint covariance matrix for all requested reactions and store the eigendecomposition in memory for sampling purposes.

HDF5 layout

Covariance data is stored under /nuclide/covariance/mf33/. The group carries metadata as attributes: a format string, a flag indicating relative covariances, the MAT number, and the processing temperature. Two datasets sit at the top level: energy_grid_ev holding the G+1 group boundaries in eV, and mts listing the MT numbers that have covariance data. Below that, two sub-groups organize the actual matrices. The reactions/ sub-group stores raw covariance matrices as (G, G) datasets indexed by {mt}/{mt1} (includes all cross-correlations). The factors/ sub-group stores the lower-triangular factors L as (G, r) datasets, also indexed by {mt}/{mt1}, for self-correlations only.

This capability enables two UQ workflows within OpenMC: sensitivity analysis via the sandwich rule, and direct cross-section sampling via the Total Monte Carlo approach (similar to SANDY). It would also allow users to generate and store their own multigroup covariance data by providing a precomputed flux spectrum as input to NJOY and selecting the desired output energy grid (the lower energy bound cannot be 0 eV due to ERRORR and resolved resonance region treatment irespr = 1). For now, the strategy is to use the SCALE energy group structure with 252 energy groups. The data structure described here is a starting point — feedback on the schema and storage layout is more than welcome!

Here is an example of the code used to generate the new HDF5 file for Fe56.h5 using 1500 energy groups:

from openmc.data.xs_covariance_njoy import NeutronXSCovariances
import numpy as np

ek = np.logspace(np.log10(1e-5), np.log10(20e6), 1501)  # 1500 groups

cov = NeutronXSCovariances.from_endf(
    "n-026_Fe_056.endf",
    ek,
    njoy_exec="/path/to/njoy",
    temperature=293.6,
)

# Standalone file
cov.to_hdf5("Fe56_covariance.h5")

# Or embed in an existing OpenMC HDF5
import h5py
with h5py.File("Fe56.h5", "r+") as f:
    cov_root = f["Fe56"].require_group("covariance")
    cov.write_mf33_group(cov_root)

It is important to note that the code needs modularity to account for the tolerance used in NJOY, the dilution cross section used for the multigroup covariance generation and the combination or not of MF32 and MF33. For this PR, MF=31, MF=34 and MF=35 are not considered for storage in the HDF5.

Fixes # (issue)

Checklist

I have performed a self-review of my own code
~~I have run clang-format (version 18) on any C++ source files (if applicable)~~
I have followed the style guidelines for Python source files (if applicable)
I have made corresponding changes to the documentation (if applicable)
I have added tests that prove my fix is effective or that my feature works (if applicable)

…hrough HDF5 files

…cross_section_sampling

…rrelations and addition of diagnostics before storing the matrices

…d adding tests to check the safeguards for self-covariance matrices

Grego01 and others added 29 commits March 4, 2026 15:21

New implementation to load covariance data from HDF5 files

e2b0b97

Merge remote-tracking branch 'upstream/develop' into covariance_HDF5

87d171d

Merge upstream changes

aff35ae

Modify xtensor to Tensor due to PR openmc-dev#3805

832d801

First implementation of sampling cross sections in OpenMC

7a6e0bf

Fixing bugs and more efficient loading of covariance data in OpenMC t…

7d6a7c6

…hrough HDF5 files

Merge remote-tracking branch 'upstream/develop' into covariance_HDF5_…

2a42e27

…cross_section_sampling

New perturb_xs function

479f3cb

Sampling on the fly working, more tests needed

b3a6287

Merge remote-tracking branch 'upstream/develop' into covariance_HDF5_…

e5bf0b6

…cross_section_sampling

Delete openmc/data/mf33_enjoy - Copy.py:Zone.Identifier

ad02a9b

Delete openmc/data/xs_covariance_njoy - Copy.py:Zone.Identifier

27eff53

cleaning up

c5a9e60

cleaning up

446b08c

clean up

ec0cf79

delete modifications on headers

92a3177

delete modifications on source files

c9b7af8

delete all modifications on source files

ed90799

delete all modifications on C++ side

d56f436

no boolean flag needed

7cd1d2b

Missing space

ad6f17d

Fix formatting of reaction.h file

e90dc5e

Fix formatting issues in reaction.h

bcc356f

cleaning file for reviewing

e45ef34

simplified python file

8aa2e76

Merge remote-tracking branch 'upstream/develop' into covariance_storage

17fac06

Merge remote-tracking branch 'upstream/develop' into covariance_storage

7899876

new test and cleanup

e599542

Remove unrelated changes to pre-commit config and simulation.cpp

e35e3fa

Grego01-biot mentioned this pull request Apr 14, 2026

Creating a new HDF5 nuclear data library for UQ openmc-dev/data#102

Open

Grego01 added 2 commits April 15, 2026 18:31

Modification of the covariance storage depending on self and cross co…

e2c28eb

…rrelations and addition of diagnostics before storing the matrices

NJOY ERRORR module not working with energy grid that start at 0 ev an…

f7d58fb

…d adding tests to check the safeguards for self-covariance matrices

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Creating a new HDF5 nuclear data library for UQ#3911

Creating a new HDF5 nuclear data library for UQ#3911
Grego01-biot wants to merge 31 commits intoopenmc-dev:developfrom
Grego01-biot:covariance_storage

Grego01-biot commented Apr 1, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Grego01-biot commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Grego01-biot commented Apr 1, 2026 •

edited

Loading