mattertune.configs

class mattertune.configs.AdamConfig(*, per_parameter_hparams=None, name='Adam', lr, eps=1e-08, betas=(0.9, 0.999), weight_decay=0.0, amsgrad=False)[source]

Parameters:

per_parameter_hparams (Sequence[PerParamHparamsDict] | None)
name (Literal['Adam'])
lr (Annotated[float, Gt(gt=0)])
eps (Annotated[float, Ge(ge=0)])
betas (tuple[Annotated[float, Gt(gt=0)], Annotated[float, Gt(gt=0)]])
weight_decay (Annotated[float, Ge(ge=0)])
amsgrad (bool)

name: Literal['Adam']: name of the optimizer.

lr: C.PositiveFloat: Learning rate.

eps: C.NonNegativeFloat: Epsilon.

betas: tuple[C.PositiveFloat, C.PositiveFloat]: Betas.

weight_decay: C.NonNegativeFloat: Weight decay.

amsgrad: bool: Whether to use AMSGrad variant of Adam.

class mattertune.configs.AdamWConfig(*, per_parameter_hparams=None, name='AdamW', lr, eps=1e-08, betas=(0.9, 0.999), weight_decay=0.01, amsgrad=False)[source]

Parameters:

per_parameter_hparams (Sequence[PerParamHparamsDict] | None)
name (Literal['AdamW'])
lr (Annotated[float, Gt(gt=0)])
eps (Annotated[float, Ge(ge=0)])
betas (tuple[Annotated[float, Gt(gt=0)], Annotated[float, Gt(gt=0)]])
weight_decay (Annotated[float, Ge(ge=0)])
amsgrad (bool)

name: Literal['AdamW']: name of the optimizer.

lr: C.PositiveFloat: Learning rate.

eps: C.NonNegativeFloat: Epsilon.

betas: tuple[C.PositiveFloat, C.PositiveFloat]: Betas.

weight_decay: C.NonNegativeFloat: Weight decay.

amsgrad: bool: Whether to use AMSGrad variant of Adam.

class mattertune.configs.AtomsListDatasetConfig(*, type='atoms_list', atoms_list)[source]

Parameters:

type (Literal['atoms_list'])
atoms_list (list[Atoms])

type: Literal['atoms_list']: Discriminator for the atoms_list dataset.

atoms_list: list[ase.Atoms]: The list of Atoms objects.

create_dataset()[source]

class mattertune.configs.AutoSplitDataModuleConfig(*, batch_size, num_workers='auto', pin_memory=True, dataset, train_split, validation_split='auto', shuffle=True, shuffle_seed=42)[source]

Parameters:

batch_size (int)
num_workers (int | Literal['auto'])
pin_memory (bool)
dataset (DatasetConfig)
train_split (float)
validation_split (float | Literal['auto', 'disable'])
shuffle (bool)
shuffle_seed (int)

dataset: DatasetConfig: The configuration for the dataset.

train_split: float: The proportion of the dataset to include in the training split.

validation_split: float | Literal['auto', 'disable']

The proportion of the dataset to include in the validation split.

If set to “auto”, the validation split will be automatically determined as the complement of the training split, i.e. validation_split = 1 - train_split.

If set to “disable”, the validation split will be disabled.

shuffle: bool: Whether to shuffle the dataset before splitting.

shuffle_seed: int: The seed to use for shuffling the dataset.

dataset_configs()[source]

create_datasets()[source]

class mattertune.configs.CSVLoggerConfig(*, type='csv', save_dir, name='lightning_logs', version=None, prefix='', flush_logs_every_n_steps=100)[source]

Parameters:

type (Literal['csv'])
save_dir (str)
name (str)
version (int | str | None)
prefix (str)
flush_logs_every_n_steps (int)

type: Literal['csv']

save_dir: str: Save directory for logs.

name: str

'lightning_logs'.

Type:: Experiment name. Default

version: int | str | None: Experiment version. If not specified, automatically assigns the next available version. Default: None.

prefix: str

''.

Type:: String to put at the beginning of metric keys. Default

flush_logs_every_n_steps: int

100.

Type:: How often to flush logs to disk. Default

create_logger()[source]: Creates a CSVLogger instance from this config.

class mattertune.configs.ConstantLRConfig(*, type='ConstantLR', factor=0.3333333333333333, total_iters=5)[source]

Parameters:

type (Literal['ConstantLR'])
factor (float)
total_iters (int)

type: Literal['ConstantLR']: Type of the learning rate scheduler.

factor: float: The number we multiply learning rate until the milestone.

total_iters: int: The number of steps that the scheduler decays the learning rate.

class mattertune.configs.CosineAnnealingLRConfig(*, type='CosineAnnealingLR', T_max, eta_min=0, last_epoch=-1)[source]

Parameters:

type (Literal['CosineAnnealingLR'])
T_max (int)
eta_min (float)
last_epoch (int)

type: Literal['CosineAnnealingLR']: Type of the learning rate scheduler.

T_max: int: Maximum number of iterations.

eta_min: float: Minimum learning rate.

last_epoch: int: The index of last epoch.

class mattertune.configs.CutoffsConfig(*, main, aeaint, qint, aint)[source]

Parameters:

main (float)
aeaint (float)
qint (float)
aint (float)

main: float

aeaint: float

qint: float

aint: float

classmethod from_constant(value)[source]

Parameters:: value (float)

class mattertune.configs.DBDatasetConfig(*, type='db', src, energy_key=None, forces_key=None, stress_key=None, preload=True)[source]

Configuration for a dataset stored in an ASE database.

Parameters:

type (Literal['db'])
src (Database | str | Path)
energy_key (str | None)
forces_key (str | None)
stress_key (str | None)
preload (bool)

type: Literal['db']: Discriminator for the DB dataset.

src: Database | str | Path: Path to the ASE database file or a database object.

energy_key: str | None: Key for the energy label in the database.

forces_key: str | None: Key for the force label in the database.

stress_key: str | None: Key for the stress label in the database.

preload: bool: Whether to load all the data at once or not.

create_dataset()[source]

class mattertune.configs.DataModuleBaseConfig(*, batch_size, num_workers='auto', pin_memory=True)[source]

Parameters:

batch_size (int)
num_workers (int | Literal['auto'])
pin_memory (bool)

batch_size: int: The batch size for the dataloaders.

num_workers: int | Literal['auto']

The number of workers for the dataloaders.

This is the number of processes that generate batches in parallel.

If set to “auto”, the number of workers will be automatically set based on the number of available CPUs.

Set to 0 to disable parallelism.

pin_memory: bool

Whether to pin memory in the dataloaders.

This is useful for speeding up GPU data transfer.

dataloader_kwargs()[source]

Return type:: DataLoaderKwargs

abstract dataset_configs()[source]

Return type:: Iterable[DatasetConfig]

abstract create_datasets()[source]

Return type:: DatasetMapping

class mattertune.configs.DatasetConfigBase[source]

abstract create_dataset()[source]

Return type:: Dataset[Atoms]

prepare_data()[source]

Prepare the dataset for training.

Use this to download and prepare data. Downloading and saving data with multiple processes (distributed settings) will result in corrupted data. Lightning ensures this method is called only within a single process, so you can safely add your downloading logic within this method.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

class mattertune.configs.EMAConfig(*, decay, validate_original_weights=False, every_n_steps=1, cpu_offload=False)[source]

Parameters:

decay (float)
validate_original_weights (bool)
every_n_steps (int)
cpu_offload (bool)

decay: float

validate_original_weights: bool

every_n_steps: int

cpu_offload: bool

construct_callback()[source]

class mattertune.configs.EMARecipeConfig(*, name='ema', decay, validate_original_weights=False, every_n_steps=1, cpu_offload=False)[source]

Parameters:

name (Literal['ema'])
decay (Annotated[float, Gt(gt=0)])
validate_original_weights (bool)
every_n_steps (int)
cpu_offload (bool)

name: Literal['ema']

decay: C.PositiveFloat: The exponential decay used when calculating the moving average. Has to be between 0-1.

validate_original_weights: bool: Validate the original weights, as apposed to the EMA weights.

every_n_steps: int: Apply EMA every N steps.

cpu_offload: bool: Offload weights to CPU.

create_lightning_callback()[source]: Creates the PyTorch Lightning callback for this recipe, or returns None if no callback is needed.

class mattertune.configs.EarlyStoppingConfig(*, monitor='val/total_loss', min_delta=0.0, patience=3, verbose=False, mode='min', strict=True, check_finite=True, stopping_threshold=None, divergence_threshold=None, check_on_train_epoch_end=None, log_rank_zero_only=False)[source]

Parameters:

monitor (str)
min_delta (float)
patience (int)
verbose (bool)
mode (Literal['min', 'max'])
strict (bool)
check_finite (bool)
stopping_threshold (float | None)
divergence_threshold (float | None)
check_on_train_epoch_end (bool | None)
log_rank_zero_only (bool)

monitor: str: Quantity to be monitored.

min_delta: float: Minimum change in monitored quantity to qualify as an improvement. Changes of less than or equal to min_delta will count as no improvement. Default: 0.0.

patience: int

3.

Type:: Number of validation checks with no improvement after which training will be stopped. Default

verbose: bool

False.

Type:: Whether to print messages when improvement is found or early stopping is triggered. Default

mode: Literal['min', 'max']: One of ‘min’ or ‘max’. In ‘min’ mode, training stops when monitored quantity stops decreasing; in ‘max’ mode it stops when the quantity stops increasing. Default: 'min'.

strict: bool

True.

Type:: Whether to raise an error if monitored metric is not found in validation metrics. Default

check_finite: bool

True.

Type:: Whether to stop training when the monitor becomes NaN or infinite. Default

stopping_threshold: float | None

None.

Type:: Stop training immediately once the monitored quantity reaches this threshold. Default

divergence_threshold: float | None

None.

Type:: Stop training as soon as the monitored quantity becomes worse than this threshold. Default

check_on_train_epoch_end: bool | None: Whether to run early stopping at the end of training epoch. If False, check runs at validation end. Default: None.

log_rank_zero_only: bool

False.

Type:: Whether to log the status of early stopping only for rank 0 process. Default

create_callback()[source]

class mattertune.configs.EnergyPropertyConfig(*, name='energy', dtype='float', loss, loss_coefficient=1.0, type='energy')[source]

Parameters:

name (str)
dtype (DType)
loss (LossConfig)
loss_coefficient (float)
type (Literal['energy'])

type: Literal['energy']

name: str

The name of the property.

This is the key that will be used to access the property in the output of the model.

dtype: DType: The type of the property values.

from_ase_atoms(atoms)[source]: Extract the property value from an ASE Atoms object.

ase_calculator_property_name()[source]

If this property can be calculated by an ASE calculator, returns the name of the property that the ASE calculator uses. Otherwise, returns None.

This should only return non-None for properties that are supported by the ASE calculator interface, i.e.: - ‘energy’ - ‘forces’ - ‘stress’ - ‘dipole’ - ‘charges’ - ‘magmom’ - ‘magmoms’

Note that this does not refer to the new experimental custom property prediction support feature in ASE, but rather the built-in properties that ASE can calculate in the ase.calculators.calculator.Calculator class.

prepare_value_for_ase_calculator(value)[source]: Convert the property value to a format that can be used by the ASE calculator.

property_type()[source]

class mattertune.configs.EqV2BackboneConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={}, name='eqV2', checkpoint_path, atoms_to_graph)[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])
name (Literal['eqV2'])
checkpoint_path (Path | CachedPathConfig)
atoms_to_graph (FAIRChemAtomsToGraphSystemConfig)

name: Literal['eqV2']: The type of the backbone.

checkpoint_path: Path | CE.CachedPath: The path to the checkpoint to load.

atoms_to_graph: FAIRChemAtomsToGraphSystemConfig: Configuration for converting ASE Atoms to a graph.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

create_model()[source]: Creates an instance of the finetune module for this configuration.

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

freeze_backbone: bool: Whether to freeze the backbone during training.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

class mattertune.configs.ExponentialConfig(*, type='ExponentialLR', gamma)[source]

Parameters:

type (Literal['ExponentialLR'])
gamma (float)

type: Literal['ExponentialLR']: Type of the learning rate scheduler.

gamma: float: Multiplicative factor of learning rate decay.

class mattertune.configs.FAIRChemAtomsToGraphSystemConfig(*, radius, max_num_neighbors)[source]

Configuration for converting ASE Atoms to a graph for the FAIRChem model.

Parameters:

radius (float)
max_num_neighbors (int)

radius: float: The radius for edge construction.

max_num_neighbors: int: The maximum number of neighbours each node can send messages to.

class mattertune.configs.FinetuneModuleBaseConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={})[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

freeze_backbone: bool: Whether to freeze the backbone during training.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

abstract classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

abstract create_model()[source]

Creates an instance of the finetune module for this configuration.

Return type:: FinetuneModuleBase

class mattertune.configs.ForcesPropertyConfig(*, name='forces', dtype='float', loss, loss_coefficient=1.0, type='forces', conservative)[source]

Parameters:

name (str)
dtype (DType)
loss (LossConfig)
loss_coefficient (float)
type (Literal['forces'])
conservative (bool)

type: Literal['forces']

name: str

The name of the property.

This is the key that will be used to access the property in the output of the model.

dtype: DType: The type of the property values.

conservative: bool

Whether the forces are energy conserving.

This is used by the backbone to decide the type of output head to use for this property. Conservative force predictions are computed by taking the negative gradient of the energy with respect to the atomic positions, whereas non-conservative forces may be computed by other means.

from_ase_atoms(atoms)[source]: Extract the property value from an ASE Atoms object.

ase_calculator_property_name()[source]

If this property can be calculated by an ASE calculator, returns the name of the property that the ASE calculator uses. Otherwise, returns None.

This should only return non-None for properties that are supported by the ASE calculator interface, i.e.: - ‘energy’ - ‘forces’ - ‘stress’ - ‘dipole’ - ‘charges’ - ‘magmom’ - ‘magmoms’

Note that this does not refer to the new experimental custom property prediction support feature in ASE, but rather the built-in properties that ASE can calculate in the ase.calculators.calculator.Calculator class.

property_type()[source]

class mattertune.configs.GraphPropertyConfig(*, name, dtype, loss, loss_coefficient=1.0, type='graph_property', reduction)[source]

Parameters:

name (str)
dtype (DType)
loss (LossConfig)
loss_coefficient (float)
type (Literal['graph_property'])
reduction (Literal['mean', 'sum', 'max'])

type: Literal['graph_property']

reduction: Literal['mean', 'sum', 'max']: The reduction to use for the output. - “sum”: Sum the property values for all atoms in the system. This is optimal for extensive properties (e.g. energy). - “mean”: Take the mean of the property values for all atoms in the system. This is optimal for intensive properties (e.g. density). - “max”: Take the maximum of the property values for all atoms in the system. This is optimal for properties like the last phdos peak of Matbench’s phonons dataset.

from_ase_atoms(atoms)[source]: Extract the property value from an ASE Atoms object.

ase_calculator_property_name()[source]

If this property can be calculated by an ASE calculator, returns the name of the property that the ASE calculator uses. Otherwise, returns None.

This should only return non-None for properties that are supported by the ASE calculator interface, i.e.: - ‘energy’ - ‘forces’ - ‘stress’ - ‘dipole’ - ‘charges’ - ‘magmom’ - ‘magmoms’

Note that this does not refer to the new experimental custom property prediction support feature in ASE, but rather the built-in properties that ASE can calculate in the ase.calculators.calculator.Calculator class.

property_type()[source]

class mattertune.configs.HuberLossConfig(*, name='huber', delta=1.0, reduction='mean')[source]

Parameters:

name (Literal['huber'])
delta (float)
reduction (Literal['mean', 'sum'])

name: Literal['huber']

delta: float: The threshold value for the Huber loss function.

reduction: Literal['mean', 'sum']

How to reduce the loss values across the batch.

"mean": The mean of the loss values.
"sum": The sum of the loss values.

class mattertune.configs.JMPBackboneConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={}, name='jmp', pretrained_model, graph_computer)[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])
name (Literal['jmp'])
pretrained_model (str)
graph_computer (JMPGraphComputerConfig)

name: Literal['jmp']: The type of the backbone.

pretrained_model: str: pretrained model name

graph_computer: JMPGraphComputerConfig: The configuration for the graph computer.

create_model()[source]: Creates an instance of the finetune module for this configuration.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

freeze_backbone: bool: Whether to freeze the backbone during training.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

class mattertune.configs.JMPGraphComputerConfig(*, pbc, cutoffs=CutoffsConfig(main=12.0, aeaint=12.0, qint=12.0, aint=12.0), max_neighbors=MaxNeighborsConfig(main=30, aeaint=20, qint=8, aint=1000), per_graph_radius_graph=False)[source]

Parameters:

pbc (bool)
cutoffs (CutoffsConfig)
max_neighbors (MaxNeighborsConfig)
per_graph_radius_graph (bool)

pbc: bool: Whether to use periodic boundary conditions.

cutoffs: CutoffsConfig: The cutoff for the radius graph.

max_neighbors: MaxNeighborsConfig: The maximum number of neighbors for the radius graph.

per_graph_radius_graph: bool: Whether to compute the radius graph per graph.

class mattertune.configs.JSONDatasetConfig(*, type='json', src, tasks)[source]

Parameters:

type (Literal['json'])
src (str | Path)
tasks (dict[str, str])

type: Literal['json']: Discriminator for the JSON dataset.

src: str | Path: The path to the JSON dataset.

tasks: dict[str, str]: Attributes in the JSON file that correspond to the tasks to be predicted.

create_dataset()[source]

class mattertune.configs.L2MAELossConfig(*, name='l2_mae', reduction='mean')[source]

Parameters:

name (Literal['l2_mae'])
reduction (Literal['mean', 'sum'])

name: Literal['l2_mae']

reduction: Literal['mean', 'sum']

How to reduce the loss values across the batch.

"mean": The mean of the loss values.
"sum": The sum of the loss values.

class mattertune.configs.LinearLRConfig(*, type='LinearLR', start_factor=0.3333333333333333, end_factor=1.0, total_iters=5)[source]

Parameters:

type (Literal['LinearLR'])
start_factor (float)
end_factor (float)
total_iters (int)

type: Literal['LinearLR']: Type of the learning rate scheduler.

start_factor: float: The number we multiply learning rate in the first epoch.

end_factor: float: The number we multiply learning rate at the end of linear changing process.

total_iters: int: The number of iterations that multiplicative factor reaches to 1.

class mattertune.configs.LoRARecipeConfig(*, name='lora', lora)[source]

Recipe for applying Low-Rank Adaptation (LoRA) to a model. LoRA is a method for fine-tuning pre-trained models via the injection of low-rank “adapter” weights into the model’s linear layers. This allows for efficient fine-tuning of large models on small datasets, while preserving the pre-trained weights in the backbone.

Reference: https://arxiv.org/abs/2106.09685

Parameters:

name (Literal['lora'])
lora (LoraConfig)

name: Literal['lora']: Discriminator for the LoRA recipe.

lora: LoraConfig: LoRA configuration.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

create_lightning_callback()[source]: Creates the PyTorch Lightning callback for this recipe, or returns None if no callback is needed.

class mattertune.configs.LoraConfig(*, peft_type=None, task_type=None, inference_mode=False, r=8, target_modules=None, lora_alpha=8, lora_dropout=0.0, fan_in_fan_out=False, bias='none', use_rslora=False, modules_to_save=None, init_lora_weights=True, layers_to_transform=None, layers_pattern=None, rank_pattern={}, alpha_pattern={})[source]

Parameters:

peft_type (str | None)
task_type (str | None)
inference_mode (bool)
r (int)
target_modules (list[str] | str | None)
lora_alpha (int)
lora_dropout (float)
fan_in_fan_out (bool)
bias (Literal['none', 'all', 'lora_only'])
use_rslora (bool)
modules_to_save (list[str] | None)
init_lora_weights (bool | Literal['gaussian'])
layers_to_transform (list[int] | int | None)
layers_pattern (list[str] | str | None)
rank_pattern (dict[str, Any])
alpha_pattern (dict[str, Any])

r: int: LoRA attention dimension (rank).

target_modules: list[str] | str | None: Names of modules to apply LoRA to. Can be a list of module names, a regex pattern, or ‘all-linear’.

lora_alpha: int: Alpha parameter for LoRA scaling.

lora_dropout: float: Dropout probability for LoRA layers.

fan_in_fan_out: bool: Set True if target layer stores weights as (fan_in, fan_out).

bias: Literal['none', 'all', 'lora_only']: Bias type for LoRA. Controls which biases are updated during training.

use_rslora: bool: Whether to use Rank-Stabilized LoRA which sets adapter scaling to lora_alpha/sqrt(r).

modules_to_save: list[str] | None: Additional modules to be trained and saved besides LoRA layers.

init_lora_weights: bool | Literal['gaussian']: Initialization method for LoRA weights.

layers_to_transform: list[int] | int | None: Specific layer indices to apply LoRA transformation to.

layers_pattern: list[str] | str | None: Layer pattern name used with layers_to_transform.

rank_pattern: dict[str, Any]: Mapping of layer names/patterns to custom ranks different from default r.

alpha_pattern: dict[str, Any]: Mapping of layer names/patterns to custom alphas different from default lora_alpha.

class mattertune.configs.M3GNetBackboneConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={}, name='m3gnet', ckpt_path, graph_computer)[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])
name (Literal['m3gnet'])
ckpt_path (str | Path)
graph_computer (M3GNetGraphComputerConfig)

name: Literal['m3gnet']: The type of the backbone.

ckpt_path: str | Path: The path to the pre-trained model checkpoint.

graph_computer: M3GNetGraphComputerConfig: Configuration for the graph computer.

create_model()[source]: Creates an instance of the finetune module for this configuration.

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

freeze_backbone: bool: Whether to freeze the backbone during training.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

class mattertune.configs.M3GNetGraphComputerConfig(*, element_types=<factory>, cutoff=None, threebody_cutoff=None, pre_compute_line_graph=False, graph_labels=None)[source]

Configuration for initialize a MatGL Atoms2Graph Convertor.

Parameters:

element_types (tuple[str, ...])
cutoff (float | None)
threebody_cutoff (float | None)
pre_compute_line_graph (bool)
graph_labels (list[int | float] | None)

element_types: tuple[str, ...]: The element types to consider, default is all elements.

cutoff: float | None: The cutoff distance for the neighbor list. If None, the cutoff is loaded from the checkpoint.

threebody_cutoff: float | None: The cutoff distance for the three-body interactions. If None, the cutoff is loaded from the checkpoint.

pre_compute_line_graph: bool: Whether to pre-compute the line graph for three-body interactions in data preparation.

graph_labels: list[int | float] | None: The graph labels to consider, default is None.

class mattertune.configs.MACEBackboneConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={}, name='mace', pretrained_model)[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])
name (Literal['mace'])
pretrained_model (str)

name: Literal['mace']: The type of the backbone.

pretrained_model: str

The name of the pretrained model to load, please pass the name of the model in the following format: mace-<model_name>. supported <model_name> are: [

“small”, “medium”, “large”, “small-0b”, “medium-0b”, “large-0b”, “small-0b2”, “medium-0b2”, “medium-0b3”, “large-0b2”, “medium-omat-0”, “small_off”, “medium_off”, “large_off”,

]

create_model()[source]: Creates an instance of the finetune module for this configuration.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

freeze_backbone: bool: Whether to freeze the backbone during training.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

class mattertune.configs.MAELossConfig(*, name='mae', reduction='mean')[source]

Parameters:

name (Literal['mae'])
reduction (Literal['mean', 'sum'])

name: Literal['mae']

reduction: Literal['mean', 'sum']

How to reduce the loss values across the batch.

"mean": The mean of the loss values.
"sum": The sum of the loss values.

class mattertune.configs.MPDatasetConfig(*, type='mp', api, fields, query)[source]

Configuration for a dataset stored in the Materials Project database.

Parameters:

type (Literal['mp'])
api (str)
fields (list[str])
query (dict)

type: Literal['mp']: Discriminator for the MP dataset.

api: str: Input API key for the Materials Project database.

fields: list[str]: Fields to retrieve from the Materials Project database.

query: dict: Query to filter the data from the Materials Project database.

create_dataset()[source]

class mattertune.configs.MPTrajDatasetConfig(*, type='mptraj', split='train', min_num_atoms=5, max_num_atoms=None, elements=None)[source]

Configuration for a dataset stored in the Materials Project database.

Parameters:

type (Literal['mptraj'])
split (Literal['train', 'val', 'test'])
min_num_atoms (int | None)
max_num_atoms (int | None)
elements (list[str] | None)

type: Literal['mptraj']: Discriminator for the MPTraj dataset.

split: Literal['train', 'val', 'test']: Split of the dataset to use.

min_num_atoms: int | None: Minimum number of atoms to be considered. Drops structures with fewer atoms.

max_num_atoms: int | None: Maximum number of atoms to be considered. Drops structures with more atoms.

elements: list[str] | None: List of elements to be considered. Drops structures with elements not in the list. Subsets are also allowed. For example, [“Li”, “Na”] will keep structures with either Li or Na.

create_dataset()[source]

class mattertune.configs.MSELossConfig(*, name='mse', reduction='mean')[source]

Parameters:

name (Literal['mse'])
reduction (Literal['mean', 'sum'])

name: Literal['mse']

reduction: Literal['mean', 'sum']

How to reduce the loss values across the batch.

"mean": The mean of the loss values.
"sum": The sum of the loss values.

class mattertune.configs.ManualSplitDataModuleConfig(*, batch_size, num_workers='auto', pin_memory=True, train, validation=None)[source]

Parameters:

batch_size (int)
num_workers (int | Literal['auto'])
pin_memory (bool)
train (DatasetConfig)
validation (DatasetConfig | None)

train: DatasetConfig: The configuration for the training data.

validation: DatasetConfig | None: The configuration for the validation data.

dataset_configs()[source]

create_datasets()[source]

class mattertune.configs.MatbenchDatasetConfig(*, type='matbench', task=None, property_name=None, fold_idx=0)[source]

Configuration for the Matbench dataset.

Parameters:

type (Literal['matbench'])
task (str | None)
property_name (str | None)
fold_idx (int)

type: Literal['matbench']: Discriminator for the Matbench dataset.

task: str | None: The name of the self.tasks to include in the dataset.

property_name: str | None: Assign a property name for the self.task. Must match the property head in the model.

fold_idx: int: The index of the fold to be used in the dataset.

create_dataset()[source]

class mattertune.configs.MatterSimBackboneConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={}, name='mattersim', pretrained_model, model_type='m3gnet', graph_convertor)[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])
name (Literal['mattersim'])
pretrained_model (str)
model_type (Literal['m3gnet', 'graphormer'])
graph_convertor (MatterSimGraphConvertorConfig | dict[str, Any])

name: Literal['mattersim']: The type of the backbone.

pretrained_model: str: The name of the pretrained model to load. MatterSim-v1.0.0-1M: A mini version of the m3gnet that is faster to run. MatterSim-v1.0.0-5M: A larger version of the m3gnet that is more accurate.

model_type: Literal['m3gnet', 'graphormer']

graph_convertor: MatterSimGraphConvertorConfig | dict[str, Any]: Configuration for the graph converter.

create_model()[source]: Creates an instance of the finetune module for this configuration.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

freeze_backbone: bool: Whether to freeze the backbone during training.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

class mattertune.configs.MatterSimGraphConvertorConfig(*, twobody_cutoff=5.0, has_threebody=True, threebody_cutoff=4.0)[source]

Configuration for the graph converter used in the MatterSim backbone.

Parameters:

twobody_cutoff (float)
has_threebody (bool)
threebody_cutoff (float)

twobody_cutoff: float: The cutoff distance for the two-body interactions.

has_threebody: bool: Whether to include three-body interactions.

threebody_cutoff: float: The cutoff distance for the three-body interactions.

class mattertune.configs.MatterTunerConfig(*, data, model, trainer=TrainerConfig(accelerator='auto', strategy='auto', num_nodes=1, devices='auto', precision='32-true', deterministic=None, max_epochs=None, min_epochs=None, max_steps=-1, min_steps=None, max_time=None, val_check_interval=None, check_val_every_n_epoch=1, log_every_n_steps=None, gradient_clip_val=None, gradient_clip_algorithm=None, checkpoint=None, early_stopping=None, ema=None, loggers='default', additional_trainer_kwargs={}), recipes=[])[source]

Parameters:

data (DataModuleConfig)
model (ModelConfig)
trainer (TrainerConfig)
recipes (Sequence[RecipeConfig])

data: DataModuleConfig: The configuration for the data.

model: ModelConfig: The configuration for the model.

trainer: TrainerConfig: The configuration for the trainer.

recipes: Sequence[RecipeConfig]

Recipes to modify the training process.

Recipes are configurable components that can modify how models are trained. Each recipe provides a specific capability like parameter-efficient fine-tuning, regularization, or advanced optimization techniques.

Recipes are applied in order when training starts. Multiple recipes can be combined to achieve the desired training behavior.

Examples

```python # Use LoRA for memory-efficient training recipes=[

LoRARecipeConfig(
lora=LoraConfig(r=8, target_modules=[“linear1”])

)

]

class mattertune.configs.MaxNeighborsConfig(*, main, aeaint, qint, aint)[source]

Parameters:

main (int)
aeaint (int)
qint (int)
aint (int)

main: int

aeaint: int

qint: int

aint: int

classmethod from_goc_base_proportions(max_neighbors)[source]

GOC base proportions:: max_neighbors: 30 max_neighbors_qint: 8 max_neighbors_aeaint: 20 max_neighbors_aint: 1000

Parameters:: max_neighbors (int)

class mattertune.configs.MeanStdNormalizerConfig(*, only_for_target=True, mean, std)[source]

Parameters:

only_for_target (bool)
mean (float)
std (float)

only_for_target: bool: Whether the normalizer should only be applied to the target property or to both predictions and targets.

mean: float: The mean of the property values.

std: float: The standard deviation of the property values.

create_normalizer_module()[source]

Return type:: MeanStdNormalizerModule

class mattertune.configs.ModelCheckpointConfig(*, dirpath=None, filename=None, monitor=None, verbose=False, save_last=None, save_top_k=1, save_weights_only=False, mode='min', auto_insert_metric_name=True, every_n_train_steps=None, train_time_interval=None, every_n_epochs=None, save_on_train_epoch_end=None, enable_version_counter=True)[source]

Parameters:

dirpath (str | None)
filename (str | None)
monitor (str | None)
verbose (bool)
save_last (Literal[True, False, 'link'] | None)
save_top_k (int)
save_weights_only (bool)
mode (Literal['min', 'max'])
auto_insert_metric_name (bool)
every_n_train_steps (int | None)
train_time_interval (timedelta | None)
every_n_epochs (int | None)
save_on_train_epoch_end (bool | None)
enable_version_counter (bool)

dirpath: str | None

None.

Type:: Directory to save the model file. Default

filename: str | None

None.

Type:: Checkpoint filename. Can contain named formatting options. Default

monitor: str | None

None.

Type:: Quantity to monitor. Default

verbose: bool

False.

Type:: Verbosity mode. Default

save_last: Literal[True, False, 'link'] | None

None.

Type:: When True or “link”, saves a ‘last.ckpt’ checkpoint when a checkpoint is saved. Default

save_top_k: int

1.

Type:: If save_top_k=k, save k models with best monitored quantity. Default

save_weights_only: bool

False.

Type:: If True, only save model weights. Default

mode: Literal['min', 'max']

'min'.

Type:: One of {‘min’, ‘max’}. For ‘min’ training stops when monitored quantity stops decreasing. Default

auto_insert_metric_name: bool

True.

Type:: Whether to automatically insert metric name in checkpoint filename. Default

every_n_train_steps: int | None

None.

Type:: Number of training steps between checkpoints. Default

train_time_interval: timedelta | None

None.

Type:: Checkpoints are monitored at the specified time interval. Default

every_n_epochs: int | None

None.

Type:: Number of epochs between checkpoints. Default

save_on_train_epoch_end: bool | None

None.

Type:: Whether to run checkpointing at end of training epoch. Default

enable_version_counter: bool

True.

Type:: Whether to append version to existing filenames. Default

create_callback()[source]: Creates a ModelCheckpoint callback instance from this config.

class mattertune.configs.MultiStepLRConfig(*, type='MultiStepLR', milestones, gamma)[source]

Parameters:

type (Literal['MultiStepLR'])
milestones (list[int])
gamma (float)

type: Literal['MultiStepLR']: Type of the learning rate scheduler.

milestones: list[int]: List of epoch indices. Must be increasing.

gamma: float: Multiplicative factor of learning rate decay.

class mattertune.configs.NoOpRecipeConfig(*, name='no-op')[source]

Example recipe that does nothing.

Parameters:: name (Literal['no-op'])

name: Literal['no-op']: Discriminator for the no-op recipe.

create_lightning_callback()[source]

Creates the PyTorch Lightning callback for this recipe, or returns None if no callback is needed.

Return type:: None

class mattertune.configs.NormalizerConfigBase(*, only_for_target)[source]

Parameters:: only_for_target (bool)

only_for_target: bool: Whether the normalizer should only be applied to the target property or to both predictions and targets.

abstract create_normalizer_module()[source]

Return type:: NormalizerModule

class mattertune.configs.OMAT24DatasetConfig(*, type='omat24', src)[source]

Parameters:

type (Literal['omat24'])
src (Path)

type: Literal['omat24']: Discriminator for the OMAT24 dataset.

src: Path: The path to the OMAT24 dataset.

create_dataset()[source]

class mattertune.configs.ORBBackboneConfig(*, reset_backbone=False, freeze_backbone=False, reset_output_heads=True, use_pretrained_normalizers=False, properties, optimizer, lr_scheduler=None, ignore_gpu_batch_transform_error=True, normalizers={}, name='orb', pretrained_model, system=ORBSystemConfig(radius=6.0, max_num_neighbors=120))[source]

Parameters:

reset_backbone (bool)
freeze_backbone (bool)
reset_output_heads (bool)
use_pretrained_normalizers (bool)
properties (Sequence[PropertyConfig])
optimizer (OptimizerConfig)
lr_scheduler (LRSchedulerConfig | None)
ignore_gpu_batch_transform_error (bool)
normalizers (Mapping[str, Sequence[NormalizerConfig]])
name (Literal['orb'])
pretrained_model (str)
system (ORBSystemConfig)

name: Literal['orb']: The type of the backbone.

pretrained_model: str: The name of the pretrained model to load.

system: ORBSystemConfig: The system configuration, controlling how to featurize a system of atoms.

freeze_backbone: bool: Whether to freeze the backbone model.

create_model()[source]: Creates an instance of the finetune module for this configuration.

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

reset_backbone: bool: Whether to reset the backbone of the model when creating the model.

reset_output_heads: bool: Whether to reset the output heads of the model when creating the model.

use_pretrained_normalizers: bool: Whether to use the pretrained normalizers.

properties: Sequence[PropertyConfig]: Properties to predict.

optimizer: OptimizerConfig: Optimizer.

lr_scheduler: LRSchedulerConfig | None: Learning Rate Scheduler

ignore_gpu_batch_transform_error: bool: Whether to ignore data processing errors during training.

normalizers: Mapping[str, Sequence[NormalizerConfig]]

Normalizers for the properties.

Any property can be associated with multiple normalizers. This is useful for cases where we want to normalize the same property in different ways. For example, we may want to normalize the energy by subtracting the atomic reference energies, as well as by mean and standard deviation normalization.

The normalizers are applied in the order they are defined in the list.

class mattertune.configs.ORBSystemConfig(*, radius, max_num_neighbors)[source]

Config controlling how to featurize a system of atoms.

Parameters:

radius (float)
max_num_neighbors (int)

radius: float: The radius for edge construction.

max_num_neighbors: int: The maximum number of neighbours each node can send messages to.

class mattertune.configs.OptimizerConfigBase(*, per_parameter_hparams=None)[source]

Parameters:: per_parameter_hparams (Sequence[PerParamHparamsDict] | None)

per_parameter_hparams: Sequence[PerParamHparamsDict] | None

Per parameter hyperparameters.

This should be a list of dictionaries, each of which has the following keys:

patterns: a list of patterns to match parameter names.
hparams: a dictionary of hyperparameters for the matched parameters.
optimize: whether to optimize this parameter. Default is True.

This allows you to, for example, set different learning rates for different parameters.

class mattertune.configs.PeftConfig(*, peft_type=None, task_type=None, inference_mode=False)[source]

Parameters:

peft_type (str | None)
task_type (str | None)
inference_mode (bool)

peft_type: str | None: Type of PEFT method being used.

task_type: str | None: Type of task being performed.

inference_mode: bool: Whether to use inference mode.

class mattertune.configs.PerAtomNormalizerConfig(*, only_for_target=False)[source]

Parameters:: only_for_target (bool)

only_for_target: bool: Whether the normalizer should only be applied to the target property or to both predictions and targets.

create_normalizer_module()[source]

Return type:: NormalizerModule

class mattertune.configs.PerAtomReferencingNormalizerConfig(*, only_for_target=True, per_atom_references)[source]

Parameters:

only_for_target (bool)
per_atom_references (Mapping[int, float] | Sequence[float] | Path)

only_for_target: bool: Whether the normalizer should only be applied to the target property or to both predictions and targets.

per_atom_references: Mapping[int, float] | Sequence[float] | Path

The reference values for each element.

If a dictionary is provided, it maps atomic numbers to reference values
If a list is provided, it’s a list of reference values indexed by atomic number
If a path is provided, it should point to a JSON file containing the references

create_normalizer_module()[source]

Return type:: NormalizerModule

class mattertune.configs.PropertyConfigBase(*, name, dtype, loss, loss_coefficient=1.0)[source]

Parameters:

name (str)
dtype (DType)
loss (LossConfig)
loss_coefficient (float)

name: str

The name of the property.

This is the key that will be used to access the property in the output of the model.

This is also the key that will be used to access the property in the ASE Atoms object.

dtype: DType: The type of the property values.

loss: LossConfig: The loss function to use when training the model on this property.

loss_coefficient: float: The coefficient to apply to this property’s loss function when training the model.

abstract from_ase_atoms(atoms)[source]

Extract the property value from an ASE Atoms object.

Parameters:: atoms (Atoms)
Return type:: int | float | ndarray | Tensor

classmethod metric_cls()[source]

Return type:: type[MetricBase]

abstract ase_calculator_property_name()[source]

If this property can be calculated by an ASE calculator, returns the name of the property that the ASE calculator uses. Otherwise, returns None.

This should only return non-None for properties that are supported by the ASE calculator interface, i.e.: - ‘energy’ - ‘forces’ - ‘stress’ - ‘dipole’ - ‘charges’ - ‘magmom’ - ‘magmoms’

Note that this does not refer to the new experimental custom property prediction support feature in ASE, but rather the built-in properties that ASE can calculate in the ase.calculators.calculator.Calculator class.

Return type:: ASECalculatorPropertyName | None

abstract property_type()[source]

Return type:: Literal[‘system’, ‘atom’]

prepare_value_for_ase_calculator(value)[source]

Convert the property value to a format that can be used by the ASE calculator.

Parameters:: value (float | ndarray)

class mattertune.configs.RMSNormalizerConfig(*, only_for_target=True, rms)[source]

Parameters:

only_for_target (bool)
rms (float)

only_for_target: bool: Whether the normalizer should only be applied to the target property or to both predictions and targets.

rms: float: The root mean square of the property values.

create_normalizer_module()[source]

Return type:: RMSNormalizerModule

class mattertune.configs.RecipeConfigBase[source]

Base configuration for recipes.

abstract create_lightning_callback()[source]

Creates the PyTorch Lightning callback for this recipe, or returns None if no callback is needed.

Return type:: Callback | None

classmethod ensure_dependencies()[source]

Ensure that all dependencies are installed.

This method should raise an exception if any dependencies are missing, with a message indicating which dependencies are missing and how to install them.

class mattertune.configs.ReduceOnPlateauConfig(*, type='ReduceLROnPlateau', mode, monitor='val_loss', factor, patience, threshold=0.0001, threshold_mode='rel', cooldown=0, min_lr=0, eps=1e-08)[source]

Parameters:

type (Literal['ReduceLROnPlateau'])
mode (Literal['min', 'max'])
monitor (str)
factor (float)
patience (int)
threshold (float)
threshold_mode (Literal['rel', 'abs'])
cooldown (int)
min_lr (float)
eps (float)

type: Literal['ReduceLROnPlateau']: Type of the learning rate scheduler.

mode: Literal['min', 'max']: One of {“min”, “max”}. Determines when to reduce the learning rate.

monitor: str: Quantity to be monitored.

factor: float: Factor by which the learning rate will be reduced.

patience: int: Number of epochs with no improvement after which learning rate will be reduced.

threshold: float: Threshold for measuring the new optimum.

threshold_mode: Literal['rel', 'abs']: One of {“rel”, “abs”}. Determines the threshold mode.

cooldown: int: Number of epochs to wait before resuming normal operation.

min_lr: float: A lower bound on the learning rate.

eps: float: Threshold for testing the new optimum.

class mattertune.configs.SGDConfig(*, per_parameter_hparams=None, name='SGD', lr, momentum=0.0, weight_decay=0.0, nestrov=False)[source]

Parameters:

per_parameter_hparams (Sequence[PerParamHparamsDict] | None)
name (Literal['SGD'])
lr (Annotated[float, Gt(gt=0)])
momentum (Annotated[float, Ge(ge=0)])
weight_decay (Annotated[float, Ge(ge=0)])
nestrov (bool)

name: Literal['SGD']: name of the optimizer.

lr: C.PositiveFloat: Learning rate.

momentum: C.NonNegativeFloat: Momentum.

weight_decay: C.NonNegativeFloat: Weight decay.

nestrov: bool: Whether to use nestrov.

class mattertune.configs.StepLRConfig(*, type='StepLR', step_size, gamma)[source]

Parameters:

type (Literal['StepLR'])
step_size (int)
gamma (float)

type: Literal['StepLR']: Type of the learning rate scheduler.

step_size: int: Period of learning rate decay.

gamma: float: Multiplicative factor of learning rate decay.

class mattertune.configs.StressesPropertyConfig(*, name='stresses', dtype='float', loss, loss_coefficient=1.0, type='stresses', conservative)[source]

Parameters:

name (str)
dtype (DType)
loss (LossConfig)
loss_coefficient (float)
type (Literal['stresses'])
conservative (bool)

type: Literal['stresses']

name: str

The name of the property.

This is the key that will be used to access the property in the output of the model.

dtype: DType: The type of the property values.

conservative: bool: Similar to the conservative parameter in ForcesPropertyConfig, this parameter specifies whether the stresses should be computed in a conservative manner.

from_ase_atoms(atoms)[source]: Extract the property value from an ASE Atoms object.

ase_calculator_property_name()[source]

If this property can be calculated by an ASE calculator, returns the name of the property that the ASE calculator uses. Otherwise, returns None.

This should only return non-None for properties that are supported by the ASE calculator interface, i.e.: - ‘energy’ - ‘forces’ - ‘stress’ - ‘dipole’ - ‘charges’ - ‘magmom’ - ‘magmoms’

Note that this does not refer to the new experimental custom property prediction support feature in ASE, but rather the built-in properties that ASE can calculate in the ase.calculators.calculator.Calculator class.

prepare_value_for_ase_calculator(value)[source]: Convert the property value to a format that can be used by the ASE calculator.

property_type()[source]

class mattertune.configs.TensorBoardLoggerConfig(*, type='tensorboard', save_dir, name='lightning_logs', version=None, log_graph=False, default_hp_metric=True, prefix='', sub_dir=None, additional_params={})[source]

Parameters:

type (Literal['tensorboard'])
save_dir (str)
name (str | None)
version (int | str | None)
log_graph (bool)
default_hp_metric (bool)
prefix (str)
sub_dir (str | None)
additional_params (dict[str, Any])

type: Literal['tensorboard']

save_dir: str: Save directory where TensorBoard logs will be saved.

name: str | None

'lightning_logs'. If empty string, no per-experiment subdirectory is used.

Type:: Experiment name. Default

version: int | str | None: Experiment version. If not specified, logger auto-assigns next available version. If string, used as run-specific subdirectory name. Default: None.

log_graph: bool: Whether to add computational graph to tensorboard. Requires model.example_input_array to be defined. Default: False.

default_hp_metric: bool: Enables placeholder metric with key hp_metric when logging hyperparameters without a metric. Default: True.

prefix: str

''.

Type:: String to put at beginning of metric keys. Default

sub_dir: str | None: Sub-directory to group TensorBoard logs. If provided, logs are saved in /save_dir/name/version/sub_dir/. Default: None.

additional_params: dict[str, Any]

{}.

Type:: Additional parameters passed to tensorboardX.SummaryWriter. Default

create_logger()[source]: Creates a TensorBoardLogger instance from this config.

class mattertune.configs.TrainerConfig(*, accelerator='auto', strategy='auto', num_nodes=1, devices='auto', precision='32-true', deterministic=None, max_epochs=None, min_epochs=None, max_steps=-1, min_steps=None, max_time=None, val_check_interval=None, check_val_every_n_epoch=1, log_every_n_steps=None, gradient_clip_val=None, gradient_clip_algorithm=None, checkpoint=None, early_stopping=None, ema=None, loggers='default', additional_trainer_kwargs={})[source]

Parameters:

accelerator (str)
strategy (str | Strategy)
num_nodes (int)
devices (list[int] | str | int)
precision (Literal[64, 32, 16] | ~typing.Literal['transformer-engine', 'transformer-engine-float16', '16-true', '16-mixed', 'bf16-true', 'bf16-mixed', '32-true', '64-true'] | ~typing.Literal['64', '32', '16', 'bf16'] | None)
deterministic (bool | Literal['warn'] | None)
max_epochs (int | None)
min_epochs (int | None)
max_steps (int)
min_steps (int | None)
max_time (str | timedelta | dict[str, int] | None)
val_check_interval (int | float | None)
check_val_every_n_epoch (int | None)
log_every_n_steps (int | None)
gradient_clip_val (int | float | None)
gradient_clip_algorithm (str | None)
checkpoint (ModelCheckpointConfig | None)
early_stopping (EarlyStoppingConfig | None)
ema (EMAConfig | None)
loggers (Sequence[LoggerConfig] | Literal['default'])
additional_trainer_kwargs (dict[str, Any])

accelerator: str: Supports passing different accelerator types (“cpu”, “gpu”, “tpu”, “ipu”, “hpu”, “mps”, “auto”) as well as custom accelerator instances.

strategy: str | Strategy: Supports different training strategies with aliases as well custom strategies. Default: "auto".

num_nodes: int: Number of GPU nodes for distributed training. Default: 1.

devices: list[int] | str | int: The devices to use. Can be set to a sequence of device indices, “all” to indicate all available devices should be used, or "auto" for automatic selection based on the chosen accelerator. Default: "auto".

precision: _PRECISION_INPUT | None: Double precision (64, ‘64’ or ‘64-true’), full precision (32, ‘32’ or ‘32-true’), 16bit mixed precision (16, ‘16’, ‘16-mixed’) or bfloat16 mixed precision (‘bf16’, ‘bf16-mixed’). Can be used on CPU, GPU, TPUs, HPUs or IPUs. Default: '32-true'.

deterministic: bool | Literal['warn'] | None: If True, sets whether PyTorch operations must use deterministic algorithms. Set to "warn" to use deterministic algorithms whenever possible, throwing warnings on operations that don’t support deterministic mode. If not set, defaults to False. Default: None.

max_epochs: int | None: Stop training once this number of epochs is reached. Disabled by default (None). If both max_epochs and max_steps are not specified, defaults to max_epochs = 1000. To enable infinite training, set max_epochs = -1.

min_epochs: int | None: Force training for at least these many epochs. Disabled by default (None).

max_steps: int: Stop training after this number of steps. Disabled by default (-1). If max_steps = -1 and max_epochs = None, will default to max_epochs = 1000. To enable infinite training, set max_epochs to -1.

min_steps: int | None: Force training for at least these number of steps. Disabled by default (None).

max_time: str | timedelta | dict[str, int] | None: Stop training after this amount of time has passed. Disabled by default (None). The time duration can be specified in the format DD:HH:MM:SS (days, hours, minutes seconds), as a datetime.timedelta, or a dictionary with keys that will be passed to datetime.timedelta.

val_check_interval: int | float | None: How often to check the validation set. Pass a float in the range [0.0, 1.0] to check after a fraction of the training epoch. Pass an int to check after a fixed number of training batches. An int value can only be higher than the number of training batches when check_val_every_n_epoch=None, which validates after every N training batches across epochs or during iteration-based training. Default: 1.0.

check_val_every_n_epoch: int | None: Perform a validation loop every after every N training epochs. If None, validation will be done solely based on the number of training batches, requiring val_check_interval to be an integer value. Default: 1.

log_every_n_steps: int | None: How often to log within steps. Default: 50.

gradient_clip_val: int | float | None: The value at which to clip gradients. Passing gradient_clip_val=None disables gradient clipping. If using Automatic Mixed Precision (AMP), the gradients will be unscaled before. Default: None.

gradient_clip_algorithm: str | None: The gradient clipping algorithm to use. Pass gradient_clip_algorithm="value" to clip by value, and gradient_clip_algorithm="norm" to clip by norm. By default it will be set to "norm".

checkpoint: ModelCheckpointConfig | None: The configuration for the model checkpoint.

early_stopping: EarlyStoppingConfig | None: The configuration for early stopping.

ema: EMAConfig | None: The configuration for the Exponential Moving Average (EMA) callback.

loggers: Sequence[LoggerConfig] | Literal['default']

The loggers to use for logging training metrics.

If "default", will use the CSV logger + the W&B logger if available. Default: "default".

additional_trainer_kwargs: dict[str, Any]

Additional keyword arguments for the Lightning Trainer.

This is for advanced users who want to customize the Lightning Trainer, and is not recommended for beginners.

class mattertune.configs.WandbLoggerConfig(*, type='wandb', name=None, save_dir='.', version=None, offline=False, dir=None, id=None, anonymous=None, project=None, log_model=False, prefix='', experiment=None, checkpoint_name=None, additional_init_parameters={})[source]

Parameters:

type (Literal['wandb'])
name (str | None)
save_dir (str)
version (str | None)
offline (bool)
dir (str | None)
id (str | None)
anonymous (bool | None)
project (str | None)
log_model (Literal['all'] | bool)
prefix (str)
experiment (Any | None)
checkpoint_name (str | None)
additional_init_parameters (dict[str, Any])

type: Literal['wandb']

name: str | None

None.

Type:: Display name for the run. Default

save_dir: str

..

Type:: Path where data is saved. Default

version: str | None

None.

Type:: Sets the version, mainly used to resume a previous run. Default

offline: bool

False.

Type:: Run offline (data can be streamed later to wandb servers). Default

dir: str | None

None.

Type:: Same as save_dir. Default

id: str | None

None.

Type:: Same as version. Default

anonymous: bool | None

None.

Type:: Enables or explicitly disables anonymous logging. Default

project: str | None

None.

Type:: The name of the project to which this run will belong. Default

log_model: Literal['all'] | bool

False. If ‘all’, checkpoints are logged during training. If True, checkpoints are logged at the end of training. If False, no checkpoints are logged.

Type:: Whether/how to log model checkpoints as W&B artifacts. Default

prefix: str

''.

Type:: A string to put at the beginning of metric keys. Default

experiment: Any | None

None.

Type:: WandB experiment object. Automatically set when creating a run. Default

checkpoint_name: str | None

None.

Type:: Name of the model checkpoint artifact being logged. Default

additional_init_parameters: dict[str, Any]

{}.

Type:: Additional parameters to pass to wandb.init(). Default

create_logger()[source]: Creates a WandbLogger instance from this config.

class mattertune.configs.XYZDatasetConfig(*, type='xyz', src, down_sample=None, down_sample_refill=False)[source]

Parameters:

type (Literal['xyz'])
src (str | Path)
down_sample (int | None)
down_sample_refill (bool)

type: Literal['xyz']: Discriminator for the XYZ dataset.

src: str | Path: The path to the XYZ dataset.

down_sample: int | None: Down sample the dataset

down_sample_refill: bool: Refill the dataset after down sampling to achieve the same length as the original dataset

create_dataset()[source]

Modules

`backbones`
`callbacks`
`data`
`finetune`
`loggers`
`main`
`normalization`
`recipes`
`registry`
`wrappers`