Workflow Type: Common Workflow Language
Frozen
Stable
AutoEncoders for Anomaly Detection tutorial using BioExcel Building Blocks (biobb)
This tutorial involves the use of a multilayer AutoEncoder (AE) for feature extraction and pattern recognition by analyzing Molecular Dynamic Simulations, step by step, using the BioExcel Building Blocks library (biobb).
Copyright & Licensing
This software has been developed in the MMB group at the BSC & IRB for the European BioExcel, funded by the European Commission (EU Horizon Europe 101093290, EU H2020 823830, EU H2020 675728).
- (c) 2015-2026 Barcelona Supercomputing Center
- (c) 2015-2026 Institute for Research in Biomedicine
Licensed under the Apache License 2.0, see the file LICENSE for details.

Click and drag the diagram to pan, double click or use the controls to zoom.
Inputs
| ID | Name | Description | Type |
|---|---|---|---|
| step1_gmx_image1_input_traj_path | Input file | Path to the GROMACS trajectory file. |
|
| step1_gmx_image1_input_top_path | Input file | Path to the GROMACS input topology file. |
|
| step1_gmx_image1_output_traj_path | Output file | Path to the output file. |
|
| step1_gmx_image1_config | Config file | Configuration file for biobb_analysis.gmx_image tool. |
|
| step2_mdfeaturizer1_input_topology_path | Input file | Path to the input topology file. |
|
| step2_mdfeaturizer1_output_dataset_pt_path | Output file | Path to the output dataset model file. |
|
| step2_mdfeaturizer1_output_stats_pt_path | Output file | Path to the output model statistics file. |
|
| step2_mdfeaturizer1_config | Config file | Configuration file for biobb_pytorch.mdfeaturizer tool. |
|
| step3_build_model_output_model_pth_path | Output file | Path to save the model in .pth format. |
|
| step3_build_model_config | Config file | Configuration file for biobb_pytorch.build_model tool. |
|
| step4_train_model_output_model_pth_path | Output file | Path to save the trained model (.pth). If omitted, the trained model is only available in memory. |
|
| step4_train_model_output_metrics_npz_path | Output file | Path save training metrics in compressed NumPy format (.npz). |
|
| step4_train_model_config | Config file | Configuration file for biobb_pytorch.train_model tool. |
|
| step5_gmx_image2_input_traj_path | Input file | Path to the GROMACS trajectory file. |
|
| step5_gmx_image2_input_top_path | Input file | Path to the GROMACS input topology file. |
|
| step5_gmx_image2_output_traj_path | Output file | Path to the output file. |
|
| step5_gmx_image2_config | Config file | Configuration file for biobb_analysis.gmx_image tool. |
|
| step6_mdfeaturizer2_input_topology_path | Input file | Path to the input topology file. |
|
| step6_mdfeaturizer2_output_dataset_pt_path | Output file | Path to the output dataset model file. |
|
| step6_mdfeaturizer2_output_stats_pt_path | Output file | Path to the output model statistics file. |
|
| step6_mdfeaturizer2_config | Config file | Configuration file for biobb_pytorch.mdfeaturizer tool. |
|
| step7_evaluate_model_output_results_npz_path | Output file | Path to the output evaluation results file (compressed NumPy archive). |
|
| step7_evaluate_model_config | Config file | Configuration file for biobb_pytorch.evaluate_model tool. |
|
| step8_make_ndx1_input_structure_path | Input file | Path to the input GRO/PDB/TPR file. |
|
| step8_make_ndx1_output_ndx_path | Output file | Path to the output index NDX file. |
|
| step8_make_ndx1_config | Config file | Configuration file for biobb_gromacs.make_ndx tool. |
|
| step9_make_ndx2_input_structure_path | Input file | Path to the input GRO/PDB/TPR file. |
|
| step9_make_ndx2_output_ndx_path | Output file | Path to the output index NDX file. |
|
| step9_make_ndx2_config | Config file | Configuration file for biobb_gromacs.make_ndx tool. |
|
| step10_gmx_rmsf1_input_structure_path | Input file | Path to the input structure file. |
|
| step10_gmx_rmsf1_input_traj_path | Input file | Path to the GROMACS trajectory file. |
|
| step10_gmx_rmsf1_output_xvg_path | Output file | Path to the XVG output file. |
|
| step10_gmx_rmsf1_config | Config file | Configuration file for biobb_analysis.gmx_rmsf tool. |
|
| step11_gmx_rmsf2_input_structure_path | Input file | Path to the input structure file. |
|
| step11_gmx_rmsf2_input_traj_path | Input file | Path to the GROMACS trajectory file. |
|
| step11_gmx_rmsf2_output_xvg_path | Output file | Path to the XVG output file. |
|
| step11_gmx_rmsf2_config | Config file | Configuration file for biobb_analysis.gmx_rmsf tool. |
|
| step12_feat2traj_output_traj_path | Output file | Path to save the trajectory in xtc/pdb/dcd format. |
|
| step12_feat2traj_output_top_path | Output file | Path to save the output topology file (pdb). Used if trajectory format requires separate topology. |
|
| step13_gmx_rmsf3_output_xvg_path | Output file | Path to the XVG output file. |
|
| step13_gmx_rmsf3_config | Config file | Configuration file for biobb_analysis.gmx_rmsf tool. |
|
| step14_make_plumed_input_reference_pdb_path | Input file | Path to reference PDB used for FIT_TO_TEMPLATE actions when Cartesian features are present. |
|
| step14_make_plumed_output_model_ptc_path | Output file | Path to the output TorchScript model file (.ptc) for PLUMED's PYTORCH_MODEL action. |
|
| step14_make_plumed_output_plumed_dat_path | Output file | Path to the output PLUMED input file. |
|
| step14_make_plumed_output_features_dat_path | Output file | Path to the output features.dat file describing the CVs to PLUMED. |
|
| step14_make_plumed_config | Config file | Configuration file for biobb_pytorch.make_plumed tool. |
|
Steps
| ID | Name | Description |
|---|---|---|
| step1_gmx_image1 | gmx_image | Wrapper of the GROMACS trjconv module for correcting periodicity (image) from a given GROMACS compatible trajectory file. |
| step2_mdfeaturizer1 | mdfeaturizer | Obtain Molecular Dynamics features for PyTorch MDAE model training (dataset and statistics). |
| step3_build_model | build_model | Build a PyTorch autoencoder model (MDAE) from input statistics and hyperparameters. |
| step4_train_model | train_model | Train a PyTorch autoencoder model (MDAE) from an input dataset and initial model, optionally saving the trained model and training metrics. |
| step5_gmx_image2 | gmx_image | Wrapper of the GROMACS trjconv module for correcting periodicity (image) from a given GROMACS compatible trajectory file. |
| step6_mdfeaturizer2 | mdfeaturizer | Obtain Molecular Dynamics features for PyTorch MDAE model training (dataset and statistics). |
| step7_evaluate_model | evaluate_model | Evaluate a trained PyTorch autoencoder model (MDAE) on a dataset, computing loss and collecting model evaluation variables. |
| step8_make_ndx1 | make_ndx | Creates a GROMACS index file (NDX) from an input selection and an input GROMACS structure file. |
| step9_make_ndx2 | make_ndx | Creates a GROMACS index file (NDX) from an input selection and an input GROMACS structure file. |
| step10_gmx_rmsf1 | gmx_rmsf | Wrapper of the GROMACS module for calculating the Root Mean Square fluctuations (RMSf) of a given GROMACS compatible trajectory. |
| step11_gmx_rmsf2 | gmx_rmsf | Wrapper of the GROMACS module for calculating the Root Mean Square fluctuations (RMSf) of a given GROMACS compatible trajectory. |
| step12_feat2traj | feat2traj | Convert reconstructed molecular dynamics features (e.g. decoded coordinates) into an MD trajectory file using topology information from the stats file and/or a topology file. |
| step13_gmx_rmsf3 | gmx_rmsf | Wrapper of the GROMACS module for calculating the Root Mean Square fluctuations (RMSf) of a given GROMACS compatible trajectory. |
| step14_make_plumed | make_plumed | Generate a PLUMED input file, a features.dat file and a TorchScript (.ptc) model from a trained MDAE model and feature statistics, suitable for biased molecular dynamics simulations. |
Outputs
| ID | Name | Description | Type |
|---|---|---|---|
| step1_gmx_image1_out1 | output_traj_path | Path to the output file. |
|
| step2_mdfeaturizer1_out1 | output_dataset_pt_path | Path to the output dataset model file. |
|
| step2_mdfeaturizer1_out2 | output_stats_pt_path | Path to the output model statistics file. |
|
| step3_build_model_out1 | output_model_pth_path | Path to save the model in .pth format. |
|
| step4_train_model_out1 | output_model_pth_path | Path to save the trained model (.pth). If omitted, the trained model is only available in memory. |
|
| step4_train_model_out2 | output_metrics_npz_path | Path save training metrics in compressed NumPy format (.npz). |
|
| step5_gmx_image2_out1 | output_traj_path | Path to the output file. |
|
| step6_mdfeaturizer2_out1 | output_dataset_pt_path | Path to the output dataset model file. |
|
| step6_mdfeaturizer2_out2 | output_stats_pt_path | Path to the output model statistics file. |
|
| step7_evaluate_model_out1 | output_results_npz_path | Path to the output evaluation results file (compressed NumPy archive). |
|
| step8_make_ndx1_out1 | output_ndx_path | Path to the output index NDX file. |
|
| step9_make_ndx2_out1 | output_ndx_path | Path to the output index NDX file. |
|
| step10_gmx_rmsf1_out1 | output_xvg_path | Path to the XVG output file. |
|
| step11_gmx_rmsf2_out1 | output_xvg_path | Path to the XVG output file. |
|
| step12_feat2traj_out1 | output_traj_path | Path to save the trajectory in xtc/pdb/dcd format. |
|
| step12_feat2traj_out2 | output_top_path | Path to save the output topology file (pdb). Used if trajectory format requires separate topology. |
|
| step13_gmx_rmsf3_out1 | output_xvg_path | Path to the XVG output file. |
|
| step14_make_plumed_out1 | output_model_ptc_path | Path to the output TorchScript model file (.ptc) for PLUMED's PYTORCH_MODEL action. |
|
| step14_make_plumed_out2 | output_plumed_dat_path | Path to the output PLUMED input file. |
|
| step14_make_plumed_out3 | output_features_dat_path | Path to the output features.dat file describing the CVs to PLUMED. |
|
Version History
Version 1 (earliest) Created 27th Mar 2026 at 09:24 by Genís Bayarri
Initial commit
Frozen
Version-1
77cf6a9
Creators and SubmitterCreators
Submitter
Discussion Channel
Citation
Bayarri, G., & Hospital, A. (2026). CWL AutoEncoders for MD Analysis tutorial. WorkflowHub. https://doi.org/10.48546/WORKFLOWHUB.WORKFLOW.2144.1
License
Activity
Views: 224 Downloads: 71
Created: 27th Mar 2026 at 09:24
TagsThis item has not yet been tagged.
AttributionsNone
View on GitHub
https://orcid.org/0000-0003-0513-0288