Tips and tricks¶
Active Learning¶
Fake MD candidates¶
DFT stage collects configurations in dir 01_md/md_candidate/*.extxyz. These are the candidates selected by the active learning algorithm.
- You can totally "fake" these candidates by: adding some configurations, removing some configurations, or even replacing all candidates with your own configurations,... just by place your
.extxyzfiles in the dir01_md/md_candidate/. The active learning algorithm will just read the configurations in thisdirand use them for the next DFT labeling stage. This is useful when you want have more custom configurations to be labeled by DFT.
Note: avoid adding duplicated custom configurations, do as follow:
- Run below cell to collect
extxyzfiles in DFT label directories. Then, add the collectedextxyzfile back to dataset for training.- Delete all DFT task directories
- Add custom MD candidate structures to
01_md/md_candidate/*.extxyz- Modify
_alff.iterto ben-1 2step,
- Relaunch
alff_trainstep to recollect interation-data for checking duplicates. (remember to quitalffbefore graph building)- Then, modify
_alff.iterto ben 1step, and relaunchalffto DFT label the new candidate structures.
- You can also ignore completely the MD runs, and just place your own configurations in the dir
01_md/md_candidate/for DFT labeling. Then, modify the ALFF iterlog_alff.iterto skip the MD stage and directly go to DFT stage (for example ignoring MD stage atiter 12, modify corresponding lines to12 1). This is useful when you have some specific configurations that you want to be labeled by DFT, but you don't want to run the MD simulations.
Effective sampling¶
Rules of thumb:
- Explicitly index
init_struct_pathsfor each structure for easier control sampling (can get them from01_md/init_paths.yml), i.e., avoid using wildcards likeinit_structs/*. - Exclude enough sampled structures from the sampling space to save computational resources (MD run).
- Add empty sampling spaces if want to ignore some iterations of sampling
md:
sampling_spaces:
- {} # ignore some iterations
- {}