Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.06-36-g478fd8a4ac98) initialized Starting run at 08:27:40 UTC on 2022-06-20. Successfully read inputs file ... Castro git describe: 22.06-12-g556652b03 AMReX git describe: 22.06-36-g478fd8a4a Microphysics git describe: 22.06-4-gef2eb86c reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.043225741 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.024978399 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048004759 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.049883845 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.062221712 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.063182637 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.064465512 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.040036153 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.068467509 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.070931588 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.054183753 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.065520535 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.070001184 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.040109053 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.024740538 seconds Ending run at 08:27:40 UTC on 2022-06-20. Run time = 0.839776223 Run time without initialization = 0.722351438 Average number of zones advanced per microsecond: 3.629 Average number of zones advanced per microsecond per rank: 3.629 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8398 ... 0.8398 ... 0.8398 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.2263 0.2263 0.2263 26.95% VisMF::Write(FabArray) 11 0.1668 0.1668 0.1668 19.86% MLCellLinOp::applyBC() 4379 0.08052 0.08052 0.08052 9.59% MLPoisson::Fsmooth() 3240 0.06318 0.06318 0.06318 7.52% StateData::FillBoundary(geom) 328 0.02492 0.02492 0.02492 2.97% MLCGSolver::bicgstab 81 0.024 0.024 0.024 2.86% MultiFab::Dot() 1100 0.02245 0.02245 0.02245 2.67% Castro::computeTemp() 63 0.01561 0.01561 0.01561 1.86% MultiFab::LinComb() 1566 0.0144 0.0144 0.0144 1.71% FabArray::setVal() 1135 0.01431 0.01431 0.01431 1.70% Castro::normalize_species() 62 0.01414 0.01414 0.01414 1.68% FillBoundary_nowait() 3974 0.01406 0.01406 0.01406 1.67% FabArray::ParallelCopy_nowait() 851 0.01312 0.01312 0.01312 1.56% Castro::enforce_min_density() 62 0.01229 0.01229 0.01229 1.46% MLCellLinOp::defineAuxData() 11 0.01194 0.01194 0.01194 1.42% MLPoisson::Fapply() 1128 0.01173 0.01173 0.01173 1.40% StateDataPhysBCFunct::() 41 0.01157 0.01157 0.01157 1.38% Gravity::fill_multipole_BCs() 11 0.008279 0.008279 0.008279 0.99% MLMG::addInterpCorrection() 405 0.007387 0.007387 0.007387 0.88% amrex::average_down 405 0.006786 0.006786 0.006786 0.81% MultiFab::Xpay() 578 0.006589 0.006589 0.006589 0.78% Castro::estTimeStep() 21 0.005866 0.005866 0.005866 0.70% Castro::do_advance_ctu() 10 0.005497 0.005497 0.005497 0.65% Amr::checkPoint() 3 0.004031 0.004031 0.004031 0.48% BndryData::define() 11 0.003939 0.003939 0.003939 0.47% Castro::reset_internal_energy(MultiFab) 63 0.003836 0.003836 0.003836 0.46% Castro::construct_new_gravity_source() 10 0.003239 0.003239 0.003239 0.39% Castro::construct_old_gravity_source() 10 0.002576 0.002576 0.002576 0.31% Amr::writePlotFile() 2 0.002364 0.002364 0.002364 0.28% Gravity::get_new_grav_vector() 11 0.001947 0.001947 0.001947 0.23% MLMG::ResNormInf() 92 0.001915 0.001915 0.001915 0.23% MultiFab::Saxpy() 20 0.001817 0.001817 0.001817 0.22% Castro::expand_state() 10 0.001732 0.001732 0.001732 0.21% Gravity::get_old_grav_vector() 10 0.00173 0.00173 0.00173 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.00167 0.00167 0.00167 0.20% MLMG::oneIter() 81 0.001655 0.001655 0.001655 0.20% Castro::reset_internal_energy(Fab) 504 0.001605 0.001605 0.001605 0.19% MLCellLinOp::setLevelBC() 11 0.001564 0.001564 0.001564 0.19% Gravity::actual_solve_with_mlmg() 11 0.001357 0.001357 0.001357 0.16% FabArray::setDomainBndry() 41 0.001336 0.001336 0.001336 0.16% FabArray::mult() 43 0.00133 0.00133 0.00133 0.16% MLCellLinOp::prepareForSolve() 11 0.001182 0.001182 0.001182 0.14% MultiFab::contains_nan() 20 0.001166 0.001166 0.001166 0.14% Castro::initData() 1 0.00115 0.00115 0.00115 0.14% MLCellLinOp::smooth() 1620 0.001098 0.001098 0.001098 0.13% MLMG::prepareForSolve() 11 0.001055 0.001055 0.001055 0.13% MLCellLinOp::compGrad() 11 0.0009253 0.0009253 0.0009253 0.11% FabArray::FillBoundary() 3974 0.0007943 0.0007943 0.0007943 0.09% Castro::enforce_speed_limit() 62 0.0007704 0.0007704 0.0007704 0.09% FabArrayBase::getCPC() 1313 0.0007523 0.0007523 0.0007523 0.09% FabArrayBase::CPC::define() 454 0.000674 0.000674 0.000674 0.08% FabArrayBase::getFB() 3974 0.0006442 0.0006442 0.0006442 0.08% Amr::InitAmr() 1 0.0004596 0.0004596 0.0004596 0.05% MLCellLinOp::apply() 1128 0.0004528 0.0004528 0.0004528 0.05% Gravity::solve_for_phi() 10 0.0004236 0.0004236 0.0004236 0.05% Gravity::update_max_rhs() 11 0.0004068 0.0004068 0.0004068 0.05% CGSolver::sxay() 1566 0.000361 0.000361 0.000361 0.04% FillPatchIterator::Initialize 41 0.00031 0.00031 0.00031 0.04% Amr::coarseTimeStep() 10 0.0003051 0.0003051 0.0003051 0.04% MLCGSolver::ParallelAllReduce 1495 0.0003016 0.0003016 0.0003016 0.04% MLCellLinOp::defineBC() 11 0.0002841 0.0002841 0.0002841 0.03% FabArray::ParallelCopy() 851 0.0002636 0.0002636 0.0002636 0.03% main() 1 0.0002624 0.0002624 0.0002624 0.03% MultiFab::Copy() 11 0.0002588 0.0002588 0.0002588 0.03% MultiFab::max() 11 0.0002516 0.0002516 0.0002516 0.03% MLCellLinOp::correctionResidual() 486 0.0002277 0.0002277 0.0002277 0.03% MLMG::MLRhsNormInf() 11 0.0002125 0.0002125 0.0002125 0.03% Castro::construct_new_gravity() 10 0.0002053 0.0002053 0.0002053 0.02% MLLinOp::defineGrids() 11 0.000202 0.000202 0.000202 0.02% MLMG::mgVcycle() 81 0.0001928 0.0001928 0.0001928 0.02% Amr::timeStep() 10 0.0001876 0.0001876 0.0001876 0.02% Castro::subcycle_advance_ctu() 10 0.0001824 0.0001824 0.0001824 0.02% StateData::checkPoint() 12 0.0001272 0.0001272 0.0001272 0.02% MLMG:computeResOfCorrection() 405 0.0001112 0.0001112 0.0001112 0.01% MLMG::actualBottomSolve() 81 9.568e-05 9.568e-05 9.568e-05 0.01% MLMG::mgVcycle_down::0 81 8.513e-05 8.513e-05 8.513e-05 0.01% Castro::Castro() 1 8.434e-05 8.434e-05 8.434e-05 0.01% FabArrayBase::FB::FB() 56 8.423e-05 8.423e-05 8.423e-05 0.01% Castro::initialize_advance() 10 8.153e-05 8.153e-05 8.153e-05 0.01% MLMG::mgVcycle_down::2 81 7.88e-05 7.88e-05 7.88e-05 0.01% MLMG::mgVcycle_down::1 81 7.715e-05 7.715e-05 7.715e-05 0.01% Castro::clean_state() 62 7.535e-05 7.535e-05 7.535e-05 0.01% MLMG::solve() 11 7.241e-05 7.241e-05 7.241e-05 0.01% AmrLevel::checkPoint() 3 7.092e-05 7.092e-05 7.092e-05 0.01% MLMG::mgVcycle_down::3 81 6.854e-05 6.854e-05 6.854e-05 0.01% MLMG::mgVcycle_down::4 81 6.844e-05 6.844e-05 6.844e-05 0.01% Castro::finalize_advance() 10 6.484e-05 6.484e-05 6.484e-05 0.01% Castro::initialize_do_advance() 10 6.482e-05 6.482e-05 6.482e-05 0.01% MLMG::mgVcycle_up::4 81 5.863e-05 5.863e-05 5.863e-05 0.01% Castro::advance() 10 5.624e-05 5.624e-05 5.624e-05 0.01% MLMG::mgVcycle_up::0 81 4.901e-05 4.901e-05 4.901e-05 0.01% MLMG::mgVcycle_up::1 81 4.858e-05 4.858e-05 4.858e-05 0.01% MLMG::mgVcycle_up::3 81 4.807e-05 4.807e-05 4.807e-05 0.01% MLCellLinOp::solutionResidual() 92 4.786e-05 4.786e-05 4.786e-05 0.01% MLMG::mgVcycle_up::2 81 4.556e-05 4.556e-05 4.556e-05 0.01% Castro::swap_state_time_levels() 10 4.19e-05 4.19e-05 4.19e-05 0.00% Castro::finalize_do_advance() 10 3.724e-05 3.724e-05 3.724e-05 0.00% StateData::define() 4 3.68e-05 3.68e-05 3.68e-05 0.00% Castro::enforce_consistent_e() 1 3.517e-05 3.517e-05 3.517e-05 0.00% Castro::construct_new_source() 50 3.365e-05 3.365e-05 3.365e-05 0.00% Gravity::actual_multilevel_solve() 1 3.267e-05 3.267e-05 3.267e-05 0.00% MLMG::mgVcycle_bottom 81 3.214e-05 3.214e-05 3.214e-05 0.00% MLMG::computeResidual() 81 3.121e-05 3.121e-05 3.121e-05 0.00% Gravity::solve_phi_with_mlmg() 11 3.087e-05 3.087e-05 3.087e-05 0.00% FillPatchSingleLevel 41 2.952e-05 2.952e-05 2.952e-05 0.00% Castro::initMFs() 1 2.621e-05 2.621e-05 2.621e-05 0.00% makeSFC 55 2.608e-05 2.608e-05 2.608e-05 0.00% Amr::writeSmallPlotFile() 1 2.584e-05 2.584e-05 2.584e-05 0.00% Amr::defBaseLevel() 1 2.466e-05 2.466e-05 2.466e-05 0.00% MLLinOp::define() 11 2.424e-05 2.424e-05 2.424e-05 0.00% Castro::buildMetrics() 1 2.422e-05 2.422e-05 2.422e-05 0.00% MLPoisson::define() 11 2.397e-05 2.397e-05 2.397e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.997e-05 1.997e-05 1.997e-05 0.00% Amr::FinalizeInit() 1 1.95e-05 1.95e-05 1.95e-05 0.00% Castro::construct_old_source() 50 1.748e-05 1.748e-05 1.748e-05 0.00% Castro::do_new_sources() 10 1.723e-05 1.723e-05 1.723e-05 0.00% Castro::do_old_sources() 10 1.578e-05 1.578e-05 1.578e-05 0.00% DistributionMapping::Distribute() 56 1.499e-05 1.499e-05 1.499e-05 0.00% Gravity::swapTimeLevels() 10 1.381e-05 1.381e-05 1.381e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.379e-05 1.379e-05 1.379e-05 0.00% Castro::check_for_nan() 20 1.145e-05 1.145e-05 1.145e-05 0.00% Castro::apply_source_to_state() 20 1.112e-05 1.112e-05 1.112e-05 0.00% Castro::construct_old_gravity() 10 9.194e-06 9.194e-06 9.194e-06 0.00% MLPoisson::prepareForSolve() 11 8.542e-06 8.542e-06 8.542e-06 0.00% Castro::post_timestep() 10 8.344e-06 8.344e-06 8.344e-06 0.00% Amr::initSubcycle() 1 7.83e-06 7.83e-06 7.83e-06 0.00% Amr::InitializeInit() 1 6.616e-06 6.616e-06 6.616e-06 0.00% MLMG::computeMLResidual() 11 6.274e-06 6.274e-06 6.274e-06 0.00% Castro::computeNewDt() 9 6.2e-06 6.2e-06 6.2e-06 0.00% AmrLevel::AmrLevel(dm) 1 6.164e-06 6.164e-06 6.164e-06 0.00% MLMG::getGradSolution() 11 5.444e-06 5.444e-06 5.444e-06 0.00% AmrLevel::checkPointPost() 3 4.895e-06 4.895e-06 4.895e-06 0.00% MLMG::buildFineMask() 11 4.751e-06 4.751e-06 4.751e-06 0.00% MLMG::MLResNormInf() 11 4.564e-06 4.564e-06 4.564e-06 0.00% Castro::create_source_corrector() 10 4.439e-06 4.439e-06 4.439e-06 0.00% Castro::retry_advance_ctu() 10 4.214e-06 4.214e-06 4.214e-06 0.00% Gravity::set_mass_offset() 11 3.554e-06 3.554e-06 3.554e-06 0.00% Castro::post_init() 1 3.456e-06 3.456e-06 3.456e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.227e-06 3.227e-06 3.227e-06 0.00% Amr::init() 1 2.771e-06 2.771e-06 2.771e-06 0.00% Castro::FluxRegCrseInit 10 2.673e-06 2.673e-06 2.673e-06 0.00% Castro::computeInitialDt() 2 2.506e-06 2.506e-06 2.506e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.08e-06 2.08e-06 2.08e-06 0.00% Castro::FluxRegFineAdd() 10 2.029e-06 2.029e-06 2.029e-06 0.00% AmrLevel::checkPointPre() 3 1.826e-06 1.826e-06 1.826e-06 0.00% Amr::initialInit() 1 1.205e-06 1.205e-06 1.205e-06 0.00% Castro::post_regrid() 1 8.81e-07 8.81e-07 8.81e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8398 0.8398 0.8398 100.00% Amr::coarseTimeStep() 10 0.6974 0.6974 0.6974 83.04% Amr::timeStep() 10 0.6136 0.6136 0.6136 73.06% Castro::advance() 10 0.6052 0.6052 0.6052 72.07% Castro::subcycle_advance_ctu() 10 0.5931 0.5931 0.5931 70.62% Castro::do_advance_ctu() 10 0.5929 0.5929 0.5929 70.60% Gravity::solve_phi_with_mlmg() 11 0.3135 0.3135 0.3135 37.33% Gravity::actual_solve_with_mlmg() 11 0.305 0.305 0.305 36.32% Castro::construct_new_gravity() 10 0.2878 0.2878 0.2878 34.27% MLMG::solve() 11 0.2819 0.2819 0.2819 33.57% Gravity::solve_for_phi() 10 0.2725 0.2725 0.2725 32.44% MLMG::oneIter() 81 0.2673 0.2673 0.2673 31.83% MLMG::mgVcycle() 81 0.2657 0.2657 0.2657 31.64% Castro::construct_ctu_hydro_source() 10 0.2263 0.2263 0.2263 26.95% VisMF::Write(FabArray) 11 0.1668 0.1668 0.1668 19.86% MLCellLinOp::smooth() 1620 0.1358 0.1358 0.1358 16.17% Amr::checkPoint() 3 0.1235 0.1235 0.1235 14.71% AmrLevel::checkPoint() 3 0.1195 0.1195 0.1195 14.23% StateData::checkPoint() 12 0.1194 0.1194 0.1194 14.22% Amr::init() 1 0.1169 0.1169 0.1169 13.91% MLCellLinOp::applyBC() 4379 0.0961 0.0961 0.0961 11.44% MLMG::mgVcycle_bottom 81 0.08218 0.08218 0.08218 9.79% MLMG::actualBottomSolve() 81 0.08215 0.08215 0.08215 9.78% MLCGSolver::bicgstab 81 0.08132 0.08132 0.08132 9.68% MLPoisson::Fsmooth() 3240 0.06318 0.06318 0.06318 7.52% Amr::writePlotFile() 2 0.04984 0.04984 0.04984 5.93% Amr::initialInit() 1 0.04852 0.04852 0.04852 5.78% Castro::clean_state() 62 0.04762 0.04762 0.04762 5.67% Amr::FinalizeInit() 1 0.04472 0.04472 0.04472 5.32% Castro::post_init() 1 0.04341 0.04341 0.04341 5.17% FillPatchIterator::Initialize 41 0.04209 0.04209 0.04209 5.01% Gravity::multilevel_solve_for_new_phi() 1 0.04154 0.04154 0.04154 4.95% Gravity::actual_multilevel_solve() 1 0.04152 0.04152 0.04152 4.94% FillPatchSingleLevel 41 0.04044 0.04044 0.04044 4.82% StateDataPhysBCFunct::() 41 0.03649 0.03649 0.03649 4.34% MLCellLinOp::apply() 1128 0.03628 0.03628 0.03628 4.32% MLMG::mgVcycle_down::0 81 0.03529 0.03529 0.03529 4.20% MLMG::mgVcycle_up::0 81 0.03021 0.03021 0.03021 3.60% StateData::FillBoundary(geom) 328 0.02492 0.02492 0.02492 2.97% MultiFab::Dot() 1100 0.02245 0.02245 0.02245 2.67% MLCellLinOp::correctionResidual() 486 0.02132 0.02132 0.02132 2.54% Castro::computeTemp() 63 0.02105 0.02105 0.02105 2.51% Castro::initialize_do_advance() 10 0.01913 0.01913 0.01913 2.28% MLPoisson::define() 11 0.01877 0.01877 0.01877 2.23% MLMG:computeResOfCorrection() 405 0.01837 0.01837 0.01837 2.19% MLMG::mgVcycle_down::1 81 0.01765 0.01765 0.01765 2.10% MLMG::mgVcycle_down::2 81 0.01724 0.01724 0.01724 2.05% Gravity::get_new_grav_vector() 11 0.01695 0.01695 0.01695 2.02% MLMG::mgVcycle_down::3 81 0.01634 0.01634 0.01634 1.95% FabArray::FillBoundary() 3974 0.01558 0.01558 0.01558 1.86% MLMG::mgVcycle_down::4 81 0.01554 0.01554 0.01554 1.85% Castro::construct_old_gravity() 10 0.01514 0.01514 0.01514 1.80% Gravity::get_old_grav_vector() 10 0.01513 0.01513 0.01513 1.80% FillBoundary_nowait() 3974 0.01479 0.01479 0.01479 1.76% CGSolver::sxay() 1566 0.01476 0.01476 0.01476 1.76% MultiFab::LinComb() 1566 0.0144 0.0144 0.0144 1.71% FabArray::setVal() 1135 0.01431 0.01431 0.01431 1.70% FabArray::ParallelCopy() 851 0.01418 0.01418 0.01418 1.69% Castro::normalize_species() 62 0.01414 0.01414 0.01414 1.68% FabArray::ParallelCopy_nowait() 851 0.01392 0.01392 0.01392 1.66% MLCGSolver::ParallelAllReduce 1495 0.01341 0.01341 0.01341 1.60% MLCellLinOp::defineAuxData() 11 0.01329 0.01329 0.01329 1.58% MLMG::mgVcycle_up::2 81 0.01323 0.01323 0.01323 1.57% MLMG::mgVcycle_up::1 81 0.01296 0.01296 0.01296 1.54% MLMG::addInterpCorrection() 405 0.0125 0.0125 0.0125 1.49% MLMG::mgVcycle_up::3 81 0.01249 0.01249 0.01249 1.49% MLMG::mgVcycle_up::4 81 0.01236 0.01236 0.01236 1.47% Castro::do_new_sources() 10 0.01232 0.01232 0.01232 1.47% Castro::enforce_min_density() 62 0.01229 0.01229 0.01229 1.46% Castro::initialize_advance() 10 0.01203 0.01203 0.01203 1.43% amrex::average_down 405 0.01193 0.01193 0.01193 1.42% MLPoisson::Fapply() 1128 0.01173 0.01173 0.01173 1.40% Castro::expand_state() 10 0.01121 0.01121 0.01121 1.34% Castro::do_old_sources() 10 0.01076 0.01076 0.01076 1.28% Gravity::fill_multipole_BCs() 11 0.008279 0.008279 0.008279 0.99% Castro::post_timestep() 10 0.008149 0.008149 0.008149 0.97% MLCellLinOp::solutionResidual() 92 0.007116 0.007116 0.007116 0.85% MultiFab::Xpay() 578 0.006589 0.006589 0.006589 0.78% MLMG::computeResidual() 81 0.006128 0.006128 0.006128 0.73% Castro::estTimeStep() 21 0.005866 0.005866 0.005866 0.70% Castro::reset_internal_energy(MultiFab) 63 0.005441 0.005441 0.005441 0.65% MLMG::prepareForSolve() 11 0.005206 0.005206 0.005206 0.62% MLCellLinOp::defineBC() 11 0.005173 0.005173 0.005173 0.62% BndryData::define() 11 0.004889 0.004889 0.004889 0.58% Amr::InitializeInit() 1 0.003807 0.003807 0.003807 0.45% Amr::defBaseLevel() 1 0.003801 0.003801 0.003801 0.45% Castro::initData() 1 0.003292 0.003292 0.003292 0.39% Castro::construct_new_source() 50 0.003273 0.003273 0.003273 0.39% Castro::construct_new_gravity_source() 10 0.003239 0.003239 0.003239 0.39% Castro::computeNewDt() 9 0.00279 0.00279 0.00279 0.33% Castro::construct_old_source() 50 0.002593 0.002593 0.002593 0.31% Castro::construct_old_gravity_source() 10 0.002576 0.002576 0.002576 0.31% MLMG::ResNormInf() 92 0.001915 0.001915 0.001915 0.23% Castro::apply_source_to_state() 20 0.001828 0.001828 0.001828 0.22% MultiFab::Saxpy() 20 0.001817 0.001817 0.001817 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.00167 0.00167 0.00167 0.20% Castro::reset_internal_energy(Fab) 504 0.001605 0.001605 0.001605 0.19% MLCellLinOp::setLevelBC() 11 0.001564 0.001564 0.001564 0.19% FabArrayBase::getCPC() 1313 0.001426 0.001426 0.001426 0.17% MLMG::getGradSolution() 11 0.001415 0.001415 0.001415 0.17% MLCellLinOp::compGrad() 11 0.00141 0.00141 0.00141 0.17% FabArray::setDomainBndry() 41 0.001336 0.001336 0.001336 0.16% FabArray::mult() 43 0.00133 0.00133 0.00133 0.16% MLPoisson::prepareForSolve() 11 0.001191 0.001191 0.001191 0.14% MLCellLinOp::prepareForSolve() 11 0.001182 0.001182 0.001182 0.14% Castro::check_for_nan() 20 0.001177 0.001177 0.001177 0.14% MultiFab::contains_nan() 20 0.001166 0.001166 0.001166 0.14% Castro::post_regrid() 1 0.00107 0.00107 0.00107 0.13% MLMG::computeMLResidual() 11 0.001026 0.001026 0.001026 0.12% Gravity::update_max_rhs() 11 0.0008078 0.0008078 0.0008078 0.10% Castro::enforce_speed_limit() 62 0.0007704 0.0007704 0.0007704 0.09% FabArrayBase::getFB() 3974 0.0007285 0.0007285 0.0007285 0.09% Castro::computeInitialDt() 2 0.0006995 0.0006995 0.0006995 0.08% FabArrayBase::CPC::define() 454 0.000674 0.000674 0.000674 0.08% Amr::InitAmr() 1 0.0004674 0.0004674 0.0004674 0.06% Gravity::swapTimeLevels() 10 0.0004448 0.0004448 0.0004448 0.05% Castro::Castro() 1 0.0004371 0.0004371 0.0004371 0.05% MLLinOp::define() 11 0.0002817 0.0002817 0.0002817 0.03% MultiFab::Copy() 11 0.0002588 0.0002588 0.0002588 0.03% MLLinOp::defineGrids() 11 0.0002575 0.0002575 0.0002575 0.03% MLMG::MLResNormInf() 11 0.0002571 0.0002571 0.0002571 0.03% MultiFab::max() 11 0.0002516 0.0002516 0.0002516 0.03% MLMG::MLRhsNormInf() 11 0.0002125 0.0002125 0.0002125 0.03% Castro::buildMetrics() 1 0.0001617 0.0001617 0.0001617 0.02% FabArrayBase::FB::FB() 56 8.423e-05 8.423e-05 8.423e-05 0.01% Castro::finalize_advance() 10 6.954e-05 6.954e-05 6.954e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.348e-05 5.348e-05 5.348e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.296e-05 4.296e-05 4.296e-05 0.01% Castro::swap_state_time_levels() 10 4.19e-05 4.19e-05 4.19e-05 0.00% makeSFC 55 3.97e-05 3.97e-05 3.97e-05 0.00% Castro::finalize_do_advance() 10 3.724e-05 3.724e-05 3.724e-05 0.00% StateData::define() 4 3.68e-05 3.68e-05 3.68e-05 0.00% Castro::enforce_consistent_e() 1 3.517e-05 3.517e-05 3.517e-05 0.00% Castro::initMFs() 1 2.621e-05 2.621e-05 2.621e-05 0.00% Amr::writeSmallPlotFile() 1 2.584e-05 2.584e-05 2.584e-05 0.00% DistributionMapping::Distribute() 56 1.499e-05 1.499e-05 1.499e-05 0.00% Amr::initSubcycle() 1 7.83e-06 7.83e-06 7.83e-06 0.00% AmrLevel::checkPointPost() 3 4.895e-06 4.895e-06 4.895e-06 0.00% MLMG::buildFineMask() 11 4.751e-06 4.751e-06 4.751e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.602e-06 4.602e-06 4.602e-06 0.00% Castro::create_source_corrector() 10 4.439e-06 4.439e-06 4.439e-06 0.00% Castro::retry_advance_ctu() 10 4.214e-06 4.214e-06 4.214e-06 0.00% Gravity::set_mass_offset() 11 3.554e-06 3.554e-06 3.554e-06 0.00% Castro::FluxRegCrseInit 10 2.673e-06 2.673e-06 2.673e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.08e-06 2.08e-06 2.08e-06 0.00% Castro::FluxRegFineAdd() 10 2.029e-06 2.029e-06 2.029e-06 0.00% AmrLevel::checkPointPre() 3 1.826e-06 1.826e-06 1.826e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.06-36-g478fd8a4ac98) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.06-36-g478fd8a4ac98) initialized Starting run at 08:27:41 UTC on 2022-06-20. Successfully read inputs file ... Castro git describe: 22.06-12-g556652b03 AMReX git describe: 22.06-36-g478fd8a4a Microphysics git describe: 22.06-4-gef2eb86c reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.445238954 Restart time = 0.072899921 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.052997629 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.049714937 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.063005649 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.065608432 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.080200578 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.026640222 seconds Ending run at 08:27:42 UTC on 2022-06-20. Run time = 0.412029696 Run time without initialization = 0.338572367 Average number of zones advanced per microsecond: 3.871 Average number of zones advanced per microsecond per rank: 3.871 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.4121 ... 0.4121 ... 0.4121 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1109 0.1109 0.1109 26.91% VisMF::Read() 3 0.03887 0.03887 0.03887 9.43% MLCellLinOp::applyBC() 1946 0.03526 0.03526 0.03526 8.56% Amr::restart() 1 0.0298 0.0298 0.0298 7.23% MLPoisson::Fsmooth() 1440 0.02756 0.02756 0.02756 6.69% VisMF::Write(FabArray) 1 0.02524 0.02524 0.02524 6.13% StateData::FillBoundary(geom) 160 0.01186 0.01186 0.01186 2.88% MLCGSolver::bicgstab 36 0.01037 0.01037 0.01037 2.52% MultiFab::Dot() 484 0.009748 0.009748 0.009748 2.37% Castro::normalize_species() 30 0.009099 0.009099 0.009099 2.21% Castro::computeTemp() 30 0.008514 0.008514 0.008514 2.07% FabArray::setVal() 537 0.006882 0.006882 0.006882 1.67% MLCellLinOp::defineAuxData() 6 0.00638 0.00638 0.00638 1.55% FillBoundary_nowait() 1766 0.006255 0.006255 0.006255 1.52% MultiFab::LinComb() 690 0.006207 0.006207 0.006207 1.51% FabArray::ParallelCopy_nowait() 380 0.005989 0.005989 0.005989 1.45% Castro::enforce_min_density() 30 0.00593 0.00593 0.00593 1.44% MLPoisson::Fapply() 500 0.005124 0.005124 0.005124 1.24% StateDataPhysBCFunct::() 20 0.00484 0.00484 0.00484 1.17% Gravity::fill_multipole_BCs() 6 0.004479 0.004479 0.004479 1.09% MLMG::addInterpCorrection() 180 0.003266 0.003266 0.003266 0.79% amrex::average_down 180 0.002998 0.002998 0.002998 0.73% MultiFab::Xpay() 258 0.002901 0.002901 0.002901 0.70% Castro::do_advance_ctu() 5 0.002601 0.002601 0.002601 0.63% BndryData::define() 6 0.002198 0.002198 0.002198 0.53% Castro::estTimeStep() 10 0.002146 0.002146 0.002146 0.52% Castro::construct_new_gravity_source() 5 0.001775 0.001775 0.001775 0.43% Castro::reset_internal_energy(MultiFab) 30 0.001612 0.001612 0.001612 0.39% Amr::writePlotFile() 1 0.001488 0.001488 0.001488 0.36% Castro::construct_old_gravity_source() 5 0.00146 0.00146 0.00146 0.35% Castro::reset_internal_energy(Fab) 240 0.0009547 0.0009547 0.0009547 0.23% MultiFab::Saxpy() 10 0.000925 0.000925 0.000925 0.22% Castro::construct_new_source() 25 0.0008956 0.0008956 0.0008956 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008952 0.0008952 0.0008952 0.22% Gravity::get_old_grav_vector() 5 0.0008912 0.0008912 0.0008912 0.22% Castro::expand_state() 5 0.0008812 0.0008812 0.0008812 0.21% Gravity::get_new_grav_vector() 5 0.0008684 0.0008684 0.0008684 0.21% MLMG::ResNormInf() 42 0.0008532 0.0008532 0.0008532 0.21% MLCellLinOp::setLevelBC() 6 0.0008415 0.0008415 0.0008415 0.20% MLMG::oneIter() 36 0.0007296 0.0007296 0.0007296 0.18% Gravity::actual_solve_with_mlmg() 6 0.0007054 0.0007054 0.0007054 0.17% FabArray::setDomainBndry() 20 0.0006555 0.0006555 0.0006555 0.16% FabArray::mult() 22 0.0006515 0.0006515 0.0006515 0.16% MLCellLinOp::prepareForSolve() 6 0.0006436 0.0006436 0.0006436 0.16% MultiFab::contains_nan() 10 0.000586 0.000586 0.000586 0.14% MLMG::prepareForSolve() 6 0.0005665 0.0005665 0.0005665 0.14% MLCellLinOp::compGrad() 6 0.000492 0.000492 0.000492 0.12% MLCellLinOp::smooth() 720 0.000481 0.000481 0.000481 0.12% Castro::enforce_speed_limit() 30 0.0004474 0.0004474 0.0004474 0.11% FabArrayBase::CPC::define() 244 0.0003969 0.0003969 0.0003969 0.10% Amr::InitAmr() 1 0.000394 0.000394 0.000394 0.10% FabArray::FillBoundary() 1766 0.0003751 0.0003751 0.0003751 0.09% FabArrayBase::getCPC() 632 0.0003667 0.0003667 0.0003667 0.09% main() 1 0.0002833 0.0002833 0.0002833 0.07% FabArrayBase::getFB() 1766 0.0002649 0.0002649 0.0002649 0.06% Gravity::update_max_rhs() 6 0.0002293 0.0002293 0.0002293 0.06% Castro::subcycle_advance_ctu() 5 0.0002293 0.0002293 0.0002293 0.06% Gravity::solve_for_phi() 5 0.0002136 0.0002136 0.0002136 0.05% MLCellLinOp::apply() 500 0.0001956 0.0001956 0.0001956 0.05% CGSolver::sxay() 690 0.0001763 0.0001763 0.0001763 0.04% MLCellLinOp::defineBC() 6 0.0001547 0.0001547 0.0001547 0.04% Amr::coarseTimeStep() 5 0.000154 0.000154 0.000154 0.04% FillPatchIterator::Initialize 20 0.0001422 0.0001422 0.0001422 0.03% MultiFab::Copy() 6 0.0001398 0.0001398 0.0001398 0.03% MultiFab::max() 6 0.0001367 0.0001367 0.0001367 0.03% MLCGSolver::ParallelAllReduce 659 0.0001295 0.0001295 0.0001295 0.03% FabArray::ParallelCopy() 380 0.0001258 0.0001258 0.0001258 0.03% Castro::create_source_corrector() 5 0.0001219 0.0001219 0.0001219 0.03% Castro::construct_new_gravity() 5 0.0001081 0.0001081 0.0001081 0.03% MLMG::MLRhsNormInf() 6 0.0001071 0.0001071 0.0001071 0.03% Amr::timeStep() 5 9.93e-05 9.93e-05 9.93e-05 0.02% MLLinOp::defineGrids() 6 9.698e-05 9.698e-05 9.698e-05 0.02% MLCellLinOp::correctionResidual() 216 9.616e-05 9.616e-05 9.616e-05 0.02% Castro::advance() 5 8.809e-05 8.809e-05 8.809e-05 0.02% MLMG::mgVcycle() 36 8.47e-05 8.47e-05 8.47e-05 0.02% AmrLevel::restart() 1 7.679e-05 7.679e-05 7.679e-05 0.02% Castro::finalize_advance() 5 6.815e-05 6.815e-05 6.815e-05 0.02% Castro::computeNewDt() 5 6.719e-05 6.719e-05 6.719e-05 0.02% StateData::restartDoit() 4 6.412e-05 6.412e-05 6.412e-05 0.02% Castro::initialize_do_advance() 5 6.169e-05 6.169e-05 6.169e-05 0.01% Castro::construct_old_source() 25 5.893e-05 5.893e-05 5.893e-05 0.01% FabArrayBase::FB::FB() 26 5.504e-05 5.504e-05 5.504e-05 0.01% Castro::clean_state() 30 5.341e-05 5.341e-05 5.341e-05 0.01% MLMG:computeResOfCorrection() 180 4.979e-05 4.979e-05 4.979e-05 0.01% MLMG::actualBottomSolve() 36 4.419e-05 4.419e-05 4.419e-05 0.01% Castro::initialize_advance() 5 3.958e-05 3.958e-05 3.958e-05 0.01% Castro::post_restart() 1 3.858e-05 3.858e-05 3.858e-05 0.01% MLMG::mgVcycle_down::0 36 3.658e-05 3.658e-05 3.658e-05 0.01% MLMG::mgVcycle_down::2 36 3.62e-05 3.62e-05 3.62e-05 0.01% MLMG::mgVcycle_down::1 36 3.607e-05 3.607e-05 3.607e-05 0.01% MLMG::solve() 6 3.497e-05 3.497e-05 3.497e-05 0.01% MLMG::mgVcycle_down::4 36 3.373e-05 3.373e-05 3.373e-05 0.01% MLMG::mgVcycle_down::3 36 3.252e-05 3.252e-05 3.252e-05 0.01% Castro::buildMetrics() 1 3.204e-05 3.204e-05 3.204e-05 0.01% Gravity::actual_multilevel_solve() 1 3.081e-05 3.081e-05 3.081e-05 0.01% MLMG::mgVcycle_up::4 36 2.741e-05 2.741e-05 2.741e-05 0.01% Castro::swap_state_time_levels() 5 2.691e-05 2.691e-05 2.691e-05 0.01% Amr::writeSmallPlotFile() 1 2.673e-05 2.673e-05 2.673e-05 0.01% Castro::initMFs() 1 2.641e-05 2.641e-05 2.641e-05 0.01% MLCellLinOp::solutionResidual() 42 2.497e-05 2.497e-05 2.497e-05 0.01% MLMG::mgVcycle_up::3 36 2.262e-05 2.262e-05 2.262e-05 0.01% MLMG::mgVcycle_up::0 36 2.214e-05 2.214e-05 2.214e-05 0.01% MLMG::mgVcycle_up::2 36 2.198e-05 2.198e-05 2.198e-05 0.01% MLLinOp::define() 6 2.187e-05 2.187e-05 2.187e-05 0.01% Castro::finalize_do_advance() 5 2.111e-05 2.111e-05 2.111e-05 0.01% MLMG::mgVcycle_up::1 36 2.046e-05 2.046e-05 2.046e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.749e-05 1.749e-05 1.749e-05 0.00% makeSFC 30 1.528e-05 1.528e-05 1.528e-05 0.00% MLPoisson::define() 6 1.512e-05 1.512e-05 1.512e-05 0.00% MLMG::mgVcycle_bottom 36 1.458e-05 1.458e-05 1.458e-05 0.00% MLMG::computeResidual() 36 1.436e-05 1.436e-05 1.436e-05 0.00% FillPatchSingleLevel 20 1.382e-05 1.382e-05 1.382e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.338e-05 1.338e-05 1.338e-05 0.00% Castro::do_new_sources() 5 9.886e-06 9.886e-06 9.886e-06 0.00% DistributionMapping::Distribute() 31 9.48e-06 9.48e-06 9.48e-06 0.00% Amr::initSubcycle() 1 8.424e-06 8.424e-06 8.424e-06 0.00% Castro::do_old_sources() 5 8.208e-06 8.208e-06 8.208e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.473e-06 7.473e-06 7.473e-06 0.00% Castro::check_for_nan() 10 7.349e-06 7.349e-06 7.349e-06 0.00% Castro::apply_source_to_state() 10 5.421e-06 5.421e-06 5.421e-06 0.00% Castro::construct_old_gravity() 5 5.345e-06 5.345e-06 5.345e-06 0.00% Castro::post_timestep() 5 4.933e-06 4.933e-06 4.933e-06 0.00% Gravity::swapTimeLevels() 5 4.547e-06 4.547e-06 4.547e-06 0.00% MLPoisson::prepareForSolve() 6 4.478e-06 4.478e-06 4.478e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.871e-06 3.871e-06 3.871e-06 0.00% MLMG::buildFineMask() 6 2.948e-06 2.948e-06 2.948e-06 0.00% MLMG::computeMLResidual() 6 2.89e-06 2.89e-06 2.89e-06 0.00% MLMG::getGradSolution() 6 2.858e-06 2.858e-06 2.858e-06 0.00% Gravity::set_mass_offset() 6 2.372e-06 2.372e-06 2.372e-06 0.00% MLMG::MLResNormInf() 6 2.313e-06 2.313e-06 2.313e-06 0.00% Castro::retry_advance_ctu() 5 2.065e-06 2.065e-06 2.065e-06 0.00% Castro::FluxRegCrseInit 5 1.677e-06 1.677e-06 1.677e-06 0.00% Castro::FluxRegFineAdd() 5 1.271e-06 1.271e-06 1.271e-06 0.00% AmrLevel::AmrLevel() 1 1.188e-06 1.188e-06 1.188e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.111e-06 1.111e-06 1.111e-06 0.00% Amr::init() 1 9.37e-07 9.37e-07 9.37e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.4121 0.4121 0.4121 100.00% Amr::coarseTimeStep() 5 0.3117 0.3117 0.3117 75.63% Amr::timeStep() 5 0.3103 0.3103 0.3103 75.31% Castro::advance() 5 0.3056 0.3056 0.3056 74.16% Castro::subcycle_advance_ctu() 5 0.3005 0.3005 0.3005 72.93% Castro::do_advance_ctu() 5 0.3003 0.3003 0.3003 72.87% Castro::construct_new_gravity() 5 0.1449 0.1449 0.1449 35.16% Gravity::solve_phi_with_mlmg() 6 0.1409 0.1409 0.1409 34.19% Gravity::solve_for_phi() 5 0.1374 0.1374 0.1374 33.35% Gravity::actual_solve_with_mlmg() 6 0.1363 0.1363 0.1363 33.08% MLMG::solve() 6 0.1238 0.1238 0.1238 30.05% MLMG::oneIter() 36 0.1167 0.1167 0.1167 28.33% MLMG::mgVcycle() 36 0.116 0.116 0.116 28.15% Castro::construct_ctu_hydro_source() 5 0.1109 0.1109 0.1109 26.91% Amr::init() 1 0.07295 0.07295 0.07295 17.70% Amr::restart() 1 0.07294 0.07294 0.07294 17.70% MLCellLinOp::smooth() 720 0.05933 0.05933 0.05933 14.40% MLCellLinOp::applyBC() 1946 0.04222 0.04222 0.04222 10.24% AmrLevel::restart() 1 0.03907 0.03907 0.03907 9.48% StateData::restartDoit() 4 0.03899 0.03899 0.03899 9.46% VisMF::Read() 3 0.03887 0.03887 0.03887 9.43% MLMG::mgVcycle_bottom 36 0.03565 0.03565 0.03565 8.65% MLMG::actualBottomSolve() 36 0.03564 0.03564 0.03564 8.65% MLCGSolver::bicgstab 36 0.03527 0.03527 0.03527 8.56% MLPoisson::Fsmooth() 1440 0.02756 0.02756 0.02756 6.69% Amr::writePlotFile() 1 0.02673 0.02673 0.02673 6.49% Castro::clean_state() 30 0.02661 0.02661 0.02661 6.46% VisMF::Write(FabArray) 1 0.02524 0.02524 0.02524 6.13% FillPatchIterator::Initialize 20 0.01949 0.01949 0.01949 4.73% FillPatchSingleLevel 20 0.01869 0.01869 0.01869 4.54% StateDataPhysBCFunct::() 20 0.0167 0.0167 0.0167 4.05% MLCellLinOp::apply() 500 0.01598 0.01598 0.01598 3.88% MLMG::mgVcycle_down::0 36 0.0156 0.0156 0.0156 3.79% MLMG::mgVcycle_up::0 36 0.01329 0.01329 0.01329 3.22% StateData::FillBoundary(geom) 160 0.01186 0.01186 0.01186 2.88% Castro::computeTemp() 30 0.01108 0.01108 0.01108 2.69% Castro::initialize_do_advance() 5 0.01028 0.01028 0.01028 2.50% MLPoisson::define() 6 0.01018 0.01018 0.01018 2.47% MultiFab::Dot() 484 0.009748 0.009748 0.009748 2.37% MLCellLinOp::correctionResidual() 216 0.00935 0.00935 0.00935 2.27% Castro::normalize_species() 30 0.009099 0.009099 0.009099 2.21% Castro::do_new_sources() 5 0.00894 0.00894 0.00894 2.17% MLMG:computeResOfCorrection() 180 0.00806 0.00806 0.00806 1.96% MLMG::mgVcycle_down::1 36 0.007759 0.007759 0.007759 1.88% MLMG::mgVcycle_down::2 36 0.00753 0.00753 0.00753 1.83% Castro::do_old_sources() 5 0.007406 0.007406 0.007406 1.80% Gravity::get_new_grav_vector() 5 0.00739 0.00739 0.00739 1.79% Castro::construct_old_gravity() 5 0.007378 0.007378 0.007378 1.79% Gravity::get_old_grav_vector() 5 0.007373 0.007373 0.007373 1.79% MLCellLinOp::defineAuxData() 6 0.007122 0.007122 0.007122 1.73% MLMG::mgVcycle_down::3 36 0.007079 0.007079 0.007079 1.72% FabArray::FillBoundary() 1766 0.00695 0.00695 0.00695 1.69% FabArray::setVal() 537 0.006882 0.006882 0.006882 1.67% MLMG::mgVcycle_down::4 36 0.006779 0.006779 0.006779 1.65% FillBoundary_nowait() 1766 0.006575 0.006575 0.006575 1.60% FabArray::ParallelCopy() 380 0.006489 0.006489 0.006489 1.57% CGSolver::sxay() 690 0.006383 0.006383 0.006383 1.55% FabArray::ParallelCopy_nowait() 380 0.006363 0.006363 0.006363 1.54% MultiFab::LinComb() 690 0.006207 0.006207 0.006207 1.51% Castro::enforce_min_density() 30 0.00593 0.00593 0.00593 1.44% MLCGSolver::ParallelAllReduce 659 0.005852 0.005852 0.005852 1.42% MLMG::mgVcycle_up::2 36 0.005734 0.005734 0.005734 1.39% MLMG::mgVcycle_up::1 36 0.005722 0.005722 0.005722 1.39% MLMG::addInterpCorrection() 180 0.005503 0.005503 0.005503 1.34% MLMG::mgVcycle_up::3 36 0.005416 0.005416 0.005416 1.31% MLMG::mgVcycle_up::4 36 0.005377 0.005377 0.005377 1.30% Castro::expand_state() 5 0.005342 0.005342 0.005342 1.30% amrex::average_down 180 0.005277 0.005277 0.005277 1.28% MLPoisson::Fapply() 500 0.005124 0.005124 0.005124 1.24% Castro::initialize_advance() 5 0.004916 0.004916 0.004916 1.19% Castro::post_timestep() 5 0.004621 0.004621 0.004621 1.12% Gravity::fill_multipole_BCs() 6 0.004479 0.004479 0.004479 1.09% Castro::post_restart() 1 0.003896 0.003896 0.003896 0.95% Gravity::multilevel_solve_for_new_phi() 1 0.003764 0.003764 0.003764 0.91% Gravity::actual_multilevel_solve() 1 0.003746 0.003746 0.003746 0.91% MLCellLinOp::solutionResidual() 42 0.003244 0.003244 0.003244 0.79% MultiFab::Xpay() 258 0.002901 0.002901 0.002901 0.70% MLCellLinOp::defineBC() 6 0.002895 0.002895 0.002895 0.70% MLMG::prepareForSolve() 6 0.002817 0.002817 0.002817 0.68% BndryData::define() 6 0.002741 0.002741 0.002741 0.67% MLMG::computeResidual() 36 0.002692 0.002692 0.002692 0.65% Castro::construct_new_source() 25 0.002671 0.002671 0.002671 0.65% Castro::reset_internal_energy(MultiFab) 30 0.002567 0.002567 0.002567 0.62% Castro::estTimeStep() 10 0.002146 0.002146 0.002146 0.52% Castro::construct_new_gravity_source() 5 0.001775 0.001775 0.001775 0.43% Castro::construct_old_source() 25 0.001519 0.001519 0.001519 0.37% Castro::construct_old_gravity_source() 5 0.00146 0.00146 0.00146 0.35% Castro::computeNewDt() 5 0.001197 0.001197 0.001197 0.29% Castro::reset_internal_energy(Fab) 240 0.0009547 0.0009547 0.0009547 0.23% Castro::apply_source_to_state() 10 0.0009304 0.0009304 0.0009304 0.23% MultiFab::Saxpy() 10 0.000925 0.000925 0.000925 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008952 0.0008952 0.0008952 0.22% MLMG::ResNormInf() 42 0.0008532 0.0008532 0.0008532 0.21% MLCellLinOp::setLevelBC() 6 0.0008415 0.0008415 0.0008415 0.20% FabArrayBase::getCPC() 632 0.0007637 0.0007637 0.0007637 0.19% MLMG::getGradSolution() 6 0.0007625 0.0007625 0.0007625 0.19% MLCellLinOp::compGrad() 6 0.0007597 0.0007597 0.0007597 0.18% FabArray::setDomainBndry() 20 0.0006555 0.0006555 0.0006555 0.16% FabArray::mult() 22 0.0006515 0.0006515 0.0006515 0.16% MLPoisson::prepareForSolve() 6 0.000648 0.000648 0.000648 0.16% MLCellLinOp::prepareForSolve() 6 0.0006436 0.0006436 0.0006436 0.16% Castro::check_for_nan() 10 0.0005934 0.0005934 0.0005934 0.14% MultiFab::contains_nan() 10 0.000586 0.000586 0.000586 0.14% MLMG::computeMLResidual() 6 0.0005692 0.0005692 0.0005692 0.14% Castro::enforce_speed_limit() 30 0.0004474 0.0004474 0.0004474 0.11% Gravity::update_max_rhs() 6 0.0004449 0.0004449 0.0004449 0.11% Amr::InitAmr() 1 0.0004024 0.0004024 0.0004024 0.10% FabArrayBase::CPC::define() 244 0.0003969 0.0003969 0.0003969 0.10% FabArrayBase::getFB() 1766 0.0003199 0.0003199 0.0003199 0.08% Gravity::swapTimeLevels() 5 0.0002311 0.0002311 0.0002311 0.06% Castro::buildMetrics() 1 0.0001534 0.0001534 0.0001534 0.04% MLLinOp::define() 6 0.0001506 0.0001506 0.0001506 0.04% MultiFab::Copy() 6 0.0001398 0.0001398 0.0001398 0.03% MultiFab::max() 6 0.0001367 0.0001367 0.0001367 0.03% MLMG::MLResNormInf() 6 0.0001366 0.0001366 0.0001366 0.03% MLLinOp::defineGrids() 6 0.0001287 0.0001287 0.0001287 0.03% Castro::create_source_corrector() 5 0.0001219 0.0001219 0.0001219 0.03% MLMG::MLRhsNormInf() 6 0.0001071 0.0001071 0.0001071 0.03% Castro::finalize_advance() 5 7.11e-05 7.11e-05 7.11e-05 0.02% FabArrayBase::FB::FB() 26 5.504e-05 5.504e-05 5.504e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 3.061e-05 3.061e-05 3.061e-05 0.01% Castro::swap_state_time_levels() 5 2.691e-05 2.691e-05 2.691e-05 0.01% Amr::writeSmallPlotFile() 1 2.673e-05 2.673e-05 2.673e-05 0.01% Castro::initMFs() 1 2.641e-05 2.641e-05 2.641e-05 0.01% makeSFC 30 2.314e-05 2.314e-05 2.314e-05 0.01% Castro::finalize_do_advance() 5 2.111e-05 2.111e-05 2.111e-05 0.01% DistributionMapping::Distribute() 31 9.48e-06 9.48e-06 9.48e-06 0.00% Amr::initSubcycle() 1 8.424e-06 8.424e-06 8.424e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.494e-06 5.494e-06 5.494e-06 0.00% MLMG::buildFineMask() 6 2.948e-06 2.948e-06 2.948e-06 0.00% Gravity::set_mass_offset() 6 2.372e-06 2.372e-06 2.372e-06 0.00% Castro::retry_advance_ctu() 5 2.065e-06 2.065e-06 2.065e-06 0.00% Castro::FluxRegCrseInit 5 1.677e-06 1.677e-06 1.677e-06 0.00% Castro::FluxRegFineAdd() 5 1.271e-06 1.271e-06 1.271e-06 0.00% AmrLevel::AmrLevel() 1 1.188e-06 1.188e-06 1.188e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.111e-06 1.111e-06 1.111e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.06-36-g478fd8a4ac98) finalized