Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.05-37-gb78921a2d80d) initialized Starting run at 08:26:20 UTC on 2022-05-30. Successfully read inputs file ... Castro git describe: 22.05-33-g9203058b8 AMReX git describe: 22.05-37-gb78921a2d Microphysics git describe: 22.05-2-g52173caf reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.042380822 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.024231492 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048919837 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.050983727 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.062396628 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.078878916 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.056285987 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.038902442 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.060344029 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.049826954 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.060691931 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.061285797 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.053231152 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.038769852 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.024042623 seconds Ending run at 08:26:21 UTC on 2022-05-30. Run time = 0.800466145 Run time without initialization = 0.685178007 Average number of zones advanced per microsecond: 3.826 Average number of zones advanced per microsecond per rank: 3.826 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8005 ... 0.8005 ... 0.8005 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.1709 0.1709 0.1709 21.35% VisMF::Write(FabArray) 11 0.1616 0.1616 0.1616 20.18% MLCellLinOp::applyBC() 4379 0.0793 0.0793 0.0793 9.91% MLPoisson::Fsmooth() 3240 0.06273 0.06273 0.06273 7.84% StateDataPhysBCFunct::() 41 0.03367 0.03367 0.03367 4.21% MLCGSolver::bicgstab 81 0.0234 0.0234 0.0234 2.92% StateData::FillBoundary(geom) 328 0.02315 0.02315 0.02315 2.89% MultiFab::Dot() 1100 0.02184 0.02184 0.02184 2.73% Castro::normalize_species() 62 0.01972 0.01972 0.01972 2.46% Castro::computeTemp() 63 0.01499 0.01499 0.01499 1.87% MultiFab::LinComb() 1566 0.01403 0.01403 0.01403 1.75% FabArray::setVal() 1135 0.01398 0.01398 0.01398 1.75% FillBoundary_nowait() 3974 0.01392 0.01392 0.01392 1.74% FabArray::ParallelCopy_nowait() 851 0.01283 0.01283 0.01283 1.60% MLPoisson::Fapply() 1128 0.01146 0.01146 0.01146 1.43% Castro::enforce_min_density() 62 0.01143 0.01143 0.01143 1.43% MLCellLinOp::defineAuxData() 11 0.01126 0.01126 0.01126 1.41% Gravity::fill_multipole_BCs() 11 0.008328 0.008328 0.008328 1.04% MLMG::addInterpCorrection() 405 0.007321 0.007321 0.007321 0.91% amrex::average_down 405 0.006671 0.006671 0.006671 0.83% MultiFab::Xpay() 578 0.00641 0.00641 0.00641 0.80% Castro::estTimeStep() 21 0.006216 0.006216 0.006216 0.78% Castro::reset_internal_energy(MultiFab) 63 0.005093 0.005093 0.005093 0.64% Castro::do_advance_ctu() 10 0.004671 0.004671 0.004671 0.58% Amr::checkPoint() 3 0.00432 0.00432 0.00432 0.54% BndryData::define() 11 0.003754 0.003754 0.003754 0.47% Castro::construct_new_gravity_source() 10 0.003285 0.003285 0.003285 0.41% Castro::construct_old_gravity_source() 10 0.00265 0.00265 0.00265 0.33% Amr::writePlotFile() 2 0.00248 0.00248 0.00248 0.31% Castro::enforce_speed_limit() 62 0.002156 0.002156 0.002156 0.27% Gravity::get_new_grav_vector() 11 0.001916 0.001916 0.001916 0.24% MLMG::ResNormInf() 92 0.001871 0.001871 0.001871 0.23% MultiFab::Saxpy() 20 0.001806 0.001806 0.001806 0.23% Gravity::get_old_grav_vector() 10 0.001727 0.001727 0.001727 0.22% Castro::expand_state() 10 0.001724 0.001724 0.001724 0.22% MLMG::oneIter() 81 0.001629 0.001629 0.001629 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001616 0.001616 0.001616 0.20% Castro::reset_internal_energy(Fab) 504 0.001559 0.001559 0.001559 0.19% MLCellLinOp::setLevelBC() 11 0.001514 0.001514 0.001514 0.19% FabArray::mult() 43 0.001329 0.001329 0.001329 0.17% Gravity::actual_solve_with_mlmg() 11 0.001319 0.001319 0.001319 0.16% FabArray::setDomainBndry() 41 0.001302 0.001302 0.001302 0.16% Castro::initData() 1 0.001236 0.001236 0.001236 0.15% MultiFab::contains_nan() 20 0.00117 0.00117 0.00117 0.15% MLCellLinOp::prepareForSolve() 11 0.001146 0.001146 0.001146 0.14% MLCellLinOp::smooth() 1620 0.001106 0.001106 0.001106 0.14% MLMG::prepareForSolve() 11 0.001024 0.001024 0.001024 0.13% MLCellLinOp::compGrad() 11 0.0009296 0.0009296 0.0009296 0.12% FabArrayBase::getCPC() 1313 0.0007686 0.0007686 0.0007686 0.10% FabArray::FillBoundary() 3974 0.0007484 0.0007484 0.0007484 0.09% FabArrayBase::CPC::define() 454 0.0006554 0.0006554 0.0006554 0.08% FabArrayBase::getFB() 3974 0.000601 0.000601 0.000601 0.08% Gravity::solve_for_phi() 10 0.0004529 0.0004529 0.0004529 0.06% Amr::InitAmr() 1 0.000447 0.000447 0.000447 0.06% MLCellLinOp::apply() 1128 0.0004365 0.0004365 0.0004365 0.05% Gravity::update_max_rhs() 11 0.0004184 0.0004184 0.0004184 0.05% CGSolver::sxay() 1566 0.0003684 0.0003684 0.0003684 0.05% Amr::coarseTimeStep() 10 0.0003279 0.0003279 0.0003279 0.04% FillPatchIterator::Initialize 41 0.0002932 0.0002932 0.0002932 0.04% MLCGSolver::ParallelAllReduce 1495 0.0002864 0.0002864 0.0002864 0.04% MLCellLinOp::defineBC() 11 0.0002779 0.0002779 0.0002779 0.03% FabArray::ParallelCopy() 851 0.0002771 0.0002771 0.0002771 0.03% main() 1 0.0002656 0.0002656 0.0002656 0.03% MultiFab::max() 11 0.0002541 0.0002541 0.0002541 0.03% MultiFab::Copy() 11 0.0002525 0.0002525 0.0002525 0.03% MLCellLinOp::correctionResidual() 486 0.0002194 0.0002194 0.0002194 0.03% Castro::construct_new_gravity() 10 0.0002151 0.0002151 0.0002151 0.03% Amr::timeStep() 10 0.0002092 0.0002092 0.0002092 0.03% Castro::subcycle_advance_ctu() 10 0.0002049 0.0002049 0.0002049 0.03% MLMG::MLRhsNormInf() 11 0.0002012 0.0002012 0.0002012 0.03% MLMG::mgVcycle() 81 0.0001945 0.0001945 0.0001945 0.02% MLLinOp::defineGrids() 11 0.0001804 0.0001804 0.0001804 0.02% StateData::checkPoint() 12 0.0001332 0.0001332 0.0001332 0.02% MLMG:computeResOfCorrection() 405 0.0001182 0.0001182 0.0001182 0.01% MLMG::actualBottomSolve() 81 9.724e-05 9.724e-05 9.724e-05 0.01% MLMG::mgVcycle_down::0 81 9.617e-05 9.617e-05 9.617e-05 0.01% Castro::initialize_advance() 10 8.294e-05 8.294e-05 8.294e-05 0.01% FabArrayBase::FB::FB() 56 8.259e-05 8.259e-05 8.259e-05 0.01% Castro::Castro() 1 7.748e-05 7.748e-05 7.748e-05 0.01% Castro::advance() 10 7.703e-05 7.703e-05 7.703e-05 0.01% Castro::clean_state() 62 7.697e-05 7.697e-05 7.697e-05 0.01% AmrLevel::checkPoint() 3 7.548e-05 7.548e-05 7.548e-05 0.01% MLMG::mgVcycle_down::1 81 7.473e-05 7.473e-05 7.473e-05 0.01% MLMG::solve() 11 7.287e-05 7.287e-05 7.287e-05 0.01% MLMG::mgVcycle_down::2 81 7.058e-05 7.058e-05 7.058e-05 0.01% Castro::initialize_do_advance() 10 6.885e-05 6.885e-05 6.885e-05 0.01% MLMG::mgVcycle_down::3 81 6.456e-05 6.456e-05 6.456e-05 0.01% MLMG::mgVcycle_down::4 81 6.418e-05 6.418e-05 6.418e-05 0.01% Castro::finalize_advance() 10 5.779e-05 5.779e-05 5.779e-05 0.01% MLMG::mgVcycle_up::4 81 5.626e-05 5.626e-05 5.626e-05 0.01% MLMG::mgVcycle_up::0 81 5.225e-05 5.225e-05 5.225e-05 0.01% Castro::post_timestep() 10 4.897e-05 4.897e-05 4.897e-05 0.01% MLCellLinOp::solutionResidual() 92 4.802e-05 4.802e-05 4.802e-05 0.01% MLMG::mgVcycle_up::1 81 4.747e-05 4.747e-05 4.747e-05 0.01% MLMG::mgVcycle_up::3 81 4.672e-05 4.672e-05 4.672e-05 0.01% MLMG::mgVcycle_up::2 81 4.432e-05 4.432e-05 4.432e-05 0.01% Castro::swap_state_time_levels() 10 4.099e-05 4.099e-05 4.099e-05 0.01% Castro::finalize_do_advance() 10 3.916e-05 3.916e-05 3.916e-05 0.00% StateData::define() 4 3.7e-05 3.7e-05 3.7e-05 0.00% MLMG::mgVcycle_bottom 81 3.406e-05 3.406e-05 3.406e-05 0.00% Castro::enforce_consistent_e() 1 3.23e-05 3.23e-05 3.23e-05 0.00% MLMG::computeResidual() 81 3.204e-05 3.204e-05 3.204e-05 0.00% Gravity::actual_multilevel_solve() 1 3.134e-05 3.134e-05 3.134e-05 0.00% FillPatchSingleLevel 41 2.826e-05 2.826e-05 2.826e-05 0.00% makeSFC 55 2.754e-05 2.754e-05 2.754e-05 0.00% Castro::initMFs() 1 2.625e-05 2.625e-05 2.625e-05 0.00% Amr::writeSmallPlotFile() 1 2.54e-05 2.54e-05 2.54e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.517e-05 2.517e-05 2.517e-05 0.00% MLPoisson::define() 11 2.499e-05 2.499e-05 2.499e-05 0.00% MLLinOp::define() 11 2.344e-05 2.344e-05 2.344e-05 0.00% Castro::buildMetrics() 1 2.278e-05 2.278e-05 2.278e-05 0.00% Amr::FinalizeInit() 1 2.185e-05 2.185e-05 2.185e-05 0.00% Amr::defBaseLevel() 1 2.04e-05 2.04e-05 2.04e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.852e-05 1.852e-05 1.852e-05 0.00% Castro::construct_old_source() 50 1.763e-05 1.763e-05 1.763e-05 0.00% Castro::construct_new_source() 50 1.696e-05 1.696e-05 1.696e-05 0.00% Castro::do_new_sources() 10 1.636e-05 1.636e-05 1.636e-05 0.00% Castro::do_old_sources() 10 1.571e-05 1.571e-05 1.571e-05 0.00% DistributionMapping::Distribute() 56 1.458e-05 1.458e-05 1.458e-05 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 1.351e-05 1.351e-05 1.351e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.307e-05 1.307e-05 1.307e-05 0.00% Castro::check_for_nan() 20 1.167e-05 1.167e-05 1.167e-05 0.00% Castro::apply_source_to_state() 20 1.131e-05 1.131e-05 1.131e-05 0.00% Castro::construct_old_gravity() 10 1.127e-05 1.127e-05 1.127e-05 0.00% Gravity::swapTimeLevels() 10 9.897e-06 9.897e-06 9.897e-06 0.00% Amr::initSubcycle() 1 9.193e-06 9.193e-06 9.193e-06 0.00% Castro::post_init() 1 8.855e-06 8.855e-06 8.855e-06 0.00% MLPoisson::prepareForSolve() 11 8.395e-06 8.395e-06 8.395e-06 0.00% MLMG::computeMLResidual() 11 7.346e-06 7.346e-06 7.346e-06 0.00% Amr::InitializeInit() 1 7.133e-06 7.133e-06 7.133e-06 0.00% AmrLevel::AmrLevel(dm) 1 6.692e-06 6.692e-06 6.692e-06 0.00% MLMG::getGradSolution() 11 5.852e-06 5.852e-06 5.852e-06 0.00% AmrLevel::checkPointPost() 3 5.82e-06 5.82e-06 5.82e-06 0.00% Castro::computeNewDt() 9 5.75e-06 5.75e-06 5.75e-06 0.00% MLMG::buildFineMask() 11 4.9e-06 4.9e-06 4.9e-06 0.00% Castro::create_source_corrector() 10 4.615e-06 4.615e-06 4.615e-06 0.00% MLMG::MLResNormInf() 11 4.394e-06 4.394e-06 4.394e-06 0.00% Castro::retry_advance_ctu() 10 4.194e-06 4.194e-06 4.194e-06 0.00% Gravity::set_mass_offset() 11 4.091e-06 4.091e-06 4.091e-06 0.00% Castro::FluxRegCrseInit 10 2.981e-06 2.981e-06 2.981e-06 0.00% Castro::computeInitialDt() 2 2.696e-06 2.696e-06 2.696e-06 0.00% Castro::FluxRegFineAdd() 10 2.482e-06 2.482e-06 2.482e-06 0.00% Amr::init() 1 2.307e-06 2.307e-06 2.307e-06 0.00% AmrLevel::checkPointPre() 3 1.981e-06 1.981e-06 1.981e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.762e-06 1.762e-06 1.762e-06 0.00% Amr::initialInit() 1 1.23e-06 1.23e-06 1.23e-06 0.00% Castro::post_regrid() 1 1.127e-06 1.127e-06 1.127e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8005 0.8005 0.8005 100.00% Amr::coarseTimeStep() 10 0.6609 0.6609 0.6609 82.56% Amr::timeStep() 10 0.5795 0.5795 0.5795 72.39% Castro::advance() 10 0.569 0.569 0.569 71.07% Castro::subcycle_advance_ctu() 10 0.556 0.556 0.556 69.46% Castro::do_advance_ctu() 10 0.5558 0.5558 0.5558 69.43% Gravity::solve_phi_with_mlmg() 11 0.3076 0.3076 0.3076 38.43% Gravity::actual_solve_with_mlmg() 11 0.2991 0.2991 0.2991 37.36% Castro::construct_new_gravity() 10 0.2825 0.2825 0.2825 35.29% MLMG::solve() 11 0.277 0.277 0.277 34.61% Gravity::solve_for_phi() 10 0.2677 0.2677 0.2677 33.44% MLMG::oneIter() 81 0.2627 0.2627 0.2627 32.82% MLMG::mgVcycle() 81 0.2611 0.2611 0.2611 32.62% Castro::construct_ctu_hydro_source() 10 0.1709 0.1709 0.1709 21.35% VisMF::Write(FabArray) 11 0.1616 0.1616 0.1616 20.18% MLCellLinOp::smooth() 1620 0.1343 0.1343 0.1343 16.78% Amr::checkPoint() 3 0.1202 0.1202 0.1202 15.01% AmrLevel::checkPoint() 3 0.1159 0.1159 0.1159 14.47% StateData::checkPoint() 12 0.1158 0.1158 0.1158 14.46% Amr::init() 1 0.1147 0.1147 0.1147 14.33% MLCellLinOp::applyBC() 4379 0.09466 0.09466 0.09466 11.83% MLMG::mgVcycle_bottom 81 0.08006 0.08006 0.08006 10.00% MLMG::actualBottomSolve() 81 0.08002 0.08002 0.08002 10.00% MLCGSolver::bicgstab 81 0.07921 0.07921 0.07921 9.89% MLPoisson::Fsmooth() 3240 0.06273 0.06273 0.06273 7.84% FillPatchIterator::Initialize 41 0.06243 0.06243 0.06243 7.80% FillPatchSingleLevel 41 0.06084 0.06084 0.06084 7.60% StateDataPhysBCFunct::() 41 0.05682 0.05682 0.05682 7.10% Castro::clean_state() 62 0.05421 0.05421 0.05421 6.77% Amr::writePlotFile() 2 0.04839 0.04839 0.04839 6.05% Amr::initialInit() 1 0.048 0.048 0.048 6.00% Amr::FinalizeInit() 1 0.04377 0.04377 0.04377 5.47% Castro::post_init() 1 0.04242 0.04242 0.04242 5.30% Castro::initialize_do_advance() 10 0.04224 0.04224 0.04224 5.28% Gravity::multilevel_solve_for_new_phi() 1 0.04047 0.04047 0.04047 5.06% Gravity::actual_multilevel_solve() 1 0.04045 0.04045 0.04045 5.05% MLCellLinOp::apply() 1128 0.03557 0.03557 0.03557 4.44% MLMG::mgVcycle_down::0 81 0.03493 0.03493 0.03493 4.36% Castro::expand_state() 10 0.03319 0.03319 0.03319 4.15% MLMG::mgVcycle_up::0 81 0.03004 0.03004 0.03004 3.75% StateData::FillBoundary(geom) 328 0.02315 0.02315 0.02315 2.89% MultiFab::Dot() 1100 0.02184 0.02184 0.02184 2.73% Castro::computeTemp() 63 0.02164 0.02164 0.02164 2.70% MLCellLinOp::correctionResidual() 486 0.02085 0.02085 0.02085 2.60% Castro::normalize_species() 62 0.01972 0.01972 0.01972 2.46% MLMG:computeResOfCorrection() 405 0.01801 0.01801 0.01801 2.25% MLPoisson::define() 11 0.01782 0.01782 0.01782 2.23% MLMG::mgVcycle_down::1 81 0.01734 0.01734 0.01734 2.17% MLMG::mgVcycle_down::2 81 0.01691 0.01691 0.01691 2.11% Gravity::get_new_grav_vector() 11 0.01644 0.01644 0.01644 2.05% MLMG::mgVcycle_down::3 81 0.01604 0.01604 0.01604 2.00% FabArray::FillBoundary() 3974 0.01536 0.01536 0.01536 1.92% MLMG::mgVcycle_down::4 81 0.0153 0.0153 0.0153 1.91% FillBoundary_nowait() 3974 0.01461 0.01461 0.01461 1.82% CGSolver::sxay() 1566 0.0144 0.0144 0.0144 1.80% Castro::construct_old_gravity() 10 0.01425 0.01425 0.01425 1.78% Gravity::get_old_grav_vector() 10 0.01424 0.01424 0.01424 1.78% MultiFab::LinComb() 1566 0.01403 0.01403 0.01403 1.75% FabArray::setVal() 1135 0.01398 0.01398 0.01398 1.75% FabArray::ParallelCopy() 851 0.0139 0.0139 0.0139 1.74% Castro::do_new_sources() 10 0.01374 0.01374 0.01374 1.72% FabArray::ParallelCopy_nowait() 851 0.01363 0.01363 0.01363 1.70% MLMG::mgVcycle_up::2 81 0.01305 0.01305 0.01305 1.63% MLCGSolver::ParallelAllReduce 1495 0.01302 0.01302 0.01302 1.63% MLMG::mgVcycle_up::1 81 0.01282 0.01282 0.01282 1.60% Castro::initialize_advance() 10 0.01279 0.01279 0.01279 1.60% MLCellLinOp::defineAuxData() 11 0.01258 0.01258 0.01258 1.57% MLMG::mgVcycle_up::3 81 0.01232 0.01232 0.01232 1.54% MLMG::addInterpCorrection() 405 0.01228 0.01228 0.01228 1.53% MLMG::mgVcycle_up::4 81 0.01212 0.01212 0.01212 1.51% amrex::average_down 405 0.01163 0.01163 0.01163 1.45% MLPoisson::Fapply() 1128 0.01146 0.01146 0.01146 1.43% Castro::enforce_min_density() 62 0.01143 0.01143 0.01143 1.43% Castro::do_old_sources() 10 0.01133 0.01133 0.01133 1.42% Castro::post_timestep() 10 0.01033 0.01033 0.01033 1.29% Gravity::fill_multipole_BCs() 11 0.008328 0.008328 0.008328 1.04% MLCellLinOp::solutionResidual() 92 0.007054 0.007054 0.007054 0.88% Castro::reset_internal_energy(MultiFab) 63 0.006652 0.006652 0.006652 0.83% MultiFab::Xpay() 578 0.00641 0.00641 0.00641 0.80% Castro::estTimeStep() 21 0.006216 0.006216 0.006216 0.78% MLMG::computeResidual() 81 0.006074 0.006074 0.006074 0.76% MLMG::prepareForSolve() 11 0.00503 0.00503 0.00503 0.63% MLCellLinOp::defineBC() 11 0.004952 0.004952 0.004952 0.62% BndryData::define() 11 0.004674 0.004674 0.004674 0.58% Amr::InitializeInit() 1 0.004228 0.004228 0.004228 0.53% Amr::defBaseLevel() 1 0.004221 0.004221 0.004221 0.53% Castro::initData() 1 0.003711 0.003711 0.003711 0.46% Castro::construct_new_source() 50 0.003302 0.003302 0.003302 0.41% Castro::construct_new_gravity_source() 10 0.003285 0.003285 0.003285 0.41% Castro::computeNewDt() 9 0.002843 0.002843 0.002843 0.36% Castro::construct_old_source() 50 0.002668 0.002668 0.002668 0.33% Castro::construct_old_gravity_source() 10 0.00265 0.00265 0.00265 0.33% Castro::enforce_speed_limit() 62 0.002156 0.002156 0.002156 0.27% MLMG::ResNormInf() 92 0.001871 0.001871 0.001871 0.23% Castro::apply_source_to_state() 20 0.001817 0.001817 0.001817 0.23% MultiFab::Saxpy() 20 0.001806 0.001806 0.001806 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001616 0.001616 0.001616 0.20% Castro::reset_internal_energy(Fab) 504 0.001559 0.001559 0.001559 0.19% MLCellLinOp::setLevelBC() 11 0.001514 0.001514 0.001514 0.19% FabArrayBase::getCPC() 1313 0.001424 0.001424 0.001424 0.18% MLMG::getGradSolution() 11 0.001418 0.001418 0.001418 0.18% MLCellLinOp::compGrad() 11 0.001412 0.001412 0.001412 0.18% FabArray::mult() 43 0.001329 0.001329 0.001329 0.17% FabArray::setDomainBndry() 41 0.001302 0.001302 0.001302 0.16% Castro::check_for_nan() 20 0.001182 0.001182 0.001182 0.15% MultiFab::contains_nan() 20 0.00117 0.00117 0.00117 0.15% MLPoisson::prepareForSolve() 11 0.001154 0.001154 0.001154 0.14% MLCellLinOp::prepareForSolve() 11 0.001146 0.001146 0.001146 0.14% Castro::post_regrid() 1 0.001098 0.001098 0.001098 0.14% MLMG::computeMLResidual() 11 0.001019 0.001019 0.001019 0.13% Gravity::update_max_rhs() 11 0.0008213 0.0008213 0.0008213 0.10% Castro::computeInitialDt() 2 0.0007196 0.0007196 0.0007196 0.09% FabArrayBase::getFB() 3974 0.0006836 0.0006836 0.0006836 0.09% FabArrayBase::CPC::define() 454 0.0006554 0.0006554 0.0006554 0.08% Amr::InitAmr() 1 0.0004562 0.0004562 0.0004562 0.06% Gravity::swapTimeLevels() 10 0.0004424 0.0004424 0.0004424 0.06% Castro::Castro() 1 0.0004309 0.0004309 0.0004309 0.05% MLLinOp::define() 11 0.0002596 0.0002596 0.0002596 0.03% MLMG::MLResNormInf() 11 0.0002581 0.0002581 0.0002581 0.03% MultiFab::max() 11 0.0002541 0.0002541 0.0002541 0.03% MultiFab::Copy() 11 0.0002525 0.0002525 0.0002525 0.03% MLLinOp::defineGrids() 11 0.0002362 0.0002362 0.0002362 0.03% MLMG::MLRhsNormInf() 11 0.0002012 0.0002012 0.0002012 0.03% Castro::buildMetrics() 1 0.0001656 0.0001656 0.0001656 0.02% FabArrayBase::FB::FB() 56 8.259e-05 8.259e-05 8.259e-05 0.01% Castro::finalize_advance() 10 6.325e-05 6.325e-05 6.325e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.395e-05 5.395e-05 5.395e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.369e-05 4.369e-05 4.369e-05 0.01% Castro::swap_state_time_levels() 10 4.099e-05 4.099e-05 4.099e-05 0.01% makeSFC 55 4.089e-05 4.089e-05 4.089e-05 0.01% Castro::finalize_do_advance() 10 3.916e-05 3.916e-05 3.916e-05 0.00% StateData::define() 4 3.7e-05 3.7e-05 3.7e-05 0.00% Castro::enforce_consistent_e() 1 3.23e-05 3.23e-05 3.23e-05 0.00% Castro::initMFs() 1 2.625e-05 2.625e-05 2.625e-05 0.00% Amr::writeSmallPlotFile() 1 2.54e-05 2.54e-05 2.54e-05 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 1.474e-05 1.474e-05 1.474e-05 0.00% DistributionMapping::Distribute() 56 1.458e-05 1.458e-05 1.458e-05 0.00% Amr::initSubcycle() 1 9.193e-06 9.193e-06 9.193e-06 0.00% AmrLevel::checkPointPost() 3 5.82e-06 5.82e-06 5.82e-06 0.00% MLMG::buildFineMask() 11 4.9e-06 4.9e-06 4.9e-06 0.00% Castro::create_source_corrector() 10 4.615e-06 4.615e-06 4.615e-06 0.00% Castro::retry_advance_ctu() 10 4.194e-06 4.194e-06 4.194e-06 0.00% Gravity::set_mass_offset() 11 4.091e-06 4.091e-06 4.091e-06 0.00% Castro::FluxRegCrseInit 10 2.981e-06 2.981e-06 2.981e-06 0.00% Castro::FluxRegFineAdd() 10 2.482e-06 2.482e-06 2.482e-06 0.00% AmrLevel::checkPointPre() 3 1.981e-06 1.981e-06 1.981e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.762e-06 1.762e-06 1.762e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.05-37-gb78921a2d80d) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.05-37-gb78921a2d80d) initialized Starting run at 08:26:22 UTC on 2022-05-30. Successfully read inputs file ... Castro git describe: 22.05-33-g9203058b8 AMReX git describe: 22.05-37-gb78921a2d Microphysics git describe: 22.05-2-g52173caf reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.451694095 Restart time = 0.093517545 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.05686853 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.05373264 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.065436289 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.065773253 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.065434477 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.025702078 seconds Ending run at 08:26:22 UTC on 2022-05-30. Run time = 0.427400844 Run time without initialization = 0.333344049 Average number of zones advanced per microsecond: 3.932 Average number of zones advanced per microsecond per rank: 3.932 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.4274 ... 0.4274 ... 0.4274 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0868 0.0868 0.0868 20.31% Amr::restart() 1 0.04927 0.04927 0.04927 11.53% VisMF::Read() 3 0.03985 0.03985 0.03985 9.32% MLCellLinOp::applyBC() 1946 0.03812 0.03812 0.03812 8.92% MLPoisson::Fsmooth() 1440 0.02777 0.02777 0.02777 6.50% StateData::FillBoundary(geom) 160 0.0259 0.0259 0.0259 6.06% VisMF::Write(FabArray) 1 0.02434 0.02434 0.02434 5.69% MLCGSolver::bicgstab 36 0.01063 0.01063 0.01063 2.49% MultiFab::Dot() 484 0.009846 0.009846 0.009846 2.30% Castro::computeTemp() 30 0.008996 0.008996 0.008996 2.10% Castro::normalize_species() 30 0.00896 0.00896 0.00896 2.10% FabArray::setVal() 537 0.007032 0.007032 0.007032 1.65% MLCellLinOp::defineAuxData() 6 0.006644 0.006644 0.006644 1.55% FillBoundary_nowait() 1766 0.006587 0.006587 0.006587 1.54% FabArray::ParallelCopy_nowait() 380 0.006439 0.006439 0.006439 1.51% MultiFab::LinComb() 690 0.006328 0.006328 0.006328 1.48% Castro::enforce_min_density() 30 0.005948 0.005948 0.005948 1.39% MLPoisson::Fapply() 500 0.005286 0.005286 0.005286 1.24% Gravity::fill_multipole_BCs() 6 0.004663 0.004663 0.004663 1.09% MLMG::addInterpCorrection() 180 0.003431 0.003431 0.003431 0.80% StateDataPhysBCFunct::() 20 0.003429 0.003429 0.003429 0.80% Castro::estTimeStep() 10 0.003376 0.003376 0.003376 0.79% amrex::average_down 180 0.00319 0.00319 0.00319 0.75% MultiFab::Xpay() 258 0.002942 0.002942 0.002942 0.69% BndryData::define() 6 0.0022 0.0022 0.0022 0.51% Castro::do_advance_ctu() 5 0.002144 0.002144 0.002144 0.50% Castro::reset_internal_energy(MultiFab) 30 0.001965 0.001965 0.001965 0.46% Castro::construct_new_gravity_source() 5 0.001628 0.001628 0.001628 0.38% Amr::writePlotFile() 1 0.001446 0.001446 0.001446 0.34% Castro::construct_old_gravity_source() 5 0.001446 0.001446 0.001446 0.34% Castro::enforce_speed_limit() 30 0.001102 0.001102 0.001102 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009861 0.0009861 0.0009861 0.23% Castro::reset_internal_energy(Fab) 240 0.0009283 0.0009283 0.0009283 0.22% MultiFab::Saxpy() 10 0.0009223 0.0009223 0.0009223 0.22% Gravity::get_old_grav_vector() 5 0.0009048 0.0009048 0.0009048 0.21% Castro::expand_state() 5 0.0008848 0.0008848 0.0008848 0.21% Gravity::get_new_grav_vector() 5 0.0008821 0.0008821 0.0008821 0.21% MLMG::ResNormInf() 42 0.0008802 0.0008802 0.0008802 0.21% MLCellLinOp::setLevelBC() 6 0.0008297 0.0008297 0.0008297 0.19% MLMG::oneIter() 36 0.0007385 0.0007385 0.0007385 0.17% Gravity::actual_solve_with_mlmg() 6 0.0007218 0.0007218 0.0007218 0.17% MLCellLinOp::smooth() 720 0.0007051 0.0007051 0.0007051 0.16% MLCellLinOp::prepareForSolve() 6 0.0006935 0.0006935 0.0006935 0.16% FabArray::setDomainBndry() 20 0.0006628 0.0006628 0.0006628 0.16% FabArray::mult() 22 0.0006576 0.0006576 0.0006576 0.15% MLMG::prepareForSolve() 6 0.0006204 0.0006204 0.0006204 0.15% MultiFab::contains_nan() 10 0.000596 0.000596 0.000596 0.14% MLCellLinOp::compGrad() 6 0.0004862 0.0004862 0.0004862 0.11% FabArrayBase::CPC::define() 244 0.0004264 0.0004264 0.0004264 0.10% FabArrayBase::getCPC() 632 0.000419 0.000419 0.000419 0.10% FabArray::FillBoundary() 1766 0.0003995 0.0003995 0.0003995 0.09% Amr::InitAmr() 1 0.0003797 0.0003797 0.0003797 0.09% MLCellLinOp::apply() 500 0.0002766 0.0002766 0.0002766 0.06% FabArrayBase::getFB() 1766 0.0002608 0.0002608 0.0002608 0.06% Gravity::update_max_rhs() 6 0.000256 0.000256 0.000256 0.06% main() 1 0.0002482 0.0002482 0.0002482 0.06% Gravity::solve_for_phi() 5 0.0002148 0.0002148 0.0002148 0.05% CGSolver::sxay() 690 0.0001982 0.0001982 0.0001982 0.05% Castro::subcycle_advance_ctu() 5 0.0001711 0.0001711 0.0001711 0.04% Amr::coarseTimeStep() 5 0.0001642 0.0001642 0.0001642 0.04% FillPatchIterator::Initialize 20 0.0001609 0.0001609 0.0001609 0.04% MLCellLinOp::defineBC() 6 0.0001522 0.0001522 0.0001522 0.04% MultiFab::Copy() 6 0.0001456 0.0001456 0.0001456 0.03% FabArray::ParallelCopy() 380 0.0001432 0.0001432 0.0001432 0.03% MultiFab::max() 6 0.0001383 0.0001383 0.0001383 0.03% MLCGSolver::ParallelAllReduce 659 0.0001379 0.0001379 0.0001379 0.03% MLCellLinOp::correctionResidual() 216 0.0001281 0.0001281 0.0001281 0.03% Castro::construct_new_gravity() 5 0.0001233 0.0001233 0.0001233 0.03% MLMG::mgVcycle() 36 0.0001133 0.0001133 0.0001133 0.03% Amr::timeStep() 5 0.000112 0.000112 0.000112 0.03% MLMG::MLRhsNormInf() 6 0.0001087 0.0001087 0.0001087 0.03% MLLinOp::defineGrids() 6 9.395e-05 9.395e-05 9.395e-05 0.02% Castro::construct_new_source() 25 8.993e-05 8.993e-05 8.993e-05 0.02% AmrLevel::restart() 1 7.467e-05 7.467e-05 7.467e-05 0.02% Castro::advance() 5 7.396e-05 7.396e-05 7.396e-05 0.02% StateData::restartDoit() 4 7.262e-05 7.262e-05 7.262e-05 0.02% MLMG:computeResOfCorrection() 180 6.962e-05 6.962e-05 6.962e-05 0.02% FabArrayBase::FB::FB() 26 5.968e-05 5.968e-05 5.968e-05 0.01% MLMG::actualBottomSolve() 36 5.733e-05 5.733e-05 5.733e-05 0.01% Castro::initialize_do_advance() 5 4.746e-05 4.746e-05 4.746e-05 0.01% MLMG::mgVcycle_down::0 36 4.727e-05 4.727e-05 4.727e-05 0.01% Castro::initialize_advance() 5 4.66e-05 4.66e-05 4.66e-05 0.01% Castro::clean_state() 30 4.65e-05 4.65e-05 4.65e-05 0.01% MLMG::solve() 6 4.615e-05 4.615e-05 4.615e-05 0.01% MLMG::mgVcycle_down::1 36 4.552e-05 4.552e-05 4.552e-05 0.01% MLMG::mgVcycle_down::2 36 4.318e-05 4.318e-05 4.318e-05 0.01% MLMG::mgVcycle_down::4 36 4.23e-05 4.23e-05 4.23e-05 0.01% Castro::create_source_corrector() 5 4.185e-05 4.185e-05 4.185e-05 0.01% MLMG::mgVcycle_down::3 36 4.086e-05 4.086e-05 4.086e-05 0.01% Castro::construct_old_source() 25 3.907e-05 3.907e-05 3.907e-05 0.01% Castro::buildMetrics() 1 3.577e-05 3.577e-05 3.577e-05 0.01% Castro::post_restart() 1 3.44e-05 3.44e-05 3.44e-05 0.01% Gravity::actual_multilevel_solve() 1 3.217e-05 3.217e-05 3.217e-05 0.01% MLMG::mgVcycle_up::4 36 3.078e-05 3.078e-05 3.078e-05 0.01% Castro::initMFs() 1 2.842e-05 2.842e-05 2.842e-05 0.01% MLCellLinOp::solutionResidual() 42 2.776e-05 2.776e-05 2.776e-05 0.01% MLMG::mgVcycle_up::0 36 2.772e-05 2.772e-05 2.772e-05 0.01% Castro::swap_state_time_levels() 5 2.757e-05 2.757e-05 2.757e-05 0.01% MLMG::mgVcycle_up::2 36 2.757e-05 2.757e-05 2.757e-05 0.01% MLMG::mgVcycle_up::3 36 2.743e-05 2.743e-05 2.743e-05 0.01% MLMG::mgVcycle_up::1 36 2.597e-05 2.597e-05 2.597e-05 0.01% Amr::writeSmallPlotFile() 1 2.568e-05 2.568e-05 2.568e-05 0.01% Castro::finalize_advance() 5 2.557e-05 2.557e-05 2.557e-05 0.01% Gravity::solve_phi_with_mlmg() 6 2.52e-05 2.52e-05 2.52e-05 0.01% MLLinOp::define() 6 2.322e-05 2.322e-05 2.322e-05 0.01% Castro::finalize_do_advance() 5 2.066e-05 2.066e-05 2.066e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.793e-05 1.793e-05 1.793e-05 0.00% FillPatchSingleLevel 20 1.689e-05 1.689e-05 1.689e-05 0.00% MLMG::mgVcycle_bottom 36 1.636e-05 1.636e-05 1.636e-05 0.00% MLPoisson::define() 6 1.633e-05 1.633e-05 1.633e-05 0.00% MLMG::computeResidual() 36 1.52e-05 1.52e-05 1.52e-05 0.00% makeSFC 30 1.506e-05 1.506e-05 1.506e-05 0.00% Castro::do_new_sources() 5 1.035e-05 1.035e-05 1.035e-05 0.00% Castro::do_old_sources() 5 9.88e-06 9.88e-06 9.88e-06 0.00% DistributionMapping::Distribute() 31 9.861e-06 9.861e-06 9.861e-06 0.00% Amr::initSubcycle() 1 9.42e-06 9.42e-06 9.42e-06 0.00% Castro::check_for_nan() 10 7.66e-06 7.66e-06 7.66e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.597e-06 7.597e-06 7.597e-06 0.00% Castro::construct_old_gravity() 5 6.887e-06 6.887e-06 6.887e-06 0.00% Castro::apply_source_to_state() 10 6.451e-06 6.451e-06 6.451e-06 0.00% Gravity::swapTimeLevels() 5 6.216e-06 6.216e-06 6.216e-06 0.00% Castro::post_timestep() 5 5.539e-06 5.539e-06 5.539e-06 0.00% MLPoisson::prepareForSolve() 6 4.853e-06 4.853e-06 4.853e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.878e-06 3.878e-06 3.878e-06 0.00% Castro::computeNewDt() 5 3.739e-06 3.739e-06 3.739e-06 0.00% MLMG::getGradSolution() 6 3.352e-06 3.352e-06 3.352e-06 0.00% MLMG::computeMLResidual() 6 3.073e-06 3.073e-06 3.073e-06 0.00% MLMG::buildFineMask() 6 2.94e-06 2.94e-06 2.94e-06 0.00% MLMG::MLResNormInf() 6 2.498e-06 2.498e-06 2.498e-06 0.00% Castro::retry_advance_ctu() 5 2.39e-06 2.39e-06 2.39e-06 0.00% Gravity::set_mass_offset() 6 2.262e-06 2.262e-06 2.262e-06 0.00% Castro::FluxRegCrseInit 5 1.952e-06 1.952e-06 1.952e-06 0.00% AmrLevel::AmrLevel() 1 1.355e-06 1.355e-06 1.355e-06 0.00% Castro::FluxRegFineAdd() 5 1.113e-06 1.113e-06 1.113e-06 0.00% Amr::init() 1 1.044e-06 1.044e-06 1.044e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.029e-06 1.029e-06 1.029e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.4274 0.4274 0.4274 100.00% Amr::coarseTimeStep() 5 0.3074 0.3074 0.3074 71.92% Amr::timeStep() 5 0.3054 0.3054 0.3054 71.46% Castro::advance() 5 0.3002 0.3002 0.3002 70.23% Castro::subcycle_advance_ctu() 5 0.294 0.294 0.294 68.79% Castro::do_advance_ctu() 5 0.2938 0.2938 0.2938 68.75% Castro::construct_new_gravity() 5 0.1561 0.1561 0.1561 36.52% Gravity::solve_phi_with_mlmg() 6 0.1473 0.1473 0.1473 34.45% Gravity::solve_for_phi() 5 0.1436 0.1436 0.1436 33.60% Gravity::actual_solve_with_mlmg() 6 0.1425 0.1425 0.1425 33.33% MLMG::solve() 6 0.1295 0.1295 0.1295 30.31% MLMG::oneIter() 36 0.1222 0.1222 0.1222 28.58% MLMG::mgVcycle() 36 0.1214 0.1214 0.1214 28.41% Amr::init() 1 0.09356 0.09356 0.09356 21.89% Amr::restart() 1 0.09356 0.09356 0.09356 21.89% Castro::construct_ctu_hydro_source() 5 0.08679 0.08679 0.08679 20.31% MLCellLinOp::smooth() 720 0.06216 0.06216 0.06216 14.54% MLCellLinOp::applyBC() 1946 0.04542 0.04542 0.04542 10.63% AmrLevel::restart() 1 0.04006 0.04006 0.04006 9.37% StateData::restartDoit() 4 0.03998 0.03998 0.03998 9.35% VisMF::Read() 3 0.03985 0.03985 0.03985 9.32% MLMG::mgVcycle_bottom 36 0.03682 0.03682 0.03682 8.61% MLMG::actualBottomSolve() 36 0.0368 0.0368 0.0368 8.61% MLCGSolver::bicgstab 36 0.03638 0.03638 0.03638 8.51% FillPatchIterator::Initialize 20 0.03216 0.03216 0.03216 7.52% FillPatchSingleLevel 20 0.03134 0.03134 0.03134 7.33% StateDataPhysBCFunct::() 20 0.02932 0.02932 0.02932 6.86% Castro::clean_state() 30 0.02795 0.02795 0.02795 6.54% MLPoisson::Fsmooth() 1440 0.02777 0.02777 0.02777 6.50% StateData::FillBoundary(geom) 160 0.0259 0.0259 0.0259 6.06% Amr::writePlotFile() 1 0.02579 0.02579 0.02579 6.03% VisMF::Write(FabArray) 1 0.02434 0.02434 0.02434 5.69% MLCellLinOp::apply() 500 0.01702 0.01702 0.01702 3.98% MLMG::mgVcycle_down::0 36 0.01602 0.01602 0.01602 3.75% MLMG::mgVcycle_up::0 36 0.01372 0.01372 0.01372 3.21% Castro::construct_old_gravity() 5 0.01269 0.01269 0.01269 2.97% Gravity::get_old_grav_vector() 5 0.01268 0.01268 0.01268 2.97% Gravity::get_new_grav_vector() 5 0.01236 0.01236 0.01236 2.89% Castro::computeTemp() 30 0.01189 0.01189 0.01189 2.78% Castro::initialize_do_advance() 5 0.01176 0.01176 0.01176 2.75% MLPoisson::define() 6 0.01059 0.01059 0.01059 2.48% MultiFab::Dot() 484 0.009846 0.009846 0.009846 2.30% MLCellLinOp::correctionResidual() 216 0.009839 0.009839 0.009839 2.30% Castro::normalize_species() 30 0.00896 0.00896 0.00896 2.10% MLMG:computeResOfCorrection() 180 0.008481 0.008481 0.008481 1.98% MLMG::mgVcycle_down::1 36 0.008193 0.008193 0.008193 1.92% MLMG::mgVcycle_down::2 36 0.007988 0.007988 0.007988 1.87% MLMG::mgVcycle_down::3 36 0.007572 0.007572 0.007572 1.77% MLCellLinOp::defineAuxData() 6 0.007491 0.007491 0.007491 1.75% FabArray::FillBoundary() 1766 0.007307 0.007307 0.007307 1.71% Castro::do_new_sources() 5 0.007294 0.007294 0.007294 1.71% MLMG::mgVcycle_down::4 36 0.007279 0.007279 0.007279 1.70% FabArray::setVal() 537 0.007032 0.007032 0.007032 1.65% FabArray::ParallelCopy() 380 0.006982 0.006982 0.006982 1.63% FillBoundary_nowait() 1766 0.006908 0.006908 0.006908 1.62% FabArray::ParallelCopy_nowait() 380 0.006838 0.006838 0.006838 1.60% Castro::do_old_sources() 5 0.006552 0.006552 0.006552 1.53% CGSolver::sxay() 690 0.006526 0.006526 0.006526 1.53% Castro::expand_state() 5 0.006356 0.006356 0.006356 1.49% MultiFab::LinComb() 690 0.006328 0.006328 0.006328 1.48% MLMG::mgVcycle_up::2 36 0.006121 0.006121 0.006121 1.43% Castro::initialize_advance() 5 0.006071 0.006071 0.006071 1.42% MLMG::mgVcycle_up::1 36 0.006034 0.006034 0.006034 1.41% Castro::enforce_min_density() 30 0.005948 0.005948 0.005948 1.39% MLCGSolver::ParallelAllReduce 659 0.005918 0.005918 0.005918 1.38% MLMG::addInterpCorrection() 180 0.005914 0.005914 0.005914 1.38% MLMG::mgVcycle_up::3 36 0.005804 0.005804 0.005804 1.36% MLMG::mgVcycle_up::4 36 0.005768 0.005768 0.005768 1.35% amrex::average_down 180 0.005693 0.005693 0.005693 1.33% MLPoisson::Fapply() 500 0.005286 0.005286 0.005286 1.24% Castro::post_timestep() 5 0.005138 0.005138 0.005138 1.20% Gravity::fill_multipole_BCs() 6 0.004663 0.004663 0.004663 1.09% Castro::post_restart() 1 0.004041 0.004041 0.004041 0.95% Gravity::multilevel_solve_for_new_phi() 1 0.003906 0.003906 0.003906 0.91% Gravity::actual_multilevel_solve() 1 0.003888 0.003888 0.003888 0.91% MLCellLinOp::solutionResidual() 42 0.00338 0.00338 0.00338 0.79% Castro::estTimeStep() 10 0.003376 0.003376 0.003376 0.79% MLMG::prepareForSolve() 6 0.002946 0.002946 0.002946 0.69% MultiFab::Xpay() 258 0.002942 0.002942 0.002942 0.69% MLCellLinOp::defineBC() 6 0.002937 0.002937 0.002937 0.69% Castro::reset_internal_energy(MultiFab) 30 0.002894 0.002894 0.002894 0.68% MLMG::computeResidual() 36 0.002812 0.002812 0.002812 0.66% BndryData::define() 6 0.002784 0.002784 0.002784 0.65% Castro::computeNewDt() 5 0.001798 0.001798 0.001798 0.42% Castro::construct_new_source() 25 0.001718 0.001718 0.001718 0.40% Castro::construct_new_gravity_source() 5 0.001628 0.001628 0.001628 0.38% Castro::construct_old_source() 25 0.001485 0.001485 0.001485 0.35% Castro::construct_old_gravity_source() 5 0.001446 0.001446 0.001446 0.34% Castro::enforce_speed_limit() 30 0.001102 0.001102 0.001102 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009861 0.0009861 0.0009861 0.23% Castro::apply_source_to_state() 10 0.0009288 0.0009288 0.0009288 0.22% Castro::reset_internal_energy(Fab) 240 0.0009283 0.0009283 0.0009283 0.22% MultiFab::Saxpy() 10 0.0009223 0.0009223 0.0009223 0.22% MLMG::ResNormInf() 42 0.0008802 0.0008802 0.0008802 0.21% FabArrayBase::getCPC() 632 0.0008454 0.0008454 0.0008454 0.20% MLCellLinOp::setLevelBC() 6 0.0008297 0.0008297 0.0008297 0.19% MLMG::getGradSolution() 6 0.0007733 0.0007733 0.0007733 0.18% MLCellLinOp::compGrad() 6 0.00077 0.00077 0.00077 0.18% MLPoisson::prepareForSolve() 6 0.0006983 0.0006983 0.0006983 0.16% MLCellLinOp::prepareForSolve() 6 0.0006935 0.0006935 0.0006935 0.16% FabArray::setDomainBndry() 20 0.0006628 0.0006628 0.0006628 0.16% FabArray::mult() 22 0.0006576 0.0006576 0.0006576 0.15% Castro::check_for_nan() 10 0.0006037 0.0006037 0.0006037 0.14% MultiFab::contains_nan() 10 0.000596 0.000596 0.000596 0.14% MLMG::computeMLResidual() 6 0.0005862 0.0005862 0.0005862 0.14% Gravity::update_max_rhs() 6 0.0004749 0.0004749 0.0004749 0.11% FabArrayBase::CPC::define() 244 0.0004264 0.0004264 0.0004264 0.10% Amr::InitAmr() 1 0.0003892 0.0003892 0.0003892 0.09% FabArrayBase::getFB() 1766 0.0003205 0.0003205 0.0003205 0.07% Gravity::swapTimeLevels() 5 0.0002375 0.0002375 0.0002375 0.06% Castro::buildMetrics() 1 0.0001612 0.0001612 0.0001612 0.04% MLLinOp::define() 6 0.000149 0.000149 0.000149 0.03% MultiFab::Copy() 6 0.0001456 0.0001456 0.0001456 0.03% MLMG::MLResNormInf() 6 0.0001386 0.0001386 0.0001386 0.03% MultiFab::max() 6 0.0001383 0.0001383 0.0001383 0.03% MLLinOp::defineGrids() 6 0.0001258 0.0001258 0.0001258 0.03% MLMG::MLRhsNormInf() 6 0.0001087 0.0001087 0.0001087 0.03% FabArrayBase::FB::FB() 26 5.968e-05 5.968e-05 5.968e-05 0.01% Castro::create_source_corrector() 5 4.185e-05 4.185e-05 4.185e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 3.084e-05 3.084e-05 3.084e-05 0.01% Castro::finalize_advance() 5 2.863e-05 2.863e-05 2.863e-05 0.01% Castro::initMFs() 1 2.842e-05 2.842e-05 2.842e-05 0.01% Castro::swap_state_time_levels() 5 2.757e-05 2.757e-05 2.757e-05 0.01% Amr::writeSmallPlotFile() 1 2.568e-05 2.568e-05 2.568e-05 0.01% makeSFC 30 2.324e-05 2.324e-05 2.324e-05 0.01% Castro::finalize_do_advance() 5 2.066e-05 2.066e-05 2.066e-05 0.00% DistributionMapping::Distribute() 31 9.861e-06 9.861e-06 9.861e-06 0.00% Amr::initSubcycle() 1 9.42e-06 9.42e-06 9.42e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.551e-06 5.551e-06 5.551e-06 0.00% MLMG::buildFineMask() 6 2.94e-06 2.94e-06 2.94e-06 0.00% Castro::retry_advance_ctu() 5 2.39e-06 2.39e-06 2.39e-06 0.00% Gravity::set_mass_offset() 6 2.262e-06 2.262e-06 2.262e-06 0.00% Castro::FluxRegCrseInit 5 1.952e-06 1.952e-06 1.952e-06 0.00% AmrLevel::AmrLevel() 1 1.355e-06 1.355e-06 1.355e-06 0.00% Castro::FluxRegFineAdd() 5 1.113e-06 1.113e-06 1.113e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.029e-06 1.029e-06 1.029e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.05-37-gb78921a2d80d) finalized