Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.07-4-gcbdc6580ee3d) initialized Starting run at 08:38:49 UTC on 2022-07-05. Successfully read inputs file ... Castro git describe: 22.06-15-gd68821af9 AMReX git describe: 22.07-4-gcbdc6580e Microphysics git describe: 22.07-5-gcfab8d9a reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.041065951 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.023737773 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048518266 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.054097321 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.047592646 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.046851302 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.074755133 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.042167405 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.074255972 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.047442748 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.048515308 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.050861865 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.067132286 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.039399934 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.02361576 seconds Ending run at 08:38:50 UTC on 2022-07-05. Run time = 0.779708507 Run time without initialization = 0.665842369 Average number of zones advanced per microsecond: 3.937 Average number of zones advanced per microsecond per rank: 3.937 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.7797 ... 0.7797 ... 0.7797 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.1871 0.1871 0.1871 23.99% VisMF::Write(FabArray) 11 0.1625 0.1625 0.1625 20.84% MLCellLinOp::applyBC() 4379 0.07714 0.07714 0.07714 9.89% MLPoisson::Fsmooth() 3240 0.06123 0.06123 0.06123 7.85% StateData::FillBoundary(geom) 328 0.02325 0.02325 0.02325 2.98% MLCGSolver::bicgstab 81 0.02316 0.02316 0.02316 2.97% MultiFab::Dot() 1100 0.0213 0.0213 0.0213 2.73% FillBoundary_nowait() 3974 0.01383 0.01383 0.01383 1.77% FabArray::setVal() 1135 0.01383 0.01383 0.01383 1.77% MultiFab::LinComb() 1566 0.01374 0.01374 0.01374 1.76% StateDataPhysBCFunct::() 41 0.01368 0.01368 0.01368 1.75% Castro::computeTemp() 63 0.01322 0.01322 0.01322 1.69% Castro::normalize_species() 62 0.01292 0.01292 0.01292 1.66% FabArray::ParallelCopy_nowait() 851 0.01244 0.01244 0.01244 1.60% MLPoisson::Fapply() 1128 0.01124 0.01124 0.01124 1.44% MLCellLinOp::defineAuxData() 11 0.01117 0.01117 0.01117 1.43% Castro::enforce_min_density() 62 0.009637 0.009637 0.009637 1.24% Gravity::fill_multipole_BCs() 11 0.008027 0.008027 0.008027 1.03% MLMG::addInterpCorrection() 405 0.007226 0.007226 0.007226 0.93% amrex::average_down 405 0.006667 0.006667 0.006667 0.86% MultiFab::Xpay() 578 0.006394 0.006394 0.006394 0.82% Castro::estTimeStep() 21 0.005284 0.005284 0.005284 0.68% Amr::checkPoint() 3 0.004992 0.004992 0.004992 0.64% Castro::do_advance_ctu() 10 0.004413 0.004413 0.004413 0.57% Castro::reset_internal_energy(MultiFab) 63 0.004275 0.004275 0.004275 0.55% BndryData::define() 11 0.003688 0.003688 0.003688 0.47% Castro::construct_new_gravity_source() 10 0.00323 0.00323 0.00323 0.41% Castro::construct_old_gravity_source() 10 0.002571 0.002571 0.002571 0.33% Amr::writePlotFile() 2 0.002513 0.002513 0.002513 0.32% Gravity::get_new_grav_vector() 11 0.001912 0.001912 0.001912 0.25% MLMG::ResNormInf() 92 0.001899 0.001899 0.001899 0.24% MultiFab::Saxpy() 20 0.001819 0.001819 0.001819 0.23% Castro::enforce_speed_limit() 62 0.001786 0.001786 0.001786 0.23% Castro::expand_state() 10 0.001723 0.001723 0.001723 0.22% Gravity::get_old_grav_vector() 10 0.001717 0.001717 0.001717 0.22% MLMG::oneIter() 81 0.001654 0.001654 0.001654 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001587 0.001587 0.001587 0.20% MLCellLinOp::setLevelBC() 11 0.001505 0.001505 0.001505 0.19% Castro::reset_internal_energy(Fab) 504 0.001434 0.001434 0.001434 0.18% Gravity::actual_solve_with_mlmg() 11 0.00139 0.00139 0.00139 0.18% FabArray::mult() 43 0.001314 0.001314 0.001314 0.17% FabArray::setDomainBndry() 41 0.001283 0.001283 0.001283 0.16% MLCellLinOp::smooth() 1620 0.001266 0.001266 0.001266 0.16% Castro::initData() 1 0.001229 0.001229 0.001229 0.16% MultiFab::contains_nan() 20 0.001171 0.001171 0.001171 0.15% MLCellLinOp::prepareForSolve() 11 0.001123 0.001123 0.001123 0.14% MLMG::prepareForSolve() 11 0.001053 0.001053 0.001053 0.14% MLCellLinOp::compGrad() 11 0.0009201 0.0009201 0.0009201 0.12% FabArray::FillBoundary() 3974 0.0007751 0.0007751 0.0007751 0.10% FabArrayBase::getCPC() 1313 0.0007514 0.0007514 0.0007514 0.10% FabArrayBase::CPC::define() 454 0.0006607 0.0006607 0.0006607 0.08% FabArrayBase::getFB() 3974 0.0005991 0.0005991 0.0005991 0.08% MLCellLinOp::apply() 1128 0.0005026 0.0005026 0.0005026 0.06% Amr::InitAmr() 1 0.000442 0.000442 0.000442 0.06% Gravity::solve_for_phi() 10 0.0004413 0.0004413 0.0004413 0.06% Gravity::update_max_rhs() 11 0.000426 0.000426 0.000426 0.05% CGSolver::sxay() 1566 0.0003974 0.0003974 0.0003974 0.05% Amr::coarseTimeStep() 10 0.0003251 0.0003251 0.0003251 0.04% FillPatchIterator::Initialize 41 0.0003048 0.0003048 0.0003048 0.04% MLCGSolver::ParallelAllReduce 1495 0.0002932 0.0002932 0.0002932 0.04% main() 1 0.0002885 0.0002885 0.0002885 0.04% MLCellLinOp::defineBC() 11 0.0002808 0.0002808 0.0002808 0.04% FabArray::ParallelCopy() 851 0.0002801 0.0002801 0.0002801 0.04% MultiFab::max() 11 0.0002626 0.0002626 0.0002626 0.03% MultiFab::Copy() 11 0.000249 0.000249 0.000249 0.03% MLCellLinOp::correctionResidual() 486 0.0002411 0.0002411 0.0002411 0.03% Castro::subcycle_advance_ctu() 10 0.0002314 0.0002314 0.0002314 0.03% Amr::timeStep() 10 0.000203 0.000203 0.000203 0.03% Castro::construct_new_gravity() 10 0.0001994 0.0001994 0.0001994 0.03% MLMG::MLRhsNormInf() 11 0.0001974 0.0001974 0.0001974 0.03% MLMG::mgVcycle() 81 0.0001921 0.0001921 0.0001921 0.02% MLLinOp::defineGrids() 11 0.0001719 0.0001719 0.0001719 0.02% StateData::checkPoint() 12 0.000134 0.000134 0.000134 0.02% MLMG:computeResOfCorrection() 405 0.0001197 0.0001197 0.0001197 0.02% MLMG::actualBottomSolve() 81 0.0001017 0.0001017 0.0001017 0.01% MLMG::mgVcycle_down::0 81 9.849e-05 9.849e-05 9.849e-05 0.01% Castro::initialize_advance() 10 9.264e-05 9.264e-05 9.264e-05 0.01% MLMG::mgVcycle_down::1 81 8.507e-05 8.507e-05 8.507e-05 0.01% MLMG::solve() 11 8.415e-05 8.415e-05 8.415e-05 0.01% MLMG::mgVcycle_down::2 81 8.404e-05 8.404e-05 8.404e-05 0.01% Castro::Castro() 1 8.354e-05 8.354e-05 8.354e-05 0.01% FabArrayBase::FB::FB() 56 8.229e-05 8.229e-05 8.229e-05 0.01% MLMG::mgVcycle_down::3 81 7.974e-05 7.974e-05 7.974e-05 0.01% MLMG::mgVcycle_down::4 81 7.911e-05 7.911e-05 7.911e-05 0.01% Castro::initialize_do_advance() 10 7.847e-05 7.847e-05 7.847e-05 0.01% Castro::clean_state() 62 7.82e-05 7.82e-05 7.82e-05 0.01% AmrLevel::checkPoint() 3 7.193e-05 7.193e-05 7.193e-05 0.01% MLMG::mgVcycle_up::4 81 6.143e-05 6.143e-05 6.143e-05 0.01% MLMG::mgVcycle_up::0 81 5.591e-05 5.591e-05 5.591e-05 0.01% MLMG::mgVcycle_up::3 81 5.569e-05 5.569e-05 5.569e-05 0.01% MLMG::mgVcycle_up::1 81 5.569e-05 5.569e-05 5.569e-05 0.01% Castro::finalize_advance() 10 5.539e-05 5.539e-05 5.539e-05 0.01% MLMG::mgVcycle_up::2 81 5.407e-05 5.407e-05 5.407e-05 0.01% MLCellLinOp::solutionResidual() 92 5.278e-05 5.278e-05 5.278e-05 0.01% Castro::advance() 10 4.505e-05 4.505e-05 4.505e-05 0.01% Castro::construct_new_source() 50 4.196e-05 4.196e-05 4.196e-05 0.01% Castro::swap_state_time_levels() 10 4.163e-05 4.163e-05 4.163e-05 0.01% StateData::define() 4 3.813e-05 3.813e-05 3.813e-05 0.00% MLMG::computeResidual() 81 3.681e-05 3.681e-05 3.681e-05 0.00% Castro::finalize_do_advance() 10 3.514e-05 3.514e-05 3.514e-05 0.00% Gravity::solve_phi_with_mlmg() 11 3.332e-05 3.332e-05 3.332e-05 0.00% Castro::enforce_consistent_e() 1 3.259e-05 3.259e-05 3.259e-05 0.00% Gravity::actual_multilevel_solve() 1 3.116e-05 3.116e-05 3.116e-05 0.00% MLMG::mgVcycle_bottom 81 3.015e-05 3.015e-05 3.015e-05 0.00% FillPatchSingleLevel 41 2.911e-05 2.911e-05 2.911e-05 0.00% makeSFC 55 2.609e-05 2.609e-05 2.609e-05 0.00% Castro::initMFs() 1 2.576e-05 2.576e-05 2.576e-05 0.00% Amr::writeSmallPlotFile() 1 2.545e-05 2.545e-05 2.545e-05 0.00% MLLinOp::define() 11 2.465e-05 2.465e-05 2.465e-05 0.00% Amr::defBaseLevel() 1 2.369e-05 2.369e-05 2.369e-05 0.00% Castro::buildMetrics() 1 2.32e-05 2.32e-05 2.32e-05 0.00% MLPoisson::define() 11 2.221e-05 2.221e-05 2.221e-05 0.00% Amr::FinalizeInit() 1 2.216e-05 2.216e-05 2.216e-05 0.00% Castro::construct_old_source() 50 1.872e-05 1.872e-05 1.872e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.793e-05 1.793e-05 1.793e-05 0.00% Castro::do_new_sources() 10 1.716e-05 1.716e-05 1.716e-05 0.00% Castro::do_old_sources() 10 1.579e-05 1.579e-05 1.579e-05 0.00% DistributionMapping::Distribute() 56 1.506e-05 1.506e-05 1.506e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.442e-05 1.442e-05 1.442e-05 0.00% Castro::apply_source_to_state() 20 1.127e-05 1.127e-05 1.127e-05 0.00% Castro::check_for_nan() 20 1.054e-05 1.054e-05 1.054e-05 0.00% Gravity::swapTimeLevels() 10 1.049e-05 1.049e-05 1.049e-05 0.00% Castro::construct_old_gravity() 10 9.433e-06 9.433e-06 9.433e-06 0.00% Castro::post_timestep() 10 8.766e-06 8.766e-06 8.766e-06 0.00% Amr::initSubcycle() 1 8.507e-06 8.507e-06 8.507e-06 0.00% MLPoisson::prepareForSolve() 11 7.792e-06 7.792e-06 7.792e-06 0.00% AmrLevel::AmrLevel(dm) 1 7.509e-06 7.509e-06 7.509e-06 0.00% AmrLevel::checkPointPost() 3 7.485e-06 7.485e-06 7.485e-06 0.00% Castro::computeNewDt() 9 6.911e-06 6.911e-06 6.911e-06 0.00% MLMG::computeMLResidual() 11 6.904e-06 6.904e-06 6.904e-06 0.00% Amr::InitializeInit() 1 6.629e-06 6.629e-06 6.629e-06 0.00% Castro::retry_advance_ctu() 10 5.999e-06 5.999e-06 5.999e-06 0.00% MLMG::getGradSolution() 11 5.845e-06 5.845e-06 5.845e-06 0.00% MLMG::buildFineMask() 11 5.323e-06 5.323e-06 5.323e-06 0.00% MLMG::MLResNormInf() 11 4.626e-06 4.626e-06 4.626e-06 0.00% Castro::create_source_corrector() 10 4.08e-06 4.08e-06 4.08e-06 0.00% Gravity::set_mass_offset() 11 4.058e-06 4.058e-06 4.058e-06 0.00% Castro::post_init() 1 4.012e-06 4.012e-06 4.012e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.068e-06 3.068e-06 3.068e-06 0.00% Castro::FluxRegCrseInit 10 3.065e-06 3.065e-06 3.065e-06 0.00% Castro::computeInitialDt() 2 2.403e-06 2.403e-06 2.403e-06 0.00% Amr::init() 1 2.259e-06 2.259e-06 2.259e-06 0.00% AmrLevel::checkPointPre() 3 2.138e-06 2.138e-06 2.138e-06 0.00% Castro::FluxRegFineAdd() 10 2.048e-06 2.048e-06 2.048e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.76e-06 1.76e-06 1.76e-06 0.00% Castro::post_regrid() 1 1.383e-06 1.383e-06 1.383e-06 0.00% Amr::initialInit() 1 9.59e-07 9.59e-07 9.59e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.7797 0.7797 0.7797 100.00% Amr::coarseTimeStep() 10 0.642 0.642 0.642 82.33% Amr::timeStep() 10 0.5569 0.5569 0.5569 71.43% Castro::advance() 10 0.5498 0.5498 0.5498 70.52% Castro::subcycle_advance_ctu() 10 0.5385 0.5385 0.5385 69.06% Castro::do_advance_ctu() 10 0.5382 0.5382 0.5382 69.03% Gravity::solve_phi_with_mlmg() 11 0.3021 0.3021 0.3021 38.74% Gravity::actual_solve_with_mlmg() 11 0.2938 0.2938 0.2938 37.68% Castro::construct_new_gravity() 10 0.2774 0.2774 0.2774 35.58% MLMG::solve() 11 0.2719 0.2719 0.2719 34.87% Gravity::solve_for_phi() 10 0.2619 0.2619 0.2619 33.59% MLMG::oneIter() 81 0.2576 0.2576 0.2576 33.04% MLMG::mgVcycle() 81 0.256 0.256 0.256 32.83% Castro::construct_ctu_hydro_source() 10 0.1871 0.1871 0.1871 23.99% VisMF::Write(FabArray) 11 0.1625 0.1625 0.1625 20.84% MLCellLinOp::smooth() 1620 0.1313 0.1313 0.1313 16.84% Amr::checkPoint() 3 0.1228 0.1228 0.1228 15.74% AmrLevel::checkPoint() 3 0.1178 0.1178 0.1178 15.10% StateData::checkPoint() 12 0.1177 0.1177 0.1177 15.09% Amr::init() 1 0.1133 0.1133 0.1133 14.53% MLCellLinOp::applyBC() 4379 0.09243 0.09243 0.09243 11.85% MLMG::mgVcycle_bottom 81 0.07868 0.07868 0.07868 10.09% MLMG::actualBottomSolve() 81 0.07864 0.07864 0.07864 10.09% MLCGSolver::bicgstab 81 0.07785 0.07785 0.07785 9.98% MLPoisson::Fsmooth() 3240 0.06123 0.06123 0.06123 7.85% Amr::initialInit() 1 0.04839 0.04839 0.04839 6.21% Amr::writePlotFile() 2 0.04747 0.04747 0.04747 6.09% Amr::FinalizeInit() 1 0.04423 0.04423 0.04423 5.67% Castro::post_init() 1 0.04286 0.04286 0.04286 5.50% Castro::clean_state() 62 0.04252 0.04252 0.04252 5.45% FillPatchIterator::Initialize 41 0.04247 0.04247 0.04247 5.45% FillPatchSingleLevel 41 0.04088 0.04088 0.04088 5.24% Gravity::multilevel_solve_for_new_phi() 1 0.04066 0.04066 0.04066 5.22% Gravity::actual_multilevel_solve() 1 0.04065 0.04065 0.04065 5.21% StateDataPhysBCFunct::() 41 0.03693 0.03693 0.03693 4.74% MLCellLinOp::apply() 1128 0.03492 0.03492 0.03492 4.48% MLMG::mgVcycle_down::0 81 0.03431 0.03431 0.03431 4.40% MLMG::mgVcycle_up::0 81 0.02935 0.02935 0.02935 3.76% StateData::FillBoundary(geom) 328 0.02325 0.02325 0.02325 2.98% MultiFab::Dot() 1100 0.0213 0.0213 0.0213 2.73% MLCellLinOp::correctionResidual() 486 0.02055 0.02055 0.02055 2.64% Castro::initialize_do_advance() 10 0.01974 0.01974 0.01974 2.53% Castro::computeTemp() 63 0.01893 0.01893 0.01893 2.43% MLMG:computeResOfCorrection() 405 0.01772 0.01772 0.01772 2.27% MLPoisson::define() 11 0.01761 0.01761 0.01761 2.26% Gravity::get_new_grav_vector() 11 0.01747 0.01747 0.01747 2.24% MLMG::mgVcycle_down::1 81 0.01702 0.01702 0.01702 2.18% MLMG::mgVcycle_down::2 81 0.01661 0.01661 0.01661 2.13% MLMG::mgVcycle_down::3 81 0.01577 0.01577 0.01577 2.02% FabArray::FillBoundary() 3974 0.01529 0.01529 0.01529 1.96% MLMG::mgVcycle_down::4 81 0.01495 0.01495 0.01495 1.92% Castro::construct_old_gravity() 10 0.01472 0.01472 0.01472 1.89% Gravity::get_old_grav_vector() 10 0.01471 0.01471 0.01471 1.89% FillBoundary_nowait() 3974 0.01451 0.01451 0.01451 1.86% CGSolver::sxay() 1566 0.01414 0.01414 0.01414 1.81% FabArray::setVal() 1135 0.01383 0.01383 0.01383 1.77% MultiFab::LinComb() 1566 0.01374 0.01374 0.01374 1.76% FabArray::ParallelCopy() 851 0.01352 0.01352 0.01352 1.73% FabArray::ParallelCopy_nowait() 851 0.01324 0.01324 0.01324 1.70% Castro::normalize_species() 62 0.01292 0.01292 0.01292 1.66% MLMG::mgVcycle_up::2 81 0.01272 0.01272 0.01272 1.63% MLCGSolver::ParallelAllReduce 1495 0.01269 0.01269 0.01269 1.63% MLMG::mgVcycle_up::1 81 0.01249 0.01249 0.01249 1.60% MLCellLinOp::defineAuxData() 11 0.01246 0.01246 0.01246 1.60% MLMG::mgVcycle_up::3 81 0.01202 0.01202 0.01202 1.54% MLMG::addInterpCorrection() 405 0.012 0.012 0.012 1.54% MLMG::mgVcycle_up::4 81 0.0119 0.0119 0.0119 1.53% Castro::expand_state() 10 0.01172 0.01172 0.01172 1.50% amrex::average_down 405 0.01149 0.01149 0.01149 1.47% Castro::initialize_advance() 10 0.01125 0.01125 0.01125 1.44% MLPoisson::Fapply() 1128 0.01124 0.01124 0.01124 1.44% Castro::do_new_sources() 10 0.01081 0.01081 0.01081 1.39% Castro::do_old_sources() 10 0.01001 0.01001 0.01001 1.28% Castro::enforce_min_density() 62 0.009637 0.009637 0.009637 1.24% Gravity::fill_multipole_BCs() 11 0.008027 0.008027 0.008027 1.03% MLCellLinOp::solutionResidual() 92 0.007001 0.007001 0.007001 0.90% Castro::post_timestep() 10 0.006893 0.006893 0.006893 0.88% MultiFab::Xpay() 578 0.006394 0.006394 0.006394 0.82% MLMG::computeResidual() 81 0.006019 0.006019 0.006019 0.77% Castro::reset_internal_energy(MultiFab) 63 0.005709 0.005709 0.005709 0.73% Castro::estTimeStep() 21 0.005284 0.005284 0.005284 0.68% MLMG::prepareForSolve() 11 0.005003 0.005003 0.005003 0.64% MLCellLinOp::defineBC() 11 0.004876 0.004876 0.004876 0.63% BndryData::define() 11 0.004595 0.004595 0.004595 0.59% Amr::InitializeInit() 1 0.004152 0.004152 0.004152 0.53% Amr::defBaseLevel() 1 0.004145 0.004145 0.004145 0.53% Castro::initData() 1 0.003648 0.003648 0.003648 0.47% Castro::construct_new_source() 50 0.003271 0.003271 0.003271 0.42% Castro::construct_new_gravity_source() 10 0.00323 0.00323 0.00323 0.41% Castro::construct_old_source() 50 0.00259 0.00259 0.00259 0.33% Castro::computeNewDt() 9 0.002575 0.002575 0.002575 0.33% Castro::construct_old_gravity_source() 10 0.002571 0.002571 0.002571 0.33% MLMG::ResNormInf() 92 0.001899 0.001899 0.001899 0.24% Castro::apply_source_to_state() 20 0.00183 0.00183 0.00183 0.23% MultiFab::Saxpy() 20 0.001819 0.001819 0.001819 0.23% Castro::enforce_speed_limit() 62 0.001786 0.001786 0.001786 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001587 0.001587 0.001587 0.20% MLCellLinOp::setLevelBC() 11 0.001505 0.001505 0.001505 0.19% Castro::reset_internal_energy(Fab) 504 0.001434 0.001434 0.001434 0.18% FabArrayBase::getCPC() 1313 0.001412 0.001412 0.001412 0.18% MLMG::getGradSolution() 11 0.001403 0.001403 0.001403 0.18% MLCellLinOp::compGrad() 11 0.001397 0.001397 0.001397 0.18% FabArray::mult() 43 0.001314 0.001314 0.001314 0.17% FabArray::setDomainBndry() 41 0.001283 0.001283 0.001283 0.16% Castro::check_for_nan() 20 0.001181 0.001181 0.001181 0.15% MultiFab::contains_nan() 20 0.001171 0.001171 0.001171 0.15% Castro::post_regrid() 1 0.001169 0.001169 0.001169 0.15% MLPoisson::prepareForSolve() 11 0.001131 0.001131 0.001131 0.15% MLCellLinOp::prepareForSolve() 11 0.001123 0.001123 0.001123 0.14% MLMG::computeMLResidual() 11 0.001026 0.001026 0.001026 0.13% Gravity::update_max_rhs() 11 0.0008401 0.0008401 0.0008401 0.11% FabArrayBase::getFB() 3974 0.0006814 0.0006814 0.0006814 0.09% Castro::computeInitialDt() 2 0.0006779 0.0006779 0.0006779 0.09% FabArrayBase::CPC::define() 454 0.0006607 0.0006607 0.0006607 0.08% Amr::InitAmr() 1 0.0004505 0.0004505 0.0004505 0.06% Gravity::swapTimeLevels() 10 0.0004273 0.0004273 0.0004273 0.05% Castro::Castro() 1 0.0004239 0.0004239 0.0004239 0.05% MultiFab::max() 11 0.0002626 0.0002626 0.0002626 0.03% MLMG::MLResNormInf() 11 0.0002578 0.0002578 0.0002578 0.03% MLLinOp::define() 11 0.0002528 0.0002528 0.0002528 0.03% MultiFab::Copy() 11 0.000249 0.000249 0.000249 0.03% MLLinOp::defineGrids() 11 0.0002281 0.0002281 0.0002281 0.03% MLMG::MLRhsNormInf() 11 0.0001974 0.0001974 0.0001974 0.03% Castro::buildMetrics() 1 0.0001534 0.0001534 0.0001534 0.02% FabArrayBase::FB::FB() 56 8.229e-05 8.229e-05 8.229e-05 0.01% Castro::finalize_advance() 10 6.05e-05 6.05e-05 6.05e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.45e-05 5.45e-05 5.45e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.564e-05 4.564e-05 4.564e-05 0.01% Castro::swap_state_time_levels() 10 4.163e-05 4.163e-05 4.163e-05 0.01% makeSFC 55 4.008e-05 4.008e-05 4.008e-05 0.01% StateData::define() 4 3.813e-05 3.813e-05 3.813e-05 0.00% Castro::finalize_do_advance() 10 3.514e-05 3.514e-05 3.514e-05 0.00% Castro::enforce_consistent_e() 1 3.259e-05 3.259e-05 3.259e-05 0.00% Castro::initMFs() 1 2.576e-05 2.576e-05 2.576e-05 0.00% Amr::writeSmallPlotFile() 1 2.545e-05 2.545e-05 2.545e-05 0.00% DistributionMapping::Distribute() 56 1.506e-05 1.506e-05 1.506e-05 0.00% Amr::initSubcycle() 1 8.507e-06 8.507e-06 8.507e-06 0.00% AmrLevel::checkPointPost() 3 7.485e-06 7.485e-06 7.485e-06 0.00% Castro::retry_advance_ctu() 10 5.999e-06 5.999e-06 5.999e-06 0.00% MLMG::buildFineMask() 11 5.323e-06 5.323e-06 5.323e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.139e-06 4.139e-06 4.139e-06 0.00% Castro::create_source_corrector() 10 4.08e-06 4.08e-06 4.08e-06 0.00% Gravity::set_mass_offset() 11 4.058e-06 4.058e-06 4.058e-06 0.00% Castro::FluxRegCrseInit 10 3.065e-06 3.065e-06 3.065e-06 0.00% AmrLevel::checkPointPre() 3 2.138e-06 2.138e-06 2.138e-06 0.00% Castro::FluxRegFineAdd() 10 2.048e-06 2.048e-06 2.048e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.76e-06 1.76e-06 1.76e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.07-4-gcbdc6580ee3d) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.07-4-gcbdc6580ee3d) initialized Starting run at 08:38:50 UTC on 2022-07-05. Successfully read inputs file ... Castro git describe: 22.06-15-gd68821af9 AMReX git describe: 22.07-4-gcbdc6580e Microphysics git describe: 22.07-5-gcfab8d9a reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.427867407 Restart time = 0.046593533 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.051750998 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.048434736 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.047832244 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.060426766 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.077332139 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.025554287 seconds Ending run at 08:38:51 UTC on 2022-07-05. Run time = 0.358858542 Run time without initialization = 0.311728087 Average number of zones advanced per microsecond: 4.205 Average number of zones advanced per microsecond per rank: 4.205 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3589 ... 0.3589 ... 0.3589 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0901 0.0901 0.0901 25.11% VisMF::Read() 3 0.03945 0.03945 0.03945 10.99% MLCellLinOp::applyBC() 1946 0.0344 0.0344 0.0344 9.59% MLPoisson::Fsmooth() 1440 0.02703 0.02703 0.02703 7.53% VisMF::Write(FabArray) 1 0.02405 0.02405 0.02405 6.70% StateData::FillBoundary(geom) 160 0.01145 0.01145 0.01145 3.19% MLCGSolver::bicgstab 36 0.0101 0.0101 0.0101 2.82% MultiFab::Dot() 484 0.009321 0.009321 0.009321 2.60% Castro::normalize_species() 30 0.009231 0.009231 0.009231 2.57% Castro::computeTemp() 30 0.007979 0.007979 0.007979 2.22% FabArray::setVal() 537 0.006627 0.006627 0.006627 1.85% FillBoundary_nowait() 1766 0.006177 0.006177 0.006177 1.72% MLCellLinOp::defineAuxData() 6 0.006043 0.006043 0.006043 1.68% MultiFab::LinComb() 690 0.005988 0.005988 0.005988 1.67% FabArray::ParallelCopy_nowait() 380 0.005843 0.005843 0.005843 1.63% StateDataPhysBCFunct::() 20 0.005285 0.005285 0.005285 1.47% Gravity::fill_multipole_BCs() 6 0.005039 0.005039 0.005039 1.40% MLPoisson::Fapply() 500 0.004953 0.004953 0.004953 1.38% Castro::enforce_min_density() 30 0.004855 0.004855 0.004855 1.35% MLMG::addInterpCorrection() 180 0.003148 0.003148 0.003148 0.88% Amr::restart() 1 0.003023 0.003023 0.003023 0.84% amrex::average_down 180 0.00291 0.00291 0.00291 0.81% MultiFab::Xpay() 258 0.002823 0.002823 0.002823 0.79% Castro::do_advance_ctu() 5 0.002135 0.002135 0.002135 0.59% BndryData::define() 6 0.00206 0.00206 0.00206 0.57% Castro::construct_new_gravity_source() 5 0.001693 0.001693 0.001693 0.47% Castro::estTimeStep() 10 0.001693 0.001693 0.001693 0.47% Castro::reset_internal_energy(MultiFab) 30 0.001653 0.001653 0.001653 0.46% Amr::writePlotFile() 1 0.001595 0.001595 0.001595 0.44% Castro::construct_old_gravity_source() 5 0.001455 0.001455 0.001455 0.41% Castro::enforce_speed_limit() 30 0.001216 0.001216 0.001216 0.34% Castro::reset_internal_energy(Fab) 240 0.001109 0.001109 0.001109 0.31% Gravity::get_old_grav_vector() 5 0.0009484 0.0009484 0.0009484 0.26% MultiFab::Saxpy() 10 0.0009205 0.0009205 0.0009205 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008684 0.0008684 0.0008684 0.24% Castro::expand_state() 5 0.0008672 0.0008672 0.0008672 0.24% Gravity::get_new_grav_vector() 5 0.0008665 0.0008665 0.0008665 0.24% MLMG::ResNormInf() 42 0.0008501 0.0008501 0.0008501 0.24% MLCellLinOp::setLevelBC() 6 0.0008047 0.0008047 0.0008047 0.22% Gravity::actual_solve_with_mlmg() 6 0.0007824 0.0007824 0.0007824 0.22% MLMG::oneIter() 36 0.0007349 0.0007349 0.0007349 0.20% FabArray::mult() 22 0.0006497 0.0006497 0.0006497 0.18% FabArray::setDomainBndry() 20 0.0006364 0.0006364 0.0006364 0.18% MLCellLinOp::prepareForSolve() 6 0.0006225 0.0006225 0.0006225 0.17% MultiFab::contains_nan() 10 0.0005938 0.0005938 0.0005938 0.17% MLMG::prepareForSolve() 6 0.0005738 0.0005738 0.0005738 0.16% MLCellLinOp::smooth() 720 0.0005441 0.0005441 0.0005441 0.15% MLCellLinOp::compGrad() 6 0.0004917 0.0004917 0.0004917 0.14% FabArrayBase::CPC::define() 244 0.0003855 0.0003855 0.0003855 0.11% Amr::InitAmr() 1 0.0003808 0.0003808 0.0003808 0.11% FabArrayBase::getCPC() 632 0.0003688 0.0003688 0.0003688 0.10% FabArray::FillBoundary() 1766 0.0003527 0.0003527 0.0003527 0.10% FabArrayBase::getFB() 1766 0.000252 0.000252 0.000252 0.07% Gravity::update_max_rhs() 6 0.000241 0.000241 0.000241 0.07% main() 1 0.0002387 0.0002387 0.0002387 0.07% Castro::subcycle_advance_ctu() 5 0.0002348 0.0002348 0.0002348 0.07% MLCellLinOp::apply() 500 0.0002168 0.0002168 0.0002168 0.06% Gravity::solve_for_phi() 5 0.0002077 0.0002077 0.0002077 0.06% CGSolver::sxay() 690 0.000188 0.000188 0.000188 0.05% Amr::coarseTimeStep() 5 0.0001624 0.0001624 0.0001624 0.05% Castro::construct_new_gravity() 5 0.0001601 0.0001601 0.0001601 0.04% Castro::create_source_corrector() 5 0.000158 0.000158 0.000158 0.04% MLCellLinOp::defineBC() 6 0.000148 0.000148 0.000148 0.04% MultiFab::max() 6 0.000138 0.000138 0.000138 0.04% MultiFab::Copy() 6 0.0001358 0.0001358 0.0001358 0.04% FillPatchIterator::Initialize 20 0.0001358 0.0001358 0.0001358 0.04% Castro::construct_new_source() 25 0.0001346 0.0001346 0.0001346 0.04% FabArray::ParallelCopy() 380 0.0001275 0.0001275 0.0001275 0.04% MLCGSolver::ParallelAllReduce 659 0.0001253 0.0001253 0.0001253 0.03% Amr::timeStep() 5 0.0001093 0.0001093 0.0001093 0.03% MLMG::MLRhsNormInf() 6 0.0001054 0.0001054 0.0001054 0.03% MLLinOp::defineGrids() 6 0.0001033 0.0001033 0.0001033 0.03% MLCellLinOp::correctionResidual() 216 0.000103 0.000103 0.000103 0.03% Castro::post_timestep() 5 9.947e-05 9.947e-05 9.947e-05 0.03% Castro::initialize_advance() 5 9.774e-05 9.774e-05 9.774e-05 0.03% Castro::initialize_do_advance() 5 9.65e-05 9.65e-05 9.65e-05 0.03% Castro::advance() 5 9.591e-05 9.591e-05 9.591e-05 0.03% Castro::construct_old_source() 25 8.875e-05 8.875e-05 8.875e-05 0.02% MLMG::mgVcycle() 36 8.301e-05 8.301e-05 8.301e-05 0.02% AmrLevel::restart() 1 7.779e-05 7.779e-05 7.779e-05 0.02% StateData::restartDoit() 4 6.971e-05 6.971e-05 6.971e-05 0.02% FabArrayBase::FB::FB() 26 6.132e-05 6.132e-05 6.132e-05 0.02% MLMG:computeResOfCorrection() 180 5.48e-05 5.48e-05 5.48e-05 0.02% Castro::construct_old_gravity() 5 4.503e-05 4.503e-05 4.503e-05 0.01% MLMG::actualBottomSolve() 36 4.38e-05 4.38e-05 4.38e-05 0.01% Castro::clean_state() 30 3.98e-05 3.98e-05 3.98e-05 0.01% MLMG::mgVcycle_down::0 36 3.804e-05 3.804e-05 3.804e-05 0.01% MLMG::solve() 6 3.683e-05 3.683e-05 3.683e-05 0.01% MLMG::mgVcycle_down::1 36 3.566e-05 3.566e-05 3.566e-05 0.01% MLMG::mgVcycle_down::2 36 3.491e-05 3.491e-05 3.491e-05 0.01% MLMG::mgVcycle_down::4 36 3.265e-05 3.265e-05 3.265e-05 0.01% Castro::buildMetrics() 1 3.191e-05 3.191e-05 3.191e-05 0.01% MLMG::mgVcycle_down::3 36 3.174e-05 3.174e-05 3.174e-05 0.01% Gravity::actual_multilevel_solve() 1 3.067e-05 3.067e-05 3.067e-05 0.01% Castro::post_restart() 1 2.994e-05 2.994e-05 2.994e-05 0.01% Castro::swap_state_time_levels() 5 2.782e-05 2.782e-05 2.782e-05 0.01% Castro::initMFs() 1 2.674e-05 2.674e-05 2.674e-05 0.01% MLMG::mgVcycle_up::4 36 2.652e-05 2.652e-05 2.652e-05 0.01% Amr::writeSmallPlotFile() 1 2.642e-05 2.642e-05 2.642e-05 0.01% Castro::finalize_advance() 5 2.571e-05 2.571e-05 2.571e-05 0.01% MLLinOp::define() 6 2.539e-05 2.539e-05 2.539e-05 0.01% MLMG::mgVcycle_up::0 36 2.361e-05 2.361e-05 2.361e-05 0.01% MLCellLinOp::solutionResidual() 42 2.332e-05 2.332e-05 2.332e-05 0.01% MLMG::mgVcycle_up::3 36 2.315e-05 2.315e-05 2.315e-05 0.01% MLMG::mgVcycle_up::2 36 2.289e-05 2.289e-05 2.289e-05 0.01% MLMG::mgVcycle_up::1 36 2.112e-05 2.112e-05 2.112e-05 0.01% Castro::finalize_do_advance() 5 1.869e-05 1.869e-05 1.869e-05 0.01% Gravity::multilevel_solve_for_new_phi() 1 1.756e-05 1.756e-05 1.756e-05 0.00% MLMG::computeResidual() 36 1.593e-05 1.593e-05 1.593e-05 0.00% MLPoisson::define() 6 1.538e-05 1.538e-05 1.538e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.489e-05 1.489e-05 1.489e-05 0.00% MLMG::mgVcycle_bottom 36 1.427e-05 1.427e-05 1.427e-05 0.00% makeSFC 30 1.372e-05 1.372e-05 1.372e-05 0.00% FillPatchSingleLevel 20 1.324e-05 1.324e-05 1.324e-05 0.00% Castro::do_new_sources() 5 9.729e-06 9.729e-06 9.729e-06 0.00% DistributionMapping::Distribute() 31 8.634e-06 8.634e-06 8.634e-06 0.00% Castro::do_old_sources() 5 8.379e-06 8.379e-06 8.379e-06 0.00% Amr::initSubcycle() 1 8.335e-06 8.335e-06 8.335e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.358e-06 7.358e-06 7.358e-06 0.00% Castro::check_for_nan() 10 6.581e-06 6.581e-06 6.581e-06 0.00% Castro::apply_source_to_state() 10 6.189e-06 6.189e-06 6.189e-06 0.00% MLPoisson::prepareForSolve() 6 4.69e-06 4.69e-06 4.69e-06 0.00% Gravity::swapTimeLevels() 5 4.551e-06 4.551e-06 4.551e-06 0.00% MLMG::buildFineMask() 6 3.685e-06 3.685e-06 3.685e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.421e-06 3.421e-06 3.421e-06 0.00% MLMG::computeMLResidual() 6 3.081e-06 3.081e-06 3.081e-06 0.00% Castro::computeNewDt() 5 2.956e-06 2.956e-06 2.956e-06 0.00% MLMG::getGradSolution() 6 2.918e-06 2.918e-06 2.918e-06 0.00% MLMG::MLResNormInf() 6 2.502e-06 2.502e-06 2.502e-06 0.00% Gravity::set_mass_offset() 6 2.188e-06 2.188e-06 2.188e-06 0.00% Castro::retry_advance_ctu() 5 1.974e-06 1.974e-06 1.974e-06 0.00% Castro::FluxRegCrseInit 5 1.757e-06 1.757e-06 1.757e-06 0.00% Castro::FluxRegFineAdd() 5 1.19e-06 1.19e-06 1.19e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.144e-06 1.144e-06 1.144e-06 0.00% Amr::init() 1 1.1e-06 1.1e-06 1.1e-06 0.00% AmrLevel::AmrLevel() 1 9.3e-07 9.3e-07 9.3e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3589 0.3589 0.3589 100.00% Amr::coarseTimeStep() 5 0.2859 0.2859 0.2859 79.67% Amr::timeStep() 5 0.2848 0.2848 0.2848 79.36% Castro::advance() 5 0.2811 0.2811 0.2811 78.32% Castro::subcycle_advance_ctu() 5 0.2751 0.2751 0.2751 76.65% Castro::do_advance_ctu() 5 0.2748 0.2748 0.2748 76.58% Castro::construct_new_gravity() 5 0.1421 0.1421 0.1421 39.59% Gravity::solve_phi_with_mlmg() 6 0.1379 0.1379 0.1379 38.42% Gravity::solve_for_phi() 5 0.1345 0.1345 0.1345 37.47% Gravity::actual_solve_with_mlmg() 6 0.1327 0.1327 0.1327 36.98% MLMG::solve() 6 0.1207 0.1207 0.1207 33.63% MLMG::oneIter() 36 0.1138 0.1138 0.1138 31.70% MLMG::mgVcycle() 36 0.113 0.113 0.113 31.49% Castro::construct_ctu_hydro_source() 5 0.09012 0.09012 0.09012 25.11% MLCellLinOp::smooth() 720 0.05817 0.05817 0.05817 16.21% Amr::init() 1 0.04664 0.04664 0.04664 13.00% Amr::restart() 1 0.04664 0.04664 0.04664 13.00% MLCellLinOp::applyBC() 1946 0.04124 0.04124 0.04124 11.49% AmrLevel::restart() 1 0.03965 0.03965 0.03965 11.05% StateData::restartDoit() 4 0.03957 0.03957 0.03957 11.03% VisMF::Read() 3 0.03945 0.03945 0.03945 10.99% MLMG::mgVcycle_bottom 36 0.03446 0.03446 0.03446 9.60% MLMG::actualBottomSolve() 36 0.03445 0.03445 0.03445 9.60% MLCGSolver::bicgstab 36 0.0341 0.0341 0.0341 9.50% MLPoisson::Fsmooth() 1440 0.02703 0.02703 0.02703 7.53% Castro::clean_state() 30 0.02608 0.02608 0.02608 7.27% Amr::writePlotFile() 1 0.02564 0.02564 0.02564 7.14% VisMF::Write(FabArray) 1 0.02405 0.02405 0.02405 6.70% FillPatchIterator::Initialize 20 0.01949 0.01949 0.01949 5.43% FillPatchSingleLevel 20 0.01872 0.01872 0.01872 5.22% StateDataPhysBCFunct::() 20 0.01674 0.01674 0.01674 4.66% MLCellLinOp::apply() 500 0.01555 0.01555 0.01555 4.33% MLMG::mgVcycle_down::0 36 0.01522 0.01522 0.01522 4.24% MLMG::mgVcycle_up::0 36 0.01299 0.01299 0.01299 3.62% Castro::initialize_do_advance() 5 0.01185 0.01185 0.01185 3.30% StateData::FillBoundary(geom) 160 0.01145 0.01145 0.01145 3.19% Castro::computeTemp() 30 0.01074 0.01074 0.01074 2.99% MLPoisson::define() 6 0.00967 0.00967 0.00967 2.69% MultiFab::Dot() 484 0.009321 0.009321 0.009321 2.60% Castro::normalize_species() 30 0.009231 0.009231 0.009231 2.57% MLCellLinOp::correctionResidual() 216 0.009093 0.009093 0.009093 2.53% MLMG:computeResOfCorrection() 180 0.007846 0.007846 0.007846 2.19% Castro::do_new_sources() 5 0.007652 0.007652 0.007652 2.13% MLMG::mgVcycle_down::1 36 0.007548 0.007548 0.007548 2.10% Gravity::get_new_grav_vector() 5 0.007455 0.007455 0.007455 2.08% Castro::construct_old_gravity() 5 0.007451 0.007451 0.007451 2.08% Gravity::get_old_grav_vector() 5 0.007406 0.007406 0.007406 2.06% MLMG::mgVcycle_down::2 36 0.007361 0.007361 0.007361 2.05% MLMG::mgVcycle_down::3 36 0.006934 0.006934 0.006934 1.93% FabArray::FillBoundary() 1766 0.006843 0.006843 0.006843 1.91% MLCellLinOp::defineAuxData() 6 0.006766 0.006766 0.006766 1.89% MLMG::mgVcycle_down::4 36 0.006656 0.006656 0.006656 1.85% FabArray::setVal() 537 0.006627 0.006627 0.006627 1.85% FillBoundary_nowait() 1766 0.006491 0.006491 0.006491 1.81% FabArray::ParallelCopy() 380 0.006347 0.006347 0.006347 1.77% FabArray::ParallelCopy_nowait() 380 0.00622 0.00622 0.00622 1.73% CGSolver::sxay() 690 0.006176 0.006176 0.006176 1.72% MultiFab::LinComb() 690 0.005988 0.005988 0.005988 1.67% Castro::initialize_advance() 5 0.005875 0.005875 0.005875 1.64% Castro::do_old_sources() 5 0.005822 0.005822 0.005822 1.62% MLMG::mgVcycle_up::2 36 0.005648 0.005648 0.005648 1.57% MLCGSolver::ParallelAllReduce 659 0.005587 0.005587 0.005587 1.56% MLMG::mgVcycle_up::1 36 0.005535 0.005535 0.005535 1.54% MLMG::addInterpCorrection() 180 0.005318 0.005318 0.005318 1.48% Castro::expand_state() 5 0.005316 0.005316 0.005316 1.48% MLMG::mgVcycle_up::3 36 0.00531 0.00531 0.00531 1.48% MLMG::mgVcycle_up::4 36 0.005282 0.005282 0.005282 1.47% amrex::average_down 180 0.005114 0.005114 0.005114 1.42% Gravity::fill_multipole_BCs() 6 0.005039 0.005039 0.005039 1.40% MLPoisson::Fapply() 500 0.004953 0.004953 0.004953 1.38% Castro::enforce_min_density() 30 0.004855 0.004855 0.004855 1.35% Castro::post_restart() 1 0.003787 0.003787 0.003787 1.06% Gravity::multilevel_solve_for_new_phi() 1 0.003657 0.003657 0.003657 1.02% Gravity::actual_multilevel_solve() 1 0.003639 0.003639 0.003639 1.01% Castro::post_timestep() 5 0.003621 0.003621 0.003621 1.01% MLCellLinOp::solutionResidual() 42 0.00319 0.00319 0.00319 0.89% MultiFab::Xpay() 258 0.002823 0.002823 0.002823 0.79% Castro::reset_internal_energy(MultiFab) 30 0.002762 0.002762 0.002762 0.77% MLCellLinOp::defineBC() 6 0.002731 0.002731 0.002731 0.76% MLMG::prepareForSolve() 6 0.002729 0.002729 0.002729 0.76% MLMG::computeResidual() 36 0.002648 0.002648 0.002648 0.74% BndryData::define() 6 0.002583 0.002583 0.002583 0.72% Castro::construct_new_source() 25 0.001828 0.001828 0.001828 0.51% Castro::construct_new_gravity_source() 5 0.001693 0.001693 0.001693 0.47% Castro::estTimeStep() 10 0.001693 0.001693 0.001693 0.47% Castro::construct_old_source() 25 0.001544 0.001544 0.001544 0.43% Castro::construct_old_gravity_source() 5 0.001455 0.001455 0.001455 0.41% Castro::enforce_speed_limit() 30 0.001216 0.001216 0.001216 0.34% Castro::reset_internal_energy(Fab) 240 0.001109 0.001109 0.001109 0.31% Castro::computeNewDt() 5 0.000952 0.000952 0.000952 0.27% Castro::apply_source_to_state() 10 0.0009266 0.0009266 0.0009266 0.26% MultiFab::Saxpy() 10 0.0009205 0.0009205 0.0009205 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008684 0.0008684 0.0008684 0.24% MLMG::ResNormInf() 42 0.0008501 0.0008501 0.0008501 0.24% MLCellLinOp::setLevelBC() 6 0.0008047 0.0008047 0.0008047 0.22% MLMG::getGradSolution() 6 0.0007578 0.0007578 0.0007578 0.21% MLCellLinOp::compGrad() 6 0.0007549 0.0007549 0.0007549 0.21% FabArrayBase::getCPC() 632 0.0007543 0.0007543 0.0007543 0.21% FabArray::mult() 22 0.0006497 0.0006497 0.0006497 0.18% FabArray::setDomainBndry() 20 0.0006364 0.0006364 0.0006364 0.18% MLPoisson::prepareForSolve() 6 0.0006272 0.0006272 0.0006272 0.17% MLCellLinOp::prepareForSolve() 6 0.0006225 0.0006225 0.0006225 0.17% Castro::check_for_nan() 10 0.0006004 0.0006004 0.0006004 0.17% MultiFab::contains_nan() 10 0.0005938 0.0005938 0.0005938 0.17% MLMG::computeMLResidual() 6 0.0005613 0.0005613 0.0005613 0.16% Gravity::update_max_rhs() 6 0.0004557 0.0004557 0.0004557 0.13% Amr::InitAmr() 1 0.0003892 0.0003892 0.0003892 0.11% FabArrayBase::CPC::define() 244 0.0003855 0.0003855 0.0003855 0.11% FabArrayBase::getFB() 1766 0.0003133 0.0003133 0.0003133 0.09% Gravity::swapTimeLevels() 5 0.0002242 0.0002242 0.0002242 0.06% MLLinOp::define() 6 0.0001582 0.0001582 0.0001582 0.04% Castro::create_source_corrector() 5 0.000158 0.000158 0.000158 0.04% Castro::buildMetrics() 1 0.0001491 0.0001491 0.0001491 0.04% MultiFab::max() 6 0.000138 0.000138 0.000138 0.04% MultiFab::Copy() 6 0.0001358 0.0001358 0.0001358 0.04% MLMG::MLResNormInf() 6 0.0001344 0.0001344 0.0001344 0.04% MLLinOp::defineGrids() 6 0.0001328 0.0001328 0.0001328 0.04% MLMG::MLRhsNormInf() 6 0.0001054 0.0001054 0.0001054 0.03% FabArrayBase::FB::FB() 26 6.132e-05 6.132e-05 6.132e-05 0.02% Castro::finalize_advance() 5 2.866e-05 2.866e-05 2.866e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.835e-05 2.835e-05 2.835e-05 0.01% Castro::swap_state_time_levels() 5 2.782e-05 2.782e-05 2.782e-05 0.01% Castro::initMFs() 1 2.674e-05 2.674e-05 2.674e-05 0.01% Amr::writeSmallPlotFile() 1 2.642e-05 2.642e-05 2.642e-05 0.01% makeSFC 30 2.099e-05 2.099e-05 2.099e-05 0.01% Castro::finalize_do_advance() 5 1.869e-05 1.869e-05 1.869e-05 0.01% DistributionMapping::Distribute() 31 8.634e-06 8.634e-06 8.634e-06 0.00% Amr::initSubcycle() 1 8.335e-06 8.335e-06 8.335e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.776e-06 4.776e-06 4.776e-06 0.00% MLMG::buildFineMask() 6 3.685e-06 3.685e-06 3.685e-06 0.00% Gravity::set_mass_offset() 6 2.188e-06 2.188e-06 2.188e-06 0.00% Castro::retry_advance_ctu() 5 1.974e-06 1.974e-06 1.974e-06 0.00% Castro::FluxRegCrseInit 5 1.757e-06 1.757e-06 1.757e-06 0.00% Castro::FluxRegFineAdd() 5 1.19e-06 1.19e-06 1.19e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.144e-06 1.144e-06 1.144e-06 0.00% AmrLevel::AmrLevel() 1 9.3e-07 9.3e-07 9.3e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.07-4-gcbdc6580ee3d) finalized