Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.03-33-g9907ac197518) initialized Starting run at 08:24:16 UTC on 2022-03-29. Successfully read inputs file ... Castro git describe: 22.02-19-ga92ac477d AMReX git describe: 22.03-33-g9907ac197 Microphysics git describe: 22.03-13-gf58972c7 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.038383667 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.022751884 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.1083694 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.050069756 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.049389934 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.058366531 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.059792955 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.035319307 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.050633843 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.047697719 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.047385604 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.06246141 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.065941609 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.035419661 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.02212955 seconds Ending run at 08:24:17 UTC on 2022-03-29. Run time = 0.816050611 Run time without initialization = 0.693473535 Average number of zones advanced per microsecond: 3.780 Average number of zones advanced per microsecond per rank: 3.780 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8161 ... 0.8161 ... 0.8161 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.2125 0.2125 0.2125 26.04% VisMF::Write(FabArray) 11 0.1471 0.1471 0.1471 18.02% MLCellLinOp::applyBC() 4433 0.09139 0.09139 0.09139 11.20% MLPoisson::Fsmooth() 3280 0.05896 0.05896 0.05896 7.22% FabArray::setVal() 1144 0.02578 0.02578 0.02578 3.16% StateData::FillBoundary(geom) 328 0.02283 0.02283 0.02283 2.80% MLCGSolver::bicgstab 82 0.02134 0.02134 0.02134 2.62% MultiFab::Dot() 1114 0.02127 0.02127 0.02127 2.61% FillBoundary_nowait() 4023 0.01688 0.01688 0.01688 2.07% FabArray::ParallelCopy_nowait() 861 0.0159 0.0159 0.0159 1.95% StateDataPhysBCFunct::() 41 0.01575 0.01575 0.01575 1.93% MultiFab::LinComb() 1586 0.01213 0.01213 0.01213 1.49% Castro::computeTemp() 63 0.01203 0.01203 0.01203 1.47% Gravity::fill_multipole_BCs() 11 0.01126 0.01126 0.01126 1.38% MLPoisson::Fapply() 1142 0.01041 0.01041 0.01041 1.28% MLCellLinOp::defineAuxData() 11 0.009904 0.009904 0.009904 1.21% Castro::enforce_min_density() 62 0.007664 0.007664 0.007664 0.94% MLMG::addInterpCorrection() 410 0.006893 0.006893 0.006893 0.84% amrex::average_down 410 0.006439 0.006439 0.006439 0.79% MultiFab::Xpay() 585 0.005986 0.005986 0.005986 0.73% FabArray::setDomainBndry() 41 0.005656 0.005656 0.005656 0.69% Castro::reset_internal_energy() 63 0.005282 0.005282 0.005282 0.65% Castro::expand_state() 10 0.004768 0.004768 0.004768 0.58% Castro::estTimeStep() 21 0.004727 0.004727 0.004727 0.58% Castro::normalize_species() 62 0.004417 0.004417 0.004417 0.54% Castro::enforce_speed_limit() 62 0.004319 0.004319 0.004319 0.53% Amr::checkPoint() 3 0.003884 0.003884 0.003884 0.48% BndryData::define() 11 0.003581 0.003581 0.003581 0.44% Castro::do_advance_ctu() 10 0.003266 0.003266 0.003266 0.40% Amr::writePlotFile() 2 0.003051 0.003051 0.003051 0.37% Castro::construct_new_gravity_source() 10 0.003014 0.003014 0.003014 0.37% Gravity::get_new_grav_vector() 11 0.002725 0.002725 0.002725 0.33% Castro::construct_old_gravity_source() 10 0.00247 0.00247 0.00247 0.30% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.002288 0.002288 0.002288 0.28% MLMG::ResNormInf() 93 0.001927 0.001927 0.001927 0.24% MLCellLinOp::compGrad() 11 0.001923 0.001923 0.001923 0.24% MultiFab::Saxpy() 20 0.001802 0.001802 0.001802 0.22% Gravity::get_old_grav_vector() 10 0.001739 0.001739 0.001739 0.21% MLMG::oneIter() 82 0.001683 0.001683 0.001683 0.21% MLCellLinOp::setLevelBC() 11 0.001518 0.001518 0.001518 0.19% Gravity::actual_solve_with_mlmg() 11 0.001387 0.001387 0.001387 0.17% MLCellLinOp::prepareForSolve() 11 0.001325 0.001325 0.001325 0.16% FabArray::mult() 43 0.001321 0.001321 0.001321 0.16% Castro::initData() 1 0.001187 0.001187 0.001187 0.15% MLCellLinOp::smooth() 1640 0.001091 0.001091 0.001091 0.13% FabArray::FillBoundary() 4023 0.0008439 0.0008439 0.0008439 0.10% FabArrayBase::getCPC() 1323 0.0008339 0.0008339 0.0008339 0.10% MLMG::prepareForSolve() 11 0.0007502 0.0007502 0.0007502 0.09% FabArrayBase::CPC::define() 454 0.000721 0.000721 0.000721 0.09% FabArrayBase::getFB() 4023 0.0006843 0.0006843 0.0006843 0.08% Gravity::update_max_rhs() 11 0.0005546 0.0005546 0.0005546 0.07% MultiFab::Copy() 11 0.0005005 0.0005005 0.0005005 0.06% MLCellLinOp::apply() 1142 0.0004883 0.0004883 0.0004883 0.06% CGSolver::sxay() 1586 0.0004661 0.0004661 0.0004661 0.06% Amr::InitAmr() 1 0.0004544 0.0004544 0.0004544 0.06% Gravity::solve_for_phi() 10 0.0004398 0.0004398 0.0004398 0.05% MLMG::mgVcycle() 82 0.0003788 0.0003788 0.0003788 0.05% MLCGSolver::ParallelAllReduce 1514 0.0003264 0.0003264 0.0003264 0.04% main() 1 0.0003179 0.0003179 0.0003179 0.04% MultiFab::min() 10 0.0003121 0.0003121 0.0003121 0.04% FabArray::ParallelCopy() 861 0.0003052 0.0003052 0.0003052 0.04% Castro::construct_new_gravity() 10 0.0002595 0.0002595 0.0002595 0.03% FillPatchIterator::Initialize 41 0.0002572 0.0002572 0.0002572 0.03% MultiFab::max() 11 0.0002505 0.0002505 0.0002505 0.03% Amr::coarseTimeStep() 10 0.0002398 0.0002398 0.0002398 0.03% MLCellLinOp::correctionResidual() 492 0.0002311 0.0002311 0.0002311 0.03% Gravity::actual_multilevel_solve() 1 0.0002206 0.0002206 0.0002206 0.03% Amr::timeStep() 10 0.0002049 0.0002049 0.0002049 0.03% MLMG::MLRhsNormInf() 11 0.0001962 0.0001962 0.0001962 0.02% MLCellLinOp::defineBC() 11 0.0001859 0.0001859 0.0001859 0.02% MLLinOp::defineGrids() 11 0.0001715 0.0001715 0.0001715 0.02% Castro::subcycle_advance_ctu() 10 0.000162 0.000162 0.000162 0.02% Amr::defBaseLevel() 1 0.0001503 0.0001503 0.0001503 0.02% StateData::checkPoint() 12 0.0001364 0.0001364 0.0001364 0.02% MLMG:computeResOfCorrection() 410 0.0001331 0.0001331 0.0001331 0.02% MLMG::actualBottomSolve() 82 0.0001097 0.0001097 0.0001097 0.01% MLMG::mgVcycle_down::0 82 9.331e-05 9.331e-05 9.331e-05 0.01% FabArrayBase::FB::FB() 56 8.899e-05 8.899e-05 8.899e-05 0.01% MLMG::mgVcycle_down::1 82 8.634e-05 8.634e-05 8.634e-05 0.01% MLMG::mgVcycle_down::2 82 8.409e-05 8.409e-05 8.409e-05 0.01% MLMG::mgVcycle_down::4 82 8.201e-05 8.201e-05 8.201e-05 0.01% MLMG::solve() 11 8.172e-05 8.172e-05 8.172e-05 0.01% MLMG::mgVcycle_down::3 82 8.117e-05 8.117e-05 8.117e-05 0.01% Castro::initialize_advance() 10 7.774e-05 7.774e-05 7.774e-05 0.01% AmrLevel::checkPoint() 3 7.315e-05 7.315e-05 7.315e-05 0.01% Castro::clean_state() 62 7.185e-05 7.185e-05 7.185e-05 0.01% MLMG::mgVcycle_up::4 82 6.138e-05 6.138e-05 6.138e-05 0.01% Castro::advance() 10 6.134e-05 6.134e-05 6.134e-05 0.01% Castro::finalize_advance() 10 5.906e-05 5.906e-05 5.906e-05 0.01% Castro::initialize_do_advance() 10 5.529e-05 5.529e-05 5.529e-05 0.01% Castro::post_timestep() 10 5.387e-05 5.387e-05 5.387e-05 0.01% MLCellLinOp::solutionResidual() 93 5.332e-05 5.332e-05 5.332e-05 0.01% MLMG::mgVcycle_up::0 82 4.998e-05 4.998e-05 4.998e-05 0.01% MLMG::mgVcycle_up::3 82 4.851e-05 4.851e-05 4.851e-05 0.01% MLMG::mgVcycle_up::1 82 4.821e-05 4.821e-05 4.821e-05 0.01% MLMG::mgVcycle_up::2 82 4.665e-05 4.665e-05 4.665e-05 0.01% StateData::define() 4 4.522e-05 4.522e-05 4.522e-05 0.01% Castro::construct_new_source() 50 4.26e-05 4.26e-05 4.26e-05 0.01% Castro::swap_state_time_levels() 10 3.897e-05 3.897e-05 3.897e-05 0.00% Castro::finalize_do_advance() 10 3.773e-05 3.773e-05 3.773e-05 0.00% MLMG::computeResidual() 82 3.729e-05 3.729e-05 3.729e-05 0.00% MLMG::mgVcycle_bottom 82 3.659e-05 3.659e-05 3.659e-05 0.00% Castro::enforce_consistent_e() 1 3.296e-05 3.296e-05 3.296e-05 0.00% FillPatchSingleLevel 41 2.917e-05 2.917e-05 2.917e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.832e-05 2.832e-05 2.832e-05 0.00% MLLinOp::define() 11 2.793e-05 2.793e-05 2.793e-05 0.00% makeSFC 55 2.658e-05 2.658e-05 2.658e-05 0.00% Amr::writeSmallPlotFile() 1 2.46e-05 2.46e-05 2.46e-05 0.00% MLPoisson::define() 11 2.364e-05 2.364e-05 2.364e-05 0.00% Amr::FinalizeInit() 1 2.242e-05 2.242e-05 2.242e-05 0.00% Castro::construct_old_source() 50 2.125e-05 2.125e-05 2.125e-05 0.00% Castro::do_new_sources() 10 1.776e-05 1.776e-05 1.776e-05 0.00% DistributionMapping::Distribute() 56 1.658e-05 1.658e-05 1.658e-05 0.00% Castro::do_old_sources() 10 1.605e-05 1.605e-05 1.605e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.592e-05 1.592e-05 1.592e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.549e-05 1.549e-05 1.549e-05 0.00% Castro::apply_source_to_state() 20 1.205e-05 1.205e-05 1.205e-05 0.00% Castro::construct_old_gravity() 10 1.079e-05 1.079e-05 1.079e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.021e-05 1.021e-05 1.021e-05 0.00% Gravity::swapTimeLevels() 10 9.561e-06 9.561e-06 9.561e-06 0.00% Amr::initSubcycle() 1 9.287e-06 9.287e-06 9.287e-06 0.00% MLPoisson::prepareForSolve() 11 8.906e-06 8.906e-06 8.906e-06 0.00% MLMG::computeMLResidual() 11 6.717e-06 6.717e-06 6.717e-06 0.00% MLMG::buildFineMask() 11 6.175e-06 6.175e-06 6.175e-06 0.00% Castro::computeNewDt() 9 6.041e-06 6.041e-06 6.041e-06 0.00% Amr::InitializeInit() 1 5.87e-06 5.87e-06 5.87e-06 0.00% AmrLevel::checkPointPost() 3 5.677e-06 5.677e-06 5.677e-06 0.00% MLMG::getGradSolution() 11 5.517e-06 5.517e-06 5.517e-06 0.00% MLMG::MLResNormInf() 11 4.781e-06 4.781e-06 4.781e-06 0.00% Castro::create_source_corrector() 10 4.653e-06 4.653e-06 4.653e-06 0.00% Castro::retry_advance_ctu() 10 4.207e-06 4.207e-06 4.207e-06 0.00% Gravity::set_mass_offset() 11 4.099e-06 4.099e-06 4.099e-06 0.00% Castro::post_init() 1 3.744e-06 3.744e-06 3.744e-06 0.00% Castro::FluxRegFineAdd() 10 3.564e-06 3.564e-06 3.564e-06 0.00% Amr::init() 1 2.977e-06 2.977e-06 2.977e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.861e-06 2.861e-06 2.861e-06 0.00% Castro::computeInitialDt() 2 2.651e-06 2.651e-06 2.651e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.16e-06 2.16e-06 2.16e-06 0.00% AmrLevel::checkPointPre() 3 1.957e-06 1.957e-06 1.957e-06 0.00% Castro::post_regrid() 1 1.27e-06 1.27e-06 1.27e-06 0.00% Amr::initialInit() 1 1.043e-06 1.043e-06 1.043e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8161 0.8161 0.8161 100.00% Amr::coarseTimeStep() 10 0.6711 0.6711 0.6711 82.24% Amr::timeStep() 10 0.5975 0.5975 0.5975 73.22% Castro::advance() 10 0.5902 0.5902 0.5902 72.32% Castro::subcycle_advance_ctu() 10 0.5772 0.5772 0.5772 70.73% Castro::do_advance_ctu() 10 0.5771 0.5771 0.5771 70.71% Gravity::solve_phi_with_mlmg() 11 0.319 0.319 0.319 39.09% Gravity::actual_solve_with_mlmg() 11 0.3075 0.3075 0.3075 37.68% Castro::construct_new_gravity() 10 0.2891 0.2891 0.2891 35.42% MLMG::solve() 11 0.2853 0.2853 0.2853 34.95% Gravity::solve_for_phi() 10 0.2716 0.2716 0.2716 33.28% MLMG::oneIter() 82 0.2697 0.2697 0.2697 33.05% MLMG::mgVcycle() 82 0.268 0.268 0.268 32.84% Castro::construct_ctu_hydro_source() 10 0.2125 0.2125 0.2125 26.04% VisMF::Write(FabArray) 11 0.1471 0.1471 0.1471 18.02% MLCellLinOp::smooth() 1640 0.1416 0.1416 0.1416 17.35% Amr::init() 1 0.122 0.122 0.122 14.95% MLCellLinOp::applyBC() 4433 0.1099 0.1099 0.1099 13.47% Amr::checkPoint() 3 0.1092 0.1092 0.1092 13.39% AmrLevel::checkPoint() 3 0.1054 0.1054 0.1054 12.91% StateData::checkPoint() 12 0.1053 0.1053 0.1053 12.90% MLMG::mgVcycle_bottom 82 0.07654 0.07654 0.07654 9.38% MLMG::actualBottomSolve() 82 0.0765 0.0765 0.0765 9.37% MLCGSolver::bicgstab 82 0.07578 0.07578 0.07578 9.29% Amr::initialInit() 1 0.06072 0.06072 0.06072 7.44% MLPoisson::Fsmooth() 3280 0.05896 0.05896 0.05896 7.22% Amr::FinalizeInit() 1 0.05156 0.05156 0.05156 6.32% Castro::post_init() 1 0.05081 0.05081 0.05081 6.23% FillPatchIterator::Initialize 41 0.04866 0.04866 0.04866 5.96% Gravity::multilevel_solve_for_new_phi() 1 0.04815 0.04815 0.04815 5.90% Gravity::actual_multilevel_solve() 1 0.04813 0.04813 0.04813 5.90% Amr::writePlotFile() 2 0.045 0.045 0.045 5.51% FillPatchSingleLevel 41 0.04275 0.04275 0.04275 5.24% MLCellLinOp::apply() 1142 0.0387 0.0387 0.0387 4.74% StateDataPhysBCFunct::() 41 0.03857 0.03857 0.03857 4.73% MLMG::mgVcycle_down::0 82 0.03737 0.03737 0.03737 4.58% Castro::clean_state() 62 0.03305 0.03305 0.03305 4.05% MLMG::mgVcycle_up::0 82 0.03191 0.03191 0.03191 3.91% FabArray::setVal() 1144 0.02578 0.02578 0.02578 3.16% Castro::initialize_do_advance() 10 0.02376 0.02376 0.02376 2.91% StateData::FillBoundary(geom) 328 0.02283 0.02283 0.02283 2.80% MLCellLinOp::correctionResidual() 492 0.02185 0.02185 0.02185 2.68% MultiFab::Dot() 1114 0.02127 0.02127 0.02127 2.61% Gravity::get_new_grav_vector() 11 0.01971 0.01971 0.01971 2.42% MLMG:computeResOfCorrection() 410 0.01891 0.01891 0.01891 2.32% FabArray::FillBoundary() 4023 0.0185 0.0185 0.0185 2.27% MLMG::mgVcycle_down::1 82 0.01823 0.01823 0.01823 2.23% Castro::expand_state() 10 0.01822 0.01822 0.01822 2.23% FillBoundary_nowait() 4023 0.01766 0.01766 0.01766 2.16% MLMG::mgVcycle_down::2 82 0.01764 0.01764 0.01764 2.16% Castro::computeTemp() 63 0.01731 0.01731 0.01731 2.12% FabArray::ParallelCopy() 861 0.01707 0.01707 0.01707 2.09% MLPoisson::define() 11 0.01693 0.01693 0.01693 2.07% MLMG::mgVcycle_down::3 82 0.01682 0.01682 0.01682 2.06% FabArray::ParallelCopy_nowait() 861 0.01677 0.01677 0.01677 2.05% MLMG::mgVcycle_down::4 82 0.01603 0.01603 0.01603 1.96% Castro::construct_old_gravity() 10 0.01575 0.01575 0.01575 1.93% Gravity::get_old_grav_vector() 10 0.01574 0.01574 0.01574 1.93% MLMG::mgVcycle_up::2 82 0.01373 0.01373 0.01373 1.68% MLMG::mgVcycle_up::1 82 0.01348 0.01348 0.01348 1.65% MLMG::addInterpCorrection() 410 0.01335 0.01335 0.01335 1.64% MLMG::mgVcycle_up::3 82 0.01302 0.01302 0.01302 1.59% amrex::average_down 410 0.01291 0.01291 0.01291 1.58% MLMG::mgVcycle_up::4 82 0.01286 0.01286 0.01286 1.58% Castro::initialize_advance() 10 0.01284 0.01284 0.01284 1.57% MLCGSolver::ParallelAllReduce 1514 0.01281 0.01281 0.01281 1.57% CGSolver::sxay() 1586 0.01259 0.01259 0.01259 1.54% MultiFab::LinComb() 1586 0.01213 0.01213 0.01213 1.49% MLCellLinOp::defineAuxData() 11 0.01162 0.01162 0.01162 1.42% Gravity::fill_multipole_BCs() 11 0.01126 0.01126 0.01126 1.38% MLPoisson::Fapply() 1142 0.01041 0.01041 0.01041 1.28% Castro::do_new_sources() 10 0.01023 0.01023 0.01023 1.25% Amr::InitializeInit() 1 0.009163 0.009163 0.009163 1.12% Amr::defBaseLevel() 1 0.009158 0.009158 0.009158 1.12% Castro::do_old_sources() 10 0.007918 0.007918 0.007918 0.97% Castro::enforce_min_density() 62 0.007664 0.007664 0.007664 0.94% MLCellLinOp::solutionResidual() 93 0.007608 0.007608 0.007608 0.93% Castro::post_timestep() 10 0.007156 0.007156 0.007156 0.88% MLMG::computeResidual() 82 0.006579 0.006579 0.006579 0.81% MultiFab::Xpay() 585 0.005986 0.005986 0.005986 0.73% MLMG::prepareForSolve() 11 0.005703 0.005703 0.005703 0.70% FabArray::setDomainBndry() 41 0.005656 0.005656 0.005656 0.69% Castro::reset_internal_energy() 63 0.005282 0.005282 0.005282 0.65% MLCellLinOp::defineBC() 11 0.005027 0.005027 0.005027 0.62% BndryData::define() 11 0.004841 0.004841 0.004841 0.59% Castro::estTimeStep() 21 0.004727 0.004727 0.004727 0.58% Castro::normalize_species() 62 0.004417 0.004417 0.004417 0.54% Castro::enforce_speed_limit() 62 0.004319 0.004319 0.004319 0.53% Castro::initData() 1 0.003445 0.003445 0.003445 0.42% Castro::construct_new_source() 50 0.003056 0.003056 0.003056 0.37% Castro::construct_new_gravity_source() 10 0.003014 0.003014 0.003014 0.37% Castro::construct_old_source() 50 0.002491 0.002491 0.002491 0.31% Castro::construct_old_gravity_source() 10 0.00247 0.00247 0.00247 0.30% MLMG::getGradSolution() 11 0.002458 0.002458 0.002458 0.30% MLCellLinOp::compGrad() 11 0.002452 0.002452 0.002452 0.30% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.002288 0.002288 0.002288 0.28% Castro::computeNewDt() 9 0.002063 0.002063 0.002063 0.25% MLMG::ResNormInf() 93 0.001927 0.001927 0.001927 0.24% Castro::apply_source_to_state() 20 0.001814 0.001814 0.001814 0.22% MultiFab::Saxpy() 20 0.001802 0.001802 0.001802 0.22% FabArrayBase::getCPC() 1323 0.001555 0.001555 0.001555 0.19% MLCellLinOp::setLevelBC() 11 0.001518 0.001518 0.001518 0.19% MLPoisson::prepareForSolve() 11 0.001334 0.001334 0.001334 0.16% MLCellLinOp::prepareForSolve() 11 0.001325 0.001325 0.001325 0.16% FabArray::mult() 43 0.001321 0.001321 0.001321 0.16% Gravity::swapTimeLevels() 10 0.001085 0.001085 0.001085 0.13% MLMG::computeMLResidual() 11 0.001072 0.001072 0.001072 0.13% Gravity::update_max_rhs() 11 0.0009475 0.0009475 0.0009475 0.12% FabArrayBase::getFB() 4023 0.0007733 0.0007733 0.0007733 0.09% FabArrayBase::CPC::define() 454 0.000721 0.000721 0.000721 0.09% Castro::computeInitialDt() 2 0.0006709 0.0006709 0.0006709 0.08% Castro::post_regrid() 1 0.0005048 0.0005048 0.0005048 0.06% MultiFab::Copy() 11 0.0005005 0.0005005 0.0005005 0.06% Amr::InitAmr() 1 0.0004637 0.0004637 0.0004637 0.06% MultiFab::min() 10 0.0003121 0.0003121 0.0003121 0.04% MLLinOp::define() 11 0.0002588 0.0002588 0.0002588 0.03% MLMG::MLResNormInf() 11 0.0002542 0.0002542 0.0002542 0.03% MultiFab::max() 11 0.0002505 0.0002505 0.0002505 0.03% MLLinOp::defineGrids() 11 0.0002309 0.0002309 0.0002309 0.03% MLMG::MLRhsNormInf() 11 0.0001962 0.0001962 0.0001962 0.02% FabArrayBase::FB::FB() 56 8.899e-05 8.899e-05 8.899e-05 0.01% Castro::finalize_advance() 10 6.262e-05 6.262e-05 6.262e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.72e-05 5.72e-05 5.72e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.543e-05 5.543e-05 5.543e-05 0.01% StateData::define() 4 4.522e-05 4.522e-05 4.522e-05 0.01% makeSFC 55 4.17e-05 4.17e-05 4.17e-05 0.01% Castro::swap_state_time_levels() 10 3.897e-05 3.897e-05 3.897e-05 0.00% Castro::finalize_do_advance() 10 3.773e-05 3.773e-05 3.773e-05 0.00% Castro::enforce_consistent_e() 1 3.296e-05 3.296e-05 3.296e-05 0.00% Amr::writeSmallPlotFile() 1 2.46e-05 2.46e-05 2.46e-05 0.00% DistributionMapping::Distribute() 56 1.658e-05 1.658e-05 1.658e-05 0.00% Amr::initSubcycle() 1 9.287e-06 9.287e-06 9.287e-06 0.00% MLMG::buildFineMask() 11 6.175e-06 6.175e-06 6.175e-06 0.00% AmrLevel::checkPointPost() 3 5.677e-06 5.677e-06 5.677e-06 0.00% Castro::create_source_corrector() 10 4.653e-06 4.653e-06 4.653e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.317e-06 4.317e-06 4.317e-06 0.00% Castro::retry_advance_ctu() 10 4.207e-06 4.207e-06 4.207e-06 0.00% Gravity::set_mass_offset() 11 4.099e-06 4.099e-06 4.099e-06 0.00% Castro::FluxRegFineAdd() 10 3.564e-06 3.564e-06 3.564e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.16e-06 2.16e-06 2.16e-06 0.00% AmrLevel::checkPointPre() 3 1.957e-06 1.957e-06 1.957e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 10619 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.03-33-g9907ac197518) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.03-33-g9907ac197518) initialized Starting run at 08:24:17 UTC on 2022-03-29. Successfully read inputs file ... Castro git describe: 22.02-19-ga92ac477d AMReX git describe: 22.03-33-g9907ac197 Microphysics git describe: 22.03-13-gf58972c7 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.483810985 Restart time = 0.055007224 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.114387354 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.048299369 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.053107713 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.058279444 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.063361995 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.023665677 seconds Ending run at 08:24:18 UTC on 2022-03-29. Run time = 0.417031077 Run time without initialization = 0.361442274 Average number of zones advanced per microsecond: 3.626 Average number of zones advanced per microsecond per rank: 3.626 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.4171 ... 0.4171 ... 0.4171 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1316 0.1316 0.1316 31.56% VisMF::Read() 3 0.04345 0.04345 0.04345 10.42% MLCellLinOp::applyBC() 1946 0.03761 0.03761 0.03761 9.02% MLPoisson::Fsmooth() 1440 0.02544 0.02544 0.02544 6.10% VisMF::Write(FabArray) 1 0.02235 0.02235 0.02235 5.36% FabArray::setVal() 537 0.01633 0.01633 0.01633 3.92% StateData::FillBoundary(geom) 160 0.01128 0.01128 0.01128 2.70% MLCGSolver::bicgstab 36 0.009177 0.009177 0.009177 2.20% MultiFab::Dot() 484 0.009011 0.009011 0.009011 2.16% Castro::computeTemp() 30 0.007246 0.007246 0.007246 1.74% FillBoundary_nowait() 1766 0.007193 0.007193 0.007193 1.72% FabArray::ParallelCopy_nowait() 380 0.007048 0.007048 0.007048 1.69% StateDataPhysBCFunct::() 20 0.006889 0.006889 0.006889 1.65% Gravity::fill_multipole_BCs() 6 0.005989 0.005989 0.005989 1.44% MLCellLinOp::defineAuxData() 6 0.00538 0.00538 0.00538 1.29% MultiFab::LinComb() 690 0.005188 0.005188 0.005188 1.24% Castro::enforce_min_density() 30 0.004951 0.004951 0.004951 1.19% MLPoisson::Fapply() 500 0.00452 0.00452 0.00452 1.08% FabArray::setDomainBndry() 20 0.003992 0.003992 0.003992 0.96% Castro::expand_state() 5 0.00397 0.00397 0.00397 0.95% Castro::estTimeStep() 10 0.002995 0.002995 0.002995 0.72% MLMG::addInterpCorrection() 180 0.002985 0.002985 0.002985 0.72% Amr::restart() 1 0.002984 0.002984 0.002984 0.72% amrex::average_down 180 0.002793 0.002793 0.002793 0.67% MultiFab::Xpay() 258 0.002611 0.002611 0.002611 0.63% Castro::do_advance_ctu() 5 0.002583 0.002583 0.002583 0.62% Castro::reset_internal_energy() 30 0.002442 0.002442 0.002442 0.59% Castro::normalize_species() 30 0.002334 0.002334 0.002334 0.56% BndryData::define() 6 0.001952 0.001952 0.001952 0.47% Gravity::get_new_grav_vector() 5 0.001879 0.001879 0.001879 0.45% Castro::construct_new_gravity_source() 5 0.001578 0.001578 0.001578 0.38% MLCellLinOp::compGrad() 6 0.001413 0.001413 0.001413 0.34% Amr::writePlotFile() 1 0.001401 0.001401 0.001401 0.34% Castro::construct_old_gravity_source() 5 0.001363 0.001363 0.001363 0.33% Castro::enforce_speed_limit() 30 0.001224 0.001224 0.001224 0.29% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.001181 0.001181 0.001181 0.28% Gravity::get_old_grav_vector() 5 0.00103 0.00103 0.00103 0.25% MultiFab::Saxpy() 10 0.0009143 0.0009143 0.0009143 0.22% MLMG::ResNormInf() 42 0.0008535 0.0008535 0.0008535 0.20% MLCellLinOp::setLevelBC() 6 0.0008137 0.0008137 0.0008137 0.20% Gravity::actual_solve_with_mlmg() 6 0.0007657 0.0007657 0.0007657 0.18% MLMG::oneIter() 36 0.0007388 0.0007388 0.0007388 0.18% MLCellLinOp::prepareForSolve() 6 0.0006977 0.0006977 0.0006977 0.17% FabArray::mult() 22 0.0006487 0.0006487 0.0006487 0.16% Gravity::update_max_rhs() 6 0.0005124 0.0005124 0.0005124 0.12% MLCellLinOp::smooth() 720 0.0004813 0.0004813 0.0004813 0.12% FabArrayBase::CPC::define() 244 0.0004308 0.0004308 0.0004308 0.10% FabArrayBase::getCPC() 632 0.000427 0.000427 0.000427 0.10% MLMG::prepareForSolve() 6 0.0004264 0.0004264 0.0004264 0.10% FabArray::FillBoundary() 1766 0.0004146 0.0004146 0.0004146 0.10% Amr::InitAmr() 1 0.0003892 0.0003892 0.0003892 0.09% MultiFab::Copy() 6 0.0003696 0.0003696 0.0003696 0.09% FabArrayBase::getFB() 1766 0.0003013 0.0003013 0.0003013 0.07% main() 1 0.0002759 0.0002759 0.0002759 0.07% Castro::construct_new_gravity() 5 0.0002432 0.0002432 0.0002432 0.06% Gravity::solve_for_phi() 5 0.0002317 0.0002317 0.0002317 0.06% Gravity::actual_multilevel_solve() 1 0.0002248 0.0002248 0.0002248 0.05% MLCellLinOp::apply() 500 0.0002206 0.0002206 0.0002206 0.05% CGSolver::sxay() 690 0.0001993 0.0001993 0.0001993 0.05% MLMG::mgVcycle() 36 0.0001737 0.0001737 0.0001737 0.04% MultiFab::min() 5 0.0001617 0.0001617 0.0001617 0.04% FabArray::ParallelCopy() 380 0.0001434 0.0001434 0.0001434 0.03% MLCGSolver::ParallelAllReduce 659 0.0001382 0.0001382 0.0001382 0.03% MultiFab::max() 6 0.0001317 0.0001317 0.0001317 0.03% FillPatchIterator::Initialize 20 0.0001245 0.0001245 0.0001245 0.03% Castro::construct_new_source() 25 0.0001224 0.0001224 0.0001224 0.03% Amr::timeStep() 5 0.0001083 0.0001083 0.0001083 0.03% Amr::coarseTimeStep() 5 0.0001075 0.0001075 0.0001075 0.03% MLMG::MLRhsNormInf() 6 0.000105 0.000105 0.000105 0.03% MLCellLinOp::defineBC() 6 0.0001021 0.0001021 0.0001021 0.02% MLCellLinOp::correctionResidual() 216 9.176e-05 9.176e-05 9.176e-05 0.02% MLLinOp::defineGrids() 6 9.148e-05 9.148e-05 9.148e-05 0.02% Castro::post_timestep() 5 8.213e-05 8.213e-05 8.213e-05 0.02% Castro::advance() 5 7.978e-05 7.978e-05 7.978e-05 0.02% StateData::restartDoit() 4 7.421e-05 7.421e-05 7.421e-05 0.02% Castro::subcycle_advance_ctu() 5 7.26e-05 7.26e-05 7.26e-05 0.02% AmrLevel::restart() 1 7.234e-05 7.234e-05 7.234e-05 0.02% MLMG:computeResOfCorrection() 180 6.804e-05 6.804e-05 6.804e-05 0.02% FabArrayBase::FB::FB() 26 6.38e-05 6.38e-05 6.38e-05 0.02% MLMG::actualBottomSolve() 36 4.786e-05 4.786e-05 4.786e-05 0.01% MLMG::solve() 6 4.13e-05 4.13e-05 4.13e-05 0.01% MLMG::mgVcycle_down::0 36 4.067e-05 4.067e-05 4.067e-05 0.01% Castro::construct_old_source() 25 3.978e-05 3.978e-05 3.978e-05 0.01% Castro::create_source_corrector() 5 3.973e-05 3.973e-05 3.973e-05 0.01% MLMG::mgVcycle_down::1 36 3.89e-05 3.89e-05 3.89e-05 0.01% MLMG::mgVcycle_down::2 36 3.803e-05 3.803e-05 3.803e-05 0.01% MLMG::mgVcycle_down::4 36 3.609e-05 3.609e-05 3.609e-05 0.01% Castro::clean_state() 30 3.604e-05 3.604e-05 3.604e-05 0.01% Castro::initialize_advance() 5 3.562e-05 3.562e-05 3.562e-05 0.01% MLMG::mgVcycle_down::3 36 3.325e-05 3.325e-05 3.325e-05 0.01% MLLinOp::define() 6 2.938e-05 2.938e-05 2.938e-05 0.01% Castro::initialize_do_advance() 5 2.822e-05 2.822e-05 2.822e-05 0.01% MLMG::mgVcycle_up::4 36 2.69e-05 2.69e-05 2.69e-05 0.01% MLMG::mgVcycle_up::0 36 2.648e-05 2.648e-05 2.648e-05 0.01% Castro::post_restart() 1 2.621e-05 2.621e-05 2.621e-05 0.01% Castro::finalize_advance() 5 2.572e-05 2.572e-05 2.572e-05 0.01% Amr::writeSmallPlotFile() 1 2.557e-05 2.557e-05 2.557e-05 0.01% Castro::swap_state_time_levels() 5 2.504e-05 2.504e-05 2.504e-05 0.01% MLMG::mgVcycle_up::3 36 2.448e-05 2.448e-05 2.448e-05 0.01% MLCellLinOp::solutionResidual() 42 2.35e-05 2.35e-05 2.35e-05 0.01% MLMG::mgVcycle_up::2 36 2.342e-05 2.342e-05 2.342e-05 0.01% MLMG::mgVcycle_up::1 36 2.29e-05 2.29e-05 2.29e-05 0.01% Gravity::multilevel_solve_for_new_phi() 1 2.149e-05 2.149e-05 2.149e-05 0.01% Castro::finalize_do_advance() 5 1.958e-05 1.958e-05 1.958e-05 0.00% MLMG::computeResidual() 36 1.851e-05 1.851e-05 1.851e-05 0.00% MLMG::mgVcycle_bottom 36 1.608e-05 1.608e-05 1.608e-05 0.00% MLPoisson::define() 6 1.6e-05 1.6e-05 1.6e-05 0.00% Castro::do_new_sources() 5 1.529e-05 1.529e-05 1.529e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.492e-05 1.492e-05 1.492e-05 0.00% FillPatchSingleLevel 20 1.486e-05 1.486e-05 1.486e-05 0.00% makeSFC 30 1.422e-05 1.422e-05 1.422e-05 0.00% DistributionMapping::Distribute() 31 9.763e-06 9.763e-06 9.763e-06 0.00% Amr::initSubcycle() 1 9.187e-06 9.187e-06 9.187e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 8.474e-06 8.474e-06 8.474e-06 0.00% Castro::do_old_sources() 5 8.275e-06 8.275e-06 8.275e-06 0.00% Castro::construct_old_gravity() 5 6.205e-06 6.205e-06 6.205e-06 0.00% Castro::apply_source_to_state() 10 5.684e-06 5.684e-06 5.684e-06 0.00% MLPoisson::prepareForSolve() 6 4.547e-06 4.547e-06 4.547e-06 0.00% Gravity::swapTimeLevels() 5 4.069e-06 4.069e-06 4.069e-06 0.00% MLMG::buildFineMask() 6 3.687e-06 3.687e-06 3.687e-06 0.00% MLMG::computeMLResidual() 6 3.315e-06 3.315e-06 3.315e-06 0.00% Castro::computeNewDt() 5 3.137e-06 3.137e-06 3.137e-06 0.00% MLMG::getGradSolution() 6 2.803e-06 2.803e-06 2.803e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.778e-06 2.778e-06 2.778e-06 0.00% MLMG::MLResNormInf() 6 2.662e-06 2.662e-06 2.662e-06 0.00% Gravity::set_mass_offset() 6 2.258e-06 2.258e-06 2.258e-06 0.00% Castro::retry_advance_ctu() 5 1.833e-06 1.833e-06 1.833e-06 0.00% Castro::FluxRegFineAdd() 5 1.589e-06 1.589e-06 1.589e-06 0.00% AmrLevel::AmrLevel() 1 1.321e-06 1.321e-06 1.321e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.087e-06 1.087e-06 1.087e-06 0.00% Amr::init() 1 9.02e-07 9.02e-07 9.02e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.417 0.417 0.417 100.00% Amr::coarseTimeStep() 5 0.3375 0.3375 0.3375 80.93% Amr::timeStep() 5 0.3357 0.3357 0.3357 80.49% Castro::advance() 5 0.332 0.332 0.332 79.61% Castro::subcycle_advance_ctu() 5 0.3211 0.3211 0.3211 76.98% Castro::do_advance_ctu() 5 0.321 0.321 0.321 76.96% Castro::construct_new_gravity() 5 0.1447 0.1447 0.1447 34.70% Gravity::solve_phi_with_mlmg() 6 0.1408 0.1408 0.1408 33.77% Gravity::solve_for_phi() 5 0.1352 0.1352 0.1352 32.41% Gravity::actual_solve_with_mlmg() 6 0.1347 0.1347 0.1347 32.31% Castro::construct_ctu_hydro_source() 5 0.1316 0.1316 0.1316 31.56% MLMG::solve() 6 0.1222 0.1222 0.1222 29.31% MLMG::oneIter() 36 0.1143 0.1143 0.1143 27.41% MLMG::mgVcycle() 36 0.1136 0.1136 0.1136 27.23% MLCellLinOp::smooth() 720 0.05961 0.05961 0.05961 14.29% Amr::init() 1 0.05506 0.05506 0.05506 13.20% Amr::restart() 1 0.05506 0.05506 0.05506 13.20% MLCellLinOp::applyBC() 1946 0.04558 0.04558 0.04558 10.93% AmrLevel::restart() 1 0.04433 0.04433 0.04433 10.63% StateData::restartDoit() 4 0.04425 0.04425 0.04425 10.61% VisMF::Read() 3 0.04345 0.04345 0.04345 10.42% MLMG::mgVcycle_bottom 36 0.03255 0.03255 0.03255 7.81% MLMG::actualBottomSolve() 36 0.03254 0.03254 0.03254 7.80% MLCGSolver::bicgstab 36 0.03222 0.03222 0.03222 7.73% MLPoisson::Fsmooth() 1440 0.02544 0.02544 0.02544 6.10% FillPatchIterator::Initialize 20 0.02437 0.02437 0.02437 5.84% Amr::writePlotFile() 1 0.02375 0.02375 0.02375 5.69% VisMF::Write(FabArray) 1 0.02235 0.02235 0.02235 5.36% FillPatchSingleLevel 20 0.02025 0.02025 0.02025 4.86% Castro::clean_state() 30 0.01823 0.01823 0.01823 4.37% StateDataPhysBCFunct::() 20 0.01817 0.01817 0.01817 4.36% MLCellLinOp::apply() 500 0.01635 0.01635 0.01635 3.92% FabArray::setVal() 537 0.01633 0.01633 0.01633 3.92% MLMG::mgVcycle_down::0 36 0.01594 0.01594 0.01594 3.82% Castro::initialize_do_advance() 5 0.01489 0.01489 0.01489 3.57% MLMG::mgVcycle_up::0 36 0.01352 0.01352 0.01352 3.24% Castro::expand_state() 5 0.01199 0.01199 0.01199 2.87% StateData::FillBoundary(geom) 160 0.01128 0.01128 0.01128 2.70% Castro::initialize_advance() 5 0.01083 0.01083 0.01083 2.60% Castro::computeTemp() 30 0.009689 0.009689 0.009689 2.32% Gravity::get_new_grav_vector() 5 0.009301 0.009301 0.009301 2.23% MLCellLinOp::correctionResidual() 216 0.009227 0.009227 0.009227 2.21% MLPoisson::define() 6 0.009224 0.009224 0.009224 2.21% MultiFab::Dot() 484 0.009011 0.009011 0.009011 2.16% MLMG:computeResOfCorrection() 180 0.007995 0.007995 0.007995 1.92% FabArray::FillBoundary() 1766 0.007973 0.007973 0.007973 1.91% Castro::construct_old_gravity() 5 0.007839 0.007839 0.007839 1.88% Gravity::get_old_grav_vector() 5 0.007833 0.007833 0.007833 1.88% MLMG::mgVcycle_down::1 36 0.007739 0.007739 0.007739 1.86% FabArray::ParallelCopy() 380 0.007609 0.007609 0.007609 1.82% FillBoundary_nowait() 1766 0.007559 0.007559 0.007559 1.81% FabArray::ParallelCopy_nowait() 380 0.007465 0.007465 0.007465 1.79% MLMG::mgVcycle_down::2 36 0.007427 0.007427 0.007427 1.78% Castro::do_new_sources() 5 0.007395 0.007395 0.007395 1.77% MLMG::mgVcycle_down::3 36 0.007082 0.007082 0.007082 1.70% MLMG::mgVcycle_down::4 36 0.006761 0.006761 0.006761 1.62% Castro::post_restart() 1 0.006528 0.006528 0.006528 1.57% MLCellLinOp::defineAuxData() 6 0.0063 0.0063 0.0063 1.51% Gravity::multilevel_solve_for_new_phi() 1 0.006148 0.006148 0.006148 1.47% Gravity::actual_multilevel_solve() 1 0.006127 0.006127 0.006127 1.47% Gravity::fill_multipole_BCs() 6 0.005989 0.005989 0.005989 1.44% MLMG::mgVcycle_up::2 36 0.005763 0.005763 0.005763 1.38% MLMG::addInterpCorrection() 180 0.005761 0.005761 0.005761 1.38% MLMG::mgVcycle_up::1 36 0.005713 0.005713 0.005713 1.37% amrex::average_down 180 0.005557 0.005557 0.005557 1.33% MLMG::mgVcycle_up::3 36 0.005472 0.005472 0.005472 1.31% MLCGSolver::ParallelAllReduce 659 0.005454 0.005454 0.005454 1.31% MLMG::mgVcycle_up::4 36 0.005412 0.005412 0.005412 1.30% CGSolver::sxay() 690 0.005387 0.005387 0.005387 1.29% MultiFab::LinComb() 690 0.005188 0.005188 0.005188 1.24% Castro::enforce_min_density() 30 0.004951 0.004951 0.004951 1.19% MLPoisson::Fapply() 500 0.00452 0.00452 0.00452 1.08% Castro::do_old_sources() 5 0.004394 0.004394 0.004394 1.05% FabArray::setDomainBndry() 20 0.003992 0.003992 0.003992 0.96% Castro::post_timestep() 5 0.003578 0.003578 0.003578 0.86% MLMG::prepareForSolve() 6 0.003523 0.003523 0.003523 0.84% MLCellLinOp::solutionResidual() 42 0.003378 0.003378 0.003378 0.81% Castro::estTimeStep() 10 0.002995 0.002995 0.002995 0.72% MLMG::computeResidual() 36 0.002796 0.002796 0.002796 0.67% MLCellLinOp::defineBC() 6 0.002755 0.002755 0.002755 0.66% BndryData::define() 6 0.002653 0.002653 0.002653 0.64% MultiFab::Xpay() 258 0.002611 0.002611 0.002611 0.63% Castro::reset_internal_energy() 30 0.002442 0.002442 0.002442 0.59% Castro::normalize_species() 30 0.002334 0.002334 0.002334 0.56% Castro::computeNewDt() 5 0.001738 0.001738 0.001738 0.42% MLMG::getGradSolution() 6 0.001708 0.001708 0.001708 0.41% MLCellLinOp::compGrad() 6 0.001705 0.001705 0.001705 0.41% Castro::construct_new_source() 25 0.0017 0.0017 0.0017 0.41% Castro::construct_new_gravity_source() 5 0.001578 0.001578 0.001578 0.38% Castro::construct_old_source() 25 0.001403 0.001403 0.001403 0.34% Castro::construct_old_gravity_source() 5 0.001363 0.001363 0.001363 0.33% Castro::enforce_speed_limit() 30 0.001224 0.001224 0.001224 0.29% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.001181 0.001181 0.001181 0.28% Castro::apply_source_to_state() 10 0.00092 0.00092 0.00092 0.22% MultiFab::Saxpy() 10 0.0009143 0.0009143 0.0009143 0.22% Gravity::swapTimeLevels() 5 0.0008811 0.0008811 0.0008811 0.21% FabArrayBase::getCPC() 632 0.0008578 0.0008578 0.0008578 0.21% MLMG::ResNormInf() 42 0.0008535 0.0008535 0.0008535 0.20% MLCellLinOp::setLevelBC() 6 0.0008137 0.0008137 0.0008137 0.20% Gravity::update_max_rhs() 6 0.0007205 0.0007205 0.0007205 0.17% MLPoisson::prepareForSolve() 6 0.0007022 0.0007022 0.0007022 0.17% MLCellLinOp::prepareForSolve() 6 0.0006977 0.0006977 0.0006977 0.17% FabArray::mult() 22 0.0006487 0.0006487 0.0006487 0.16% MLMG::computeMLResidual() 6 0.0006039 0.0006039 0.0006039 0.14% FabArrayBase::CPC::define() 244 0.0004308 0.0004308 0.0004308 0.10% Amr::InitAmr() 1 0.0003984 0.0003984 0.0003984 0.10% MultiFab::Copy() 6 0.0003696 0.0003696 0.0003696 0.09% FabArrayBase::getFB() 1766 0.0003651 0.0003651 0.0003651 0.09% MultiFab::min() 5 0.0001617 0.0001617 0.0001617 0.04% MLLinOp::define() 6 0.0001525 0.0001525 0.0001525 0.04% MLMG::MLResNormInf() 6 0.0001332 0.0001332 0.0001332 0.03% MultiFab::max() 6 0.0001317 0.0001317 0.0001317 0.03% MLLinOp::defineGrids() 6 0.0001232 0.0001232 0.0001232 0.03% MLMG::MLRhsNormInf() 6 0.000105 0.000105 0.000105 0.03% FabArrayBase::FB::FB() 26 6.38e-05 6.38e-05 6.38e-05 0.02% Castro::create_source_corrector() 5 3.973e-05 3.973e-05 3.973e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 3.06e-05 3.06e-05 3.06e-05 0.01% Castro::finalize_advance() 5 2.731e-05 2.731e-05 2.731e-05 0.01% Amr::writeSmallPlotFile() 1 2.557e-05 2.557e-05 2.557e-05 0.01% Castro::swap_state_time_levels() 5 2.504e-05 2.504e-05 2.504e-05 0.01% makeSFC 30 2.212e-05 2.212e-05 2.212e-05 0.01% Castro::finalize_do_advance() 5 1.958e-05 1.958e-05 1.958e-05 0.00% DistributionMapping::Distribute() 31 9.763e-06 9.763e-06 9.763e-06 0.00% Amr::initSubcycle() 1 9.187e-06 9.187e-06 9.187e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.641e-06 4.641e-06 4.641e-06 0.00% MLMG::buildFineMask() 6 3.687e-06 3.687e-06 3.687e-06 0.00% Gravity::set_mass_offset() 6 2.258e-06 2.258e-06 2.258e-06 0.00% Castro::retry_advance_ctu() 5 1.833e-06 1.833e-06 1.833e-06 0.00% Castro::FluxRegFineAdd() 5 1.589e-06 1.589e-06 1.589e-06 0.00% AmrLevel::AmrLevel() 1 1.321e-06 1.321e-06 1.321e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.087e-06 1.087e-06 1.087e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 10619 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.03-33-g9907ac197518) finalized