Initializing AMReX (24.02-30-g2ecafcff4013)... Initializing CUDA... CUDA initialized with 1 device. AMReX (24.02-30-g2ecafcff4013) initialized Starting run at 09:30:08 UTC on 2024-02-28. Successfully read inputs file ... Castro git describe: 24.02-23-g9a07326d4 AMReX git describe: 24.02-30-g2ecafcff4 Microphysics git describe: 24.02-27-gf336cab4 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.046058396 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.025359399 seconds [Level 0 step 1] ADVANCE at time 0 with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.067568772 [STEP 1] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE at time 4.541742215e-05 with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.074042188 [STEP 2] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE at time 9.31057154e-05 with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.083285917 [STEP 3] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE at time 0.0001431784233 with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.069103647 [STEP 4] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE at time 0.0001957547666 with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.057182731 [STEP 5] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.072624627 secs. [Level 0 step 6] ADVANCE at time 0.0002509599271 with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.07047592 [STEP 6] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE at time 0.0003089253456 with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.07831333 [STEP 7] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE at time 0.000369789035 with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.0590262 [STEP 8] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE at time 0.000433695909 with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.05987786 [STEP 9] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE at time 0.0005007981265 with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.07077021 [STEP 10] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.044759416 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.059996621 seconds Ending run at 09:30:09 UTC on 2024-02-28. Run time = 0.995208111 Run time without initialization = 0.867874321 Average number of zones advanced per microsecond: 3.021 Average number of zones advanced per microsecond per rank: 3.021 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.9952 ... 0.9952 ... 0.9952 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.3038 0.3038 0.3038 30.53% VisMF::Write(FabArray) 11 0.1793 0.1793 0.1793 18.02% MLCellLinOp::applyBC() 4351 0.0824 0.0824 0.0824 8.28% Amr::writePlotFile() 2 0.0362 0.0362 0.0362 3.64% MLPoisson::Fsmooth() 3280 0.03378 0.03378 0.03378 3.39% Amr::checkPoint() 3 0.03274 0.03274 0.03274 3.29% FillBoundary_nowait() 3941 0.03069 0.03069 0.03069 3.08% StateData::FillBoundary(geom) 328 0.02745 0.02745 0.02745 2.76% amrex::Dot() 1114 0.02179 0.02179 0.02179 2.19% Castro::normalize_species() 62 0.02129 0.02129 0.02129 2.14% FabArray::norminf() 1061 0.02023 0.02023 0.02023 2.03% Castro::computeTemp() 63 0.01701 0.01701 0.01701 1.71% FabArray::ParallelCopy_nowait() 861 0.01393 0.01393 0.01393 1.40% FabArray::setVal() 1062 0.01351 0.01351 0.01351 1.36% FabArray::Saxpy() 1370 0.0132 0.0132 0.0132 1.33% Castro::enforce_min_density() 62 0.01293 0.01293 0.01293 1.30% StateDataPhysBCFunct::() 41 0.01234 0.01234 0.01234 1.24% amrex::Copy() 472 0.0109 0.0109 0.0109 1.10% MLPoisson::Fapply() 1060 0.01045 0.01045 0.01045 1.05% MLCellLinOp::defineAuxData() 11 0.01044 0.01044 0.01044 1.05% Gravity::fill_multipole_BCs() 11 0.009183 0.009183 0.009183 0.92% FabArray::Xpay() 739 0.007913 0.007913 0.007913 0.80% MLMG::addInterpCorrection() 410 0.007191 0.007191 0.007191 0.72% amrex::average_down 410 0.006322 0.006322 0.006322 0.64% Castro::estTimeStep() 21 0.00615 0.00615 0.00615 0.62% Castro::reset_internal_energy(MultiFab) 63 0.004887 0.004887 0.004887 0.49% BndryData::define() 11 0.0041 0.0041 0.0041 0.41% amrex::Add() 82 0.003606 0.003606 0.003606 0.36% Castro::construct_new_gravity_source() 10 0.00354 0.00354 0.00354 0.36% Castro::construct_old_gravity_source() 10 0.002946 0.002946 0.002946 0.30% Castro::enforce_speed_limit() 62 0.002327 0.002327 0.002327 0.23% check_for_negative_density() 10 0.001912 0.001912 0.001912 0.19% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001822 0.001822 0.001822 0.18% Castro::reset_internal_energy(Fab) 504 0.001767 0.001767 0.001767 0.18% MLCGSolver::bicgstab 82 0.001652 0.001652 0.001652 0.17% MLCellLinOp::setLevelBC() 11 0.001622 0.001622 0.001622 0.16% Gravity::actual_solve_with_mlmg() 11 0.001588 0.001588 0.001588 0.16% Castro::initData() 1 0.001544 0.001544 0.001544 0.16% FabArray::mult() 43 0.001398 0.001398 0.001398 0.14% MLCellLinOp::prepareForSolve() 11 0.001378 0.001378 0.001378 0.14% FabArray::setDomainBndry() 41 0.001374 0.001374 0.001374 0.14% MultiFab::contains_nan() 20 0.001308 0.001308 0.001308 0.13% MLCellLinOp::smooth() 1640 0.001132 0.001132 0.001132 0.11% MLCellLinOp::compGrad() 11 0.001107 0.001107 0.001107 0.11% MLMG::prepareForSolve() 11 0.0009851 0.0009851 0.0009851 0.10% FabArray::FillBoundary() 3941 0.000858 0.000858 0.000858 0.09% FabArrayBase::getCPC() 1323 0.0008075 0.0008075 0.0008075 0.08% FabArrayBase::CPC::define() 454 0.0006864 0.0006864 0.0006864 0.07% FabArrayBase::getFB() 3941 0.0006291 0.0006291 0.0006291 0.06% Gravity::get_new_grav_vector() 11 0.0006253 0.0006253 0.0006253 0.06% Amr::InitAmr() 1 0.0005745 0.0005745 0.0005745 0.06% Amr::coarseTimeStep() 10 0.000511 0.000511 0.000511 0.05% Gravity::get_old_grav_vector() 10 0.0004863 0.0004863 0.0004863 0.05% MLCellLinOp::apply() 1060 0.0004274 0.0004274 0.0004274 0.04% AmrLevel::FillPatch() 41 0.0004165 0.0004165 0.0004165 0.04% MLCGSolver::ParallelAllReduce 1832 0.0003644 0.0003644 0.0003644 0.04% MultiFab::max() 11 0.0003349 0.0003349 0.0003349 0.03% main() 1 0.0003072 0.0003072 0.0003072 0.03% MLCellLinOp::defineBC() 11 0.0002878 0.0002878 0.0002878 0.03% FabArray::ParallelCopy() 861 0.0002476 0.0002476 0.0002476 0.02% FillPatchIterator::Initialize 41 0.0002119 0.0002119 0.0002119 0.02% MLMG::mgVcycle() 82 0.0002033 0.0002033 0.0002033 0.02% Castro::subcycle_advance_ctu() 10 0.0001846 0.0001846 0.0001846 0.02% MLLinOp::defineGrids() 11 0.0001768 0.0001768 0.0001768 0.02% MLCellLinOp::correctionResidual() 410 0.0001733 0.0001733 0.0001733 0.02% Amr::timeStep() 10 0.0001637 0.0001637 0.0001637 0.02% Castro::create_source_corrector() 10 0.0001637 0.0001637 0.0001637 0.02% StateData::checkPoint() 12 0.0001375 0.0001375 0.0001375 0.01% Gravity::update_max_rhs() 11 0.0001286 0.0001286 0.0001286 0.01% MLMG:computeResOfCorrection() 410 0.0001244 0.0001244 0.0001244 0.01% Gravity::solve_for_phi() 10 0.0001177 0.0001177 0.0001177 0.01% Castro::Castro() 1 9.386e-05 9.386e-05 9.386e-05 0.01% FabArrayBase::FB::FB() 56 9.084e-05 9.084e-05 9.084e-05 0.01% MLMG::actualBottomSolve() 82 8.926e-05 8.926e-05 8.926e-05 0.01% Castro::initialize_advance() 10 8.605e-05 8.605e-05 8.605e-05 0.01% MLMG::mgVcycle_down::0 82 8.524e-05 8.524e-05 8.524e-05 0.01% MLMG::mgVcycle_down::1 82 8.154e-05 8.154e-05 8.154e-05 0.01% MLMG::mgVcycle_down::2 82 7.88e-05 7.88e-05 7.88e-05 0.01% AmrLevel::checkPoint() 3 7.729e-05 7.729e-05 7.729e-05 0.01% Castro::clean_state() 62 7.686e-05 7.686e-05 7.686e-05 0.01% Castro::post_timestep() 10 7.602e-05 7.602e-05 7.602e-05 0.01% MLMG::mgVcycle_down::4 82 7.565e-05 7.565e-05 7.565e-05 0.01% MLMG::mgVcycle_down::3 82 7.324e-05 7.324e-05 7.324e-05 0.01% Castro::construct_new_source() 50 7.135e-05 7.135e-05 7.135e-05 0.01% MLMG::solve() 11 7.043e-05 7.043e-05 7.043e-05 0.01% Castro::advance() 10 6.961e-05 6.961e-05 6.961e-05 0.01% Castro::enforce_consistent_e() 1 6.301e-05 6.301e-05 6.301e-05 0.01% Castro::finalize_advance() 10 6.19e-05 6.19e-05 6.19e-05 0.01% MLMG::mgVcycle_up::4 82 6.169e-05 6.169e-05 6.169e-05 0.01% Castro::initialize_do_advance() 10 6.01e-05 6.01e-05 6.01e-05 0.01% MLMG::oneIter() 82 5.41e-05 5.41e-05 5.41e-05 0.01% Castro::do_advance_ctu() 10 5.372e-05 5.372e-05 5.372e-05 0.01% MLMG::mgVcycle_up::3 82 5.162e-05 5.162e-05 5.162e-05 0.01% MLMG::mgVcycle_up::0 82 5.157e-05 5.157e-05 5.157e-05 0.01% MLCellLinOp::solutionResidual() 93 5.057e-05 5.057e-05 5.057e-05 0.01% MLMG::mgVcycle_up::1 82 5.031e-05 5.031e-05 5.031e-05 0.01% MLMG::mgVcycle_up::2 82 4.958e-05 4.958e-05 4.958e-05 0.00% FillPatchIterator::FillFromLevel0() 41 4.764e-05 4.764e-05 4.764e-05 0.00% Castro::do_new_sources() 10 4.532e-05 4.532e-05 4.532e-05 0.00% Castro::finalize_do_advance() 10 4.27e-05 4.27e-05 4.27e-05 0.00% Castro::swap_state_time_levels() 10 3.612e-05 3.612e-05 3.612e-05 0.00% Castro::initMFs() 1 3.572e-05 3.572e-05 3.572e-05 0.00% StateData::define() 4 3.526e-05 3.526e-05 3.526e-05 0.00% Amr::writeSmallPlotFile() 1 3.509e-05 3.509e-05 3.509e-05 0.00% MLMG::mgVcycle_bottom 82 3.496e-05 3.496e-05 3.496e-05 0.00% FillPatchSingleLevel 41 3.394e-05 3.394e-05 3.394e-05 0.00% MLMG::ResNormInf() 93 3.376e-05 3.376e-05 3.376e-05 0.00% MLMG::computeResidual() 82 3.345e-05 3.345e-05 3.345e-05 0.00% Castro::construct_new_gravity() 10 3.195e-05 3.195e-05 3.195e-05 0.00% Castro::buildMetrics() 1 2.745e-05 2.745e-05 2.745e-05 0.00% Amr::defBaseLevel() 1 2.646e-05 2.646e-05 2.646e-05 0.00% makeSFC 55 2.565e-05 2.565e-05 2.565e-05 0.00% MLPoisson::define() 11 2.418e-05 2.418e-05 2.418e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.312e-05 2.312e-05 2.312e-05 0.00% Castro::do_old_sources() 10 2.172e-05 2.172e-05 2.172e-05 0.00% Castro::construct_old_source() 50 1.865e-05 1.865e-05 1.865e-05 0.00% Amr::FinalizeInit() 1 1.86e-05 1.86e-05 1.86e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.816e-05 1.816e-05 1.816e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.728e-05 1.728e-05 1.728e-05 0.00% DistributionMapping::Distribute() 56 1.574e-05 1.574e-05 1.574e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.501e-05 1.501e-05 1.501e-05 0.00% MLPoisson::prepareForSolve() 11 1.442e-05 1.442e-05 1.442e-05 0.00% MLLinOp::define() 11 1.431e-05 1.431e-05 1.431e-05 0.00% Castro::construct_old_gravity() 10 1.24e-05 1.24e-05 1.24e-05 0.00% Castro::check_for_nan() 20 1.193e-05 1.193e-05 1.193e-05 0.00% Castro::apply_source_to_state() 20 1.181e-05 1.181e-05 1.181e-05 0.00% MLMG::computeMLResidual() 11 9.787e-06 9.787e-06 9.787e-06 0.00% Castro::post_init() 1 9.33e-06 9.33e-06 9.33e-06 0.00% Amr::initSubcycle() 1 9.038e-06 9.038e-06 9.038e-06 0.00% Gravity::swapTimeLevels() 10 8.781e-06 8.781e-06 8.781e-06 0.00% Gravity::actual_multilevel_solve() 1 7.964e-06 7.964e-06 7.964e-06 0.00% Castro::computeNewDt() 9 6.63e-06 6.63e-06 6.63e-06 0.00% MLMG::getGradSolution() 11 6.44e-06 6.44e-06 6.44e-06 0.00% Gravity::set_mass_offset() 11 5.675e-06 5.675e-06 5.675e-06 0.00% AmrLevel::checkPointPost() 3 5.516e-06 5.516e-06 5.516e-06 0.00% Castro::expand_state() 10 5.496e-06 5.496e-06 5.496e-06 0.00% Amr::InitializeInit() 1 4.649e-06 4.649e-06 4.649e-06 0.00% Castro::retry_advance_ctu() 10 4.331e-06 4.331e-06 4.331e-06 0.00% MLMG::MLRhsNormInf() 11 4.095e-06 4.095e-06 4.095e-06 0.00% MLMG::MLResNormInf() 11 3.799e-06 3.799e-06 3.799e-06 0.00% AmrLevel::checkPointPre() 3 3.001e-06 3.001e-06 3.001e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.943e-06 2.943e-06 2.943e-06 0.00% Castro::FluxRegCrseInit 10 2.884e-06 2.884e-06 2.884e-06 0.00% Castro::computeInitialDt() 2 2.54e-06 2.54e-06 2.54e-06 0.00% Amr::init() 1 2.382e-06 2.382e-06 2.382e-06 0.00% Castro::FluxRegFineAdd() 10 2.168e-06 2.168e-06 2.168e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.121e-06 2.121e-06 2.121e-06 0.00% Castro::post_regrid() 1 1.172e-06 1.172e-06 1.172e-06 0.00% Amr::initialInit() 1 1.013e-06 1.013e-06 1.013e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.9952 0.9952 0.9952 100.00% Amr::coarseTimeStep() 10 0.8076 0.8076 0.8076 81.15% Amr::timeStep() 10 0.6863 0.6863 0.6863 68.95% Castro::advance() 10 0.6744 0.6744 0.6744 67.76% Castro::subcycle_advance_ctu() 10 0.6606 0.6606 0.6606 66.37% Castro::do_advance_ctu() 10 0.6604 0.6604 0.6604 66.35% Castro::construct_ctu_hydro_source() 10 0.3151 0.3151 0.3151 31.66% Gravity::solve_phi_with_mlmg() 11 0.2984 0.2984 0.2984 29.98% Gravity::actual_solve_with_mlmg() 11 0.2887 0.2887 0.2887 29.01% Castro::construct_new_gravity() 10 0.2695 0.2695 0.2695 27.07% MLMG::solve() 11 0.2663 0.2663 0.2663 26.76% Gravity::solve_for_phi() 10 0.2526 0.2526 0.2526 25.38% MLMG::oneIter() 82 0.2506 0.2506 0.2506 25.18% MLMG::mgVcycle() 82 0.2469 0.2469 0.2469 24.81% VisMF::Write(FabArray) 11 0.1793 0.1793 0.1793 18.02% Amr::checkPoint() 3 0.1636 0.1636 0.1636 16.43% AmrLevel::checkPoint() 3 0.1308 0.1308 0.1308 13.14% StateData::checkPoint() 12 0.1307 0.1307 0.1307 13.14% Amr::init() 1 0.1266 0.1266 0.1266 12.72% MLCellLinOp::smooth() 1640 0.123 0.123 0.123 12.36% MLCellLinOp::applyBC() 4351 0.1147 0.1147 0.1147 11.52% Amr::writePlotFile() 2 0.08549 0.08549 0.08549 8.59% MLMG::mgVcycle_bottom 82 0.074 0.074 0.074 7.44% MLMG::actualBottomSolve() 82 0.07397 0.07397 0.07397 7.43% MLCGSolver::bicgstab 82 0.07312 0.07312 0.07312 7.35% Castro::clean_state() 62 0.05931 0.05931 0.05931 5.96% Amr::initialInit() 1 0.05511 0.05511 0.05511 5.54% Amr::FinalizeInit() 1 0.05001 0.05001 0.05001 5.03% AmrLevel::FillPatch() 41 0.04982 0.04982 0.04982 5.01% Castro::post_init() 1 0.04858 0.04858 0.04858 4.88% Gravity::multilevel_solve_for_new_phi() 1 0.04618 0.04618 0.04618 4.64% Gravity::actual_multilevel_solve() 1 0.04616 0.04616 0.04616 4.64% FillPatchIterator::Initialize 41 0.04551 0.04551 0.04551 4.57% FillPatchIterator::FillFromLevel0() 41 0.04393 0.04393 0.04393 4.41% FillPatchSingleLevel 41 0.04388 0.04388 0.04388 4.41% StateDataPhysBCFunct::() 41 0.03978 0.03978 0.03978 4.00% MLCellLinOp::apply() 1060 0.03697 0.03697 0.03697 3.71% MLMG::mgVcycle_down::0 82 0.03491 0.03491 0.03491 3.51% MLPoisson::Fsmooth() 3280 0.03378 0.03378 0.03378 3.39% FabArray::FillBoundary() 3941 0.03227 0.03227 0.03227 3.24% FillBoundary_nowait() 3941 0.03141 0.03141 0.03141 3.16% StateData::FillBoundary(geom) 328 0.02745 0.02745 0.02745 2.76% MLMG::mgVcycle_up::0 82 0.02637 0.02637 0.02637 2.65% Castro::computeTemp() 63 0.02366 0.02366 0.02366 2.38% Castro::initialize_do_advance() 10 0.02183 0.02183 0.02183 2.19% amrex::Dot() 1114 0.02179 0.02179 0.02179 2.19% Castro::normalize_species() 62 0.02129 0.02129 0.02129 2.14% MLMG:computeResOfCorrection() 410 0.02071 0.02071 0.02071 2.08% MLCellLinOp::correctionResidual() 410 0.02059 0.02059 0.02059 2.07% FabArray::norminf() 1061 0.02023 0.02023 0.02023 2.03% Castro::do_old_sources() 10 0.01992 0.01992 0.01992 2.00% Gravity::get_new_grav_vector() 11 0.01875 0.01875 0.01875 1.88% MLPoisson::define() 11 0.01759 0.01759 0.01759 1.77% MLMG::mgVcycle_down::1 82 0.01694 0.01694 0.01694 1.70% MLMG::mgVcycle_down::2 82 0.01575 0.01575 0.01575 1.58% Castro::construct_old_gravity() 10 0.01568 0.01568 0.01568 1.58% Gravity::get_old_grav_vector() 10 0.01567 0.01567 0.01567 1.57% MLMG::mgVcycle_down::3 82 0.01533 0.01533 0.01533 1.54% MLMG::mgVcycle_down::4 82 0.01524 0.01524 0.01524 1.53% FabArray::ParallelCopy() 861 0.015 0.015 0.015 1.51% Castro::do_new_sources() 10 0.01491 0.01491 0.01491 1.50% FabArray::ParallelCopy_nowait() 861 0.01476 0.01476 0.01476 1.48% FabArray::setVal() 1062 0.01351 0.01351 0.01351 1.36% FabArray::Saxpy() 1370 0.0132 0.0132 0.0132 1.33% Castro::initialize_advance() 10 0.01314 0.01314 0.01314 1.32% MLCGSolver::ParallelAllReduce 1832 0.01313 0.01313 0.01313 1.32% Castro::enforce_min_density() 62 0.01293 0.01293 0.01293 1.30% MLMG::addInterpCorrection() 410 0.01263 0.01263 0.01263 1.27% MLMG::mgVcycle_up::1 82 0.01227 0.01227 0.01227 1.23% MLMG::mgVcycle_up::4 82 0.01217 0.01217 0.01217 1.22% Castro::expand_state() 10 0.01208 0.01208 0.01208 1.21% MLMG::mgVcycle_up::2 82 0.01198 0.01198 0.01198 1.20% MLCellLinOp::defineAuxData() 11 0.01189 0.01189 0.01189 1.19% amrex::average_down 410 0.01183 0.01183 0.01183 1.19% MLMG::mgVcycle_up::3 82 0.01175 0.01175 0.01175 1.18% Castro::post_timestep() 10 0.01169 0.01169 0.01169 1.18% amrex::Copy() 472 0.0109 0.0109 0.0109 1.10% MLPoisson::Fapply() 1060 0.01045 0.01045 0.01045 1.05% Gravity::fill_multipole_BCs() 11 0.009413 0.009413 0.009413 0.95% FabArray::Xpay() 739 0.007913 0.007913 0.007913 0.80% MLCellLinOp::solutionResidual() 93 0.007909 0.007909 0.007909 0.79% Castro::reset_internal_energy(MultiFab) 63 0.006653 0.006653 0.006653 0.67% MLMG::computeResidual() 82 0.006543 0.006543 0.006543 0.66% Castro::estTimeStep() 21 0.00615 0.00615 0.00615 0.62% MLCellLinOp::defineBC() 11 0.005432 0.005432 0.005432 0.55% MLMG::prepareForSolve() 11 0.005268 0.005268 0.005268 0.53% BndryData::define() 11 0.005144 0.005144 0.005144 0.52% Amr::InitializeInit() 1 0.005093 0.005093 0.005093 0.51% Amr::defBaseLevel() 1 0.005089 0.005089 0.005089 0.51% Castro::initData() 1 0.0044 0.0044 0.0044 0.44% Castro::construct_new_source() 50 0.003611 0.003611 0.003611 0.36% amrex::Add() 82 0.003606 0.003606 0.003606 0.36% Castro::construct_new_gravity_source() 10 0.00354 0.00354 0.00354 0.36% Castro::construct_old_source() 50 0.002965 0.002965 0.002965 0.30% Castro::construct_old_gravity_source() 10 0.002946 0.002946 0.002946 0.30% Castro::computeNewDt() 9 0.002608 0.002608 0.002608 0.26% Castro::finalize_do_advance() 10 0.002545 0.002545 0.002545 0.26% Castro::enforce_speed_limit() 62 0.002327 0.002327 0.002327 0.23% MLMG::ResNormInf() 93 0.0022 0.0022 0.0022 0.22% check_for_negative_density() 10 0.001912 0.001912 0.001912 0.19% Castro::apply_source_to_state() 20 0.00187 0.00187 0.00187 0.19% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001822 0.001822 0.001822 0.18% Castro::reset_internal_energy(Fab) 504 0.001767 0.001767 0.001767 0.18% MLCellLinOp::setLevelBC() 11 0.001622 0.001622 0.001622 0.16% MLMG::getGradSolution() 11 0.001609 0.001609 0.001609 0.16% MLCellLinOp::compGrad() 11 0.001602 0.001602 0.001602 0.16% FabArrayBase::getCPC() 1323 0.001494 0.001494 0.001494 0.15% MLMG::computeMLResidual() 11 0.00141 0.00141 0.00141 0.14% FabArray::mult() 43 0.001398 0.001398 0.001398 0.14% MLPoisson::prepareForSolve() 11 0.001392 0.001392 0.001392 0.14% MLCellLinOp::prepareForSolve() 11 0.001378 0.001378 0.001378 0.14% FabArray::setDomainBndry() 41 0.001374 0.001374 0.001374 0.14% Castro::check_for_nan() 20 0.00132 0.00132 0.00132 0.13% MultiFab::contains_nan() 20 0.001308 0.001308 0.001308 0.13% Castro::post_regrid() 1 0.001133 0.001133 0.001133 0.11% Castro::computeInitialDt() 2 0.001048 0.001048 0.001048 0.11% Gravity::update_max_rhs() 11 0.000999 0.000999 0.000999 0.10% FabArrayBase::getFB() 3941 0.00072 0.00072 0.00072 0.07% FabArrayBase::CPC::define() 454 0.0006864 0.0006864 0.0006864 0.07% Castro::finalize_advance() 10 0.0006087 0.0006087 0.0006087 0.06% Castro::Castro() 1 0.0006051 0.0006051 0.0006051 0.06% Amr::InitAmr() 1 0.0005835 0.0005835 0.0005835 0.06% Gravity::swapTimeLevels() 10 0.0004635 0.0004635 0.0004635 0.05% MLMG::MLResNormInf() 11 0.000341 0.000341 0.000341 0.03% MultiFab::max() 11 0.0003349 0.0003349 0.0003349 0.03% Castro::buildMetrics() 1 0.0002983 0.0002983 0.0002983 0.03% MLLinOp::define() 11 0.0002484 0.0002484 0.0002484 0.02% MLLinOp::defineGrids() 11 0.0002341 0.0002341 0.0002341 0.02% MLMG::MLRhsNormInf() 11 0.0002303 0.0002303 0.0002303 0.02% Castro::create_source_corrector() 10 0.0001637 0.0001637 0.0001637 0.02% FabArrayBase::FB::FB() 56 9.084e-05 9.084e-05 9.084e-05 0.01% Castro::enforce_consistent_e() 1 6.301e-05 6.301e-05 6.301e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.517e-05 5.517e-05 5.517e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.342e-05 5.342e-05 5.342e-05 0.01% makeSFC 55 4.016e-05 4.016e-05 4.016e-05 0.00% Castro::swap_state_time_levels() 10 3.612e-05 3.612e-05 3.612e-05 0.00% Castro::initMFs() 1 3.572e-05 3.572e-05 3.572e-05 0.00% StateData::define() 4 3.526e-05 3.526e-05 3.526e-05 0.00% Amr::writeSmallPlotFile() 1 3.509e-05 3.509e-05 3.509e-05 0.00% DistributionMapping::Distribute() 56 1.574e-05 1.574e-05 1.574e-05 0.00% Amr::initSubcycle() 1 9.038e-06 9.038e-06 9.038e-06 0.00% Gravity::set_mass_offset() 11 5.675e-06 5.675e-06 5.675e-06 0.00% AmrLevel::checkPointPost() 3 5.516e-06 5.516e-06 5.516e-06 0.00% Castro::retry_advance_ctu() 10 4.331e-06 4.331e-06 4.331e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.169e-06 4.169e-06 4.169e-06 0.00% AmrLevel::checkPointPre() 3 3.001e-06 3.001e-06 3.001e-06 0.00% Castro::FluxRegCrseInit 10 2.884e-06 2.884e-06 2.884e-06 0.00% Castro::FluxRegFineAdd() 10 2.168e-06 2.168e-06 2.168e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.121e-06 2.121e-06 2.121e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Device Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Arena::Initialize() 1 1 97 MiB 9042 MiB Castro::initMFs() 48 48 67 MiB 68 MiB Castro::swap_state_time_levels() 32 32 48 MiB 55 MiB StateData::define() 32 32 55 MiB 55 MiB FillPatchIterator::Initialize 328 328 969 KiB 39 MiB Castro::initialize_do_advance() 80 80 25 MiB 39 MiB ResizeRandomSeed 1 1 30 MiB 30 MiB Amr::writePlotFile() 16 16 1442 KiB 28 MiB Castro::initialize_advance() 80 80 15 MiB 23 MiB Castro::buildMetrics() 32 32 15 MiB 15 MiB Castro::Castro() 48 48 7615 KiB 14 MiB MLMG::prepareForSolve() 660 660 3290 KiB 12 MiB Gravity::get_new_grav_vector() 91 91 195 KiB 10 MiB Gravity::get_old_grav_vector() 80 80 161 KiB 10 MiB Gravity::multilevel_solve_for_new_phi() 24 24 7522 KiB 7586 KiB Gravity::fill_multipole_BCs() 154 154 15 KiB 2053 KiB Gravity::update_max_rhs() 88 88 1945 B 2048 KiB Gravity::solve_for_phi() 80 80 519 KiB 2048 KiB Gravity::actual_multilevel_solve() 8 8 94 KiB 2048 KiB BndryData::define() 1056 1056 301 KiB 1095 KiB MLCellLinOp::defineAuxData() 1716 1716 191 KiB 671 KiB Castro::estTimeStep() 21 21 3012 B 480 KiB VisMF::Write(FabArray) 656 656 3119 B 320 KiB Castro::normalize_species() 62 62 6963 B 320 KiB amrex::average_down 1067 1067 1545 B 257 KiB MLMG::addInterpCorrection() 1066 1066 1086 B 257 KiB amrex::Dot() 1360 1360 3222 B 160 KiB FabArray::norminf() 1143 1143 3140 B 160 KiB check_for_negative_density() 10 10 302 B 160 KiB Castro::initData() 1 1 47 B 160 KiB MultiFab::max() 11 11 52 B 160 KiB FabArray::setVal() 106 106 25 KiB 30 KiB MultiFab::contains_nan() 20 20 25 B 20 KiB MLPoisson::Fsmooth() 132 132 3211 B 12 KiB FabArray::ParallelCopy_nowait() 861 861 40 B 10 KiB FillBoundary_nowait() 760 760 269 B 9648 B MLCellLinOp::applyBC() 8702 8702 206 B 9344 B MLCellLinOp::prepareForSolve() 66 66 3 B 7792 B amrex::Copy() 100 100 3878 B 6144 B StateData::FillBoundary(geom) 1992 1992 41 B 2688 B FabArray::setVal(val, thecmd, scomp, ncomp) 132 132 1 B 1616 B MLCellLinOp::defineBC() 66 66 339 B 1248 B MLCGSolver::bicgstab 410 410 88 B 1216 B ------------------------------------------------------------------------------ Managed Memory Usage: ----------------------------------------------------------------- Name Nalloc Nfree AvgMem MaxMem ----------------------------------------------------------------- The_Managed_Arena::Initialize() 1 1 530 B 8192 KiB ----------------------------------------------------------------- Pinned Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Pinned_Arena::Initialize() 1 1 37 KiB 8192 KiB VisMF::Write(FabArray) 744 744 395 KiB 3584 KiB FabArray::setVal() 106 106 25 KiB 30 KiB MLPoisson::Fsmooth() 132 132 3211 B 12 KiB FabArray::ParallelCopy_nowait() 861 861 40 B 10 KiB FillBoundary_nowait() 760 760 269 B 9648 B MLCellLinOp::applyBC() 4351 4351 204 B 9328 B MLCellLinOp::prepareForSolve() 66 66 3 B 7792 B amrex::Copy() 100 100 3878 B 6144 B Gravity::get_new_grav_vector() 3 3 2903 B 3072 B Gravity::fill_multipole_BCs() 33 33 4 B 2832 B StateData::FillBoundary(geom) 1992 1992 41 B 2688 B amrex::average_down 83 83 617 B 1648 B FabArray::setVal(val, thecmd, scomp, ncomp) 132 132 1 B 1616 B MLMG::addInterpCorrection() 82 82 2 B 1024 B MLMG::prepareForSolve() 11 11 274 B 1024 B MLCellLinOp::setLevelBC() 66 66 0 B 768 B amrex::Dot() 1360 1360 23 B 400 B FabArray::norminf() 1143 1143 9 B 144 B Castro::estTimeStep() 21 21 0 B 32 B check_for_negative_density() 10 10 0 B 16 B MultiFab::max() 11 11 0 B 16 B MultiFab::contains_nan() 20 20 0 B 16 B Castro::normalize_species() 62 62 0 B 16 B Castro::initData() 1 1 0 B 16 B ------------------------------------------------------------------------------ Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2167 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (24.02-30-g2ecafcff4013) finalized Initializing AMReX (24.02-30-g2ecafcff4013)... Initializing CUDA... CUDA initialized with 1 device. AMReX (24.02-30-g2ecafcff4013) initialized Starting run at 09:30:10 UTC on 2024-02-28. Successfully read inputs file ... Castro git describe: 24.02-23-g9a07326d4 AMReX git describe: 24.02-30-g2ecafcff4 Microphysics git describe: 24.02-27-gf336cab4 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.551249806 Restart time = 0.072980116 seconds. [Level 0 step 6] ADVANCE at time 0.0002509599271 with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.070017305 [STEP 6] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE at time 0.0003089253456 with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.047941065 [STEP 7] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE at time 0.000369789035 with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.071349991 [STEP 8] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE at time 0.000433695909 with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.081242831 [STEP 9] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE at time 0.0005007981265 with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.070686762 [STEP 10] FAB kilobyte spread across MPI nodes: [368280 ... 368280] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.029337741 seconds Ending run at 09:30:10 UTC on 2024-02-28. Run time = 0.444759243 Run time without initialization = 0.371072272 Average number of zones advanced per microsecond: 3.532 Average number of zones advanced per microsecond per rank: 3.532 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.4448 ... 0.4448 ... 0.4448 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1375 0.1375 0.1375 30.92% VisMF::Read() 3 0.06131 0.06131 0.06131 13.78% MLCellLinOp::applyBC() 1910 0.03587 0.03587 0.03587 8.06% VisMF::Write(FabArray) 1 0.02673 0.02673 0.02673 6.01% MLPoisson::Fsmooth() 1440 0.02153 0.02153 0.02153 4.84% StateData::FillBoundary(geom) 160 0.01329 0.01329 0.01329 2.99% FillBoundary_nowait() 1730 0.01291 0.01291 0.01291 2.90% Castro::normalize_species() 30 0.01131 0.01131 0.01131 2.54% amrex::Dot() 484 0.009263 0.009263 0.009263 2.08% FabArray::norminf() 465 0.008715 0.008715 0.008715 1.96% Castro::computeTemp() 30 0.007379 0.007379 0.007379 1.66% FabArray::setVal() 501 0.006611 0.006611 0.006611 1.49% Castro::enforce_min_density() 30 0.00641 0.00641 0.00641 1.44% FabArray::ParallelCopy_nowait() 380 0.006242 0.006242 0.006242 1.40% FabArray::Saxpy() 597 0.005847 0.005847 0.005847 1.31% Gravity::fill_multipole_BCs() 6 0.005756 0.005756 0.005756 1.29% MLCellLinOp::defineAuxData() 6 0.005651 0.005651 0.005651 1.27% amrex::Copy() 221 0.005447 0.005447 0.005447 1.22% StateDataPhysBCFunct::() 20 0.00508 0.00508 0.00508 1.14% Amr::restart() 1 0.004819 0.004819 0.004819 1.08% MLPoisson::Fapply() 464 0.00452 0.00452 0.00452 1.02% FabArray::Xpay() 325 0.003491 0.003491 0.003491 0.78% MLMG::addInterpCorrection() 180 0.003191 0.003191 0.003191 0.72% amrex::average_down 180 0.003041 0.003041 0.003041 0.68% Castro::estTimeStep() 10 0.00303 0.00303 0.00303 0.68% Amr::writePlotFile() 1 0.002454 0.002454 0.002454 0.55% Castro::reset_internal_energy(MultiFab) 30 0.002264 0.002264 0.002264 0.51% BndryData::define() 6 0.002215 0.002215 0.002215 0.50% Castro::construct_new_gravity_source() 5 0.001861 0.001861 0.001861 0.42% amrex::Add() 36 0.001624 0.001624 0.001624 0.37% Castro::construct_old_gravity_source() 5 0.001533 0.001533 0.001533 0.34% Castro::enforce_speed_limit() 30 0.001346 0.001346 0.001346 0.30% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009769 0.0009769 0.0009769 0.22% check_for_negative_density() 5 0.0009717 0.0009717 0.0009717 0.22% Castro::reset_internal_energy(Fab) 240 0.000902 0.000902 0.000902 0.20% Gravity::actual_solve_with_mlmg() 6 0.0008931 0.0008931 0.0008931 0.20% MLCellLinOp::setLevelBC() 6 0.0008761 0.0008761 0.0008761 0.20% MLCellLinOp::prepareForSolve() 6 0.0007637 0.0007637 0.0007637 0.17% FabArray::setDomainBndry() 20 0.0007167 0.0007167 0.0007167 0.16% MLCGSolver::bicgstab 36 0.0007154 0.0007154 0.0007154 0.16% FabArray::mult() 22 0.0007023 0.0007023 0.0007023 0.16% MultiFab::contains_nan() 10 0.0006642 0.0006642 0.0006642 0.15% MLCellLinOp::compGrad() 6 0.0006299 0.0006299 0.0006299 0.14% MLMG::prepareForSolve() 6 0.0005605 0.0005605 0.0005605 0.13% Amr::InitAmr() 1 0.0005231 0.0005231 0.0005231 0.12% MLCellLinOp::smooth() 720 0.0005076 0.0005076 0.0005076 0.11% FabArrayBase::CPC::define() 244 0.0004314 0.0004314 0.0004314 0.10% FabArrayBase::getCPC() 632 0.0003884 0.0003884 0.0003884 0.09% FabArray::FillBoundary() 1730 0.000375 0.000375 0.000375 0.08% Gravity::get_old_grav_vector() 5 0.0003255 0.0003255 0.0003255 0.07% FabArrayBase::getFB() 1730 0.0002903 0.0002903 0.0002903 0.07% main() 1 0.0002734 0.0002734 0.0002734 0.06% Gravity::get_new_grav_vector() 5 0.0002727 0.0002727 0.0002727 0.06% MultiFab::max() 6 0.0002236 0.0002236 0.0002236 0.05% Amr::coarseTimeStep() 5 0.000222 0.000222 0.000222 0.05% AmrLevel::FillPatch() 20 0.0001998 0.0001998 0.0001998 0.04% MLCellLinOp::apply() 464 0.0001918 0.0001918 0.0001918 0.04% MLCGSolver::ParallelAllReduce 798 0.0001585 0.0001585 0.0001585 0.04% MLCellLinOp::defineBC() 6 0.0001492 0.0001492 0.0001492 0.03% Castro::subcycle_advance_ctu() 5 0.0001162 0.0001162 0.0001162 0.03% Castro::create_source_corrector() 5 0.0001109 0.0001109 0.0001109 0.02% FabArray::ParallelCopy() 380 0.0001083 0.0001083 0.0001083 0.02% FillPatchIterator::Initialize 20 0.0001062 0.0001062 0.0001062 0.02% Castro::construct_new_source() 25 0.000104 0.000104 0.000104 0.02% Castro::do_advance_ctu() 5 0.0001038 0.0001038 0.0001038 0.02% MLLinOp::defineGrids() 6 9.382e-05 9.382e-05 9.382e-05 0.02% MLMG::mgVcycle() 36 9.192e-05 9.192e-05 9.192e-05 0.02% Amr::timeStep() 5 9.126e-05 9.126e-05 9.126e-05 0.02% AmrLevel::restart() 1 7.715e-05 7.715e-05 7.715e-05 0.02% MLCellLinOp::correctionResidual() 180 7.641e-05 7.641e-05 7.641e-05 0.02% StateData::restartDoit() 4 6.848e-05 6.848e-05 6.848e-05 0.02% Castro::initialize_do_advance() 5 6.648e-05 6.648e-05 6.648e-05 0.01% Gravity::update_max_rhs() 6 6.617e-05 6.617e-05 6.617e-05 0.01% FabArrayBase::FB::FB() 26 6.118e-05 6.118e-05 6.118e-05 0.01% MLMG:computeResOfCorrection() 180 5.689e-05 5.689e-05 5.689e-05 0.01% Gravity::solve_for_phi() 5 5.523e-05 5.523e-05 5.523e-05 0.01% Castro::post_timestep() 5 5.072e-05 5.072e-05 5.072e-05 0.01% Castro::finalize_do_advance() 5 4.37e-05 4.37e-05 4.37e-05 0.01% MLMG::actualBottomSolve() 36 4.158e-05 4.158e-05 4.158e-05 0.01% MLMG::mgVcycle_down::0 36 3.764e-05 3.764e-05 3.764e-05 0.01% MLMG::mgVcycle_down::1 36 3.671e-05 3.671e-05 3.671e-05 0.01% Castro::initialize_advance() 5 3.609e-05 3.609e-05 3.609e-05 0.01% Castro::do_new_sources() 5 3.556e-05 3.556e-05 3.556e-05 0.01% MLMG::solve() 6 3.51e-05 3.51e-05 3.51e-05 0.01% Castro::advance() 5 3.298e-05 3.298e-05 3.298e-05 0.01% Castro::clean_state() 30 3.253e-05 3.253e-05 3.253e-05 0.01% Amr::writeSmallPlotFile() 1 3.246e-05 3.246e-05 3.246e-05 0.01% MLMG::mgVcycle_down::2 36 3.214e-05 3.214e-05 3.214e-05 0.01% MLMG::mgVcycle_down::4 36 3.1e-05 3.1e-05 3.1e-05 0.01% MLMG::mgVcycle_down::3 36 2.988e-05 2.988e-05 2.988e-05 0.01% MLMG::oneIter() 36 2.946e-05 2.946e-05 2.946e-05 0.01% Castro::post_restart() 1 2.94e-05 2.94e-05 2.94e-05 0.01% Castro::finalize_advance() 5 2.936e-05 2.936e-05 2.936e-05 0.01% Castro::buildMetrics() 1 2.898e-05 2.898e-05 2.898e-05 0.01% MLMG::mgVcycle_up::4 36 2.85e-05 2.85e-05 2.85e-05 0.01% Castro::construct_old_source() 25 2.764e-05 2.764e-05 2.764e-05 0.01% MLCellLinOp::solutionResidual() 42 2.692e-05 2.692e-05 2.692e-05 0.01% Castro::initMFs() 1 2.414e-05 2.414e-05 2.414e-05 0.01% Castro::swap_state_time_levels() 5 2.411e-05 2.411e-05 2.411e-05 0.01% MLMG::mgVcycle_up::3 36 2.383e-05 2.383e-05 2.383e-05 0.01% MLMG::mgVcycle_up::0 36 2.233e-05 2.233e-05 2.233e-05 0.01% FillPatchIterator::FillFromLevel0() 20 2.165e-05 2.165e-05 2.165e-05 0.00% MLMG::mgVcycle_up::2 36 2.15e-05 2.15e-05 2.15e-05 0.00% MLMG::computeResidual() 36 2.149e-05 2.149e-05 2.149e-05 0.00% MLMG::mgVcycle_up::1 36 2.075e-05 2.075e-05 2.075e-05 0.00% MLMG::ResNormInf() 42 1.811e-05 1.811e-05 1.811e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.723e-05 1.723e-05 1.723e-05 0.00% Castro::construct_new_gravity() 5 1.692e-05 1.692e-05 1.692e-05 0.00% FillPatchSingleLevel 20 1.641e-05 1.641e-05 1.641e-05 0.00% MLMG::mgVcycle_bottom 36 1.575e-05 1.575e-05 1.575e-05 0.00% makeSFC 30 1.554e-05 1.554e-05 1.554e-05 0.00% MLPoisson::define() 6 1.432e-05 1.432e-05 1.432e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.411e-05 1.411e-05 1.411e-05 0.00% Castro::do_old_sources() 5 1.141e-05 1.141e-05 1.141e-05 0.00% MLPoisson::prepareForSolve() 6 1.044e-05 1.044e-05 1.044e-05 0.00% DistributionMapping::Distribute() 31 1.012e-05 1.012e-05 1.012e-05 0.00% Amr::initSubcycle() 1 9.235e-06 9.235e-06 9.235e-06 0.00% Gravity::actual_multilevel_solve() 1 8.65e-06 8.65e-06 8.65e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.935e-06 7.935e-06 7.935e-06 0.00% Castro::check_for_nan() 10 7.448e-06 7.448e-06 7.448e-06 0.00% Castro::construct_old_gravity() 5 7.228e-06 7.228e-06 7.228e-06 0.00% MLLinOp::define() 6 6.736e-06 6.736e-06 6.736e-06 0.00% Castro::apply_source_to_state() 10 6.175e-06 6.175e-06 6.175e-06 0.00% MLMG::computeMLResidual() 6 4.519e-06 4.519e-06 4.519e-06 0.00% Gravity::swapTimeLevels() 5 4.156e-06 4.156e-06 4.156e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.671e-06 3.671e-06 3.671e-06 0.00% MLMG::getGradSolution() 6 3.207e-06 3.207e-06 3.207e-06 0.00% Castro::expand_state() 5 3.056e-06 3.056e-06 3.056e-06 0.00% Castro::computeNewDt() 5 3.006e-06 3.006e-06 3.006e-06 0.00% MLMG::MLResNormInf() 6 2.821e-06 2.821e-06 2.821e-06 0.00% MLMG::MLRhsNormInf() 6 2.364e-06 2.364e-06 2.364e-06 0.00% Gravity::set_mass_offset() 6 2.302e-06 2.302e-06 2.302e-06 0.00% Castro::retry_advance_ctu() 5 1.98e-06 1.98e-06 1.98e-06 0.00% Castro::FluxRegCrseInit 5 1.439e-06 1.439e-06 1.439e-06 0.00% Castro::FluxRegFineAdd() 5 1.374e-06 1.374e-06 1.374e-06 0.00% Amr::init() 1 1.146e-06 1.146e-06 1.146e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.059e-06 1.059e-06 1.059e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.4448 0.4448 0.4448 100.00% Amr::coarseTimeStep() 5 0.3415 0.3415 0.3415 76.77% Amr::timeStep() 5 0.3393 0.3393 0.3393 76.29% Castro::advance() 5 0.3339 0.3339 0.3339 75.07% Castro::subcycle_advance_ctu() 5 0.3261 0.3261 0.3261 73.31% Castro::do_advance_ctu() 5 0.326 0.326 0.326 73.29% Castro::construct_ctu_hydro_source() 5 0.1441 0.1441 0.1441 32.41% Castro::construct_new_gravity() 5 0.1436 0.1436 0.1436 32.29% Gravity::solve_phi_with_mlmg() 6 0.1412 0.1412 0.1412 31.75% Gravity::solve_for_phi() 5 0.1355 0.1355 0.1355 30.47% Gravity::actual_solve_with_mlmg() 6 0.1352 0.1352 0.1352 30.40% MLMG::solve() 6 0.123 0.123 0.123 27.65% MLMG::oneIter() 36 0.1152 0.1152 0.1152 25.89% MLMG::mgVcycle() 36 0.1135 0.1135 0.1135 25.52% Amr::init() 1 0.07304 0.07304 0.07304 16.42% Amr::restart() 1 0.07304 0.07304 0.07304 16.42% AmrLevel::restart() 1 0.06168 0.06168 0.06168 13.87% StateData::restartDoit() 4 0.0616 0.0616 0.0616 13.85% VisMF::Read() 3 0.06131 0.06131 0.06131 13.78% MLCellLinOp::smooth() 720 0.05978 0.05978 0.05978 13.44% MLCellLinOp::applyBC() 1910 0.04951 0.04951 0.04951 11.13% MLMG::mgVcycle_bottom 36 0.03175 0.03175 0.03175 7.14% MLMG::actualBottomSolve() 36 0.03174 0.03174 0.03174 7.14% MLCGSolver::bicgstab 36 0.03137 0.03137 0.03137 7.05% Castro::clean_state() 30 0.02964 0.02964 0.02964 6.66% Amr::writePlotFile() 1 0.02944 0.02944 0.02944 6.62% VisMF::Write(FabArray) 1 0.02673 0.02673 0.02673 6.01% AmrLevel::FillPatch() 20 0.02338 0.02338 0.02338 5.26% MLPoisson::Fsmooth() 1440 0.02153 0.02153 0.02153 4.84% FillPatchIterator::Initialize 20 0.02128 0.02128 0.02128 4.78% FillPatchIterator::FillFromLevel0() 20 0.02045 0.02045 0.02045 4.60% FillPatchSingleLevel 20 0.02043 0.02043 0.02043 4.59% StateDataPhysBCFunct::() 20 0.01837 0.01837 0.01837 4.13% MLMG::mgVcycle_up::0 36 0.01808 0.01808 0.01808 4.06% MLCellLinOp::apply() 464 0.01621 0.01621 0.01621 3.64% MLMG::mgVcycle_down::0 36 0.01523 0.01523 0.01523 3.42% FabArray::FillBoundary() 1730 0.01364 0.01364 0.01364 3.07% StateData::FillBoundary(geom) 160 0.01329 0.01329 0.01329 2.99% FillBoundary_nowait() 1730 0.01327 0.01327 0.01327 2.98% Castro::normalize_species() 30 0.01131 0.01131 0.01131 2.54% Castro::initialize_do_advance() 5 0.0112 0.0112 0.0112 2.52% Castro::computeTemp() 30 0.01054 0.01054 0.01054 2.37% Castro::do_old_sources() 5 0.01019 0.01019 0.01019 2.29% MLPoisson::define() 6 0.009548 0.009548 0.009548 2.15% amrex::Dot() 484 0.009263 0.009263 0.009263 2.08% MLMG:computeResOfCorrection() 180 0.008962 0.008962 0.008962 2.01% MLCellLinOp::correctionResidual() 180 0.008905 0.008905 0.008905 2.00% FabArray::norminf() 465 0.008715 0.008715 0.008715 1.96% Gravity::get_new_grav_vector() 5 0.007984 0.007984 0.007984 1.79% Castro::construct_old_gravity() 5 0.007754 0.007754 0.007754 1.74% Gravity::get_old_grav_vector() 5 0.007747 0.007747 0.007747 1.74% Castro::initialize_advance() 5 0.007486 0.007486 0.007486 1.68% MLMG::mgVcycle_down::1 36 0.00748 0.00748 0.00748 1.68% Castro::do_new_sources() 5 0.007321 0.007321 0.007321 1.65% MLMG::mgVcycle_down::2 36 0.006834 0.006834 0.006834 1.54% FabArray::ParallelCopy() 380 0.006764 0.006764 0.006764 1.52% FabArray::ParallelCopy_nowait() 380 0.006655 0.006655 0.006655 1.50% MLMG::mgVcycle_down::3 36 0.006645 0.006645 0.006645 1.49% FabArray::setVal() 501 0.006611 0.006611 0.006611 1.49% MLMG::mgVcycle_down::4 36 0.006573 0.006573 0.006573 1.48% MLCellLinOp::defineAuxData() 6 0.006451 0.006451 0.006451 1.45% Castro::enforce_min_density() 30 0.00641 0.00641 0.00641 1.44% Castro::post_restart() 1 0.006359 0.006359 0.006359 1.43% Castro::expand_state() 5 0.006152 0.006152 0.006152 1.38% Gravity::multilevel_solve_for_new_phi() 1 0.005966 0.005966 0.005966 1.34% Gravity::actual_multilevel_solve() 1 0.005948 0.005948 0.005948 1.34% Gravity::fill_multipole_BCs() 6 0.005881 0.005881 0.005881 1.32% FabArray::Saxpy() 597 0.005847 0.005847 0.005847 1.31% MLCGSolver::ParallelAllReduce 798 0.005657 0.005657 0.005657 1.27% MLMG::addInterpCorrection() 180 0.005523 0.005523 0.005523 1.24% amrex::Copy() 221 0.005447 0.005447 0.005447 1.22% amrex::average_down 180 0.005421 0.005421 0.005421 1.22% Castro::post_timestep() 5 0.005328 0.005328 0.005328 1.20% MLMG::mgVcycle_up::1 36 0.005317 0.005317 0.005317 1.20% MLMG::mgVcycle_up::4 36 0.005304 0.005304 0.005304 1.19% MLMG::mgVcycle_up::2 36 0.005151 0.005151 0.005151 1.16% MLMG::mgVcycle_up::3 36 0.005066 0.005066 0.005066 1.14% MLPoisson::Fapply() 464 0.00452 0.00452 0.00452 1.02% MLCellLinOp::solutionResidual() 42 0.003728 0.003728 0.003728 0.84% FabArray::Xpay() 325 0.003491 0.003491 0.003491 0.78% Castro::reset_internal_energy(MultiFab) 30 0.003166 0.003166 0.003166 0.71% Castro::estTimeStep() 10 0.00303 0.00303 0.00303 0.68% MLMG::computeResidual() 36 0.002949 0.002949 0.002949 0.66% MLCellLinOp::defineBC() 6 0.002948 0.002948 0.002948 0.66% MLMG::prepareForSolve() 6 0.002901 0.002901 0.002901 0.65% BndryData::define() 6 0.002799 0.002799 0.002799 0.63% Castro::construct_new_source() 25 0.001964 0.001964 0.001964 0.44% Castro::computeNewDt() 5 0.001899 0.001899 0.001899 0.43% Castro::construct_new_gravity_source() 5 0.001861 0.001861 0.001861 0.42% amrex::Add() 36 0.001624 0.001624 0.001624 0.37% Castro::construct_old_source() 25 0.001561 0.001561 0.001561 0.35% Castro::construct_old_gravity_source() 5 0.001533 0.001533 0.001533 0.34% Castro::enforce_speed_limit() 30 0.001346 0.001346 0.001346 0.30% Castro::finalize_do_advance() 5 0.001178 0.001178 0.001178 0.26% MLMG::ResNormInf() 42 0.001006 0.001006 0.001006 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009769 0.0009769 0.0009769 0.22% check_for_negative_density() 5 0.0009717 0.0009717 0.0009717 0.22% Castro::apply_source_to_state() 10 0.0009603 0.0009603 0.0009603 0.22% MLMG::getGradSolution() 6 0.0009116 0.0009116 0.0009116 0.20% MLCellLinOp::compGrad() 6 0.0009084 0.0009084 0.0009084 0.20% Castro::reset_internal_energy(Fab) 240 0.000902 0.000902 0.000902 0.20% MLCellLinOp::setLevelBC() 6 0.0008761 0.0008761 0.0008761 0.20% FabArrayBase::getCPC() 632 0.0008198 0.0008198 0.0008198 0.18% MLMG::computeMLResidual() 6 0.0008053 0.0008053 0.0008053 0.18% MLPoisson::prepareForSolve() 6 0.0007741 0.0007741 0.0007741 0.17% MLCellLinOp::prepareForSolve() 6 0.0007637 0.0007637 0.0007637 0.17% Gravity::update_max_rhs() 6 0.0007411 0.0007411 0.0007411 0.17% FabArray::setDomainBndry() 20 0.0007167 0.0007167 0.0007167 0.16% FabArray::mult() 22 0.0007023 0.0007023 0.0007023 0.16% Castro::check_for_nan() 10 0.0006717 0.0006717 0.0006717 0.15% MultiFab::contains_nan() 10 0.0006642 0.0006642 0.0006642 0.15% Amr::InitAmr() 1 0.0005323 0.0005323 0.0005323 0.12% FabArrayBase::CPC::define() 244 0.0004314 0.0004314 0.0004314 0.10% FabArrayBase::getFB() 1730 0.0003514 0.0003514 0.0003514 0.08% Castro::finalize_advance() 5 0.0002982 0.0002982 0.0002982 0.07% Gravity::swapTimeLevels() 5 0.0002454 0.0002454 0.0002454 0.06% MultiFab::max() 6 0.0002236 0.0002236 0.0002236 0.05% MLMG::MLResNormInf() 6 0.0001937 0.0001937 0.0001937 0.04% Castro::buildMetrics() 1 0.0001585 0.0001585 0.0001585 0.04% MLLinOp::define() 6 0.0001339 0.0001339 0.0001339 0.03% MLLinOp::defineGrids() 6 0.0001271 0.0001271 0.0001271 0.03% MLMG::MLRhsNormInf() 6 0.0001217 0.0001217 0.0001217 0.03% Castro::create_source_corrector() 5 0.0001109 0.0001109 0.0001109 0.02% FabArrayBase::FB::FB() 26 6.118e-05 6.118e-05 6.118e-05 0.01% Amr::writeSmallPlotFile() 1 3.246e-05 3.246e-05 3.246e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 3.224e-05 3.224e-05 3.224e-05 0.01% makeSFC 30 2.43e-05 2.43e-05 2.43e-05 0.01% Castro::initMFs() 1 2.414e-05 2.414e-05 2.414e-05 0.01% Castro::swap_state_time_levels() 5 2.411e-05 2.411e-05 2.411e-05 0.01% DistributionMapping::Distribute() 31 1.012e-05 1.012e-05 1.012e-05 0.00% Amr::initSubcycle() 1 9.235e-06 9.235e-06 9.235e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.031e-06 5.031e-06 5.031e-06 0.00% Gravity::set_mass_offset() 6 2.302e-06 2.302e-06 2.302e-06 0.00% Castro::retry_advance_ctu() 5 1.98e-06 1.98e-06 1.98e-06 0.00% Castro::FluxRegCrseInit 5 1.439e-06 1.439e-06 1.439e-06 0.00% Castro::FluxRegFineAdd() 5 1.374e-06 1.374e-06 1.374e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.059e-06 1.059e-06 1.059e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Device Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Arena::Initialize() 1 1 179 MiB 9042 MiB Castro::initMFs() 48 48 57 MiB 68 MiB Castro::swap_state_time_levels() 32 32 46 MiB 55 MiB StateData::restartDoit() 32 32 52 MiB 55 MiB FillPatchIterator::Initialize 160 160 1050 KiB 39 MiB Castro::initialize_do_advance() 40 40 28 MiB 39 MiB ResizeRandomSeed 1 1 30 MiB 30 MiB Amr::writePlotFile() 8 8 1771 KiB 28 MiB Castro::initialize_advance() 40 40 17 MiB 23 MiB Castro::buildMetrics() 32 32 13 MiB 15 MiB Castro::post_restart() 48 48 6433 KiB 14 MiB MLMG::prepareForSolve() 361 361 3396 KiB 12 MiB Gravity::get_old_grav_vector() 43 43 180 KiB 10 MiB Gravity::get_new_grav_vector() 40 40 183 KiB 10 MiB Gravity::multilevel_solve_for_new_phi() 24 24 6419 KiB 7586 KiB Gravity::fill_multipole_BCs() 84 84 22 KiB 2053 KiB Gravity::update_max_rhs() 48 48 3312 B 2048 KiB Gravity::solve_for_phi() 40 40 622 KiB 2048 KiB Gravity::actual_multilevel_solve() 8 8 27 KiB 2048 KiB BndryData::define() 576 576 312 KiB 1095 KiB MLCellLinOp::defineAuxData() 936 936 200 KiB 671 KiB Castro::estTimeStep() 10 10 3260 B 480 KiB VisMF::Write(FabArray) 112 112 2253 B 320 KiB Castro::normalize_species() 30 30 8279 B 320 KiB amrex::average_down 469 469 1524 B 257 KiB MLMG::addInterpCorrection() 468 468 1066 B 257 KiB amrex::Dot() 592 592 3071 B 160 KiB FabArray::norminf() 501 501 3026 B 160 KiB check_for_negative_density() 5 5 350 B 160 KiB MultiFab::max() 6 6 78 B 160 KiB FabArray::setVal() 66 66 20 KiB 27 KiB MultiFab::contains_nan() 10 10 29 B 20 KiB MLPoisson::Fsmooth() 60 60 3295 B 12 KiB FabArray::ParallelCopy_nowait() 380 380 44 B 10 KiB FillBoundary_nowait() 336 336 251 B 9648 B MLCellLinOp::applyBC() 3820 3820 201 B 9344 B amrex::Copy() 56 56 5880 B 8816 B MLCellLinOp::prepareForSolve() 36 36 4 B 7792 B StateData::FillBoundary(geom) 960 960 43 B 2880 B FabArray::setVal(val, thecmd, scomp, ncomp) 72 72 1 B 1616 B MLCellLinOp::defineBC() 36 36 351 B 1248 B MLCGSolver::bicgstab 180 180 85 B 1216 B ------------------------------------------------------------------------------ Managed Memory Usage: ------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------ The_Managed_Arena::Initialize() 1 1 1089 B 8192 KiB ------------------------------------------------------------------ Pinned Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Pinned_Arena::Initialize() 1 1 83 KiB 8192 KiB VisMF::Write(FabArray) 120 120 160 KiB 3584 KiB VisMF::Read() 24 24 208 KiB 3000 KiB FabArray::setVal() 66 66 20 KiB 27 KiB MLPoisson::Fsmooth() 60 60 3295 B 12 KiB FabArray::ParallelCopy_nowait() 380 380 44 B 10 KiB FillBoundary_nowait() 336 336 252 B 9648 B MLCellLinOp::applyBC() 1910 1910 200 B 9328 B amrex::Copy() 56 56 5880 B 8816 B MLCellLinOp::prepareForSolve() 36 36 4 B 7792 B Gravity::get_old_grav_vector() 3 3 2521 B 3072 B StateData::FillBoundary(geom) 960 960 43 B 2880 B Gravity::fill_multipole_BCs() 18 18 5 B 2832 B MLMG::prepareForSolve() 7 7 804 B 1648 B amrex::average_down 37 37 462 B 1648 B FabArray::setVal(val, thecmd, scomp, ncomp) 72 72 1 B 1616 B MLMG::addInterpCorrection() 36 36 2 B 1024 B MLCellLinOp::setLevelBC() 36 36 0 B 768 B amrex::Dot() 592 592 22 B 400 B FabArray::norminf() 501 501 8 B 144 B Castro::estTimeStep() 10 10 0 B 32 B check_for_negative_density() 5 5 0 B 16 B MultiFab::max() 6 6 0 B 16 B MultiFab::contains_nan() 10 10 0 B 16 B Castro::normalize_species() 30 30 0 B 16 B ------------------------------------------------------------------------------ Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2167 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (24.02-30-g2ecafcff4013) finalized