Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.08-12-g4f639294606d) initialized Starting run at 08:35:25 UTC on 2022-08-15. Successfully read inputs file ... Castro git describe: 22.08-11-ga978fcf88 AMReX git describe: 22.08-12-g4f6392946 Microphysics git describe: 22.08-2-gd7421d4a reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.051629823 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.029662544 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048230933 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.051507049 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.05062344 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.057634938 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.068286088 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.047476265 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.079737922 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.060255659 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.060441196 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.049007641 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.055296674 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.047402513 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.029412564 seconds Ending run at 08:35:25 UTC on 2022-08-15. Run time = 0.839212201 Run time without initialization = 0.706020859 Average number of zones advanced per microsecond: 3.713 Average number of zones advanced per microsecond per rank: 3.713 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8392 ... 0.8392 ... 0.8392 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.1975 0.1975 0.1975 23.53% Castro::construct_ctu_hydro_source() 10 0.1942 0.1942 0.1942 23.14% MLCellLinOp::applyBC() 4433 0.07907 0.07907 0.07907 9.42% MLPoisson::Fsmooth() 3280 0.06331 0.06331 0.06331 7.54% MLCGSolver::bicgstab 82 0.0237 0.0237 0.0237 2.82% StateData::FillBoundary(geom) 328 0.02339 0.02339 0.02339 2.79% MultiFab::Dot() 1114 0.02191 0.02191 0.02191 2.61% StateDataPhysBCFunct::() 41 0.02102 0.02102 0.02102 2.50% MultiFab::LinComb() 1586 0.0142 0.0142 0.0142 1.69% FillBoundary_nowait() 4023 0.01414 0.01414 0.01414 1.69% FabArray::setVal() 1144 0.01406 0.01406 0.01406 1.68% Castro::computeTemp() 63 0.01394 0.01394 0.01394 1.66% Castro::normalize_species() 62 0.0132 0.0132 0.0132 1.57% FabArray::ParallelCopy_nowait() 861 0.0129 0.0129 0.0129 1.54% MLPoisson::Fapply() 1142 0.01154 0.01154 0.01154 1.37% MLCellLinOp::defineAuxData() 11 0.01121 0.01121 0.01121 1.34% Castro::enforce_min_density() 62 0.009772 0.009772 0.009772 1.16% Gravity::fill_multipole_BCs() 11 0.008241 0.008241 0.008241 0.98% MLMG::addInterpCorrection() 410 0.007657 0.007657 0.007657 0.91% amrex::average_down 410 0.00677 0.00677 0.00677 0.81% MultiFab::Xpay() 585 0.006493 0.006493 0.006493 0.77% Amr::checkPoint() 3 0.005251 0.005251 0.005251 0.63% Castro::estTimeStep() 21 0.004943 0.004943 0.004943 0.59% Castro::do_advance_ctu() 10 0.00455 0.00455 0.00455 0.54% Castro::reset_internal_energy(MultiFab) 63 0.004283 0.004283 0.004283 0.51% BndryData::define() 11 0.003711 0.003711 0.003711 0.44% Castro::construct_new_gravity_source() 10 0.003281 0.003281 0.003281 0.39% Amr::writePlotFile() 2 0.002888 0.002888 0.002888 0.34% Castro::construct_old_gravity_source() 10 0.002636 0.002636 0.002636 0.31% MLMG::ResNormInf() 93 0.002097 0.002097 0.002097 0.25% Gravity::get_new_grav_vector() 11 0.001911 0.001911 0.001911 0.23% MultiFab::Saxpy() 20 0.001809 0.001809 0.001809 0.22% Castro::expand_state() 10 0.001727 0.001727 0.001727 0.21% Gravity::get_old_grav_vector() 10 0.001716 0.001716 0.001716 0.20% MultiFab::Add() 82 0.001647 0.001647 0.001647 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001606 0.001606 0.001606 0.19% Castro::enforce_speed_limit() 62 0.001538 0.001538 0.001538 0.18% MLCellLinOp::setLevelBC() 11 0.001509 0.001509 0.001509 0.18% Castro::reset_internal_energy(Fab) 504 0.001483 0.001483 0.001483 0.18% Gravity::actual_solve_with_mlmg() 11 0.001419 0.001419 0.001419 0.17% FabArray::mult() 43 0.001317 0.001317 0.001317 0.16% FabArray::setDomainBndry() 41 0.001284 0.001284 0.001284 0.15% Castro::initData() 1 0.00123 0.00123 0.00123 0.15% MLMG::prepareForSolve() 11 0.001229 0.001229 0.001229 0.15% MultiFab::contains_nan() 20 0.001176 0.001176 0.001176 0.14% MLCellLinOp::smooth() 1640 0.001165 0.001165 0.001165 0.14% MLCellLinOp::prepareForSolve() 11 0.001146 0.001146 0.001146 0.14% MLCellLinOp::compGrad() 11 0.0009045 0.0009045 0.0009045 0.11% FabArray::FillBoundary() 4023 0.0008526 0.0008526 0.0008526 0.10% FabArrayBase::getCPC() 1323 0.0007691 0.0007691 0.0007691 0.09% FabArrayBase::CPC::define() 454 0.0006684 0.0006684 0.0006684 0.08% FabArrayBase::getFB() 4023 0.0005876 0.0005876 0.0005876 0.07% MLCellLinOp::apply() 1142 0.0004981 0.0004981 0.0004981 0.06% Amr::InitAmr() 1 0.0004687 0.0004687 0.0004687 0.06% Gravity::solve_for_phi() 10 0.0004418 0.0004418 0.0004418 0.05% Amr::coarseTimeStep() 10 0.0004164 0.0004164 0.0004164 0.05% Gravity::update_max_rhs() 11 0.0004131 0.0004131 0.0004131 0.05% CGSolver::sxay() 1586 0.0003588 0.0003588 0.0003588 0.04% MultiFab::Copy() 11 0.0003212 0.0003212 0.0003212 0.04% MLCGSolver::ParallelAllReduce 1514 0.0002986 0.0002986 0.0002986 0.04% FillPatchIterator::Initialize 41 0.0002926 0.0002926 0.0002926 0.03% MLCellLinOp::defineBC() 11 0.000274 0.000274 0.000274 0.03% main() 1 0.0002666 0.0002666 0.0002666 0.03% FabArray::ParallelCopy() 861 0.0002579 0.0002579 0.0002579 0.03% MultiFab::max() 11 0.0002537 0.0002537 0.0002537 0.03% Castro::subcycle_advance_ctu() 10 0.0002489 0.0002489 0.0002489 0.03% MLCellLinOp::correctionResidual() 492 0.0002208 0.0002208 0.0002208 0.03% MLMG::MLRhsNormInf() 11 0.0002139 0.0002139 0.0002139 0.03% Amr::timeStep() 10 0.0002071 0.0002071 0.0002071 0.02% Castro::construct_new_gravity() 10 0.0002069 0.0002069 0.0002069 0.02% MLMG::mgVcycle() 82 0.0001937 0.0001937 0.0001937 0.02% MLMG:computeResOfCorrection() 410 0.0001446 0.0001446 0.0001446 0.02% StateData::checkPoint() 12 0.0001313 0.0001313 0.0001313 0.02% MLLinOp::defineGrids() 11 0.0001295 0.0001295 0.0001295 0.02% MLMG::mgVcycle_down::0 82 0.0001128 0.0001128 0.0001128 0.01% MLMG::mgVcycle_down::1 82 9.493e-05 9.493e-05 9.493e-05 0.01% MLMG::mgVcycle_down::2 82 9.327e-05 9.327e-05 9.327e-05 0.01% MLMG::mgVcycle_down::3 82 8.651e-05 8.651e-05 8.651e-05 0.01% MLMG::mgVcycle_down::4 82 8.614e-05 8.614e-05 8.614e-05 0.01% Castro::clean_state() 62 8.501e-05 8.501e-05 8.501e-05 0.01% Castro::Castro() 1 8.419e-05 8.419e-05 8.419e-05 0.01% FabArrayBase::FB::FB() 56 8.392e-05 8.392e-05 8.392e-05 0.01% Castro::initialize_advance() 10 8.32e-05 8.32e-05 8.32e-05 0.01% Castro::finalize_advance() 10 8.163e-05 8.163e-05 8.163e-05 0.01% MLMG::actualBottomSolve() 82 7.878e-05 7.878e-05 7.878e-05 0.01% MLMG::mgVcycle_up::4 82 7.254e-05 7.254e-05 7.254e-05 0.01% AmrLevel::checkPoint() 3 7.235e-05 7.235e-05 7.235e-05 0.01% MLMG::solve() 11 7.123e-05 7.123e-05 7.123e-05 0.01% Castro::initialize_do_advance() 10 6.452e-05 6.452e-05 6.452e-05 0.01% MLMG::mgVcycle_up::0 82 6.314e-05 6.314e-05 6.314e-05 0.01% MLMG::oneIter() 82 5.977e-05 5.977e-05 5.977e-05 0.01% MLMG::mgVcycle_up::3 82 5.836e-05 5.836e-05 5.836e-05 0.01% MLMG::mgVcycle_up::1 82 5.737e-05 5.737e-05 5.737e-05 0.01% MLMG::mgVcycle_up::2 82 5.576e-05 5.576e-05 5.576e-05 0.01% MLCellLinOp::solutionResidual() 93 5.244e-05 5.244e-05 5.244e-05 0.01% StateData::define() 4 4.309e-05 4.309e-05 4.309e-05 0.01% Castro::advance() 10 4.211e-05 4.211e-05 4.211e-05 0.01% Castro::swap_state_time_levels() 10 4.043e-05 4.043e-05 4.043e-05 0.00% Castro::construct_new_source() 50 3.951e-05 3.951e-05 3.951e-05 0.00% MLMG::computeResidual() 82 3.707e-05 3.707e-05 3.707e-05 0.00% Castro::enforce_consistent_e() 1 3.225e-05 3.225e-05 3.225e-05 0.00% Castro::finalize_do_advance() 10 3.169e-05 3.169e-05 3.169e-05 0.00% MLMG::mgVcycle_bottom 82 3.142e-05 3.142e-05 3.142e-05 0.00% Gravity::actual_multilevel_solve() 1 3.029e-05 3.029e-05 3.029e-05 0.00% MLPoisson::define() 11 2.945e-05 2.945e-05 2.945e-05 0.00% Castro::initMFs() 1 2.792e-05 2.792e-05 2.792e-05 0.00% FillPatchSingleLevel 41 2.787e-05 2.787e-05 2.787e-05 0.00% Amr::defBaseLevel() 1 2.644e-05 2.644e-05 2.644e-05 0.00% makeSFC 55 2.613e-05 2.613e-05 2.613e-05 0.00% Amr::writeSmallPlotFile() 1 2.573e-05 2.573e-05 2.573e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.379e-05 2.379e-05 2.379e-05 0.00% Castro::buildMetrics() 1 2.315e-05 2.315e-05 2.315e-05 0.00% Amr::FinalizeInit() 1 2.264e-05 2.264e-05 2.264e-05 0.00% MLLinOp::define() 11 2.203e-05 2.203e-05 2.203e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.791e-05 1.791e-05 1.791e-05 0.00% Castro::construct_old_source() 50 1.78e-05 1.78e-05 1.78e-05 0.00% Castro::do_new_sources() 10 1.617e-05 1.617e-05 1.617e-05 0.00% Castro::do_old_sources() 10 1.563e-05 1.563e-05 1.563e-05 0.00% DistributionMapping::Distribute() 56 1.438e-05 1.438e-05 1.438e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.313e-05 1.313e-05 1.313e-05 0.00% Castro::check_for_nan() 20 1.121e-05 1.121e-05 1.121e-05 0.00% Castro::construct_old_gravity() 10 1.018e-05 1.018e-05 1.018e-05 0.00% Castro::apply_source_to_state() 20 1.014e-05 1.014e-05 1.014e-05 0.00% MLPoisson::prepareForSolve() 11 9.798e-06 9.798e-06 9.798e-06 0.00% MLMG::computeMLResidual() 11 9.573e-06 9.573e-06 9.573e-06 0.00% Gravity::swapTimeLevels() 10 9.499e-06 9.499e-06 9.499e-06 0.00% Castro::post_timestep() 10 8.763e-06 8.763e-06 8.763e-06 0.00% Amr::initSubcycle() 1 8.401e-06 8.401e-06 8.401e-06 0.00% Amr::InitializeInit() 1 7.032e-06 7.032e-06 7.032e-06 0.00% AmrLevel::AmrLevel(dm) 1 6.838e-06 6.838e-06 6.838e-06 0.00% Castro::computeNewDt() 9 6.26e-06 6.26e-06 6.26e-06 0.00% MLMG::getGradSolution() 11 6.193e-06 6.193e-06 6.193e-06 0.00% AmrLevel::checkPointPost() 3 5.742e-06 5.742e-06 5.742e-06 0.00% Gravity::set_mass_offset() 11 4.049e-06 4.049e-06 4.049e-06 0.00% Castro::retry_advance_ctu() 10 3.997e-06 3.997e-06 3.997e-06 0.00% Castro::create_source_corrector() 10 3.701e-06 3.701e-06 3.701e-06 0.00% Castro::post_init() 1 3.604e-06 3.604e-06 3.604e-06 0.00% MLMG::MLResNormInf() 11 3.498e-06 3.498e-06 3.498e-06 0.00% Castro::FluxRegCrseInit 10 2.792e-06 2.792e-06 2.792e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.683e-06 2.683e-06 2.683e-06 0.00% Amr::init() 1 2.478e-06 2.478e-06 2.478e-06 0.00% Castro::computeInitialDt() 2 2.311e-06 2.311e-06 2.311e-06 0.00% Castro::FluxRegFineAdd() 10 2.173e-06 2.173e-06 2.173e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.996e-06 1.996e-06 1.996e-06 0.00% AmrLevel::checkPointPre() 3 1.861e-06 1.861e-06 1.861e-06 0.00% Amr::initialInit() 1 1.275e-06 1.275e-06 1.275e-06 0.00% Castro::post_regrid() 1 1.07e-06 1.07e-06 1.07e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8392 0.8392 0.8392 100.00% Amr::coarseTimeStep() 10 0.6764 0.6764 0.6764 80.60% Amr::timeStep() 10 0.5784 0.5784 0.5784 68.92% Castro::advance() 10 0.5716 0.5716 0.5716 68.11% Castro::subcycle_advance_ctu() 10 0.5605 0.5605 0.5605 66.79% Castro::do_advance_ctu() 10 0.5603 0.5603 0.5603 66.76% Gravity::solve_phi_with_mlmg() 11 0.3103 0.3103 0.3103 36.97% Gravity::actual_solve_with_mlmg() 11 0.3018 0.3018 0.3018 35.96% Castro::construct_new_gravity() 10 0.2824 0.2824 0.2824 33.65% MLMG::solve() 11 0.2798 0.2798 0.2798 33.34% Gravity::solve_for_phi() 10 0.267 0.267 0.267 31.81% MLMG::oneIter() 82 0.265 0.265 0.265 31.58% MLMG::mgVcycle() 82 0.2633 0.2633 0.2633 31.38% VisMF::Write(FabArray) 11 0.1975 0.1975 0.1975 23.53% Castro::construct_ctu_hydro_source() 10 0.1942 0.1942 0.1942 23.14% Amr::checkPoint() 3 0.1466 0.1466 0.1466 17.47% AmrLevel::checkPoint() 3 0.1414 0.1414 0.1414 16.85% StateData::checkPoint() 12 0.1413 0.1413 0.1413 16.84% MLCellLinOp::smooth() 1640 0.135 0.135 0.135 16.08% Amr::init() 1 0.1326 0.1326 0.1326 15.80% MLCellLinOp::applyBC() 4433 0.09474 0.09474 0.09474 11.29% MLMG::mgVcycle_bottom 82 0.08082 0.08082 0.08082 9.63% MLMG::actualBottomSolve() 82 0.08079 0.08079 0.08079 9.63% MLCGSolver::bicgstab 82 0.07998 0.07998 0.07998 9.53% MLPoisson::Fsmooth() 3280 0.06331 0.06331 0.06331 7.54% Amr::writePlotFile() 2 0.05922 0.05922 0.05922 7.06% Amr::initialInit() 1 0.05117 0.05117 0.05117 6.10% FillPatchIterator::Initialize 41 0.04999 0.04999 0.04999 5.96% FillPatchSingleLevel 41 0.04841 0.04841 0.04841 5.77% Amr::FinalizeInit() 1 0.04713 0.04713 0.04713 5.62% Castro::post_init() 1 0.04571 0.04571 0.04571 5.45% StateDataPhysBCFunct::() 41 0.0444 0.0444 0.0444 5.29% Gravity::multilevel_solve_for_new_phi() 1 0.04381 0.04381 0.04381 5.22% Gravity::actual_multilevel_solve() 1 0.04379 0.04379 0.04379 5.22% Castro::clean_state() 62 0.04344 0.04344 0.04344 5.18% MLCellLinOp::apply() 1142 0.03577 0.03577 0.03577 4.26% MLMG::mgVcycle_down::0 82 0.03512 0.03512 0.03512 4.18% MLMG::mgVcycle_up::0 82 0.03017 0.03017 0.03017 3.59% StateData::FillBoundary(geom) 328 0.02339 0.02339 0.02339 2.79% MultiFab::Dot() 1114 0.02191 0.02191 0.02191 2.61% MLCellLinOp::correctionResidual() 492 0.02094 0.02094 0.02094 2.50% Castro::initialize_do_advance() 10 0.02028 0.02028 0.02028 2.42% Castro::computeTemp() 63 0.0197 0.0197 0.0197 2.35% MLMG:computeResOfCorrection() 410 0.01809 0.01809 0.01809 2.16% MLPoisson::define() 11 0.01765 0.01765 0.01765 2.10% MLMG::mgVcycle_down::1 82 0.01748 0.01748 0.01748 2.08% MLMG::mgVcycle_down::2 82 0.01707 0.01707 0.01707 2.03% Gravity::get_new_grav_vector() 11 0.01702 0.01702 0.01702 2.03% MLMG::mgVcycle_down::3 82 0.01619 0.01619 0.01619 1.93% FabArray::FillBoundary() 4023 0.01567 0.01567 0.01567 1.87% MLMG::mgVcycle_down::4 82 0.01539 0.01539 0.01539 1.83% Castro::construct_old_gravity() 10 0.01485 0.01485 0.01485 1.77% Gravity::get_old_grav_vector() 10 0.01484 0.01484 0.01484 1.77% FillBoundary_nowait() 4023 0.01482 0.01482 0.01482 1.77% CGSolver::sxay() 1586 0.01456 0.01456 0.01456 1.73% MultiFab::LinComb() 1586 0.0142 0.0142 0.0142 1.69% FabArray::setVal() 1144 0.01406 0.01406 0.01406 1.68% FabArray::ParallelCopy() 861 0.01397 0.01397 0.01397 1.66% FabArray::ParallelCopy_nowait() 861 0.01372 0.01372 0.01372 1.63% Castro::normalize_species() 62 0.0132 0.0132 0.0132 1.57% MLMG::mgVcycle_up::2 82 0.01315 0.01315 0.01315 1.57% MLCGSolver::ParallelAllReduce 1514 0.01307 0.01307 0.01307 1.56% MLMG::mgVcycle_up::1 82 0.01289 0.01289 0.01289 1.54% MLMG::addInterpCorrection() 410 0.01262 0.01262 0.01262 1.50% MLCellLinOp::defineAuxData() 11 0.01251 0.01251 0.01251 1.49% MLMG::mgVcycle_up::3 82 0.01245 0.01245 0.01245 1.48% MLMG::mgVcycle_up::4 82 0.01239 0.01239 0.01239 1.48% Castro::expand_state() 10 0.01188 0.01188 0.01188 1.42% amrex::average_down 410 0.0118 0.0118 0.0118 1.41% MLPoisson::Fapply() 1142 0.01154 0.01154 0.01154 1.37% Castro::do_new_sources() 10 0.01117 0.01117 0.01117 1.33% Castro::initialize_advance() 10 0.01092 0.01092 0.01092 1.30% Castro::do_old_sources() 10 0.01079 0.01079 0.01079 1.29% Castro::enforce_min_density() 62 0.009772 0.009772 0.009772 1.16% Gravity::fill_multipole_BCs() 11 0.008241 0.008241 0.008241 0.98% MLCellLinOp::solutionResidual() 93 0.007068 0.007068 0.007068 0.84% Castro::post_timestep() 10 0.006664 0.006664 0.006664 0.79% MultiFab::Xpay() 585 0.006493 0.006493 0.006493 0.77% MLMG::computeResidual() 82 0.006102 0.006102 0.006102 0.73% Castro::reset_internal_energy(MultiFab) 63 0.005766 0.005766 0.005766 0.69% MLMG::prepareForSolve() 11 0.005292 0.005292 0.005292 0.63% Castro::estTimeStep() 21 0.004943 0.004943 0.004943 0.59% MLCellLinOp::defineBC() 11 0.004907 0.004907 0.004907 0.58% BndryData::define() 11 0.004633 0.004633 0.004633 0.55% Amr::InitializeInit() 1 0.004037 0.004037 0.004037 0.48% Amr::defBaseLevel() 1 0.00403 0.00403 0.00403 0.48% Castro::initData() 1 0.003519 0.003519 0.003519 0.42% Castro::construct_new_source() 50 0.003321 0.003321 0.003321 0.40% Castro::construct_new_gravity_source() 10 0.003281 0.003281 0.003281 0.39% Castro::construct_old_source() 50 0.002654 0.002654 0.002654 0.32% Castro::construct_old_gravity_source() 10 0.002636 0.002636 0.002636 0.31% MLMG::ResNormInf() 93 0.002097 0.002097 0.002097 0.25% Castro::computeNewDt() 9 0.002064 0.002064 0.002064 0.25% Castro::apply_source_to_state() 20 0.001819 0.001819 0.001819 0.22% MultiFab::Saxpy() 20 0.001809 0.001809 0.001809 0.22% MultiFab::Add() 82 0.001647 0.001647 0.001647 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001606 0.001606 0.001606 0.19% Castro::enforce_speed_limit() 62 0.001538 0.001538 0.001538 0.18% MLCellLinOp::setLevelBC() 11 0.001509 0.001509 0.001509 0.18% Castro::reset_internal_energy(Fab) 504 0.001483 0.001483 0.001483 0.18% FabArrayBase::getCPC() 1323 0.001438 0.001438 0.001438 0.17% MLMG::getGradSolution() 11 0.001411 0.001411 0.001411 0.17% MLCellLinOp::compGrad() 11 0.001404 0.001404 0.001404 0.17% FabArray::mult() 43 0.001317 0.001317 0.001317 0.16% FabArray::setDomainBndry() 41 0.001284 0.001284 0.001284 0.15% Castro::check_for_nan() 20 0.001188 0.001188 0.001188 0.14% MultiFab::contains_nan() 20 0.001176 0.001176 0.001176 0.14% Castro::post_regrid() 1 0.001159 0.001159 0.001159 0.14% MLPoisson::prepareForSolve() 11 0.001155 0.001155 0.001155 0.14% MLCellLinOp::prepareForSolve() 11 0.001146 0.001146 0.001146 0.14% MLMG::computeMLResidual() 11 0.001013 0.001013 0.001013 0.12% Gravity::update_max_rhs() 11 0.00081 0.00081 0.00081 0.10% Castro::computeInitialDt() 2 0.0007359 0.0007359 0.0007359 0.09% FabArrayBase::getFB() 4023 0.0006715 0.0006715 0.0006715 0.08% FabArrayBase::CPC::define() 454 0.0006684 0.0006684 0.0006684 0.08% Amr::InitAmr() 1 0.0004771 0.0004771 0.0004771 0.06% Gravity::swapTimeLevels() 10 0.000434 0.000434 0.000434 0.05% Castro::Castro() 1 0.0004312 0.0004312 0.0004312 0.05% MultiFab::Copy() 11 0.0003212 0.0003212 0.0003212 0.04% MLMG::MLResNormInf() 11 0.0002761 0.0002761 0.0002761 0.03% MultiFab::max() 11 0.0002537 0.0002537 0.0002537 0.03% MLMG::MLRhsNormInf() 11 0.0002139 0.0002139 0.0002139 0.03% MLLinOp::define() 11 0.0002061 0.0002061 0.0002061 0.02% MLLinOp::defineGrids() 11 0.0001841 0.0001841 0.0001841 0.02% Castro::buildMetrics() 1 0.0001485 0.0001485 0.0001485 0.02% Castro::finalize_advance() 10 8.66e-05 8.66e-05 8.66e-05 0.01% FabArrayBase::FB::FB() 56 8.392e-05 8.392e-05 8.392e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.261e-05 5.261e-05 5.261e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.993e-05 4.993e-05 4.993e-05 0.01% StateData::define() 4 4.309e-05 4.309e-05 4.309e-05 0.01% Castro::swap_state_time_levels() 10 4.043e-05 4.043e-05 4.043e-05 0.00% makeSFC 55 3.949e-05 3.949e-05 3.949e-05 0.00% Castro::enforce_consistent_e() 1 3.225e-05 3.225e-05 3.225e-05 0.00% Castro::finalize_do_advance() 10 3.169e-05 3.169e-05 3.169e-05 0.00% Castro::initMFs() 1 2.792e-05 2.792e-05 2.792e-05 0.00% Amr::writeSmallPlotFile() 1 2.573e-05 2.573e-05 2.573e-05 0.00% DistributionMapping::Distribute() 56 1.438e-05 1.438e-05 1.438e-05 0.00% Amr::initSubcycle() 1 8.401e-06 8.401e-06 8.401e-06 0.00% AmrLevel::checkPointPost() 3 5.742e-06 5.742e-06 5.742e-06 0.00% Gravity::set_mass_offset() 11 4.049e-06 4.049e-06 4.049e-06 0.00% Castro::retry_advance_ctu() 10 3.997e-06 3.997e-06 3.997e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.713e-06 3.713e-06 3.713e-06 0.00% Castro::create_source_corrector() 10 3.701e-06 3.701e-06 3.701e-06 0.00% Castro::FluxRegCrseInit 10 2.792e-06 2.792e-06 2.792e-06 0.00% Castro::FluxRegFineAdd() 10 2.173e-06 2.173e-06 2.173e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.996e-06 1.996e-06 1.996e-06 0.00% AmrLevel::checkPointPre() 3 1.861e-06 1.861e-06 1.861e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.08-12-g4f639294606d) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.08-12-g4f639294606d) initialized Starting run at 08:35:26 UTC on 2022-08-15. Successfully read inputs file ... Castro git describe: 22.08-11-ga978fcf88 AMReX git describe: 22.08-12-g4f6392946 Microphysics git describe: 22.08-2-gd7421d4a reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.457045763 Restart time = 0.048050747 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.053879622 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.051606829 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.050238575 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.070988953 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.078070737 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.031579838 seconds Ending run at 08:35:27 UTC on 2022-08-15. Run time = 0.385377072 Run time without initialization = 0.336774252 Average number of zones advanced per microsecond: 3.892 Average number of zones advanced per microsecond per rank: 3.892 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3854 ... 0.3854 ... 0.3854 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1031 0.1031 0.1031 26.75% VisMF::Read() 3 0.04023 0.04023 0.04023 10.44% MLCellLinOp::applyBC() 1946 0.03452 0.03452 0.03452 8.96% VisMF::Write(FabArray) 1 0.02995 0.02995 0.02995 7.77% MLPoisson::Fsmooth() 1440 0.02726 0.02726 0.02726 7.07% StateData::FillBoundary(geom) 160 0.01136 0.01136 0.01136 2.95% MLCGSolver::bicgstab 36 0.01018 0.01018 0.01018 2.64% Castro::computeTemp() 30 0.01008 0.01008 0.01008 2.62% MultiFab::Dot() 484 0.009415 0.009415 0.009415 2.44% Castro::normalize_species() 30 0.007707 0.007707 0.007707 2.00% StateDataPhysBCFunct::() 20 0.007206 0.007206 0.007206 1.87% FabArray::setVal() 537 0.006684 0.006684 0.006684 1.73% FillBoundary_nowait() 1766 0.006237 0.006237 0.006237 1.62% MLCellLinOp::defineAuxData() 6 0.00612 0.00612 0.00612 1.59% MultiFab::LinComb() 690 0.006111 0.006111 0.006111 1.59% FabArray::ParallelCopy_nowait() 380 0.005901 0.005901 0.005901 1.53% MLPoisson::Fapply() 500 0.005 0.005 0.005 1.30% Gravity::fill_multipole_BCs() 6 0.00452 0.00452 0.00452 1.17% Castro::enforce_min_density() 30 0.004443 0.004443 0.004443 1.15% Amr::restart() 1 0.003591 0.003591 0.003591 0.93% MLMG::addInterpCorrection() 180 0.003316 0.003316 0.003316 0.86% Castro::estTimeStep() 10 0.002973 0.002973 0.002973 0.77% amrex::average_down 180 0.002937 0.002937 0.002937 0.76% MultiFab::Xpay() 258 0.002839 0.002839 0.002839 0.74% Castro::enforce_speed_limit() 30 0.00267 0.00267 0.00267 0.69% Castro::do_advance_ctu() 5 0.002262 0.002262 0.002262 0.59% BndryData::define() 6 0.002108 0.002108 0.002108 0.55% Castro::construct_new_gravity_source() 5 0.001741 0.001741 0.001741 0.45% Amr::writePlotFile() 1 0.001715 0.001715 0.001715 0.45% Castro::reset_internal_energy(MultiFab) 30 0.001705 0.001705 0.001705 0.44% Castro::construct_old_gravity_source() 5 0.001599 0.001599 0.001599 0.41% Castro::reset_internal_energy(Fab) 240 0.001065 0.001065 0.001065 0.28% Gravity::get_old_grav_vector() 5 0.0009391 0.0009391 0.0009391 0.24% MLMG::ResNormInf() 42 0.0009323 0.0009323 0.0009323 0.24% MultiFab::Saxpy() 10 0.0009206 0.0009206 0.0009206 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.00088 0.00088 0.00088 0.23% Castro::expand_state() 5 0.0008679 0.0008679 0.0008679 0.23% Gravity::get_new_grav_vector() 5 0.0008604 0.0008604 0.0008604 0.22% MLCellLinOp::setLevelBC() 6 0.0008287 0.0008287 0.0008287 0.22% Gravity::actual_solve_with_mlmg() 6 0.0008037 0.0008037 0.0008037 0.21% MultiFab::Add() 36 0.0007157 0.0007157 0.0007157 0.19% MLMG::prepareForSolve() 6 0.0006622 0.0006622 0.0006622 0.17% FabArray::mult() 22 0.0006438 0.0006438 0.0006438 0.17% MLCellLinOp::prepareForSolve() 6 0.000635 0.000635 0.000635 0.16% FabArray::setDomainBndry() 20 0.0006334 0.0006334 0.0006334 0.16% MultiFab::contains_nan() 10 0.0005925 0.0005925 0.0005925 0.15% MLCellLinOp::smooth() 720 0.0004989 0.0004989 0.0004989 0.13% MLCellLinOp::compGrad() 6 0.000494 0.000494 0.000494 0.13% FabArray::FillBoundary() 1766 0.0004019 0.0004019 0.0004019 0.10% FabArrayBase::CPC::define() 244 0.0003963 0.0003963 0.0003963 0.10% Amr::InitAmr() 1 0.0003881 0.0003881 0.0003881 0.10% FabArrayBase::getCPC() 632 0.0003593 0.0003593 0.0003593 0.09% FabArrayBase::getFB() 1766 0.0002533 0.0002533 0.0002533 0.07% main() 1 0.000252 0.000252 0.000252 0.07% Castro::subcycle_advance_ctu() 5 0.000243 0.000243 0.000243 0.06% Gravity::update_max_rhs() 6 0.0002285 0.0002285 0.0002285 0.06% MLCellLinOp::apply() 500 0.0002085 0.0002085 0.0002085 0.05% Gravity::solve_for_phi() 5 0.0002046 0.0002046 0.0002046 0.05% Castro::create_source_corrector() 5 0.0001961 0.0001961 0.0001961 0.05% MultiFab::Copy() 6 0.0001776 0.0001776 0.0001776 0.05% Amr::coarseTimeStep() 5 0.0001753 0.0001753 0.0001753 0.05% Castro::construct_new_source() 25 0.0001644 0.0001644 0.0001644 0.04% Castro::construct_new_gravity() 5 0.0001642 0.0001642 0.0001642 0.04% CGSolver::sxay() 690 0.0001589 0.0001589 0.0001589 0.04% MLCellLinOp::defineBC() 6 0.0001485 0.0001485 0.0001485 0.04% FillPatchIterator::Initialize 20 0.0001375 0.0001375 0.0001375 0.04% MultiFab::max() 6 0.000136 0.000136 0.000136 0.04% Castro::construct_old_source() 25 0.0001207 0.0001207 0.0001207 0.03% MLCGSolver::ParallelAllReduce 659 0.0001199 0.0001199 0.0001199 0.03% FabArray::ParallelCopy() 380 0.0001182 0.0001182 0.0001182 0.03% MLMG::MLRhsNormInf() 6 0.000114 0.000114 0.000114 0.03% Amr::timeStep() 5 0.0001074 0.0001074 0.0001074 0.03% MLCellLinOp::correctionResidual() 216 8.974e-05 8.974e-05 8.974e-05 0.02% Castro::initialize_do_advance() 5 8.952e-05 8.952e-05 8.952e-05 0.02% Castro::post_timestep() 5 8.853e-05 8.853e-05 8.853e-05 0.02% MLLinOp::defineGrids() 6 8.641e-05 8.641e-05 8.641e-05 0.02% MLMG::mgVcycle() 36 8.456e-05 8.456e-05 8.456e-05 0.02% Castro::advance() 5 8.396e-05 8.396e-05 8.396e-05 0.02% Castro::computeNewDt() 5 7.792e-05 7.792e-05 7.792e-05 0.02% AmrLevel::restart() 1 7.499e-05 7.499e-05 7.499e-05 0.02% StateData::restartDoit() 4 7.277e-05 7.277e-05 7.277e-05 0.02% Castro::finalize_advance() 5 6.702e-05 6.702e-05 6.702e-05 0.02% MLMG:computeResOfCorrection() 180 6.384e-05 6.384e-05 6.384e-05 0.02% Castro::construct_old_gravity() 5 5.811e-05 5.811e-05 5.811e-05 0.02% FabArrayBase::FB::FB() 26 5.642e-05 5.642e-05 5.642e-05 0.01% Castro::initialize_advance() 5 5.262e-05 5.262e-05 5.262e-05 0.01% MLMG::mgVcycle_down::0 36 4.554e-05 4.554e-05 4.554e-05 0.01% MLMG::mgVcycle_down::1 36 4.273e-05 4.273e-05 4.273e-05 0.01% MLMG::mgVcycle_down::2 36 4.034e-05 4.034e-05 4.034e-05 0.01% Castro::clean_state() 30 4.001e-05 4.001e-05 4.001e-05 0.01% MLMG::mgVcycle_down::4 36 3.897e-05 3.897e-05 3.897e-05 0.01% MLMG::mgVcycle_down::3 36 3.759e-05 3.759e-05 3.759e-05 0.01% MLMG::solve() 6 3.435e-05 3.435e-05 3.435e-05 0.01% MLMG::actualBottomSolve() 36 3.433e-05 3.433e-05 3.433e-05 0.01% MLMG::mgVcycle_up::4 36 3.389e-05 3.389e-05 3.389e-05 0.01% Castro::buildMetrics() 1 3.26e-05 3.26e-05 3.26e-05 0.01% Castro::post_restart() 1 3.06e-05 3.06e-05 3.06e-05 0.01% Gravity::actual_multilevel_solve() 1 2.961e-05 2.961e-05 2.961e-05 0.01% MLMG::mgVcycle_up::0 36 2.898e-05 2.898e-05 2.898e-05 0.01% Castro::initMFs() 1 2.843e-05 2.843e-05 2.843e-05 0.01% Castro::swap_state_time_levels() 5 2.778e-05 2.778e-05 2.778e-05 0.01% MLMG::mgVcycle_up::3 36 2.733e-05 2.733e-05 2.733e-05 0.01% MLMG::oneIter() 36 2.703e-05 2.703e-05 2.703e-05 0.01% Amr::writeSmallPlotFile() 1 2.634e-05 2.634e-05 2.634e-05 0.01% MLMG::mgVcycle_up::2 36 2.615e-05 2.615e-05 2.615e-05 0.01% MLMG::mgVcycle_up::1 36 2.499e-05 2.499e-05 2.499e-05 0.01% MLCellLinOp::solutionResidual() 42 2.185e-05 2.185e-05 2.185e-05 0.01% MLPoisson::define() 6 2.148e-05 2.148e-05 2.148e-05 0.01% MLLinOp::define() 6 2.105e-05 2.105e-05 2.105e-05 0.01% Castro::finalize_do_advance() 5 1.792e-05 1.792e-05 1.792e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.752e-05 1.752e-05 1.752e-05 0.00% MLMG::computeResidual() 36 1.644e-05 1.644e-05 1.644e-05 0.00% MLMG::mgVcycle_bottom 36 1.4e-05 1.4e-05 1.4e-05 0.00% makeSFC 30 1.346e-05 1.346e-05 1.346e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.326e-05 1.326e-05 1.326e-05 0.00% FillPatchSingleLevel 20 1.307e-05 1.307e-05 1.307e-05 0.00% Castro::do_new_sources() 5 9.634e-06 9.634e-06 9.634e-06 0.00% Castro::check_for_nan() 10 8.741e-06 8.741e-06 8.741e-06 0.00% Amr::initSubcycle() 1 8.43e-06 8.43e-06 8.43e-06 0.00% DistributionMapping::Distribute() 31 8.219e-06 8.219e-06 8.219e-06 0.00% Castro::do_old_sources() 5 8.213e-06 8.213e-06 8.213e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 6.889e-06 6.889e-06 6.889e-06 0.00% MLPoisson::prepareForSolve() 6 5.269e-06 5.269e-06 5.269e-06 0.00% Castro::apply_source_to_state() 10 5.161e-06 5.161e-06 5.161e-06 0.00% MLMG::computeMLResidual() 6 4.955e-06 4.955e-06 4.955e-06 0.00% Gravity::swapTimeLevels() 5 4.33e-06 4.33e-06 4.33e-06 0.00% MLMG::getGradSolution() 6 3.093e-06 3.093e-06 3.093e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.797e-06 2.797e-06 2.797e-06 0.00% MLMG::MLResNormInf() 6 2.419e-06 2.419e-06 2.419e-06 0.00% Gravity::set_mass_offset() 6 2.295e-06 2.295e-06 2.295e-06 0.00% Castro::retry_advance_ctu() 5 1.756e-06 1.756e-06 1.756e-06 0.00% Castro::FluxRegCrseInit 5 1.54e-06 1.54e-06 1.54e-06 0.00% Castro::FluxRegFineAdd() 5 1.102e-06 1.102e-06 1.102e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.034e-06 1.034e-06 1.034e-06 0.00% AmrLevel::AmrLevel() 1 1.006e-06 1.006e-06 1.006e-06 0.00% Amr::init() 1 9.42e-07 9.42e-07 9.42e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3854 0.3854 0.3854 100.00% Amr::coarseTimeStep() 5 0.305 0.305 0.305 79.12% Amr::timeStep() 5 0.3032 0.3032 0.3032 78.67% Castro::advance() 5 0.2992 0.2992 0.2992 77.63% Castro::subcycle_advance_ctu() 5 0.2913 0.2913 0.2913 75.57% Castro::do_advance_ctu() 5 0.291 0.291 0.291 75.51% Castro::construct_new_gravity() 5 0.1436 0.1436 0.1436 37.26% Gravity::solve_phi_with_mlmg() 6 0.1388 0.1388 0.1388 36.02% Gravity::solve_for_phi() 5 0.1353 0.1353 0.1353 35.10% Gravity::actual_solve_with_mlmg() 6 0.1342 0.1342 0.1342 34.81% MLMG::solve() 6 0.122 0.122 0.122 31.65% MLMG::oneIter() 36 0.1148 0.1148 0.1148 29.78% MLMG::mgVcycle() 36 0.114 0.114 0.114 29.58% Castro::construct_ctu_hydro_source() 5 0.1031 0.1031 0.1031 26.75% MLCellLinOp::smooth() 720 0.05851 0.05851 0.05851 15.18% Amr::init() 1 0.0481 0.0481 0.0481 12.48% Amr::restart() 1 0.0481 0.0481 0.0481 12.48% MLCellLinOp::applyBC() 1946 0.04147 0.04147 0.04147 10.76% AmrLevel::restart() 1 0.04044 0.04044 0.04044 10.49% StateData::restartDoit() 4 0.04036 0.04036 0.04036 10.47% VisMF::Read() 3 0.04023 0.04023 0.04023 10.44% MLMG::mgVcycle_bottom 36 0.0348 0.0348 0.0348 9.03% MLMG::actualBottomSolve() 36 0.03479 0.03479 0.03479 9.03% MLCGSolver::bicgstab 36 0.03444 0.03444 0.03444 8.94% Amr::writePlotFile() 1 0.03166 0.03166 0.03166 8.22% VisMF::Write(FabArray) 1 0.02995 0.02995 0.02995 7.77% Castro::clean_state() 30 0.02771 0.02771 0.02771 7.19% MLPoisson::Fsmooth() 1440 0.02726 0.02726 0.02726 7.07% FillPatchIterator::Initialize 20 0.02134 0.02134 0.02134 5.54% FillPatchSingleLevel 20 0.02057 0.02057 0.02057 5.34% StateDataPhysBCFunct::() 20 0.01857 0.01857 0.01857 4.82% MLCellLinOp::apply() 500 0.01565 0.01565 0.01565 4.06% MLMG::mgVcycle_down::0 36 0.01527 0.01527 0.01527 3.96% MLMG::mgVcycle_up::0 36 0.01311 0.01311 0.01311 3.40% Castro::computeTemp() 30 0.01285 0.01285 0.01285 3.34% StateData::FillBoundary(geom) 160 0.01136 0.01136 0.01136 2.95% Castro::initialize_do_advance() 5 0.01133 0.01133 0.01133 2.94% MLPoisson::define() 6 0.0098 0.0098 0.0098 2.54% MultiFab::Dot() 484 0.009415 0.009415 0.009415 2.44% MLCellLinOp::correctionResidual() 216 0.009109 0.009109 0.009109 2.36% Gravity::get_new_grav_vector() 5 0.008156 0.008156 0.008156 2.12% Castro::construct_old_gravity() 5 0.007873 0.007873 0.007873 2.04% MLMG:computeResOfCorrection() 180 0.007867 0.007867 0.007867 2.04% Gravity::get_old_grav_vector() 5 0.007815 0.007815 0.007815 2.03% Castro::initialize_advance() 5 0.00779 0.00779 0.00779 2.02% Castro::normalize_species() 30 0.007707 0.007707 0.007707 2.00% MLMG::mgVcycle_down::1 36 0.007609 0.007609 0.007609 1.97% Castro::do_new_sources() 5 0.007473 0.007473 0.007473 1.94% MLMG::mgVcycle_down::2 36 0.007388 0.007388 0.007388 1.92% MLMG::mgVcycle_down::3 36 0.00699 0.00699 0.00699 1.81% FabArray::FillBoundary() 1766 0.006948 0.006948 0.006948 1.80% MLCellLinOp::defineAuxData() 6 0.006853 0.006853 0.006853 1.78% MLMG::mgVcycle_down::4 36 0.006708 0.006708 0.006708 1.74% FabArray::setVal() 537 0.006684 0.006684 0.006684 1.73% FillBoundary_nowait() 1766 0.006547 0.006547 0.006547 1.70% Castro::do_old_sources() 5 0.006425 0.006425 0.006425 1.67% FabArray::ParallelCopy() 380 0.006389 0.006389 0.006389 1.66% FabArray::ParallelCopy_nowait() 380 0.006271 0.006271 0.006271 1.63% CGSolver::sxay() 690 0.00627 0.00627 0.00627 1.63% MultiFab::LinComb() 690 0.006111 0.006111 0.006111 1.59% Castro::expand_state() 5 0.005842 0.005842 0.005842 1.52% MLMG::mgVcycle_up::2 36 0.005672 0.005672 0.005672 1.47% MLCGSolver::ParallelAllReduce 659 0.005642 0.005642 0.005642 1.46% MLMG::mgVcycle_up::1 36 0.005614 0.005614 0.005614 1.46% MLMG::addInterpCorrection() 180 0.005495 0.005495 0.005495 1.43% MLMG::mgVcycle_up::3 36 0.005409 0.005409 0.005409 1.40% MLMG::mgVcycle_up::4 36 0.005357 0.005357 0.005357 1.39% amrex::average_down 180 0.005153 0.005153 0.005153 1.34% MLPoisson::Fapply() 500 0.005 0.005 0.005 1.30% Gravity::fill_multipole_BCs() 6 0.00452 0.00452 0.00452 1.17% Castro::enforce_min_density() 30 0.004443 0.004443 0.004443 1.15% Castro::post_restart() 1 0.003894 0.003894 0.003894 1.01% Castro::post_timestep() 5 0.003867 0.003867 0.003867 1.00% Gravity::multilevel_solve_for_new_phi() 1 0.003771 0.003771 0.003771 0.98% Gravity::actual_multilevel_solve() 1 0.003753 0.003753 0.003753 0.97% MLCellLinOp::solutionResidual() 42 0.003203 0.003203 0.003203 0.83% Castro::estTimeStep() 10 0.002973 0.002973 0.002973 0.77% MLMG::prepareForSolve() 6 0.002902 0.002902 0.002902 0.75% MultiFab::Xpay() 258 0.002839 0.002839 0.002839 0.74% MLCellLinOp::defineBC() 6 0.00279 0.00279 0.00279 0.72% Castro::reset_internal_energy(MultiFab) 30 0.002769 0.002769 0.002769 0.72% Castro::enforce_speed_limit() 30 0.00267 0.00267 0.00267 0.69% MLMG::computeResidual() 36 0.002656 0.002656 0.002656 0.69% BndryData::define() 6 0.002641 0.002641 0.002641 0.69% Castro::construct_new_source() 25 0.001906 0.001906 0.001906 0.49% Castro::construct_new_gravity_source() 5 0.001741 0.001741 0.001741 0.45% Castro::construct_old_source() 25 0.00172 0.00172 0.00172 0.45% Castro::construct_old_gravity_source() 5 0.001599 0.001599 0.001599 0.41% Castro::computeNewDt() 5 0.001593 0.001593 0.001593 0.41% Castro::reset_internal_energy(Fab) 240 0.001065 0.001065 0.001065 0.28% MLMG::ResNormInf() 42 0.0009323 0.0009323 0.0009323 0.24% Castro::apply_source_to_state() 10 0.0009258 0.0009258 0.0009258 0.24% MultiFab::Saxpy() 10 0.0009206 0.0009206 0.0009206 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.00088 0.00088 0.00088 0.23% MLCellLinOp::setLevelBC() 6 0.0008287 0.0008287 0.0008287 0.22% MLMG::getGradSolution() 6 0.0007718 0.0007718 0.0007718 0.20% MLCellLinOp::compGrad() 6 0.0007687 0.0007687 0.0007687 0.20% FabArrayBase::getCPC() 632 0.0007556 0.0007556 0.0007556 0.20% MultiFab::Add() 36 0.0007157 0.0007157 0.0007157 0.19% FabArray::mult() 22 0.0006438 0.0006438 0.0006438 0.17% MLPoisson::prepareForSolve() 6 0.0006403 0.0006403 0.0006403 0.17% MLCellLinOp::prepareForSolve() 6 0.000635 0.000635 0.000635 0.16% FabArray::setDomainBndry() 20 0.0006334 0.0006334 0.0006334 0.16% Castro::check_for_nan() 10 0.0006013 0.0006013 0.0006013 0.16% MultiFab::contains_nan() 10 0.0005925 0.0005925 0.0005925 0.15% MLMG::computeMLResidual() 6 0.0005689 0.0005689 0.0005689 0.15% Gravity::update_max_rhs() 6 0.0004405 0.0004405 0.0004405 0.11% Amr::InitAmr() 1 0.0003965 0.0003965 0.0003965 0.10% FabArrayBase::CPC::define() 244 0.0003963 0.0003963 0.0003963 0.10% FabArrayBase::getFB() 1766 0.0003098 0.0003098 0.0003098 0.08% Gravity::swapTimeLevels() 5 0.0002277 0.0002277 0.0002277 0.06% Castro::create_source_corrector() 5 0.0001961 0.0001961 0.0001961 0.05% MultiFab::Copy() 6 0.0001776 0.0001776 0.0001776 0.05% Castro::buildMetrics() 1 0.0001471 0.0001471 0.0001471 0.04% MLMG::MLResNormInf() 6 0.0001468 0.0001468 0.0001468 0.04% MultiFab::max() 6 0.000136 0.000136 0.000136 0.04% MLLinOp::define() 6 0.0001359 0.0001359 0.0001359 0.04% MLLinOp::defineGrids() 6 0.0001148 0.0001148 0.0001148 0.03% MLMG::MLRhsNormInf() 6 0.000114 0.000114 0.000114 0.03% Castro::finalize_advance() 5 6.966e-05 6.966e-05 6.966e-05 0.02% FabArrayBase::FB::FB() 26 5.642e-05 5.642e-05 5.642e-05 0.01% Castro::initMFs() 1 2.843e-05 2.843e-05 2.843e-05 0.01% Castro::swap_state_time_levels() 5 2.778e-05 2.778e-05 2.778e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.74e-05 2.74e-05 2.74e-05 0.01% Amr::writeSmallPlotFile() 1 2.634e-05 2.634e-05 2.634e-05 0.01% makeSFC 30 2.051e-05 2.051e-05 2.051e-05 0.01% Castro::finalize_do_advance() 5 1.792e-05 1.792e-05 1.792e-05 0.00% Amr::initSubcycle() 1 8.43e-06 8.43e-06 8.43e-06 0.00% DistributionMapping::Distribute() 31 8.219e-06 8.219e-06 8.219e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.969e-06 3.969e-06 3.969e-06 0.00% Gravity::set_mass_offset() 6 2.295e-06 2.295e-06 2.295e-06 0.00% Castro::retry_advance_ctu() 5 1.756e-06 1.756e-06 1.756e-06 0.00% Castro::FluxRegCrseInit 5 1.54e-06 1.54e-06 1.54e-06 0.00% Castro::FluxRegFineAdd() 5 1.102e-06 1.102e-06 1.102e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.034e-06 1.034e-06 1.034e-06 0.00% AmrLevel::AmrLevel() 1 1.006e-06 1.006e-06 1.006e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.08-12-g4f639294606d) finalized