Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-29-g225c605680e8) initialized Starting run at 10:11:58 UTC on 2023-01-30. Successfully read inputs file ... Castro git describe: 23.01-22-g69c150804 AMReX git describe: 23.01-29-g225c60568 Microphysics git describe: 23.01-7-g5e1d020c reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.05796072 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.03344252 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.04534865 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.050531259 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.049160084 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.064545695 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.074266987 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.057361223 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.059717738 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.05069936 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.059592457 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.058730078 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.063655236 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.057258895 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.033264165 seconds Ending run at 10:11:59 UTC on 2023-01-30. Run time = 0.864779392 Run time without initialization = 0.724800303 Average number of zones advanced per microsecond: 3.617 Average number of zones advanced per microsecond per rank: 3.617 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.8648 ... 0.8648 ... 0.8648 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2333 0.2333 0.2333 26.98% Castro::construct_ctu_hydro_source() 10 0.216 0.216 0.216 24.97% MLCellLinOp::applyBC() 4433 0.07519 0.07519 0.07519 8.69% MLPoisson::Fsmooth() 3280 0.03246 0.03246 0.03246 3.75% FillBoundary_nowait() 4023 0.03245 0.03245 0.03245 3.75% StateData::FillBoundary(geom) 328 0.0234 0.0234 0.0234 2.71% amrex::Dot() 1114 0.02045 0.02045 0.02045 2.36% StateDataPhysBCFunct::() 41 0.01719 0.01719 0.01719 1.99% amrex::Copy() 1029 0.01487 0.01487 0.01487 1.72% Castro::normalize_species() 62 0.01462 0.01462 0.01462 1.69% FabArray::norminf() 743 0.01439 0.01439 0.01439 1.66% Castro::computeTemp() 63 0.01423 0.01423 0.01423 1.65% FabArray::setVal() 1144 0.01327 0.01327 0.01327 1.53% FabArray::ParallelCopy_nowait() 861 0.01316 0.01316 0.01316 1.52% Castro::enforce_min_density() 62 0.01097 0.01097 0.01097 1.27% MLPoisson::Fapply() 1142 0.01047 0.01047 0.01047 1.21% MLCellLinOp::defineAuxData() 11 0.009756 0.009756 0.009756 1.13% FabArray::Saxpy() 813 0.008143 0.008143 0.008143 0.94% FabArray::Xpay() 821 0.008119 0.008119 0.008119 0.94% MLMG::addInterpCorrection() 410 0.006609 0.006609 0.006609 0.76% Gravity::fill_multipole_BCs() 11 0.006223 0.006223 0.006223 0.72% amrex::average_down 410 0.005853 0.005853 0.005853 0.68% FabArray::LinComb() 557 0.004548 0.004548 0.004548 0.53% amrex::Add() 164 0.0043 0.0043 0.0043 0.50% Castro::estTimeStep() 21 0.004289 0.004289 0.004289 0.50% Castro::reset_internal_energy(MultiFab) 63 0.004056 0.004056 0.004056 0.47% BndryData::define() 11 0.003704 0.003704 0.003704 0.43% Amr::checkPoint() 3 0.003489 0.003489 0.003489 0.40% Castro::construct_new_gravity_source() 10 0.002855 0.002855 0.002855 0.33% Castro::do_advance_ctu() 10 0.002677 0.002677 0.002677 0.31% Amr::writePlotFile() 2 0.002112 0.002112 0.002112 0.24% MLCGSolver::bicgstab 82 0.002023 0.002023 0.002023 0.23% Castro::construct_old_gravity_source() 10 0.001956 0.001956 0.001956 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.00168 0.00168 0.00168 0.19% Castro::reset_internal_energy(Fab) 504 0.001581 0.001581 0.001581 0.18% MLCellLinOp::setLevelBC() 11 0.001392 0.001392 0.001392 0.16% Gravity::actual_solve_with_mlmg() 11 0.001363 0.001363 0.001363 0.16% FabArray::mult() 43 0.001337 0.001337 0.001337 0.15% Castro::initData() 1 0.001304 0.001304 0.001304 0.15% FabArray::setDomainBndry() 41 0.001298 0.001298 0.001298 0.15% MultiFab::contains_nan() 20 0.001186 0.001186 0.001186 0.14% MLCellLinOp::smooth() 1640 0.00115 0.00115 0.00115 0.13% MLCellLinOp::prepareForSolve() 11 0.001114 0.001114 0.001114 0.13% Castro::enforce_speed_limit() 62 0.001057 0.001057 0.001057 0.12% MLCellLinOp::compGrad() 11 0.0009125 0.0009125 0.0009125 0.11% MLMG::prepareForSolve() 11 0.0008277 0.0008277 0.0008277 0.10% FabArray::FillBoundary() 4023 0.0007873 0.0007873 0.0007873 0.09% Castro::create_source_corrector() 10 0.0007747 0.0007747 0.0007747 0.09% FabArrayBase::getCPC() 1323 0.0007477 0.0007477 0.0007477 0.09% FabArrayBase::CPC::define() 454 0.0006799 0.0006799 0.0006799 0.08% Gravity::get_new_grav_vector() 11 0.0006002 0.0006002 0.0006002 0.07% FabArrayBase::getFB() 4023 0.0005864 0.0005864 0.0005864 0.07% Gravity::get_old_grav_vector() 10 0.0005247 0.0005247 0.0005247 0.06% Amr::InitAmr() 1 0.000494 0.000494 0.000494 0.06% MLCellLinOp::apply() 1142 0.0004727 0.0004727 0.0004727 0.05% MLMG::mgVcycle() 82 0.000378 0.000378 0.000378 0.04% Amr::coarseTimeStep() 10 0.000377 0.000377 0.000377 0.04% MLCGSolver::ParallelAllReduce 1514 0.0002943 0.0002943 0.0002943 0.03% main() 1 0.0002836 0.0002836 0.0002836 0.03% MultiFab::max() 11 0.0002567 0.0002567 0.0002567 0.03% FabArray::ParallelCopy() 861 0.00024 0.00024 0.00024 0.03% MLCellLinOp::correctionResidual() 492 0.0002081 0.0002081 0.0002081 0.02% FillPatchIterator::Initialize 41 0.0002055 0.0002055 0.0002055 0.02% MLCellLinOp::defineBC() 11 0.0001964 0.0001964 0.0001964 0.02% Gravity::solve_for_phi() 10 0.0001581 0.0001581 0.0001581 0.02% Amr::timeStep() 10 0.0001472 0.0001472 0.0001472 0.02% MLLinOp::defineGrids() 11 0.0001454 0.0001454 0.0001454 0.02% Castro::subcycle_advance_ctu() 10 0.0001335 0.0001335 0.0001335 0.02% StateData::checkPoint() 12 0.0001322 0.0001322 0.0001322 0.02% Castro::advance() 10 0.0001238 0.0001238 0.0001238 0.01% MLMG:computeResOfCorrection() 410 0.0001104 0.0001104 0.0001104 0.01% Gravity::update_max_rhs() 11 0.0001052 0.0001052 0.0001052 0.01% MLMG::mgVcycle_down::0 82 9.664e-05 9.664e-05 9.664e-05 0.01% Castro::finalize_advance() 10 9.216e-05 9.216e-05 9.216e-05 0.01% MLMG::actualBottomSolve() 82 8.838e-05 8.838e-05 8.838e-05 0.01% FabArrayBase::FB::FB() 56 8.657e-05 8.657e-05 8.657e-05 0.01% Castro::clean_state() 62 8.177e-05 8.177e-05 8.177e-05 0.01% MLMG::mgVcycle_down::1 82 8.008e-05 8.008e-05 8.008e-05 0.01% Castro::Castro() 1 7.885e-05 7.885e-05 7.885e-05 0.01% MLMG::mgVcycle_down::2 82 7.544e-05 7.544e-05 7.544e-05 0.01% Castro::expand_state() 10 7.518e-05 7.518e-05 7.518e-05 0.01% MLMG::solve() 11 7.445e-05 7.445e-05 7.445e-05 0.01% AmrLevel::checkPoint() 3 7.325e-05 7.325e-05 7.325e-05 0.01% MLMG::mgVcycle_down::3 82 7.181e-05 7.181e-05 7.181e-05 0.01% MLMG::mgVcycle_down::4 82 7.113e-05 7.113e-05 7.113e-05 0.01% Castro::initialize_advance() 10 6.396e-05 6.396e-05 6.396e-05 0.01% MLMG::mgVcycle_up::4 82 6.183e-05 6.183e-05 6.183e-05 0.01% MLMG::mgVcycle_up::0 82 5.549e-05 5.549e-05 5.549e-05 0.01% Castro::initialize_do_advance() 10 5.532e-05 5.532e-05 5.532e-05 0.01% MLMG::mgVcycle_up::1 82 5.327e-05 5.327e-05 5.327e-05 0.01% MLMG::mgVcycle_up::3 82 5.276e-05 5.276e-05 5.276e-05 0.01% MLMG::oneIter() 82 5.199e-05 5.199e-05 5.199e-05 0.01% MLMG::mgVcycle_up::2 82 5.127e-05 5.127e-05 5.127e-05 0.01% MLCellLinOp::solutionResidual() 93 5.069e-05 5.069e-05 5.069e-05 0.01% Castro::finalize_do_advance() 10 4.393e-05 4.393e-05 4.393e-05 0.01% Castro::construct_new_source() 50 4.183e-05 4.183e-05 4.183e-05 0.00% Castro::enforce_consistent_e() 1 3.414e-05 3.414e-05 3.414e-05 0.00% MLMG::ResNormInf() 93 3.396e-05 3.396e-05 3.396e-05 0.00% Castro::post_timestep() 10 3.372e-05 3.372e-05 3.372e-05 0.00% Castro::swap_state_time_levels() 10 3.35e-05 3.35e-05 3.35e-05 0.00% MLMG::mgVcycle_bottom 82 3.187e-05 3.187e-05 3.187e-05 0.00% Gravity::solve_phi_with_mlmg() 11 3.141e-05 3.141e-05 3.141e-05 0.00% MLMG::computeResidual() 82 3.059e-05 3.059e-05 3.059e-05 0.00% FillPatchSingleLevel 41 2.786e-05 2.786e-05 2.786e-05 0.00% StateData::define() 4 2.553e-05 2.553e-05 2.553e-05 0.00% makeSFC 55 2.539e-05 2.539e-05 2.539e-05 0.00% Amr::writeSmallPlotFile() 1 2.492e-05 2.492e-05 2.492e-05 0.00% Castro::construct_new_gravity() 10 2.403e-05 2.403e-05 2.403e-05 0.00% MLPoisson::define() 11 2.217e-05 2.217e-05 2.217e-05 0.00% Castro::initMFs() 1 2.008e-05 2.008e-05 2.008e-05 0.00% Amr::FinalizeInit() 1 1.987e-05 1.987e-05 1.987e-05 0.00% Amr::defBaseLevel() 1 1.953e-05 1.953e-05 1.953e-05 0.00% Castro::do_new_sources() 10 1.659e-05 1.659e-05 1.659e-05 0.00% Castro::buildMetrics() 1 1.639e-05 1.639e-05 1.639e-05 0.00% Castro::construct_old_source() 50 1.628e-05 1.628e-05 1.628e-05 0.00% DistributionMapping::Distribute() 56 1.581e-05 1.581e-05 1.581e-05 0.00% Castro::do_old_sources() 10 1.508e-05 1.508e-05 1.508e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.423e-05 1.423e-05 1.423e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.402e-05 1.402e-05 1.402e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.31e-05 1.31e-05 1.31e-05 0.00% Castro::check_for_nan() 20 1.216e-05 1.216e-05 1.216e-05 0.00% Castro::apply_source_to_state() 20 1.053e-05 1.053e-05 1.053e-05 0.00% Amr::InitializeInit() 1 1.039e-05 1.039e-05 1.039e-05 0.00% Castro::post_init() 1 1.013e-05 1.013e-05 1.013e-05 0.00% MLLinOp::define() 11 9.509e-06 9.509e-06 9.509e-06 0.00% Castro::construct_old_gravity() 10 8.93e-06 8.93e-06 8.93e-06 0.00% Gravity::swapTimeLevels() 10 8.7e-06 8.7e-06 8.7e-06 0.00% Amr::initSubcycle() 1 8.519e-06 8.519e-06 8.519e-06 0.00% Castro::computeNewDt() 9 7.704e-06 7.704e-06 7.704e-06 0.00% MLMG::computeMLResidual() 11 7.689e-06 7.689e-06 7.689e-06 0.00% MLPoisson::prepareForSolve() 11 7.599e-06 7.599e-06 7.599e-06 0.00% Gravity::actual_multilevel_solve() 1 7.47e-06 7.47e-06 7.47e-06 0.00% MLMG::getGradSolution() 11 5.447e-06 5.447e-06 5.447e-06 0.00% Gravity::set_mass_offset() 11 4.673e-06 4.673e-06 4.673e-06 0.00% AmrLevel::checkPointPost() 3 4.397e-06 4.397e-06 4.397e-06 0.00% Castro::retry_advance_ctu() 10 4.114e-06 4.114e-06 4.114e-06 0.00% MLMG::MLRhsNormInf() 11 3.831e-06 3.831e-06 3.831e-06 0.00% MLMG::MLResNormInf() 11 3.523e-06 3.523e-06 3.523e-06 0.00% Castro::computeInitialDt() 2 3.016e-06 3.016e-06 3.016e-06 0.00% Castro::FluxRegCrseInit 10 2.998e-06 2.998e-06 2.998e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.787e-06 2.787e-06 2.787e-06 0.00% Amr::init() 1 2.269e-06 2.269e-06 2.269e-06 0.00% Castro::FluxRegFineAdd() 10 2.035e-06 2.035e-06 2.035e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.006e-06 2.006e-06 2.006e-06 0.00% AmrLevel::checkPointPre() 3 1.661e-06 1.661e-06 1.661e-06 0.00% Castro::post_regrid() 1 1.228e-06 1.228e-06 1.228e-06 0.00% Amr::initialInit() 1 9.12e-07 9.12e-07 9.12e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8648 0.8648 0.8648 100.00% Amr::coarseTimeStep() 10 0.6913 0.6913 0.6913 79.94% Amr::timeStep() 10 0.5737 0.5737 0.5737 66.34% Castro::advance() 10 0.5665 0.5665 0.5665 65.50% Castro::subcycle_advance_ctu() 10 0.5542 0.5542 0.5542 64.08% Castro::do_advance_ctu() 10 0.554 0.554 0.554 64.06% Gravity::solve_phi_with_mlmg() 11 0.2823 0.2823 0.2823 32.64% Gravity::actual_solve_with_mlmg() 11 0.2756 0.2756 0.2756 31.87% Castro::construct_new_gravity() 10 0.2577 0.2577 0.2577 29.80% MLMG::solve() 11 0.2553 0.2553 0.2553 29.52% Gravity::solve_for_phi() 10 0.2423 0.2423 0.2423 28.02% MLMG::oneIter() 82 0.2411 0.2411 0.2411 27.88% MLMG::mgVcycle() 82 0.2375 0.2375 0.2375 27.46% VisMF::Write(FabArray) 11 0.2333 0.2333 0.2333 26.98% Castro::construct_ctu_hydro_source() 10 0.216 0.216 0.216 24.97% Amr::checkPoint() 3 0.1727 0.1727 0.1727 19.97% AmrLevel::checkPoint() 3 0.1692 0.1692 0.1692 19.57% StateData::checkPoint() 12 0.1692 0.1692 0.1692 19.56% Amr::init() 1 0.1393 0.1393 0.1393 16.11% MLCellLinOp::smooth() 1640 0.117 0.117 0.117 13.53% MLCellLinOp::applyBC() 4433 0.1091 0.1091 0.1091 12.61% MLMG::mgVcycle_bottom 82 0.07348 0.07348 0.07348 8.50% MLMG::actualBottomSolve() 82 0.07344 0.07344 0.07344 8.49% MLCGSolver::bicgstab 82 0.07274 0.07274 0.07274 8.41% Amr::writePlotFile() 2 0.0669 0.0669 0.0669 7.74% Amr::initialInit() 1 0.04775 0.04775 0.04775 5.52% FillPatchIterator::Initialize 41 0.04612 0.04612 0.04612 5.33% Castro::clean_state() 62 0.04578 0.04578 0.04578 5.29% FillPatchSingleLevel 41 0.04462 0.04462 0.04462 5.16% Amr::FinalizeInit() 1 0.04362 0.04362 0.04362 5.04% Castro::post_init() 1 0.04231 0.04231 0.04231 4.89% StateDataPhysBCFunct::() 41 0.0406 0.0406 0.0406 4.69% Gravity::multilevel_solve_for_new_phi() 1 0.04043 0.04043 0.04043 4.68% Gravity::actual_multilevel_solve() 1 0.04042 0.04042 0.04042 4.67% MLCellLinOp::apply() 1142 0.0362 0.0362 0.0362 4.19% MLMG::mgVcycle_down::0 82 0.03433 0.03433 0.03433 3.97% FabArray::FillBoundary() 4023 0.03391 0.03391 0.03391 3.92% FillBoundary_nowait() 4023 0.03312 0.03312 0.03312 3.83% MLPoisson::Fsmooth() 3280 0.03246 0.03246 0.03246 3.75% MLMG::mgVcycle_up::0 82 0.02593 0.02593 0.02593 3.00% Castro::initialize_do_advance() 10 0.02362 0.02362 0.02362 2.73% StateData::FillBoundary(geom) 328 0.0234 0.0234 0.0234 2.71% MLCellLinOp::correctionResidual() 492 0.02226 0.02226 0.02226 2.57% amrex::Dot() 1114 0.02045 0.02045 0.02045 2.36% Castro::computeTemp() 63 0.01987 0.01987 0.01987 2.30% MLMG:computeResOfCorrection() 410 0.01963 0.01963 0.01963 2.27% Gravity::get_new_grav_vector() 11 0.01694 0.01694 0.01694 1.96% MLPoisson::define() 11 0.01619 0.01619 0.01619 1.87% Castro::expand_state() 10 0.01589 0.01589 0.01589 1.84% MLMG::mgVcycle_down::1 82 0.0157 0.0157 0.0157 1.81% amrex::Copy() 1029 0.01487 0.01487 0.01487 1.72% Castro::normalize_species() 62 0.01462 0.01462 0.01462 1.69% Castro::construct_old_gravity() 10 0.01461 0.01461 0.01461 1.69% Gravity::get_old_grav_vector() 10 0.0146 0.0146 0.0146 1.69% MLMG::mgVcycle_down::2 82 0.01457 0.01457 0.01457 1.68% FabArray::norminf() 743 0.01439 0.01439 0.01439 1.66% MLMG::mgVcycle_down::3 82 0.01426 0.01426 0.01426 1.65% FabArray::ParallelCopy() 861 0.01421 0.01421 0.01421 1.64% MLMG::mgVcycle_down::4 82 0.01402 0.01402 0.01402 1.62% FabArray::ParallelCopy_nowait() 861 0.01397 0.01397 0.01397 1.62% FabArray::setVal() 1144 0.01327 0.01327 0.01327 1.53% Castro::do_new_sources() 10 0.01267 0.01267 0.01267 1.46% MLCGSolver::ParallelAllReduce 1514 0.01225 0.01225 0.01225 1.42% MLMG::addInterpCorrection() 410 0.01174 0.01174 0.01174 1.36% Castro::initialize_advance() 10 0.01157 0.01157 0.01157 1.34% MLMG::mgVcycle_up::4 82 0.01146 0.01146 0.01146 1.33% MLMG::mgVcycle_up::1 82 0.01139 0.01139 0.01139 1.32% MLCellLinOp::defineAuxData() 11 0.01111 0.01111 0.01111 1.28% MLMG::mgVcycle_up::2 82 0.0111 0.0111 0.0111 1.28% Castro::enforce_min_density() 62 0.01097 0.01097 0.01097 1.27% amrex::average_down 410 0.01094 0.01094 0.01094 1.26% MLMG::mgVcycle_up::3 82 0.01087 0.01087 0.01087 1.26% MLPoisson::Fapply() 1142 0.01047 0.01047 0.01047 1.21% Castro::do_old_sources() 10 0.00947 0.00947 0.00947 1.10% FabArray::Saxpy() 813 0.008143 0.008143 0.008143 0.94% FabArray::Xpay() 821 0.008119 0.008119 0.008119 0.94% MLCellLinOp::solutionResidual() 93 0.007134 0.007134 0.007134 0.82% Castro::post_timestep() 10 0.00707 0.00707 0.00707 0.82% Gravity::fill_multipole_BCs() 11 0.006466 0.006466 0.006466 0.75% MLMG::computeResidual() 82 0.006167 0.006167 0.006167 0.71% Castro::reset_internal_energy(MultiFab) 63 0.005637 0.005637 0.005637 0.65% MLCellLinOp::defineBC() 11 0.00485 0.00485 0.00485 0.56% BndryData::define() 11 0.004653 0.004653 0.004653 0.54% MLMG::prepareForSolve() 11 0.004642 0.004642 0.004642 0.54% FabArray::LinComb() 557 0.004548 0.004548 0.004548 0.53% amrex::Add() 164 0.0043 0.0043 0.0043 0.50% Castro::estTimeStep() 21 0.004289 0.004289 0.004289 0.50% Amr::InitializeInit() 1 0.004133 0.004133 0.004133 0.48% Amr::defBaseLevel() 1 0.004123 0.004123 0.004123 0.48% Castro::initData() 1 0.00364 0.00364 0.00364 0.42% Castro::construct_new_source() 50 0.002897 0.002897 0.002897 0.33% Castro::construct_new_gravity_source() 10 0.002855 0.002855 0.002855 0.33% MLMG::ResNormInf() 93 0.002087 0.002087 0.002087 0.24% Castro::construct_old_source() 50 0.001973 0.001973 0.001973 0.23% Castro::construct_old_gravity_source() 10 0.001956 0.001956 0.001956 0.23% Castro::computeNewDt() 9 0.001856 0.001856 0.001856 0.21% Castro::apply_source_to_state() 20 0.001822 0.001822 0.001822 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.00168 0.00168 0.00168 0.19% Castro::reset_internal_energy(Fab) 504 0.001581 0.001581 0.001581 0.18% FabArrayBase::getCPC() 1323 0.001428 0.001428 0.001428 0.17% MLCellLinOp::setLevelBC() 11 0.001392 0.001392 0.001392 0.16% MLMG::getGradSolution() 11 0.001376 0.001376 0.001376 0.16% MLCellLinOp::compGrad() 11 0.00137 0.00137 0.00137 0.16% FabArray::mult() 43 0.001337 0.001337 0.001337 0.15% FabArray::setDomainBndry() 41 0.001298 0.001298 0.001298 0.15% Castro::check_for_nan() 20 0.001198 0.001198 0.001198 0.14% MultiFab::contains_nan() 20 0.001186 0.001186 0.001186 0.14% Castro::post_regrid() 1 0.001127 0.001127 0.001127 0.13% MLPoisson::prepareForSolve() 11 0.001122 0.001122 0.001122 0.13% MLCellLinOp::prepareForSolve() 11 0.001114 0.001114 0.001114 0.13% Castro::enforce_speed_limit() 62 0.001057 0.001057 0.001057 0.12% MLMG::computeMLResidual() 11 0.001005 0.001005 0.001005 0.12% Castro::computeInitialDt() 2 0.0008398 0.0008398 0.0008398 0.10% Gravity::update_max_rhs() 11 0.0008083 0.0008083 0.0008083 0.09% Castro::create_source_corrector() 10 0.0007747 0.0007747 0.0007747 0.09% FabArrayBase::CPC::define() 454 0.0006799 0.0006799 0.0006799 0.08% FabArrayBase::getFB() 4023 0.0006729 0.0006729 0.0006729 0.08% Castro::finalize_advance() 10 0.0006105 0.0006105 0.0006105 0.07% Amr::InitAmr() 1 0.0005025 0.0005025 0.0005025 0.06% Gravity::swapTimeLevels() 10 0.0004402 0.0004402 0.0004402 0.05% Castro::Castro() 1 0.0004192 0.0004192 0.0004192 0.05% MLMG::MLResNormInf() 11 0.0002939 0.0002939 0.0002939 0.03% MultiFab::max() 11 0.0002567 0.0002567 0.0002567 0.03% MLMG::MLRhsNormInf() 11 0.0002212 0.0002212 0.0002212 0.03% MLLinOp::define() 11 0.00021 0.00021 0.00021 0.02% MLLinOp::defineGrids() 11 0.0002005 0.0002005 0.0002005 0.02% Castro::buildMetrics() 1 0.0001514 0.0001514 0.0001514 0.02% FabArrayBase::FB::FB() 56 8.657e-05 8.657e-05 8.657e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.312e-05 5.312e-05 5.312e-05 0.01% Castro::finalize_do_advance() 10 4.393e-05 4.393e-05 4.393e-05 0.01% makeSFC 55 4.002e-05 4.002e-05 4.002e-05 0.00% AmrLevel::AmrLevel(dm) 1 3.955e-05 3.955e-05 3.955e-05 0.00% Castro::enforce_consistent_e() 1 3.414e-05 3.414e-05 3.414e-05 0.00% Castro::swap_state_time_levels() 10 3.35e-05 3.35e-05 3.35e-05 0.00% StateData::define() 4 2.553e-05 2.553e-05 2.553e-05 0.00% Amr::writeSmallPlotFile() 1 2.492e-05 2.492e-05 2.492e-05 0.00% Castro::initMFs() 1 2.008e-05 2.008e-05 2.008e-05 0.00% DistributionMapping::Distribute() 56 1.581e-05 1.581e-05 1.581e-05 0.00% Amr::initSubcycle() 1 8.519e-06 8.519e-06 8.519e-06 0.00% Gravity::set_mass_offset() 11 4.673e-06 4.673e-06 4.673e-06 0.00% AmrLevel::checkPointPost() 3 4.397e-06 4.397e-06 4.397e-06 0.00% Castro::retry_advance_ctu() 10 4.114e-06 4.114e-06 4.114e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.967e-06 3.967e-06 3.967e-06 0.00% Castro::FluxRegCrseInit 10 2.998e-06 2.998e-06 2.998e-06 0.00% Castro::FluxRegFineAdd() 10 2.035e-06 2.035e-06 2.035e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.006e-06 2.006e-06 2.006e-06 0.00% AmrLevel::checkPointPre() 3 1.661e-06 1.661e-06 1.661e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-29-g225c605680e8) finalized Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-29-g225c605680e8) initialized Starting run at 10:12:00 UTC on 2023-01-30. Successfully read inputs file ... Castro git describe: 23.01-22-g69c150804 AMReX git describe: 23.01-29-g225c60568 Microphysics git describe: 23.01-7-g5e1d020c reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.481200259 Restart time = 0.046982632 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.048438449 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.048140489 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.057054658 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.057227957 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.062721524 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.032868173 seconds Ending run at 10:12:00 UTC on 2023-01-30. Run time = 0.354425495 Run time without initialization = 0.306831884 Average number of zones advanced per microsecond: 4.272 Average number of zones advanced per microsecond per rank: 4.272 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.3545 ... 0.3545 ... 0.3545 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0986 0.0986 0.0986 27.81% VisMF::Read() 3 0.04086 0.04086 0.04086 11.53% MLCellLinOp::applyBC() 1946 0.03239 0.03239 0.03239 9.14% VisMF::Write(FabArray) 1 0.03149 0.03149 0.03149 8.89% MLPoisson::Fsmooth() 1440 0.01398 0.01398 0.01398 3.94% FillBoundary_nowait() 1766 0.01319 0.01319 0.01319 3.72% StateData::FillBoundary(geom) 160 0.01145 0.01145 0.01145 3.23% amrex::Dot() 484 0.00869 0.00869 0.00869 2.45% amrex::Copy() 463 0.006932 0.006932 0.006932 1.96% Castro::normalize_species() 30 0.00649 0.00649 0.00649 1.83% Castro::computeTemp() 30 0.006431 0.006431 0.006431 1.81% FabArray::setVal() 537 0.006221 0.006221 0.006221 1.75% FabArray::norminf() 326 0.006162 0.006162 0.006162 1.74% FabArray::ParallelCopy_nowait() 380 0.005888 0.005888 0.005888 1.66% StateDataPhysBCFunct::() 20 0.005854 0.005854 0.005854 1.65% MLCellLinOp::defineAuxData() 6 0.005198 0.005198 0.005198 1.47% MLPoisson::Fapply() 500 0.004459 0.004459 0.004459 1.26% Castro::enforce_min_density() 30 0.003631 0.003631 0.003631 1.02% FabArray::Saxpy() 355 0.003588 0.003588 0.003588 1.01% FabArray::Xpay() 361 0.003516 0.003516 0.003516 0.99% Gravity::fill_multipole_BCs() 6 0.003091 0.003091 0.003091 0.87% MLMG::addInterpCorrection() 180 0.00281 0.00281 0.00281 0.79% Castro::estTimeStep() 10 0.002521 0.002521 0.002521 0.71% amrex::average_down 180 0.002506 0.002506 0.002506 0.71% Amr::restart() 1 0.002441 0.002441 0.002441 0.69% BndryData::define() 6 0.001995 0.001995 0.001995 0.56% Castro::reset_internal_energy(MultiFab) 30 0.001927 0.001927 0.001927 0.54% FabArray::LinComb() 242 0.001912 0.001912 0.001912 0.54% amrex::Add() 72 0.00184 0.00184 0.00184 0.52% Castro::construct_new_gravity_source() 5 0.001435 0.001435 0.001435 0.40% Amr::writePlotFile() 1 0.001205 0.001205 0.001205 0.34% Castro::do_advance_ctu() 5 0.0011 0.0011 0.0011 0.31% Castro::construct_old_gravity_source() 5 0.0009791 0.0009791 0.0009791 0.28% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008767 0.0008767 0.0008767 0.25% MLCGSolver::bicgstab 36 0.0008697 0.0008697 0.0008697 0.25% MLCellLinOp::setLevelBC() 6 0.0007454 0.0007454 0.0007454 0.21% Castro::reset_internal_energy(Fab) 240 0.000732 0.000732 0.000732 0.21% Gravity::actual_solve_with_mlmg() 6 0.0007094 0.0007094 0.0007094 0.20% FabArray::mult() 22 0.0006588 0.0006588 0.0006588 0.19% FabArray::setDomainBndry() 20 0.0006439 0.0006439 0.0006439 0.18% MLCellLinOp::prepareForSolve() 6 0.0005946 0.0005946 0.0005946 0.17% MultiFab::contains_nan() 10 0.0005836 0.0005836 0.0005836 0.16% MLCellLinOp::compGrad() 6 0.0004886 0.0004886 0.0004886 0.14% MLCellLinOp::smooth() 720 0.000487 0.000487 0.000487 0.14% MLMG::prepareForSolve() 6 0.0004424 0.0004424 0.0004424 0.12% Castro::enforce_speed_limit() 30 0.0004221 0.0004221 0.0004221 0.12% Amr::InitAmr() 1 0.0004125 0.0004125 0.0004125 0.12% FabArrayBase::CPC::define() 244 0.0003966 0.0003966 0.0003966 0.11% FabArrayBase::getCPC() 632 0.0003442 0.0003442 0.0003442 0.10% FabArray::FillBoundary() 1766 0.0003384 0.0003384 0.0003384 0.10% Gravity::get_old_grav_vector() 5 0.0002914 0.0002914 0.0002914 0.08% main() 1 0.0002771 0.0002771 0.0002771 0.08% Gravity::get_new_grav_vector() 5 0.0002642 0.0002642 0.0002642 0.07% FabArrayBase::getFB() 1766 0.0002372 0.0002372 0.0002372 0.07% MLCellLinOp::apply() 500 0.0001957 0.0001957 0.0001957 0.06% MLMG::mgVcycle() 36 0.0001692 0.0001692 0.0001692 0.05% Amr::coarseTimeStep() 5 0.0001567 0.0001567 0.0001567 0.04% MultiFab::max() 6 0.0001338 0.0001338 0.0001338 0.04% MLCGSolver::ParallelAllReduce 659 0.0001197 0.0001197 0.0001197 0.03% FabArray::ParallelCopy() 380 0.0001078 0.0001078 0.0001078 0.03% MLLinOp::defineGrids() 6 0.0001055 0.0001055 0.0001055 0.03% MLCellLinOp::defineBC() 6 0.0001055 0.0001055 0.0001055 0.03% FillPatchIterator::Initialize 20 9.948e-05 9.948e-05 9.948e-05 0.03% MLCellLinOp::correctionResidual() 216 9.057e-05 9.057e-05 9.057e-05 0.03% Amr::timeStep() 5 7.847e-05 7.847e-05 7.847e-05 0.02% Castro::subcycle_advance_ctu() 5 7.4e-05 7.4e-05 7.4e-05 0.02% Gravity::solve_for_phi() 5 7.049e-05 7.049e-05 7.049e-05 0.02% AmrLevel::restart() 1 6.873e-05 6.873e-05 6.873e-05 0.02% StateData::restartDoit() 4 5.743e-05 5.743e-05 5.743e-05 0.02% FabArrayBase::FB::FB() 26 5.679e-05 5.679e-05 5.679e-05 0.02% Gravity::update_max_rhs() 6 5.431e-05 5.431e-05 5.431e-05 0.02% MLMG:computeResOfCorrection() 180 4.787e-05 4.787e-05 4.787e-05 0.01% Castro::finalize_advance() 5 4.081e-05 4.081e-05 4.081e-05 0.01% Castro::clean_state() 30 4.006e-05 4.006e-05 4.006e-05 0.01% MLMG::mgVcycle_down::0 36 3.919e-05 3.919e-05 3.919e-05 0.01% MLMG::actualBottomSolve() 36 3.835e-05 3.835e-05 3.835e-05 0.01% Castro::expand_state() 5 3.79e-05 3.79e-05 3.79e-05 0.01% MLMG::solve() 6 3.471e-05 3.471e-05 3.471e-05 0.01% MLMG::mgVcycle_down::1 36 3.43e-05 3.43e-05 3.43e-05 0.01% MLMG::mgVcycle_down::3 36 3.428e-05 3.428e-05 3.428e-05 0.01% Castro::advance() 5 3.262e-05 3.262e-05 3.262e-05 0.01% MLMG::mgVcycle_down::4 36 3.22e-05 3.22e-05 3.22e-05 0.01% MLMG::mgVcycle_down::2 36 3.166e-05 3.166e-05 3.166e-05 0.01% Castro::initialize_advance() 5 3.054e-05 3.054e-05 3.054e-05 0.01% MLMG::mgVcycle_up::4 36 2.86e-05 2.86e-05 2.86e-05 0.01% Castro::construct_new_source() 25 2.604e-05 2.604e-05 2.604e-05 0.01% Amr::writeSmallPlotFile() 1 2.499e-05 2.499e-05 2.499e-05 0.01% MLMG::mgVcycle_up::0 36 2.419e-05 2.419e-05 2.419e-05 0.01% Castro::buildMetrics() 1 2.347e-05 2.347e-05 2.347e-05 0.01% Castro::initialize_do_advance() 5 2.282e-05 2.282e-05 2.282e-05 0.01% MLMG::oneIter() 36 2.216e-05 2.216e-05 2.216e-05 0.01% MLMG::mgVcycle_up::2 36 2.203e-05 2.203e-05 2.203e-05 0.01% MLMG::mgVcycle_up::3 36 2.173e-05 2.173e-05 2.173e-05 0.01% Castro::post_restart() 1 2.141e-05 2.141e-05 2.141e-05 0.01% MLCellLinOp::solutionResidual() 42 2.131e-05 2.131e-05 2.131e-05 0.01% Castro::swap_state_time_levels() 5 2.131e-05 2.131e-05 2.131e-05 0.01% MLMG::mgVcycle_up::1 36 2.1e-05 2.1e-05 2.1e-05 0.01% Castro::initMFs() 1 2.098e-05 2.098e-05 2.098e-05 0.01% Castro::create_source_corrector() 5 2.088e-05 2.088e-05 2.088e-05 0.01% Castro::construct_new_gravity() 5 1.796e-05 1.796e-05 1.796e-05 0.01% Castro::finalize_do_advance() 5 1.765e-05 1.765e-05 1.765e-05 0.00% MLMG::ResNormInf() 42 1.641e-05 1.641e-05 1.641e-05 0.00% MLLinOp::define() 6 1.601e-05 1.601e-05 1.601e-05 0.00% MLMG::mgVcycle_bottom 36 1.474e-05 1.474e-05 1.474e-05 0.00% MLPoisson::define() 6 1.443e-05 1.443e-05 1.443e-05 0.00% FillPatchSingleLevel 20 1.416e-05 1.416e-05 1.416e-05 0.00% MLMG::computeResidual() 36 1.386e-05 1.386e-05 1.386e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.334e-05 1.334e-05 1.334e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.308e-05 1.308e-05 1.308e-05 0.00% makeSFC 30 1.277e-05 1.277e-05 1.277e-05 0.00% Castro::construct_old_source() 25 9.032e-06 9.032e-06 9.032e-06 0.00% DistributionMapping::Distribute() 31 8.985e-06 8.985e-06 8.985e-06 0.00% Castro::do_new_sources() 5 8.82e-06 8.82e-06 8.82e-06 0.00% Amr::initSubcycle() 1 8.635e-06 8.635e-06 8.635e-06 0.00% Castro::do_old_sources() 5 8.148e-06 8.148e-06 8.148e-06 0.00% Castro::check_for_nan() 10 7.203e-06 7.203e-06 7.203e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.154e-06 7.154e-06 7.154e-06 0.00% Gravity::actual_multilevel_solve() 1 6.359e-06 6.359e-06 6.359e-06 0.00% Castro::post_timestep() 5 5.735e-06 5.735e-06 5.735e-06 0.00% Castro::apply_source_to_state() 10 5.076e-06 5.076e-06 5.076e-06 0.00% Castro::construct_old_gravity() 5 5.025e-06 5.025e-06 5.025e-06 0.00% Gravity::swapTimeLevels() 5 4.108e-06 4.108e-06 4.108e-06 0.00% MLPoisson::prepareForSolve() 6 4.105e-06 4.105e-06 4.105e-06 0.00% MLMG::computeMLResidual() 6 3.656e-06 3.656e-06 3.656e-06 0.00% Castro::computeNewDt() 5 3.624e-06 3.624e-06 3.624e-06 0.00% MLMG::getGradSolution() 6 2.961e-06 2.961e-06 2.961e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.943e-06 2.943e-06 2.943e-06 0.00% MLMG::MLRhsNormInf() 6 2.206e-06 2.206e-06 2.206e-06 0.00% MLMG::MLResNormInf() 6 2.169e-06 2.169e-06 2.169e-06 0.00% Gravity::set_mass_offset() 6 1.782e-06 1.782e-06 1.782e-06 0.00% Castro::retry_advance_ctu() 5 1.696e-06 1.696e-06 1.696e-06 0.00% Castro::FluxRegCrseInit 5 1.511e-06 1.511e-06 1.511e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.208e-06 1.208e-06 1.208e-06 0.00% Castro::FluxRegFineAdd() 5 1.087e-06 1.087e-06 1.087e-06 0.00% Amr::init() 1 1.028e-06 1.028e-06 1.028e-06 0.00% AmrLevel::AmrLevel() 1 8.61e-07 8.61e-07 8.61e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3544 0.3544 0.3544 100.00% Amr::coarseTimeStep() 5 0.2737 0.2737 0.2737 77.23% Amr::timeStep() 5 0.2718 0.2718 0.2718 76.69% Castro::advance() 5 0.2692 0.2692 0.2692 75.94% Castro::subcycle_advance_ctu() 5 0.263 0.263 0.263 74.19% Castro::do_advance_ctu() 5 0.2629 0.2629 0.2629 74.17% Castro::construct_new_gravity() 5 0.1279 0.1279 0.1279 36.09% Gravity::solve_phi_with_mlmg() 6 0.1234 0.1234 0.1234 34.81% Gravity::solve_for_phi() 5 0.1204 0.1204 0.1204 33.96% Gravity::actual_solve_with_mlmg() 6 0.1201 0.1201 0.1201 33.87% MLMG::solve() 6 0.1091 0.1091 0.1091 30.79% MLMG::oneIter() 36 0.1024 0.1024 0.1024 28.88% MLMG::mgVcycle() 36 0.1008 0.1008 0.1008 28.45% Castro::construct_ctu_hydro_source() 5 0.09858 0.09858 0.09858 27.81% MLCellLinOp::smooth() 720 0.04967 0.04967 0.04967 14.01% Amr::init() 1 0.04703 0.04703 0.04703 13.27% Amr::restart() 1 0.04703 0.04703 0.04703 13.27% MLCellLinOp::applyBC() 1946 0.04622 0.04622 0.04622 13.04% AmrLevel::restart() 1 0.04105 0.04105 0.04105 11.58% StateData::restartDoit() 4 0.04097 0.04097 0.04097 11.56% VisMF::Read() 3 0.04086 0.04086 0.04086 11.53% Amr::writePlotFile() 1 0.03295 0.03295 0.03295 9.30% VisMF::Write(FabArray) 1 0.03149 0.03149 0.03149 8.89% MLMG::mgVcycle_bottom 36 0.03122 0.03122 0.03122 8.81% MLMG::actualBottomSolve() 36 0.0312 0.0312 0.0312 8.80% MLCGSolver::bicgstab 36 0.0309 0.0309 0.0309 8.72% FillPatchIterator::Initialize 20 0.02003 0.02003 0.02003 5.65% Castro::clean_state() 30 0.01967 0.01967 0.01967 5.55% FillPatchSingleLevel 20 0.01929 0.01929 0.01929 5.44% StateDataPhysBCFunct::() 20 0.01731 0.01731 0.01731 4.88% MLCellLinOp::apply() 500 0.01541 0.01541 0.01541 4.35% MLMG::mgVcycle_down::0 36 0.01431 0.01431 0.01431 4.04% MLPoisson::Fsmooth() 1440 0.01398 0.01398 0.01398 3.94% FabArray::FillBoundary() 1766 0.01382 0.01382 0.01382 3.90% FillBoundary_nowait() 1766 0.01348 0.01348 0.01348 3.80% StateData::FillBoundary(geom) 160 0.01145 0.01145 0.01145 3.23% MLMG::mgVcycle_up::0 36 0.01083 0.01083 0.01083 3.06% MLCellLinOp::correctionResidual() 216 0.009369 0.009369 0.009369 2.64% Castro::initialize_do_advance() 5 0.009367 0.009367 0.009367 2.64% Castro::computeTemp() 30 0.00909 0.00909 0.00909 2.56% MLPoisson::define() 6 0.008718 0.008718 0.008718 2.46% amrex::Dot() 484 0.00869 0.00869 0.00869 2.45% MLMG:computeResOfCorrection() 180 0.008226 0.008226 0.008226 2.32% Gravity::get_new_grav_vector() 5 0.007441 0.007441 0.007441 2.10% Castro::do_new_sources() 5 0.007361 0.007361 0.007361 2.08% Castro::construct_old_gravity() 5 0.007322 0.007322 0.007322 2.07% Gravity::get_old_grav_vector() 5 0.007317 0.007317 0.007317 2.06% amrex::Copy() 463 0.006932 0.006932 0.006932 1.96% MLMG::mgVcycle_down::1 36 0.006707 0.006707 0.006707 1.89% Castro::normalize_species() 30 0.00649 0.00649 0.00649 1.83% FabArray::ParallelCopy() 380 0.006359 0.006359 0.006359 1.79% FabArray::ParallelCopy_nowait() 380 0.006251 0.006251 0.006251 1.76% MLMG::mgVcycle_down::2 36 0.006228 0.006228 0.006228 1.76% FabArray::setVal() 537 0.006221 0.006221 0.006221 1.75% FabArray::norminf() 326 0.006162 0.006162 0.006162 1.74% MLMG::mgVcycle_down::3 36 0.006096 0.006096 0.006096 1.72% MLMG::mgVcycle_down::4 36 0.00605 0.00605 0.00605 1.71% Castro::expand_state() 5 0.005999 0.005999 0.005999 1.69% MLCellLinOp::defineAuxData() 6 0.005914 0.005914 0.005914 1.67% Castro::initialize_advance() 5 0.005877 0.005877 0.005877 1.66% MLCGSolver::ParallelAllReduce 659 0.005247 0.005247 0.005247 1.48% MLMG::addInterpCorrection() 180 0.005016 0.005016 0.005016 1.42% MLMG::mgVcycle_up::4 36 0.004894 0.004894 0.004894 1.38% MLMG::mgVcycle_up::1 36 0.004874 0.004874 0.004874 1.38% MLMG::mgVcycle_up::2 36 0.004778 0.004778 0.004778 1.35% amrex::average_down 180 0.004687 0.004687 0.004687 1.32% MLMG::mgVcycle_up::3 36 0.004684 0.004684 0.004684 1.32% MLPoisson::Fapply() 500 0.004459 0.004459 0.004459 1.26% Castro::do_old_sources() 5 0.003786 0.003786 0.003786 1.07% Castro::enforce_min_density() 30 0.003631 0.003631 0.003631 1.02% FabArray::Saxpy() 355 0.003588 0.003588 0.003588 1.01% FabArray::Xpay() 361 0.003516 0.003516 0.003516 0.99% Castro::post_restart() 1 0.003374 0.003374 0.003374 0.95% Gravity::multilevel_solve_for_new_phi() 1 0.003253 0.003253 0.003253 0.92% Gravity::actual_multilevel_solve() 1 0.003239 0.003239 0.003239 0.91% Gravity::fill_multipole_BCs() 6 0.003214 0.003214 0.003214 0.91% MLCellLinOp::solutionResidual() 42 0.003208 0.003208 0.003208 0.91% MLMG::computeResidual() 36 0.002666 0.002666 0.002666 0.75% Castro::reset_internal_energy(MultiFab) 30 0.002659 0.002659 0.002659 0.75% MLCellLinOp::defineBC() 6 0.002639 0.002639 0.002639 0.74% Castro::post_timestep() 5 0.002571 0.002571 0.002571 0.73% BndryData::define() 6 0.002534 0.002534 0.002534 0.71% Castro::estTimeStep() 10 0.002521 0.002521 0.002521 0.71% MLMG::prepareForSolve() 6 0.002454 0.002454 0.002454 0.69% FabArray::LinComb() 242 0.001912 0.001912 0.001912 0.54% amrex::Add() 72 0.00184 0.00184 0.00184 0.52% Castro::computeNewDt() 5 0.001753 0.001753 0.001753 0.49% Castro::construct_new_source() 25 0.001461 0.001461 0.001461 0.41% Castro::construct_new_gravity_source() 5 0.001435 0.001435 0.001435 0.40% Castro::construct_old_source() 25 0.0009882 0.0009882 0.0009882 0.28% Castro::construct_old_gravity_source() 5 0.0009791 0.0009791 0.0009791 0.28% MLMG::ResNormInf() 42 0.0009282 0.0009282 0.0009282 0.26% Castro::apply_source_to_state() 10 0.000919 0.000919 0.000919 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008767 0.0008767 0.0008767 0.25% MLMG::getGradSolution() 6 0.0007506 0.0007506 0.0007506 0.21% MLCellLinOp::compGrad() 6 0.0007476 0.0007476 0.0007476 0.21% MLCellLinOp::setLevelBC() 6 0.0007454 0.0007454 0.0007454 0.21% FabArrayBase::getCPC() 632 0.0007408 0.0007408 0.0007408 0.21% Castro::reset_internal_energy(Fab) 240 0.000732 0.000732 0.000732 0.21% FabArray::mult() 22 0.0006588 0.0006588 0.0006588 0.19% FabArray::setDomainBndry() 20 0.0006439 0.0006439 0.0006439 0.18% MLPoisson::prepareForSolve() 6 0.0005987 0.0005987 0.0005987 0.17% MLCellLinOp::prepareForSolve() 6 0.0005946 0.0005946 0.0005946 0.17% Castro::check_for_nan() 10 0.0005908 0.0005908 0.0005908 0.17% MultiFab::contains_nan() 10 0.0005836 0.0005836 0.0005836 0.16% MLMG::computeMLResidual() 6 0.0005598 0.0005598 0.0005598 0.16% Gravity::update_max_rhs() 6 0.0004482 0.0004482 0.0004482 0.13% Castro::enforce_speed_limit() 30 0.0004221 0.0004221 0.0004221 0.12% Amr::InitAmr() 1 0.0004212 0.0004212 0.0004212 0.12% FabArrayBase::CPC::define() 244 0.0003966 0.0003966 0.0003966 0.11% Castro::finalize_advance() 5 0.000303 0.000303 0.000303 0.09% FabArrayBase::getFB() 1766 0.000294 0.000294 0.000294 0.08% Gravity::swapTimeLevels() 5 0.0002273 0.0002273 0.0002273 0.06% MLMG::MLResNormInf() 6 0.0001503 0.0001503 0.0001503 0.04% MLLinOp::define() 6 0.0001498 0.0001498 0.0001498 0.04% Castro::buildMetrics() 1 0.0001429 0.0001429 0.0001429 0.04% MLLinOp::defineGrids() 6 0.0001338 0.0001338 0.0001338 0.04% MultiFab::max() 6 0.0001338 0.0001338 0.0001338 0.04% MLMG::MLRhsNormInf() 6 0.000117 0.000117 0.000117 0.03% FabArrayBase::FB::FB() 26 5.679e-05 5.679e-05 5.679e-05 0.02% MLLinOp::makeAgglomeratedDMap 6 2.709e-05 2.709e-05 2.709e-05 0.01% Amr::writeSmallPlotFile() 1 2.499e-05 2.499e-05 2.499e-05 0.01% Castro::swap_state_time_levels() 5 2.131e-05 2.131e-05 2.131e-05 0.01% Castro::initMFs() 1 2.098e-05 2.098e-05 2.098e-05 0.01% Castro::create_source_corrector() 5 2.088e-05 2.088e-05 2.088e-05 0.01% makeSFC 30 1.994e-05 1.994e-05 1.994e-05 0.01% Castro::finalize_do_advance() 5 1.765e-05 1.765e-05 1.765e-05 0.00% DistributionMapping::Distribute() 31 8.985e-06 8.985e-06 8.985e-06 0.00% Amr::initSubcycle() 1 8.635e-06 8.635e-06 8.635e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.761e-06 4.761e-06 4.761e-06 0.00% Gravity::set_mass_offset() 6 1.782e-06 1.782e-06 1.782e-06 0.00% Castro::retry_advance_ctu() 5 1.696e-06 1.696e-06 1.696e-06 0.00% Castro::FluxRegCrseInit 5 1.511e-06 1.511e-06 1.511e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.208e-06 1.208e-06 1.208e-06 0.00% Castro::FluxRegFineAdd() 5 1.087e-06 1.087e-06 1.087e-06 0.00% AmrLevel::AmrLevel() 1 8.61e-07 8.61e-07 8.61e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-29-g225c605680e8) finalized