Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-23-ge472c108e721) initialized Starting run at 10:12:22 UTC on 2023-01-24. Successfully read inputs file ... Castro git describe: 23.01-18-gbb2758482 AMReX git describe: 23.01-23-ge472c108e Microphysics git describe: 23.01-4-gd64aa25b reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.056330899 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.03247262 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.045448876 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.049031959 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.056741699 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.058511752 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.057136047 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.055769295 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.065133797 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.059827929 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.048103854 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.056448602 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.058658238 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.055421149 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.032109177 seconds Ending run at 10:12:23 UTC on 2023-01-24. Run time = 0.834984557 Run time without initialization = 0.69894041 Average number of zones advanced per microsecond: 3.751 Average number of zones advanced per microsecond per rank: 3.751 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.835 ... 0.835 ... 0.835 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2260 0.2260 0.2260 27.07% Castro::construct_ctu_hydro_source() 10 0.2054 0.2054 0.2054 24.60% MLCellLinOp::applyBC() 4433 0.07341 0.07341 0.07341 8.79% MLPoisson::Fsmooth() 3280 0.03153 0.03153 0.03153 3.78% FillBoundary_nowait() 4023 0.03114 0.03114 0.03114 3.73% StateData::FillBoundary(geom) 328 0.02246 0.02246 0.02246 2.69% amrex::Dot() 1114 0.01977 0.01977 0.01977 2.37% StateDataPhysBCFunct::() 41 0.01527 0.01527 0.01527 1.83% amrex::Copy() 1029 0.01456 0.01456 0.01456 1.74% Castro::normalize_species() 62 0.0142 0.0142 0.0142 1.70% FabArray::norminf() 743 0.01393 0.01393 0.01393 1.67% Castro::computeTemp() 63 0.0139 0.0139 0.0139 1.66% FabArray::setVal() 1144 0.01287 0.01287 0.01287 1.54% FabArray::ParallelCopy_nowait() 861 0.01273 0.01273 0.01273 1.52% Castro::enforce_min_density() 62 0.0116 0.0116 0.0116 1.39% MLPoisson::Fapply() 1142 0.01013 0.01013 0.01013 1.21% MLCellLinOp::defineAuxData() 11 0.009435 0.009435 0.009435 1.13% FabArray::Saxpy() 813 0.007933 0.007933 0.007933 0.95% FabArray::Xpay() 821 0.007929 0.007929 0.007929 0.95% MLMG::addInterpCorrection() 410 0.006399 0.006399 0.006399 0.77% Gravity::fill_multipole_BCs() 11 0.005865 0.005865 0.005865 0.70% amrex::average_down 410 0.005638 0.005638 0.005638 0.68% Castro::estTimeStep() 21 0.005026 0.005026 0.005026 0.60% FabArray::LinComb() 557 0.004368 0.004368 0.004368 0.52% amrex::Add() 164 0.004285 0.004285 0.004285 0.51% Castro::reset_internal_energy(MultiFab) 63 0.004001 0.004001 0.004001 0.48% Amr::checkPoint() 3 0.003589 0.003589 0.003589 0.43% BndryData::define() 11 0.003431 0.003431 0.003431 0.41% Castro::construct_new_gravity_source() 10 0.002849 0.002849 0.002849 0.34% Castro::do_advance_ctu() 10 0.002582 0.002582 0.002582 0.31% Castro::construct_old_gravity_source() 10 0.002236 0.002236 0.002236 0.27% Amr::writePlotFile() 2 0.002067 0.002067 0.002067 0.25% MLCGSolver::bicgstab 82 0.00202 0.00202 0.00202 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001596 0.001596 0.001596 0.19% Castro::reset_internal_energy(Fab) 504 0.001526 0.001526 0.001526 0.18% Gravity::actual_solve_with_mlmg() 11 0.001344 0.001344 0.001344 0.16% MLCellLinOp::setLevelBC() 11 0.001332 0.001332 0.001332 0.16% FabArray::mult() 43 0.001325 0.001325 0.001325 0.16% FabArray::setDomainBndry() 41 0.001281 0.001281 0.001281 0.15% Castro::initData() 1 0.001277 0.001277 0.001277 0.15% MultiFab::contains_nan() 20 0.001194 0.001194 0.001194 0.14% MLCellLinOp::smooth() 1640 0.001176 0.001176 0.001176 0.14% MLCellLinOp::prepareForSolve() 11 0.001061 0.001061 0.001061 0.13% Castro::enforce_speed_limit() 62 0.0009804 0.0009804 0.0009804 0.12% MLCellLinOp::compGrad() 11 0.0008866 0.0008866 0.0008866 0.11% MLMG::prepareForSolve() 11 0.0007993 0.0007993 0.0007993 0.10% FabArray::FillBoundary() 4023 0.0007712 0.0007712 0.0007712 0.09% FabArrayBase::getCPC() 1323 0.0007149 0.0007149 0.0007149 0.09% FabArrayBase::CPC::define() 454 0.000682 0.000682 0.000682 0.08% Gravity::get_new_grav_vector() 11 0.0006003 0.0006003 0.0006003 0.07% FabArrayBase::getFB() 4023 0.0005812 0.0005812 0.0005812 0.07% Gravity::get_old_grav_vector() 10 0.0005482 0.0005482 0.0005482 0.07% Amr::InitAmr() 1 0.000498 0.000498 0.000498 0.06% MLCellLinOp::apply() 1142 0.0004661 0.0004661 0.0004661 0.06% MLMG::mgVcycle() 82 0.0003634 0.0003634 0.0003634 0.04% Amr::coarseTimeStep() 10 0.0003152 0.0003152 0.0003152 0.04% main() 1 0.000302 0.000302 0.000302 0.04% MLCGSolver::ParallelAllReduce 1514 0.0002892 0.0002892 0.0002892 0.03% MultiFab::max() 11 0.0002571 0.0002571 0.0002571 0.03% FabArray::ParallelCopy() 861 0.000239 0.000239 0.000239 0.03% FillPatchIterator::Initialize 41 0.0002155 0.0002155 0.0002155 0.03% MLCellLinOp::correctionResidual() 492 0.0002139 0.0002139 0.0002139 0.03% MLLinOp::defineGrids() 11 0.0002028 0.0002028 0.0002028 0.02% MLCellLinOp::defineBC() 11 0.0001959 0.0001959 0.0001959 0.02% Castro::subcycle_advance_ctu() 10 0.0001559 0.0001559 0.0001559 0.02% Amr::timeStep() 10 0.0001546 0.0001546 0.0001546 0.02% StateData::checkPoint() 12 0.0001352 0.0001352 0.0001352 0.02% Gravity::solve_for_phi() 10 0.000135 0.000135 0.000135 0.02% Gravity::update_max_rhs() 11 0.0001063 0.0001063 0.0001063 0.01% MLMG:computeResOfCorrection() 410 0.0001051 0.0001051 0.0001051 0.01% Castro::advance() 10 0.0001035 0.0001035 0.0001035 0.01% MLMG::mgVcycle_down::0 82 9.456e-05 9.456e-05 9.456e-05 0.01% MLMG::actualBottomSolve() 82 8.965e-05 8.965e-05 8.965e-05 0.01% MLMG::mgVcycle_down::1 82 8.213e-05 8.213e-05 8.213e-05 0.01% FabArrayBase::FB::FB() 56 8.114e-05 8.114e-05 8.114e-05 0.01% Castro::clean_state() 62 7.908e-05 7.908e-05 7.908e-05 0.01% Castro::Castro() 1 7.661e-05 7.661e-05 7.661e-05 0.01% AmrLevel::checkPoint() 3 7.557e-05 7.557e-05 7.557e-05 0.01% Castro::expand_state() 10 7.473e-05 7.473e-05 7.473e-05 0.01% MLMG::mgVcycle_down::2 82 7.415e-05 7.415e-05 7.415e-05 0.01% MLMG::solve() 11 7.302e-05 7.302e-05 7.302e-05 0.01% MLMG::mgVcycle_down::3 82 7.11e-05 7.11e-05 7.11e-05 0.01% MLMG::mgVcycle_down::4 82 7.032e-05 7.032e-05 7.032e-05 0.01% Castro::finalize_advance() 10 6.984e-05 6.984e-05 6.984e-05 0.01% Castro::initialize_advance() 10 6.572e-05 6.572e-05 6.572e-05 0.01% MLMG::mgVcycle_up::4 82 5.798e-05 5.798e-05 5.798e-05 0.01% MLMG::mgVcycle_up::0 82 5.478e-05 5.478e-05 5.478e-05 0.01% MLMG::oneIter() 82 5.312e-05 5.312e-05 5.312e-05 0.01% MLMG::mgVcycle_up::3 82 5.036e-05 5.036e-05 5.036e-05 0.01% MLMG::mgVcycle_up::1 82 4.997e-05 4.997e-05 4.997e-05 0.01% MLCellLinOp::solutionResidual() 93 4.99e-05 4.99e-05 4.99e-05 0.01% Castro::initialize_do_advance() 10 4.858e-05 4.858e-05 4.858e-05 0.01% MLMG::mgVcycle_up::2 82 4.841e-05 4.841e-05 4.841e-05 0.01% Castro::create_source_corrector() 10 4.114e-05 4.114e-05 4.114e-05 0.00% Castro::construct_new_gravity() 10 3.817e-05 3.817e-05 3.817e-05 0.00% Castro::swap_state_time_levels() 10 3.458e-05 3.458e-05 3.458e-05 0.00% Castro::enforce_consistent_e() 1 3.374e-05 3.374e-05 3.374e-05 0.00% MLMG::ResNormInf() 93 3.349e-05 3.349e-05 3.349e-05 0.00% Castro::finalize_do_advance() 10 3.241e-05 3.241e-05 3.241e-05 0.00% MLMG::mgVcycle_bottom 82 3.087e-05 3.087e-05 3.087e-05 0.00% MLMG::computeResidual() 82 3.078e-05 3.078e-05 3.078e-05 0.00% Gravity::solve_phi_with_mlmg() 11 3.034e-05 3.034e-05 3.034e-05 0.00% FillPatchSingleLevel 41 2.896e-05 2.896e-05 2.896e-05 0.00% makeSFC 55 2.811e-05 2.811e-05 2.811e-05 0.00% StateData::define() 4 2.703e-05 2.703e-05 2.703e-05 0.00% Amr::writeSmallPlotFile() 1 2.323e-05 2.323e-05 2.323e-05 0.00% MLPoisson::define() 11 2.215e-05 2.215e-05 2.215e-05 0.00% Amr::FinalizeInit() 1 2.067e-05 2.067e-05 2.067e-05 0.00% Amr::defBaseLevel() 1 1.886e-05 1.886e-05 1.886e-05 0.00% Castro::construct_old_source() 50 1.813e-05 1.813e-05 1.813e-05 0.00% Castro::initMFs() 1 1.812e-05 1.812e-05 1.812e-05 0.00% Castro::do_new_sources() 10 1.687e-05 1.687e-05 1.687e-05 0.00% Castro::construct_new_source() 50 1.647e-05 1.647e-05 1.647e-05 0.00% DistributionMapping::Distribute() 56 1.594e-05 1.594e-05 1.594e-05 0.00% Castro::buildMetrics() 1 1.552e-05 1.552e-05 1.552e-05 0.00% Castro::do_old_sources() 10 1.512e-05 1.512e-05 1.512e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.432e-05 1.432e-05 1.432e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.43e-05 1.43e-05 1.43e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.361e-05 1.361e-05 1.361e-05 0.00% Castro::check_for_nan() 20 1.339e-05 1.339e-05 1.339e-05 0.00% Amr::InitializeInit() 1 1.11e-05 1.11e-05 1.11e-05 0.00% MLLinOp::define() 11 1.104e-05 1.104e-05 1.104e-05 0.00% Castro::apply_source_to_state() 20 1.027e-05 1.027e-05 1.027e-05 0.00% Castro::post_init() 1 1.019e-05 1.019e-05 1.019e-05 0.00% Gravity::swapTimeLevels() 10 9.001e-06 9.001e-06 9.001e-06 0.00% Castro::construct_old_gravity() 10 8.783e-06 8.783e-06 8.783e-06 0.00% Castro::post_timestep() 10 8.549e-06 8.549e-06 8.549e-06 0.00% Amr::initSubcycle() 1 7.962e-06 7.962e-06 7.962e-06 0.00% MLMG::computeMLResidual() 11 7.801e-06 7.801e-06 7.801e-06 0.00% Gravity::actual_multilevel_solve() 1 7.57e-06 7.57e-06 7.57e-06 0.00% MLPoisson::prepareForSolve() 11 7.376e-06 7.376e-06 7.376e-06 0.00% Castro::computeNewDt() 9 6.551e-06 6.551e-06 6.551e-06 0.00% AmrLevel::checkPointPost() 3 6.506e-06 6.506e-06 6.506e-06 0.00% MLMG::getGradSolution() 11 6.316e-06 6.316e-06 6.316e-06 0.00% Gravity::set_mass_offset() 11 4.13e-06 4.13e-06 4.13e-06 0.00% Castro::retry_advance_ctu() 10 4.004e-06 4.004e-06 4.004e-06 0.00% MLMG::MLRhsNormInf() 11 3.975e-06 3.975e-06 3.975e-06 0.00% MLMG::MLResNormInf() 11 3.752e-06 3.752e-06 3.752e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.171e-06 3.171e-06 3.171e-06 0.00% Castro::FluxRegCrseInit 10 2.718e-06 2.718e-06 2.718e-06 0.00% Castro::computeInitialDt() 2 2.602e-06 2.602e-06 2.602e-06 0.00% Castro::FluxRegFineAdd() 10 2.374e-06 2.374e-06 2.374e-06 0.00% Amr::init() 1 2.313e-06 2.313e-06 2.313e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.111e-06 2.111e-06 2.111e-06 0.00% AmrLevel::checkPointPre() 3 1.756e-06 1.756e-06 1.756e-06 0.00% Castro::post_regrid() 1 1.169e-06 1.169e-06 1.169e-06 0.00% Amr::initialInit() 1 8.93e-07 8.93e-07 8.93e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.835 0.835 0.835 100.00% Amr::coarseTimeStep() 10 0.6666 0.6666 0.6666 79.83% Amr::timeStep() 10 0.5517 0.5517 0.5517 66.07% Castro::advance() 10 0.5438 0.5438 0.5438 65.13% Castro::subcycle_advance_ctu() 10 0.5321 0.5321 0.5321 63.72% Castro::do_advance_ctu() 10 0.532 0.532 0.532 63.70% Gravity::solve_phi_with_mlmg() 11 0.2735 0.2735 0.2735 32.75% Gravity::actual_solve_with_mlmg() 11 0.2671 0.2671 0.2671 31.99% Castro::construct_new_gravity() 10 0.2495 0.2495 0.2495 29.89% MLMG::solve() 11 0.2475 0.2475 0.2475 29.65% Gravity::solve_for_phi() 10 0.2345 0.2345 0.2345 28.09% MLMG::oneIter() 82 0.2337 0.2337 0.2337 27.99% MLMG::mgVcycle() 82 0.2301 0.2301 0.2301 27.55% VisMF::Write(FabArray) 11 0.226 0.226 0.226 27.07% Castro::construct_ctu_hydro_source() 10 0.2054 0.2054 0.2054 24.60% Amr::checkPoint() 3 0.1677 0.1677 0.1677 20.08% AmrLevel::checkPoint() 3 0.1641 0.1641 0.1641 19.65% StateData::checkPoint() 12 0.164 0.164 0.164 19.64% Amr::init() 1 0.1354 0.1354 0.1354 16.21% MLCellLinOp::smooth() 1640 0.1137 0.1137 0.1137 13.62% MLCellLinOp::applyBC() 4433 0.106 0.106 0.106 12.69% MLMG::mgVcycle_bottom 82 0.071 0.071 0.071 8.50% MLMG::actualBottomSolve() 82 0.07097 0.07097 0.07097 8.50% MLCGSolver::bicgstab 82 0.07029 0.07029 0.07029 8.42% Amr::writePlotFile() 2 0.06473 0.06473 0.06473 7.75% Amr::initialInit() 1 0.04644 0.04644 0.04644 5.56% Castro::clean_state() 62 0.04554 0.04554 0.04554 5.45% FillPatchIterator::Initialize 41 0.04323 0.04323 0.04323 5.18% Amr::FinalizeInit() 1 0.04246 0.04246 0.04246 5.08% FillPatchSingleLevel 41 0.04173 0.04173 0.04173 5.00% Castro::post_init() 1 0.0412 0.0412 0.0412 4.93% Gravity::multilevel_solve_for_new_phi() 1 0.03941 0.03941 0.03941 4.72% Gravity::actual_multilevel_solve() 1 0.0394 0.0394 0.0394 4.72% StateDataPhysBCFunct::() 41 0.03773 0.03773 0.03773 4.52% MLCellLinOp::apply() 1142 0.03513 0.03513 0.03513 4.21% MLMG::mgVcycle_down::0 82 0.03313 0.03313 0.03313 3.97% FabArray::FillBoundary() 4023 0.03258 0.03258 0.03258 3.90% FillBoundary_nowait() 4023 0.03181 0.03181 0.03181 3.81% MLPoisson::Fsmooth() 3280 0.03153 0.03153 0.03153 3.78% MLMG::mgVcycle_up::0 82 0.02519 0.02519 0.02519 3.02% StateData::FillBoundary(geom) 328 0.02246 0.02246 0.02246 2.69% MLCellLinOp::correctionResidual() 492 0.02158 0.02158 0.02158 2.58% Castro::initialize_do_advance() 10 0.02109 0.02109 0.02109 2.53% amrex::Dot() 1114 0.01977 0.01977 0.01977 2.37% Castro::computeTemp() 63 0.01943 0.01943 0.01943 2.33% MLMG:computeResOfCorrection() 410 0.01904 0.01904 0.01904 2.28% Gravity::get_new_grav_vector() 11 0.01649 0.01649 0.01649 1.97% MLPoisson::define() 11 0.01556 0.01556 0.01556 1.86% MLMG::mgVcycle_down::1 82 0.01517 0.01517 0.01517 1.82% Castro::construct_old_gravity() 10 0.01459 0.01459 0.01459 1.75% Gravity::get_old_grav_vector() 10 0.01458 0.01458 0.01458 1.75% amrex::Copy() 1029 0.01456 0.01456 0.01456 1.74% Castro::normalize_species() 62 0.0142 0.0142 0.0142 1.70% MLMG::mgVcycle_down::2 82 0.01417 0.01417 0.01417 1.70% FabArray::norminf() 743 0.01393 0.01393 0.01393 1.67% MLMG::mgVcycle_down::3 82 0.01391 0.01391 0.01391 1.67% FabArray::ParallelCopy() 861 0.01376 0.01376 0.01376 1.65% MLMG::mgVcycle_down::4 82 0.01367 0.01367 0.01367 1.64% FabArray::ParallelCopy_nowait() 861 0.01352 0.01352 0.01352 1.62% Castro::expand_state() 10 0.01346 0.01346 0.01346 1.61% FabArray::setVal() 1144 0.01287 0.01287 0.01287 1.54% MLCGSolver::ParallelAllReduce 1514 0.01189 0.01189 0.01189 1.42% Castro::enforce_min_density() 62 0.0116 0.0116 0.0116 1.39% MLMG::addInterpCorrection() 410 0.01129 0.01129 0.01129 1.35% Castro::do_new_sources() 10 0.01124 0.01124 0.01124 1.35% MLMG::mgVcycle_up::4 82 0.01111 0.01111 0.01111 1.33% MLMG::mgVcycle_up::1 82 0.01103 0.01103 0.01103 1.32% Castro::initialize_advance() 10 0.01103 0.01103 0.01103 1.32% MLMG::mgVcycle_up::2 82 0.01077 0.01077 0.01077 1.29% MLCellLinOp::defineAuxData() 11 0.01074 0.01074 0.01074 1.29% Castro::do_old_sources() 10 0.01058 0.01058 0.01058 1.27% MLMG::mgVcycle_up::3 82 0.01058 0.01058 0.01058 1.27% amrex::average_down 410 0.01053 0.01053 0.01053 1.26% MLPoisson::Fapply() 1142 0.01013 0.01013 0.01013 1.21% FabArray::Saxpy() 813 0.007933 0.007933 0.007933 0.95% FabArray::Xpay() 821 0.007929 0.007929 0.007929 0.95% Castro::post_timestep() 10 0.007739 0.007739 0.007739 0.93% MLCellLinOp::solutionResidual() 93 0.007026 0.007026 0.007026 0.84% Gravity::fill_multipole_BCs() 11 0.006117 0.006117 0.006117 0.73% MLMG::computeResidual() 82 0.006077 0.006077 0.006077 0.73% Castro::reset_internal_energy(MultiFab) 63 0.005527 0.005527 0.005527 0.66% Castro::estTimeStep() 21 0.005026 0.005026 0.005026 0.60% MLCellLinOp::defineBC() 11 0.004523 0.004523 0.004523 0.54% MLMG::prepareForSolve() 11 0.004424 0.004424 0.004424 0.53% FabArray::LinComb() 557 0.004368 0.004368 0.004368 0.52% BndryData::define() 11 0.004328 0.004328 0.004328 0.52% amrex::Add() 164 0.004285 0.004285 0.004285 0.51% Amr::InitializeInit() 1 0.003984 0.003984 0.003984 0.48% Amr::defBaseLevel() 1 0.003973 0.003973 0.003973 0.48% Castro::initData() 1 0.003504 0.003504 0.003504 0.42% Castro::construct_new_source() 50 0.002865 0.002865 0.002865 0.34% Castro::construct_new_gravity_source() 10 0.002849 0.002849 0.002849 0.34% Castro::computeNewDt() 9 0.002575 0.002575 0.002575 0.31% Castro::construct_old_source() 50 0.002254 0.002254 0.002254 0.27% Castro::construct_old_gravity_source() 10 0.002236 0.002236 0.002236 0.27% MLMG::ResNormInf() 93 0.002079 0.002079 0.002079 0.25% Castro::apply_source_to_state() 20 0.001816 0.001816 0.001816 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001596 0.001596 0.001596 0.19% Castro::reset_internal_energy(Fab) 504 0.001526 0.001526 0.001526 0.18% FabArrayBase::getCPC() 1323 0.001397 0.001397 0.001397 0.17% MLMG::getGradSolution() 11 0.001348 0.001348 0.001348 0.16% MLCellLinOp::compGrad() 11 0.001342 0.001342 0.001342 0.16% MLCellLinOp::setLevelBC() 11 0.001332 0.001332 0.001332 0.16% FabArray::mult() 43 0.001325 0.001325 0.001325 0.16% FabArray::setDomainBndry() 41 0.001281 0.001281 0.001281 0.15% Castro::check_for_nan() 20 0.001208 0.001208 0.001208 0.14% MultiFab::contains_nan() 20 0.001194 0.001194 0.001194 0.14% Castro::post_regrid() 1 0.001083 0.001083 0.001083 0.13% MLPoisson::prepareForSolve() 11 0.001069 0.001069 0.001069 0.13% MLCellLinOp::prepareForSolve() 11 0.001061 0.001061 0.001061 0.13% MLMG::computeMLResidual() 11 0.0009882 0.0009882 0.0009882 0.12% Castro::enforce_speed_limit() 62 0.0009804 0.0009804 0.0009804 0.12% Castro::computeInitialDt() 2 0.000877 0.000877 0.000877 0.11% Gravity::update_max_rhs() 11 0.0008028 0.0008028 0.0008028 0.10% FabArrayBase::CPC::define() 454 0.000682 0.000682 0.000682 0.08% FabArrayBase::getFB() 4023 0.0006623 0.0006623 0.0006623 0.08% Castro::finalize_advance() 10 0.0005857 0.0005857 0.0005857 0.07% Amr::InitAmr() 1 0.000506 0.000506 0.000506 0.06% Gravity::swapTimeLevels() 10 0.0004574 0.0004574 0.0004574 0.05% Castro::Castro() 1 0.0004038 0.0004038 0.0004038 0.05% MLMG::MLResNormInf() 11 0.0002892 0.0002892 0.0002892 0.03% MLLinOp::define() 11 0.0002725 0.0002725 0.0002725 0.03% MLLinOp::defineGrids() 11 0.0002614 0.0002614 0.0002614 0.03% MultiFab::max() 11 0.0002571 0.0002571 0.0002571 0.03% MLMG::MLRhsNormInf() 11 0.0002171 0.0002171 0.0002171 0.03% Castro::buildMetrics() 1 0.0001506 0.0001506 0.0001506 0.02% FabArrayBase::FB::FB() 56 8.114e-05 8.114e-05 8.114e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.651e-05 5.651e-05 5.651e-05 0.01% makeSFC 55 4.29e-05 4.29e-05 4.29e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.135e-05 4.135e-05 4.135e-05 0.00% Castro::create_source_corrector() 10 4.114e-05 4.114e-05 4.114e-05 0.00% Castro::swap_state_time_levels() 10 3.458e-05 3.458e-05 3.458e-05 0.00% Castro::enforce_consistent_e() 1 3.374e-05 3.374e-05 3.374e-05 0.00% Castro::finalize_do_advance() 10 3.241e-05 3.241e-05 3.241e-05 0.00% StateData::define() 4 2.703e-05 2.703e-05 2.703e-05 0.00% Amr::writeSmallPlotFile() 1 2.323e-05 2.323e-05 2.323e-05 0.00% Castro::initMFs() 1 1.812e-05 1.812e-05 1.812e-05 0.00% DistributionMapping::Distribute() 56 1.594e-05 1.594e-05 1.594e-05 0.00% Amr::initSubcycle() 1 7.962e-06 7.962e-06 7.962e-06 0.00% AmrLevel::checkPointPost() 3 6.506e-06 6.506e-06 6.506e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.323e-06 4.323e-06 4.323e-06 0.00% Gravity::set_mass_offset() 11 4.13e-06 4.13e-06 4.13e-06 0.00% Castro::retry_advance_ctu() 10 4.004e-06 4.004e-06 4.004e-06 0.00% Castro::FluxRegCrseInit 10 2.718e-06 2.718e-06 2.718e-06 0.00% Castro::FluxRegFineAdd() 10 2.374e-06 2.374e-06 2.374e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.111e-06 2.111e-06 2.111e-06 0.00% AmrLevel::checkPointPre() 3 1.756e-06 1.756e-06 1.756e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-23-ge472c108e721) finalized Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-23-ge472c108e721) initialized Starting run at 10:12:24 UTC on 2023-01-24. Successfully read inputs file ... Castro git describe: 23.01-18-gbb2758482 AMReX git describe: 23.01-23-ge472c108e Microphysics git describe: 23.01-4-gd64aa25b reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.458673563 Restart time = 0.086943654 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.046795561 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.04515403 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.054598794 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.057131694 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.065877094 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.034081708 seconds Ending run at 10:12:24 UTC on 2023-01-24. Run time = 0.391603308 Run time without initialization = 0.304079726 Average number of zones advanced per microsecond: 4.310 Average number of zones advanced per microsecond per rank: 4.310 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.3916 ... 0.3916 ... 0.3916 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0919 0.0919 0.0919 23.46% Amr::restart() 1 0.04178 0.04178 0.04178 10.67% VisMF::Read() 3 0.04152 0.04152 0.04152 10.60% MLCellLinOp::applyBC() 1946 0.03667 0.03667 0.03667 9.36% VisMF::Write(FabArray) 1 0.03176 0.03176 0.03176 8.11% MLPoisson::Fsmooth() 1440 0.0137 0.0137 0.0137 3.50% FillBoundary_nowait() 1766 0.01271 0.01271 0.01271 3.25% StateData::FillBoundary(geom) 160 0.01112 0.01112 0.01112 2.84% amrex::Dot() 484 0.0084 0.0084 0.0084 2.14% amrex::Copy() 463 0.006857 0.006857 0.006857 1.75% Castro::computeTemp() 30 0.006089 0.006089 0.006089 1.55% FabArray::setVal() 537 0.006082 0.006082 0.006082 1.55% Castro::normalize_species() 30 0.00608 0.00608 0.00608 1.55% FabArray::norminf() 326 0.006025 0.006025 0.006025 1.54% StateDataPhysBCFunct::() 20 0.005811 0.005811 0.005811 1.48% FabArray::ParallelCopy_nowait() 380 0.005784 0.005784 0.005784 1.48% Castro::enforce_min_density() 30 0.005305 0.005305 0.005305 1.35% MLCellLinOp::defineAuxData() 6 0.005053 0.005053 0.005053 1.29% MLPoisson::Fapply() 500 0.004387 0.004387 0.004387 1.12% FabArray::Saxpy() 355 0.003567 0.003567 0.003567 0.91% FabArray::Xpay() 361 0.003439 0.003439 0.003439 0.88% Gravity::fill_multipole_BCs() 6 0.002988 0.002988 0.002988 0.76% MLMG::addInterpCorrection() 180 0.002812 0.002812 0.002812 0.72% amrex::average_down 180 0.002475 0.002475 0.002475 0.63% Amr::writePlotFile() 1 0.00219 0.00219 0.00219 0.56% Castro::estTimeStep() 10 0.001997 0.001997 0.001997 0.51% BndryData::define() 6 0.001905 0.001905 0.001905 0.49% FabArray::LinComb() 242 0.001878 0.001878 0.001878 0.48% amrex::Add() 72 0.001825 0.001825 0.001825 0.47% Castro::reset_internal_energy(MultiFab) 30 0.001688 0.001688 0.001688 0.43% Castro::construct_new_gravity_source() 5 0.001503 0.001503 0.001503 0.38% Castro::do_advance_ctu() 5 0.00116 0.00116 0.00116 0.30% Castro::construct_old_gravity_source() 5 0.001095 0.001095 0.001095 0.28% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008676 0.0008676 0.0008676 0.22% MLCGSolver::bicgstab 36 0.0008643 0.0008643 0.0008643 0.22% Castro::enforce_speed_limit() 30 0.000797 0.000797 0.000797 0.20% MLCellLinOp::setLevelBC() 6 0.0007227 0.0007227 0.0007227 0.18% Gravity::actual_solve_with_mlmg() 6 0.0007173 0.0007173 0.0007173 0.18% Castro::reset_internal_energy(Fab) 240 0.0006982 0.0006982 0.0006982 0.18% FabArray::mult() 22 0.0006669 0.0006669 0.0006669 0.17% FabArray::setDomainBndry() 20 0.0006421 0.0006421 0.0006421 0.16% MultiFab::contains_nan() 10 0.0005919 0.0005919 0.0005919 0.15% MLCellLinOp::prepareForSolve() 6 0.000581 0.000581 0.000581 0.15% MLCellLinOp::compGrad() 6 0.0005187 0.0005187 0.0005187 0.13% MLCellLinOp::smooth() 720 0.0005048 0.0005048 0.0005048 0.13% MLMG::prepareForSolve() 6 0.000444 0.000444 0.000444 0.11% FabArrayBase::CPC::define() 244 0.0004252 0.0004252 0.0004252 0.11% Amr::InitAmr() 1 0.0003915 0.0003915 0.0003915 0.10% FabArrayBase::getCPC() 632 0.0003595 0.0003595 0.0003595 0.09% FabArray::FillBoundary() 1766 0.0003429 0.0003429 0.0003429 0.09% main() 1 0.0002946 0.0002946 0.0002946 0.08% Gravity::get_old_grav_vector() 5 0.0002943 0.0002943 0.0002943 0.08% Gravity::get_new_grav_vector() 5 0.0002731 0.0002731 0.0002731 0.07% FabArrayBase::getFB() 1766 0.0002576 0.0002576 0.0002576 0.07% MLCellLinOp::apply() 500 0.0002075 0.0002075 0.0002075 0.05% Amr::coarseTimeStep() 5 0.0001621 0.0001621 0.0001621 0.04% MLMG::mgVcycle() 36 0.0001614 0.0001614 0.0001614 0.04% MultiFab::max() 6 0.0001329 0.0001329 0.0001329 0.03% MLCGSolver::ParallelAllReduce 659 0.000125 0.000125 0.000125 0.03% FabArray::ParallelCopy() 380 0.0001167 0.0001167 0.0001167 0.03% MLCellLinOp::defineBC() 6 0.0001054 0.0001054 0.0001054 0.03% FillPatchIterator::Initialize 20 0.0001031 0.0001031 0.0001031 0.03% MLLinOp::defineGrids() 6 9.348e-05 9.348e-05 9.348e-05 0.02% MLCellLinOp::correctionResidual() 216 9.194e-05 9.194e-05 9.194e-05 0.02% Amr::timeStep() 5 8.053e-05 8.053e-05 8.053e-05 0.02% Gravity::solve_for_phi() 5 7.143e-05 7.143e-05 7.143e-05 0.02% AmrLevel::restart() 1 7.134e-05 7.134e-05 7.134e-05 0.02% Castro::subcycle_advance_ctu() 5 6.939e-05 6.939e-05 6.939e-05 0.02% Castro::finalize_advance() 5 5.894e-05 5.894e-05 5.894e-05 0.02% FabArrayBase::FB::FB() 26 5.701e-05 5.701e-05 5.701e-05 0.01% StateData::restartDoit() 4 5.672e-05 5.672e-05 5.672e-05 0.01% Gravity::update_max_rhs() 6 5.286e-05 5.286e-05 5.286e-05 0.01% MLMG:computeResOfCorrection() 180 4.885e-05 4.885e-05 4.885e-05 0.01% Castro::clean_state() 30 4.076e-05 4.076e-05 4.076e-05 0.01% MLMG::mgVcycle_down::0 36 3.959e-05 3.959e-05 3.959e-05 0.01% MLMG::actualBottomSolve() 36 3.892e-05 3.892e-05 3.892e-05 0.01% Castro::expand_state() 5 3.802e-05 3.802e-05 3.802e-05 0.01% MLMG::mgVcycle_down::1 36 3.626e-05 3.626e-05 3.626e-05 0.01% MLMG::solve() 6 3.461e-05 3.461e-05 3.461e-05 0.01% MLMG::mgVcycle_down::2 36 3.169e-05 3.169e-05 3.169e-05 0.01% Castro::buildMetrics() 1 3.092e-05 3.092e-05 3.092e-05 0.01% MLMG::mgVcycle_down::4 36 3.068e-05 3.068e-05 3.068e-05 0.01% Castro::advance() 5 2.978e-05 2.978e-05 2.978e-05 0.01% Castro::initialize_advance() 5 2.97e-05 2.97e-05 2.97e-05 0.01% MLMG::mgVcycle_down::3 36 2.948e-05 2.948e-05 2.948e-05 0.01% MLMG::mgVcycle_up::4 36 2.845e-05 2.845e-05 2.845e-05 0.01% Amr::writeSmallPlotFile() 1 2.788e-05 2.788e-05 2.788e-05 0.01% MLMG::mgVcycle_up::2 36 2.56e-05 2.56e-05 2.56e-05 0.01% MLMG::mgVcycle_up::0 36 2.504e-05 2.504e-05 2.504e-05 0.01% MLMG::oneIter() 36 2.339e-05 2.339e-05 2.339e-05 0.01% Castro::initialize_do_advance() 5 2.239e-05 2.239e-05 2.239e-05 0.01% Castro::post_restart() 1 2.221e-05 2.221e-05 2.221e-05 0.01% MLCellLinOp::solutionResidual() 42 2.158e-05 2.158e-05 2.158e-05 0.01% MLMG::mgVcycle_up::3 36 2.152e-05 2.152e-05 2.152e-05 0.01% Castro::swap_state_time_levels() 5 2.148e-05 2.148e-05 2.148e-05 0.01% Castro::initMFs() 1 2.022e-05 2.022e-05 2.022e-05 0.01% MLMG::mgVcycle_up::1 36 1.964e-05 1.964e-05 1.964e-05 0.01% Castro::finalize_do_advance() 5 1.837e-05 1.837e-05 1.837e-05 0.00% MLMG::ResNormInf() 42 1.698e-05 1.698e-05 1.698e-05 0.00% FillPatchSingleLevel 20 1.504e-05 1.504e-05 1.504e-05 0.00% MLPoisson::define() 6 1.49e-05 1.49e-05 1.49e-05 0.00% Castro::construct_new_gravity() 5 1.429e-05 1.429e-05 1.429e-05 0.00% MLMG::mgVcycle_bottom 36 1.405e-05 1.405e-05 1.405e-05 0.00% MLMG::computeResidual() 36 1.375e-05 1.375e-05 1.375e-05 0.00% makeSFC 30 1.362e-05 1.362e-05 1.362e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.289e-05 1.289e-05 1.289e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.183e-05 1.183e-05 1.183e-05 0.00% Castro::construct_old_source() 25 1.028e-05 1.028e-05 1.028e-05 0.00% DistributionMapping::Distribute() 31 9.317e-06 9.317e-06 9.317e-06 0.00% Castro::construct_new_source() 25 9.188e-06 9.188e-06 9.188e-06 0.00% Castro::do_new_sources() 5 8.447e-06 8.447e-06 8.447e-06 0.00% Amr::initSubcycle() 1 8.161e-06 8.161e-06 8.161e-06 0.00% Castro::do_old_sources() 5 7.93e-06 7.93e-06 7.93e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.126e-06 7.126e-06 7.126e-06 0.00% Gravity::actual_multilevel_solve() 1 6.447e-06 6.447e-06 6.447e-06 0.00% Castro::check_for_nan() 10 6.081e-06 6.081e-06 6.081e-06 0.00% Castro::post_timestep() 5 5.972e-06 5.972e-06 5.972e-06 0.00% Castro::apply_source_to_state() 10 5.559e-06 5.559e-06 5.559e-06 0.00% Castro::construct_old_gravity() 5 5.419e-06 5.419e-06 5.419e-06 0.00% MLLinOp::define() 6 5.104e-06 5.104e-06 5.104e-06 0.00% Gravity::swapTimeLevels() 5 4.147e-06 4.147e-06 4.147e-06 0.00% MLPoisson::prepareForSolve() 6 4.091e-06 4.091e-06 4.091e-06 0.00% MLMG::computeMLResidual() 6 3.515e-06 3.515e-06 3.515e-06 0.00% Castro::computeNewDt() 5 3.403e-06 3.403e-06 3.403e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.219e-06 3.219e-06 3.219e-06 0.00% MLMG::getGradSolution() 6 2.986e-06 2.986e-06 2.986e-06 0.00% MLMG::MLResNormInf() 6 2.164e-06 2.164e-06 2.164e-06 0.00% Gravity::set_mass_offset() 6 2.101e-06 2.101e-06 2.101e-06 0.00% MLMG::MLRhsNormInf() 6 2.082e-06 2.082e-06 2.082e-06 0.00% Castro::retry_advance_ctu() 5 2.003e-06 2.003e-06 2.003e-06 0.00% Castro::create_source_corrector() 5 1.931e-06 1.931e-06 1.931e-06 0.00% Castro::FluxRegCrseInit 5 1.49e-06 1.49e-06 1.49e-06 0.00% AmrLevel::AmrLevel() 1 1.242e-06 1.242e-06 1.242e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.183e-06 1.183e-06 1.183e-06 0.00% Castro::FluxRegFineAdd() 5 1.178e-06 1.178e-06 1.178e-06 0.00% Amr::init() 1 1.085e-06 1.085e-06 1.085e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3916 0.3916 0.3916 100.00% Amr::coarseTimeStep() 5 0.2697 0.2697 0.2697 68.87% Amr::timeStep() 5 0.2683 0.2683 0.2683 68.50% Castro::advance() 5 0.265 0.265 0.265 67.66% Castro::subcycle_advance_ctu() 5 0.2587 0.2587 0.2587 66.05% Castro::do_advance_ctu() 5 0.2586 0.2586 0.2586 66.03% Castro::construct_new_gravity() 5 0.1302 0.1302 0.1302 33.26% Gravity::solve_phi_with_mlmg() 6 0.1257 0.1257 0.1257 32.09% Gravity::solve_for_phi() 5 0.1227 0.1227 0.1227 31.33% Gravity::actual_solve_with_mlmg() 6 0.1224 0.1224 0.1224 31.26% MLMG::solve() 6 0.1117 0.1117 0.1117 28.53% MLMG::oneIter() 36 0.105 0.105 0.105 26.82% MLMG::mgVcycle() 36 0.1035 0.1035 0.1035 26.43% Castro::construct_ctu_hydro_source() 5 0.09186 0.09186 0.09186 23.46% Amr::init() 1 0.08699 0.08699 0.08699 22.21% Amr::restart() 1 0.08698 0.08698 0.08698 22.21% MLCellLinOp::smooth() 720 0.05341 0.05341 0.05341 13.64% MLCellLinOp::applyBC() 1946 0.05004 0.05004 0.05004 12.78% AmrLevel::restart() 1 0.04171 0.04171 0.04171 10.65% StateData::restartDoit() 4 0.04163 0.04163 0.04163 10.63% VisMF::Read() 3 0.04152 0.04152 0.04152 10.60% Amr::writePlotFile() 1 0.0342 0.0342 0.0342 8.73% VisMF::Write(FabArray) 1 0.03176 0.03176 0.03176 8.11% MLMG::mgVcycle_bottom 36 0.03043 0.03043 0.03043 7.77% MLMG::actualBottomSolve() 36 0.03042 0.03042 0.03042 7.77% MLCGSolver::bicgstab 36 0.03012 0.03012 0.03012 7.69% Castro::clean_state() 30 0.0207 0.0207 0.0207 5.28% FillPatchIterator::Initialize 20 0.01969 0.01969 0.01969 5.03% FillPatchSingleLevel 20 0.01894 0.01894 0.01894 4.84% StateDataPhysBCFunct::() 20 0.01693 0.01693 0.01693 4.32% MLCellLinOp::apply() 500 0.01517 0.01517 0.01517 3.87% MLMG::mgVcycle_down::0 36 0.01399 0.01399 0.01399 3.57% MLPoisson::Fsmooth() 1440 0.0137 0.0137 0.0137 3.50% FabArray::FillBoundary() 1766 0.01337 0.01337 0.01337 3.41% FillBoundary_nowait() 1766 0.01303 0.01303 0.01303 3.33% StateData::FillBoundary(geom) 160 0.01112 0.01112 0.01112 2.84% MLMG::mgVcycle_up::0 36 0.01061 0.01061 0.01061 2.71% Castro::initialize_do_advance() 5 0.01025 0.01025 0.01025 2.62% MLMG::mgVcycle_up::3 36 0.009532 0.009532 0.009532 2.43% MLCellLinOp::correctionResidual() 216 0.009231 0.009231 0.009231 2.36% Castro::computeTemp() 30 0.008475 0.008475 0.008475 2.16% MLPoisson::define() 6 0.00847 0.00847 0.00847 2.16% amrex::Dot() 484 0.0084 0.0084 0.0084 2.14% MLMG:computeResOfCorrection() 180 0.008122 0.008122 0.008122 2.07% Gravity::get_new_grav_vector() 5 0.007424 0.007424 0.007424 1.90% Castro::construct_old_gravity() 5 0.006986 0.006986 0.006986 1.78% Gravity::get_old_grav_vector() 5 0.00698 0.00698 0.00698 1.78% amrex::Copy() 463 0.006857 0.006857 0.006857 1.75% MLMG::mgVcycle_down::1 36 0.006576 0.006576 0.006576 1.68% FabArray::ParallelCopy() 380 0.00629 0.00629 0.00629 1.61% FabArray::ParallelCopy_nowait() 380 0.006173 0.006173 0.006173 1.58% Castro::do_new_sources() 5 0.006127 0.006127 0.006127 1.56% MLMG::mgVcycle_down::2 36 0.006084 0.006084 0.006084 1.55% FabArray::setVal() 537 0.006082 0.006082 0.006082 1.55% Castro::normalize_species() 30 0.00608 0.00608 0.00608 1.55% FabArray::norminf() 326 0.006025 0.006025 0.006025 1.54% Castro::expand_state() 5 0.006009 0.006009 0.006009 1.53% MLMG::mgVcycle_down::3 36 0.005972 0.005972 0.005972 1.52% Castro::initialize_advance() 5 0.005946 0.005946 0.005946 1.52% MLMG::mgVcycle_down::4 36 0.005899 0.005899 0.005899 1.51% MLCellLinOp::defineAuxData() 6 0.005795 0.005795 0.005795 1.48% Castro::enforce_min_density() 30 0.005305 0.005305 0.005305 1.35% MLCGSolver::ParallelAllReduce 659 0.00508 0.00508 0.00508 1.30% MLMG::addInterpCorrection() 180 0.004962 0.004962 0.004962 1.27% MLMG::mgVcycle_up::4 36 0.004784 0.004784 0.004784 1.22% MLMG::mgVcycle_up::1 36 0.004768 0.004768 0.004768 1.22% MLMG::mgVcycle_up::2 36 0.004716 0.004716 0.004716 1.20% amrex::average_down 180 0.004624 0.004624 0.004624 1.18% MLPoisson::Fapply() 500 0.004387 0.004387 0.004387 1.12% Castro::do_old_sources() 5 0.004278 0.004278 0.004278 1.09% FabArray::Saxpy() 355 0.003567 0.003567 0.003567 0.91% FabArray::Xpay() 361 0.003439 0.003439 0.003439 0.88% Castro::post_restart() 1 0.003322 0.003322 0.003322 0.85% Gravity::multilevel_solve_for_new_phi() 1 0.003208 0.003208 0.003208 0.82% Castro::post_timestep() 5 0.003207 0.003207 0.003207 0.82% Gravity::actual_multilevel_solve() 1 0.003196 0.003196 0.003196 0.82% MLCellLinOp::solutionResidual() 42 0.003188 0.003188 0.003188 0.81% Gravity::fill_multipole_BCs() 6 0.003119 0.003119 0.003119 0.80% MLMG::computeResidual() 36 0.00265 0.00265 0.00265 0.68% MLCellLinOp::defineBC() 6 0.002532 0.002532 0.002532 0.65% BndryData::define() 6 0.002427 0.002427 0.002427 0.62% MLMG::prepareForSolve() 6 0.002407 0.002407 0.002407 0.61% Castro::reset_internal_energy(MultiFab) 30 0.002386 0.002386 0.002386 0.61% Castro::estTimeStep() 10 0.001997 0.001997 0.001997 0.51% FabArray::LinComb() 242 0.001878 0.001878 0.001878 0.48% amrex::Add() 72 0.001825 0.001825 0.001825 0.47% Castro::construct_new_source() 25 0.001512 0.001512 0.001512 0.39% Castro::construct_new_gravity_source() 5 0.001503 0.001503 0.001503 0.38% Castro::computeNewDt() 5 0.001296 0.001296 0.001296 0.33% Castro::construct_old_source() 25 0.001105 0.001105 0.001105 0.28% Castro::construct_old_gravity_source() 5 0.001095 0.001095 0.001095 0.28% MLMG::ResNormInf() 42 0.0009291 0.0009291 0.0009291 0.24% Castro::apply_source_to_state() 10 0.0009217 0.0009217 0.0009217 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008676 0.0008676 0.0008676 0.22% Castro::enforce_speed_limit() 30 0.000797 0.000797 0.000797 0.20% FabArrayBase::getCPC() 632 0.0007847 0.0007847 0.0007847 0.20% MLMG::getGradSolution() 6 0.0007835 0.0007835 0.0007835 0.20% MLCellLinOp::compGrad() 6 0.0007806 0.0007806 0.0007806 0.20% MLCellLinOp::setLevelBC() 6 0.0007227 0.0007227 0.0007227 0.18% Castro::reset_internal_energy(Fab) 240 0.0006982 0.0006982 0.0006982 0.18% FabArray::mult() 22 0.0006669 0.0006669 0.0006669 0.17% FabArray::setDomainBndry() 20 0.0006421 0.0006421 0.0006421 0.16% Castro::check_for_nan() 10 0.000598 0.000598 0.000598 0.15% MultiFab::contains_nan() 10 0.0005919 0.0005919 0.0005919 0.15% MLPoisson::prepareForSolve() 6 0.0005851 0.0005851 0.0005851 0.15% MLCellLinOp::prepareForSolve() 6 0.000581 0.000581 0.000581 0.15% MLMG::computeMLResidual() 6 0.0005554 0.0005554 0.0005554 0.14% Gravity::update_max_rhs() 6 0.0004326 0.0004326 0.0004326 0.11% FabArrayBase::CPC::define() 244 0.0004252 0.0004252 0.0004252 0.11% Amr::InitAmr() 1 0.0003996 0.0003996 0.0003996 0.10% Castro::finalize_advance() 5 0.000317 0.000317 0.000317 0.08% FabArrayBase::getFB() 1766 0.0003147 0.0003147 0.0003147 0.08% Gravity::swapTimeLevels() 5 0.0002186 0.0002186 0.0002186 0.06% MLMG::MLResNormInf() 6 0.0001561 0.0001561 0.0001561 0.04% Castro::buildMetrics() 1 0.0001488 0.0001488 0.0001488 0.04% MultiFab::max() 6 0.0001329 0.0001329 0.0001329 0.03% MLLinOp::define() 6 0.0001279 0.0001279 0.0001279 0.03% MLLinOp::defineGrids() 6 0.0001228 0.0001228 0.0001228 0.03% MLMG::MLRhsNormInf() 6 0.0001195 0.0001195 0.0001195 0.03% FabArrayBase::FB::FB() 26 5.701e-05 5.701e-05 5.701e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.816e-05 2.816e-05 2.816e-05 0.01% Amr::writeSmallPlotFile() 1 2.788e-05 2.788e-05 2.788e-05 0.01% Castro::swap_state_time_levels() 5 2.148e-05 2.148e-05 2.148e-05 0.01% makeSFC 30 2.104e-05 2.104e-05 2.104e-05 0.01% Castro::initMFs() 1 2.022e-05 2.022e-05 2.022e-05 0.01% Castro::finalize_do_advance() 5 1.837e-05 1.837e-05 1.837e-05 0.00% DistributionMapping::Distribute() 31 9.317e-06 9.317e-06 9.317e-06 0.00% Amr::initSubcycle() 1 8.161e-06 8.161e-06 8.161e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.117e-06 5.117e-06 5.117e-06 0.00% Gravity::set_mass_offset() 6 2.101e-06 2.101e-06 2.101e-06 0.00% Castro::retry_advance_ctu() 5 2.003e-06 2.003e-06 2.003e-06 0.00% Castro::create_source_corrector() 5 1.931e-06 1.931e-06 1.931e-06 0.00% Castro::FluxRegCrseInit 5 1.49e-06 1.49e-06 1.49e-06 0.00% AmrLevel::AmrLevel() 1 1.242e-06 1.242e-06 1.242e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.183e-06 1.183e-06 1.183e-06 0.00% Castro::FluxRegFineAdd() 5 1.178e-06 1.178e-06 1.178e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-23-ge472c108e721) finalized