Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-21-gb487434b948f) initialized Starting run at 10:11:37 UTC on 2023-01-23. Successfully read inputs file ... Castro git describe: 23.01-18-gbb2758482 AMReX git describe: 23.01-21-gb487434b9 Microphysics git describe: 23.01-4-gd64aa25b reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.057019022 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.032083137 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.045141311 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.048616829 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.046970419 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.06145616 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.071481146 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.056323681 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.060179342 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.046918505 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.056304782 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.054976333 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.057509697 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.055072646 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.03187594 seconds Ending run at 10:11:38 UTC on 2023-01-23. Run time = 0.829737129 Run time without initialization = 0.693445248 Average number of zones advanced per microsecond: 3.780 Average number of zones advanced per microsecond per rank: 3.780 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.8298 ... 0.8298 ... 0.8298 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2254 0.2254 0.2254 27.16% Castro::construct_ctu_hydro_source() 10 0.1966 0.1966 0.1966 23.69% MLCellLinOp::applyBC() 4433 0.07368 0.07368 0.07368 8.88% FillBoundary_nowait() 4023 0.0356 0.0356 0.0356 4.29% MLPoisson::Fsmooth() 3280 0.03163 0.03163 0.03163 3.81% StateData::FillBoundary(geom) 328 0.02181 0.02181 0.02181 2.63% amrex::Dot() 1114 0.01979 0.01979 0.01979 2.39% StateDataPhysBCFunct::() 41 0.01714 0.01714 0.01714 2.07% Castro::normalize_species() 62 0.01463 0.01463 0.01463 1.76% amrex::Copy() 1029 0.01461 0.01461 0.01461 1.76% FabArray::norminf() 743 0.014 0.014 0.014 1.69% Castro::computeTemp() 63 0.01347 0.01347 0.01347 1.62% FabArray::setVal() 1144 0.0129 0.0129 0.0129 1.55% FabArray::ParallelCopy_nowait() 861 0.01279 0.01279 0.01279 1.54% MLPoisson::Fapply() 1142 0.01017 0.01017 0.01017 1.23% MLCellLinOp::defineAuxData() 11 0.00932 0.00932 0.00932 1.12% FabArray::Saxpy() 813 0.007957 0.007957 0.007957 0.96% FabArray::Xpay() 821 0.007908 0.007908 0.007908 0.95% MLMG::addInterpCorrection() 410 0.006458 0.006458 0.006458 0.78% Gravity::fill_multipole_BCs() 11 0.006193 0.006193 0.006193 0.75% amrex::average_down 410 0.005691 0.005691 0.005691 0.69% Castro::enforce_min_density() 62 0.005483 0.005483 0.005483 0.66% Castro::estTimeStep() 21 0.005044 0.005044 0.005044 0.61% Amr::checkPoint() 3 0.00446 0.00446 0.00446 0.54% FabArray::LinComb() 557 0.00438 0.00438 0.00438 0.53% amrex::Add() 164 0.004297 0.004297 0.004297 0.52% Castro::reset_internal_energy(MultiFab) 63 0.004256 0.004256 0.004256 0.51% BndryData::define() 11 0.00351 0.00351 0.00351 0.42% Castro::construct_new_gravity_source() 10 0.003312 0.003312 0.003312 0.40% Castro::construct_old_gravity_source() 10 0.002636 0.002636 0.002636 0.32% Castro::do_advance_ctu() 10 0.002549 0.002549 0.002549 0.31% Castro::reset_internal_energy(Fab) 504 0.00232 0.00232 0.00232 0.28% Amr::writePlotFile() 2 0.002041 0.002041 0.002041 0.25% MLCGSolver::bicgstab 82 0.002022 0.002022 0.002022 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001599 0.001599 0.001599 0.19% Gravity::actual_solve_with_mlmg() 11 0.001455 0.001455 0.001455 0.18% MLCellLinOp::setLevelBC() 11 0.001345 0.001345 0.001345 0.16% FabArray::mult() 43 0.001335 0.001335 0.001335 0.16% FabArray::setDomainBndry() 41 0.001315 0.001315 0.001315 0.16% Castro::initData() 1 0.001265 0.001265 0.001265 0.15% MLCellLinOp::smooth() 1640 0.001192 0.001192 0.001192 0.14% MultiFab::contains_nan() 20 0.001187 0.001187 0.001187 0.14% MLCellLinOp::prepareForSolve() 11 0.001075 0.001075 0.001075 0.13% Castro::enforce_speed_limit() 62 0.00106 0.00106 0.00106 0.13% MLCellLinOp::compGrad() 11 0.0009235 0.0009235 0.0009235 0.11% MLMG::prepareForSolve() 11 0.0008213 0.0008213 0.0008213 0.10% FabArray::FillBoundary() 4023 0.0007998 0.0007998 0.0007998 0.10% FabArrayBase::getCPC() 1323 0.0007205 0.0007205 0.0007205 0.09% FabArrayBase::CPC::define() 454 0.0006818 0.0006818 0.0006818 0.08% Gravity::get_new_grav_vector() 11 0.0006086 0.0006086 0.0006086 0.07% FabArrayBase::getFB() 4023 0.0005905 0.0005905 0.0005905 0.07% Gravity::get_old_grav_vector() 10 0.0005268 0.0005268 0.0005268 0.06% Castro::subcycle_advance_ctu() 10 0.0005251 0.0005251 0.0005251 0.06% Amr::InitAmr() 1 0.0004707 0.0004707 0.0004707 0.06% MLCellLinOp::apply() 1142 0.0004652 0.0004652 0.0004652 0.06% MLMG::mgVcycle() 82 0.000362 0.000362 0.000362 0.04% Amr::coarseTimeStep() 10 0.0003353 0.0003353 0.0003353 0.04% main() 1 0.0003105 0.0003105 0.0003105 0.04% MLCGSolver::ParallelAllReduce 1514 0.0002794 0.0002794 0.0002794 0.03% MultiFab::max() 11 0.0002701 0.0002701 0.0002701 0.03% FabArray::ParallelCopy() 861 0.000244 0.000244 0.000244 0.03% MLCellLinOp::correctionResidual() 492 0.0002302 0.0002302 0.0002302 0.03% FillPatchIterator::Initialize 41 0.0002112 0.0002112 0.0002112 0.03% MLCellLinOp::defineBC() 11 0.0002 0.0002 0.0002 0.02% MLLinOp::defineGrids() 11 0.0001741 0.0001741 0.0001741 0.02% Amr::timeStep() 10 0.0001592 0.0001592 0.0001592 0.02% Gravity::solve_for_phi() 10 0.0001504 0.0001504 0.0001504 0.02% StateData::checkPoint() 12 0.0001352 0.0001352 0.0001352 0.02% Castro::finalize_advance() 10 0.0001301 0.0001301 0.0001301 0.02% Gravity::update_max_rhs() 11 0.0001118 0.0001118 0.0001118 0.01% Castro::advance() 10 0.000105 0.000105 0.000105 0.01% MLMG:computeResOfCorrection() 410 0.0001043 0.0001043 0.0001043 0.01% MLMG::mgVcycle_down::0 82 9.622e-05 9.622e-05 9.622e-05 0.01% MLMG::actualBottomSolve() 82 9.273e-05 9.273e-05 9.273e-05 0.01% Castro::Castro() 1 8.652e-05 8.652e-05 8.652e-05 0.01% FabArrayBase::FB::FB() 56 8.473e-05 8.473e-05 8.473e-05 0.01% MLMG::mgVcycle_down::1 82 8.34e-05 8.34e-05 8.34e-05 0.01% Castro::clean_state() 62 7.814e-05 7.814e-05 7.814e-05 0.01% MLMG::mgVcycle_down::2 82 7.524e-05 7.524e-05 7.524e-05 0.01% AmrLevel::checkPoint() 3 7.449e-05 7.449e-05 7.449e-05 0.01% Castro::expand_state() 10 7.327e-05 7.327e-05 7.327e-05 0.01% MLMG::solve() 11 7.141e-05 7.141e-05 7.141e-05 0.01% MLMG::mgVcycle_down::3 82 7.085e-05 7.085e-05 7.085e-05 0.01% MLMG::mgVcycle_down::4 82 7.05e-05 7.05e-05 7.05e-05 0.01% Castro::initialize_advance() 10 6.45e-05 6.45e-05 6.45e-05 0.01% MLMG::mgVcycle_up::4 82 6.126e-05 6.126e-05 6.126e-05 0.01% MLMG::mgVcycle_up::0 82 5.598e-05 5.598e-05 5.598e-05 0.01% MLMG::oneIter() 82 5.387e-05 5.387e-05 5.387e-05 0.01% MLCellLinOp::solutionResidual() 93 5.181e-05 5.181e-05 5.181e-05 0.01% MLMG::mgVcycle_up::1 82 5.084e-05 5.084e-05 5.084e-05 0.01% MLMG::mgVcycle_up::3 82 5.019e-05 5.019e-05 5.019e-05 0.01% Castro::initialize_do_advance() 10 4.979e-05 4.979e-05 4.979e-05 0.01% MLMG::mgVcycle_up::2 82 4.956e-05 4.956e-05 4.956e-05 0.01% Castro::finalize_do_advance() 10 3.72e-05 3.72e-05 3.72e-05 0.00% Castro::swap_state_time_levels() 10 3.676e-05 3.676e-05 3.676e-05 0.00% Castro::post_timestep() 10 3.451e-05 3.451e-05 3.451e-05 0.00% Castro::enforce_consistent_e() 1 3.355e-05 3.355e-05 3.355e-05 0.00% MLMG::ResNormInf() 93 3.344e-05 3.344e-05 3.344e-05 0.00% Castro::construct_new_gravity() 10 3.265e-05 3.265e-05 3.265e-05 0.00% MLMG::computeResidual() 82 3.032e-05 3.032e-05 3.032e-05 0.00% StateData::define() 4 3.026e-05 3.026e-05 3.026e-05 0.00% MLMG::mgVcycle_bottom 82 2.935e-05 2.935e-05 2.935e-05 0.00% FillPatchSingleLevel 41 2.869e-05 2.869e-05 2.869e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.844e-05 2.844e-05 2.844e-05 0.00% makeSFC 55 2.556e-05 2.556e-05 2.556e-05 0.00% Amr::writeSmallPlotFile() 1 2.505e-05 2.505e-05 2.505e-05 0.00% Amr::defBaseLevel() 1 2.496e-05 2.496e-05 2.496e-05 0.00% MLPoisson::define() 11 2.357e-05 2.357e-05 2.357e-05 0.00% Amr::FinalizeInit() 1 2.096e-05 2.096e-05 2.096e-05 0.00% Castro::construct_old_source() 50 1.784e-05 1.784e-05 1.784e-05 0.00% Castro::initMFs() 1 1.778e-05 1.778e-05 1.778e-05 0.00% Castro::do_new_sources() 10 1.733e-05 1.733e-05 1.733e-05 0.00% Castro::check_for_nan() 20 1.732e-05 1.732e-05 1.732e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.625e-05 1.625e-05 1.625e-05 0.00% DistributionMapping::Distribute() 56 1.622e-05 1.622e-05 1.622e-05 0.00% Castro::buildMetrics() 1 1.621e-05 1.621e-05 1.621e-05 0.00% Castro::construct_new_source() 50 1.619e-05 1.619e-05 1.619e-05 0.00% Castro::do_old_sources() 10 1.541e-05 1.541e-05 1.541e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.504e-05 1.504e-05 1.504e-05 0.00% Castro::computeNewDt() 9 1.434e-05 1.434e-05 1.434e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.244e-05 1.244e-05 1.244e-05 0.00% Amr::InitializeInit() 1 1.178e-05 1.178e-05 1.178e-05 0.00% Castro::post_init() 1 1.057e-05 1.057e-05 1.057e-05 0.00% MLLinOp::define() 11 1.014e-05 1.014e-05 1.014e-05 0.00% Castro::apply_source_to_state() 20 9.278e-06 9.278e-06 9.278e-06 0.00% Castro::construct_old_gravity() 10 9.018e-06 9.018e-06 9.018e-06 0.00% Amr::initSubcycle() 1 9e-06 9e-06 9e-06 0.00% Gravity::swapTimeLevels() 10 8.879e-06 8.879e-06 8.879e-06 0.00% MLPoisson::prepareForSolve() 11 8.184e-06 8.184e-06 8.184e-06 0.00% MLMG::computeMLResidual() 11 8.02e-06 8.02e-06 8.02e-06 0.00% Gravity::actual_multilevel_solve() 1 7.432e-06 7.432e-06 7.432e-06 0.00% MLMG::getGradSolution() 11 5.853e-06 5.853e-06 5.853e-06 0.00% AmrLevel::checkPointPost() 3 5.334e-06 5.334e-06 5.334e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.145e-06 4.145e-06 4.145e-06 0.00% Castro::retry_advance_ctu() 10 4.086e-06 4.086e-06 4.086e-06 0.00% Gravity::set_mass_offset() 11 4.008e-06 4.008e-06 4.008e-06 0.00% MLMG::MLRhsNormInf() 11 3.881e-06 3.881e-06 3.881e-06 0.00% Castro::create_source_corrector() 10 3.621e-06 3.621e-06 3.621e-06 0.00% MLMG::MLResNormInf() 11 3.312e-06 3.312e-06 3.312e-06 0.00% Castro::computeInitialDt() 2 2.965e-06 2.965e-06 2.965e-06 0.00% Castro::FluxRegCrseInit 10 2.92e-06 2.92e-06 2.92e-06 0.00% Amr::init() 1 2.79e-06 2.79e-06 2.79e-06 0.00% Castro::FluxRegFineAdd() 10 2.131e-06 2.131e-06 2.131e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.995e-06 1.995e-06 1.995e-06 0.00% AmrLevel::checkPointPre() 3 1.722e-06 1.722e-06 1.722e-06 0.00% Castro::post_regrid() 1 1.096e-06 1.096e-06 1.096e-06 0.00% Amr::initialInit() 1 9.61e-07 9.61e-07 9.61e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8297 0.8297 0.8297 100.00% Amr::coarseTimeStep() 10 0.6614 0.6614 0.6614 79.70% Amr::timeStep() 10 0.5463 0.5463 0.5463 65.84% Castro::advance() 10 0.54 0.54 0.54 65.08% Castro::subcycle_advance_ctu() 10 0.5282 0.5282 0.5282 63.66% Castro::do_advance_ctu() 10 0.5277 0.5277 0.5277 63.60% Gravity::solve_phi_with_mlmg() 11 0.2794 0.2794 0.2794 33.67% Gravity::actual_solve_with_mlmg() 11 0.2727 0.2727 0.2727 32.86% Castro::construct_new_gravity() 10 0.2553 0.2553 0.2553 30.77% MLMG::solve() 11 0.253 0.253 0.253 30.49% Gravity::solve_for_phi() 10 0.2405 0.2405 0.2405 28.98% MLMG::oneIter() 82 0.239 0.239 0.239 28.81% MLMG::mgVcycle() 82 0.2354 0.2354 0.2354 28.37% VisMF::Write(FabArray) 11 0.2254 0.2254 0.2254 27.16% Castro::construct_ctu_hydro_source() 10 0.1966 0.1966 0.1966 23.69% Amr::checkPoint() 3 0.1685 0.1685 0.1685 20.31% AmrLevel::checkPoint() 3 0.1641 0.1641 0.1641 19.77% StateData::checkPoint() 12 0.164 0.164 0.164 19.76% Amr::init() 1 0.1357 0.1357 0.1357 16.35% MLCellLinOp::smooth() 1640 0.1186 0.1186 0.1186 14.29% MLCellLinOp::applyBC() 4433 0.1108 0.1108 0.1108 13.35% MLMG::mgVcycle_bottom 82 0.07119 0.07119 0.07119 8.58% MLMG::actualBottomSolve() 82 0.07116 0.07116 0.07116 8.58% MLCGSolver::bicgstab 82 0.07047 0.07047 0.07047 8.49% Amr::writePlotFile() 2 0.06408 0.06408 0.06408 7.72% Amr::initialInit() 1 0.04643 0.04643 0.04643 5.60% FillPatchIterator::Initialize 41 0.04446 0.04446 0.04446 5.36% FillPatchSingleLevel 41 0.04293 0.04293 0.04293 5.17% Amr::FinalizeInit() 1 0.04241 0.04241 0.04241 5.11% Castro::post_init() 1 0.04112 0.04112 0.04112 4.96% Castro::clean_state() 62 0.04047 0.04047 0.04047 4.88% Gravity::multilevel_solve_for_new_phi() 1 0.03933 0.03933 0.03933 4.74% Gravity::actual_multilevel_solve() 1 0.03932 0.03932 0.03932 4.74% StateDataPhysBCFunct::() 41 0.03895 0.03895 0.03895 4.69% FabArray::FillBoundary() 4023 0.03708 0.03708 0.03708 4.47% FillBoundary_nowait() 4023 0.03628 0.03628 0.03628 4.37% MLCellLinOp::apply() 1142 0.03515 0.03515 0.03515 4.24% MLMG::mgVcycle_down::0 82 0.03312 0.03312 0.03312 3.99% MLPoisson::Fsmooth() 3280 0.03163 0.03163 0.03163 3.81% MLMG::mgVcycle_up::0 82 0.02997 0.02997 0.02997 3.61% Castro::initialize_do_advance() 10 0.02205 0.02205 0.02205 2.66% StateData::FillBoundary(geom) 328 0.02181 0.02181 0.02181 2.63% MLCellLinOp::correctionResidual() 492 0.02158 0.02158 0.02158 2.60% Castro::computeTemp() 63 0.02004 0.02004 0.02004 2.42% amrex::Dot() 1114 0.01979 0.01979 0.01979 2.39% MLMG:computeResOfCorrection() 410 0.01902 0.01902 0.01902 2.29% Gravity::get_new_grav_vector() 11 0.0163 0.0163 0.0163 1.96% Castro::expand_state() 10 0.01571 0.01571 0.01571 1.89% MLPoisson::define() 11 0.01549 0.01549 0.01549 1.87% MLMG::mgVcycle_down::1 82 0.01527 0.01527 0.01527 1.84% Castro::normalize_species() 62 0.01463 0.01463 0.01463 1.76% amrex::Copy() 1029 0.01461 0.01461 0.01461 1.76% MLMG::mgVcycle_down::2 82 0.01419 0.01419 0.01419 1.71% FabArray::norminf() 743 0.014 0.014 0.014 1.69% Castro::construct_old_gravity() 10 0.01393 0.01393 0.01393 1.68% Gravity::get_old_grav_vector() 10 0.01392 0.01392 0.01392 1.68% MLMG::mgVcycle_down::3 82 0.01391 0.01391 0.01391 1.68% FabArray::ParallelCopy() 861 0.01384 0.01384 0.01384 1.67% MLMG::mgVcycle_down::4 82 0.01379 0.01379 0.01379 1.66% FabArray::ParallelCopy_nowait() 861 0.0136 0.0136 0.0136 1.64% FabArray::setVal() 1144 0.0129 0.0129 0.0129 1.55% Castro::do_new_sources() 10 0.01283 0.01283 0.01283 1.55% MLCGSolver::ParallelAllReduce 1514 0.01187 0.01187 0.01187 1.43% MLMG::addInterpCorrection() 410 0.01142 0.01142 0.01142 1.38% MLMG::mgVcycle_up::4 82 0.01117 0.01117 0.01117 1.35% MLMG::mgVcycle_up::1 82 0.01107 0.01107 0.01107 1.33% Castro::initialize_advance() 10 0.01101 0.01101 0.01101 1.33% MLMG::mgVcycle_up::2 82 0.01081 0.01081 0.01081 1.30% amrex::average_down 410 0.01062 0.01062 0.01062 1.28% MLCellLinOp::defineAuxData() 11 0.01061 0.01061 0.01061 1.28% MLMG::mgVcycle_up::3 82 0.01059 0.01059 0.01059 1.28% MLPoisson::Fapply() 1142 0.01017 0.01017 0.01017 1.23% Castro::do_old_sources() 10 0.00941 0.00941 0.00941 1.13% FabArray::Saxpy() 813 0.007957 0.007957 0.007957 0.96% FabArray::Xpay() 821 0.007908 0.007908 0.007908 0.95% MLCellLinOp::solutionResidual() 93 0.007052 0.007052 0.007052 0.85% Castro::reset_internal_energy(MultiFab) 63 0.006576 0.006576 0.006576 0.79% Gravity::fill_multipole_BCs() 11 0.006468 0.006468 0.006468 0.78% Castro::post_timestep() 10 0.006185 0.006185 0.006185 0.75% MLMG::computeResidual() 82 0.006098 0.006098 0.006098 0.73% Castro::enforce_min_density() 62 0.005483 0.005483 0.005483 0.66% Castro::estTimeStep() 21 0.005044 0.005044 0.005044 0.61% MLCellLinOp::defineBC() 11 0.004619 0.004619 0.004619 0.56% MLMG::prepareForSolve() 11 0.004479 0.004479 0.004479 0.54% BndryData::define() 11 0.004419 0.004419 0.004419 0.53% FabArray::LinComb() 557 0.00438 0.00438 0.00438 0.53% amrex::Add() 164 0.004297 0.004297 0.004297 0.52% Amr::InitializeInit() 1 0.004015 0.004015 0.004015 0.48% Amr::defBaseLevel() 1 0.004003 0.004003 0.004003 0.48% Castro::initData() 1 0.003512 0.003512 0.003512 0.42% Castro::construct_new_source() 50 0.003328 0.003328 0.003328 0.40% Castro::construct_new_gravity_source() 10 0.003312 0.003312 0.003312 0.40% Castro::construct_old_source() 50 0.002653 0.002653 0.002653 0.32% Castro::construct_old_gravity_source() 10 0.002636 0.002636 0.002636 0.32% Castro::computeNewDt() 9 0.002542 0.002542 0.002542 0.31% Castro::reset_internal_energy(Fab) 504 0.00232 0.00232 0.00232 0.28% MLMG::ResNormInf() 93 0.002089 0.002089 0.002089 0.25% Castro::apply_source_to_state() 20 0.001818 0.001818 0.001818 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001599 0.001599 0.001599 0.19% FabArrayBase::getCPC() 1323 0.001402 0.001402 0.001402 0.17% MLMG::getGradSolution() 11 0.00139 0.00139 0.00139 0.17% MLCellLinOp::compGrad() 11 0.001384 0.001384 0.001384 0.17% MLCellLinOp::setLevelBC() 11 0.001345 0.001345 0.001345 0.16% FabArray::mult() 43 0.001335 0.001335 0.001335 0.16% FabArray::setDomainBndry() 41 0.001315 0.001315 0.001315 0.16% Castro::check_for_nan() 20 0.001204 0.001204 0.001204 0.15% MultiFab::contains_nan() 20 0.001187 0.001187 0.001187 0.14% Castro::post_regrid() 1 0.00112 0.00112 0.00112 0.13% MLPoisson::prepareForSolve() 11 0.001084 0.001084 0.001084 0.13% MLCellLinOp::prepareForSolve() 11 0.001075 0.001075 0.001075 0.13% Castro::enforce_speed_limit() 62 0.00106 0.00106 0.00106 0.13% MLMG::computeMLResidual() 11 0.0009919 0.0009919 0.0009919 0.12% Gravity::update_max_rhs() 11 0.0008249 0.0008249 0.0008249 0.10% Castro::computeInitialDt() 2 0.0007994 0.0007994 0.0007994 0.10% FabArrayBase::CPC::define() 454 0.0006818 0.0006818 0.0006818 0.08% FabArrayBase::getFB() 4023 0.0006752 0.0006752 0.0006752 0.08% Castro::finalize_advance() 10 0.0006445 0.0006445 0.0006445 0.08% Amr::InitAmr() 1 0.0004797 0.0004797 0.0004797 0.06% Gravity::swapTimeLevels() 10 0.0004338 0.0004338 0.0004338 0.05% Castro::Castro() 1 0.0004142 0.0004142 0.0004142 0.05% MLMG::MLResNormInf() 11 0.0002886 0.0002886 0.0002886 0.03% MultiFab::max() 11 0.0002701 0.0002701 0.0002701 0.03% MLLinOp::define() 11 0.000239 0.000239 0.000239 0.03% MLLinOp::defineGrids() 11 0.0002288 0.0002288 0.0002288 0.03% MLMG::MLRhsNormInf() 11 0.000218 0.000218 0.000218 0.03% Castro::buildMetrics() 1 0.0001521 0.0001521 0.0001521 0.02% FabArrayBase::FB::FB() 56 8.473e-05 8.473e-05 8.473e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.275e-05 5.275e-05 5.275e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.651e-05 4.651e-05 4.651e-05 0.01% makeSFC 55 4.031e-05 4.031e-05 4.031e-05 0.00% Castro::finalize_do_advance() 10 3.72e-05 3.72e-05 3.72e-05 0.00% Castro::swap_state_time_levels() 10 3.676e-05 3.676e-05 3.676e-05 0.00% Castro::enforce_consistent_e() 1 3.355e-05 3.355e-05 3.355e-05 0.00% StateData::define() 4 3.026e-05 3.026e-05 3.026e-05 0.00% Amr::writeSmallPlotFile() 1 2.505e-05 2.505e-05 2.505e-05 0.00% Castro::initMFs() 1 1.778e-05 1.778e-05 1.778e-05 0.00% DistributionMapping::Distribute() 56 1.622e-05 1.622e-05 1.622e-05 0.00% Amr::initSubcycle() 1 9e-06 9e-06 9e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.616e-06 5.616e-06 5.616e-06 0.00% AmrLevel::checkPointPost() 3 5.334e-06 5.334e-06 5.334e-06 0.00% Castro::retry_advance_ctu() 10 4.086e-06 4.086e-06 4.086e-06 0.00% Gravity::set_mass_offset() 11 4.008e-06 4.008e-06 4.008e-06 0.00% Castro::create_source_corrector() 10 3.621e-06 3.621e-06 3.621e-06 0.00% Castro::FluxRegCrseInit 10 2.92e-06 2.92e-06 2.92e-06 0.00% Castro::FluxRegFineAdd() 10 2.131e-06 2.131e-06 2.131e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.995e-06 1.995e-06 1.995e-06 0.00% AmrLevel::checkPointPre() 3 1.722e-06 1.722e-06 1.722e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-21-gb487434b948f) finalized Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-21-gb487434b948f) initialized Starting run at 10:11:39 UTC on 2023-01-23. Successfully read inputs file ... Castro git describe: 23.01-18-gbb2758482 AMReX git describe: 23.01-21-gb487434b9 Microphysics git describe: 23.01-4-gd64aa25b reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.466264203 Restart time = 0.047427242 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.047045275 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.045675497 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.056307651 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.055692231 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.059440774 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.031964862 seconds Ending run at 10:11:39 UTC on 2023-01-23. Run time = 0.344518557 Run time without initialization = 0.29651812 Average number of zones advanced per microsecond: 4.420 Average number of zones advanced per microsecond per rank: 4.420 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.3445 ... 0.3445 ... 0.3445 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0913 0.0913 0.0913 26.49% VisMF::Read() 3 0.04045 0.04045 0.04045 11.74% MLCellLinOp::applyBC() 1946 0.03173 0.03173 0.03173 9.21% VisMF::Write(FabArray) 1 0.03055 0.03055 0.03055 8.87% MLPoisson::Fsmooth() 1440 0.01361 0.01361 0.01361 3.95% FillBoundary_nowait() 1766 0.01285 0.01285 0.01285 3.73% StateData::FillBoundary(geom) 160 0.01109 0.01109 0.01109 3.22% amrex::Dot() 484 0.008366 0.008366 0.008366 2.43% amrex::Copy() 463 0.006838 0.006838 0.006838 1.98% FabArray::setVal() 537 0.006173 0.006173 0.006173 1.79% Castro::computeTemp() 30 0.006125 0.006125 0.006125 1.78% Castro::enforce_min_density() 30 0.00601 0.00601 0.00601 1.74% FabArray::norminf() 326 0.005983 0.005983 0.005983 1.74% Castro::normalize_species() 30 0.005811 0.005811 0.005811 1.69% FabArray::ParallelCopy_nowait() 380 0.005754 0.005754 0.005754 1.67% StateDataPhysBCFunct::() 20 0.005554 0.005554 0.005554 1.61% MLCellLinOp::defineAuxData() 6 0.00506 0.00506 0.00506 1.47% MLPoisson::Fapply() 500 0.004337 0.004337 0.004337 1.26% FabArray::Saxpy() 355 0.003525 0.003525 0.003525 1.02% FabArray::Xpay() 361 0.003409 0.003409 0.003409 0.99% Amr::restart() 1 0.003225 0.003225 0.003225 0.94% Gravity::fill_multipole_BCs() 6 0.002952 0.002952 0.002952 0.86% MLMG::addInterpCorrection() 180 0.002778 0.002778 0.002778 0.81% amrex::average_down 180 0.002451 0.002451 0.002451 0.71% Castro::estTimeStep() 10 0.002036 0.002036 0.002036 0.59% BndryData::define() 6 0.001885 0.001885 0.001885 0.55% FabArray::LinComb() 242 0.001856 0.001856 0.001856 0.54% Castro::reset_internal_energy(MultiFab) 30 0.001856 0.001856 0.001856 0.54% amrex::Add() 72 0.001821 0.001821 0.001821 0.53% Castro::construct_new_gravity_source() 5 0.001621 0.001621 0.001621 0.47% Castro::do_advance_ctu() 5 0.001425 0.001425 0.001425 0.41% Castro::construct_old_gravity_source() 5 0.001309 0.001309 0.001309 0.38% Amr::writePlotFile() 1 0.001248 0.001248 0.001248 0.36% MLCGSolver::bicgstab 36 0.0009002 0.0009002 0.0009002 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.000888 0.000888 0.000888 0.26% MLCellLinOp::setLevelBC() 6 0.000724 0.000724 0.000724 0.21% Gravity::actual_solve_with_mlmg() 6 0.0007197 0.0007197 0.0007197 0.21% Castro::reset_internal_energy(Fab) 240 0.0006819 0.0006819 0.0006819 0.20% FabArray::mult() 22 0.0006532 0.0006532 0.0006532 0.19% FabArray::setDomainBndry() 20 0.0006224 0.0006224 0.0006224 0.18% MultiFab::contains_nan() 10 0.0005857 0.0005857 0.0005857 0.17% MLCellLinOp::prepareForSolve() 6 0.0005835 0.0005835 0.0005835 0.17% MLCellLinOp::smooth() 720 0.0005012 0.0005012 0.0005012 0.15% MLCellLinOp::compGrad() 6 0.0004811 0.0004811 0.0004811 0.14% MLMG::prepareForSolve() 6 0.0004389 0.0004389 0.0004389 0.13% FabArrayBase::CPC::define() 244 0.0003972 0.0003972 0.0003972 0.12% Amr::InitAmr() 1 0.0003933 0.0003933 0.0003933 0.11% Castro::enforce_speed_limit() 30 0.0003888 0.0003888 0.0003888 0.11% FabArray::FillBoundary() 1766 0.0003443 0.0003443 0.0003443 0.10% FabArrayBase::getCPC() 632 0.0003431 0.0003431 0.0003431 0.10% Gravity::get_old_grav_vector() 5 0.0002897 0.0002897 0.0002897 0.08% main() 1 0.0002725 0.0002725 0.0002725 0.08% Gravity::get_new_grav_vector() 5 0.0002672 0.0002672 0.0002672 0.08% FabArrayBase::getFB() 1766 0.0002434 0.0002434 0.0002434 0.07% MLCellLinOp::apply() 500 0.0002005 0.0002005 0.0002005 0.06% MLMG::mgVcycle() 36 0.0001676 0.0001676 0.0001676 0.05% Amr::coarseTimeStep() 5 0.0001593 0.0001593 0.0001593 0.05% MultiFab::max() 6 0.0001328 0.0001328 0.0001328 0.04% MLCGSolver::ParallelAllReduce 659 0.0001237 0.0001237 0.0001237 0.04% FabArray::ParallelCopy() 380 0.0001116 0.0001116 0.0001116 0.03% MLLinOp::defineGrids() 6 0.000107 0.000107 0.000107 0.03% FillPatchIterator::Initialize 20 0.0001052 0.0001052 0.0001052 0.03% MLCellLinOp::defineBC() 6 0.0001034 0.0001034 0.0001034 0.03% MLCellLinOp::correctionResidual() 216 9.36e-05 9.36e-05 9.36e-05 0.03% Amr::timeStep() 5 7.734e-05 7.734e-05 7.734e-05 0.02% Castro::subcycle_advance_ctu() 5 7.588e-05 7.588e-05 7.588e-05 0.02% AmrLevel::restart() 1 6.742e-05 6.742e-05 6.742e-05 0.02% Gravity::solve_for_phi() 5 6.111e-05 6.111e-05 6.111e-05 0.02% FabArrayBase::FB::FB() 26 5.69e-05 5.69e-05 5.69e-05 0.02% StateData::restartDoit() 4 5.686e-05 5.686e-05 5.686e-05 0.02% Gravity::update_max_rhs() 6 5.392e-05 5.392e-05 5.392e-05 0.02% Castro::advance() 5 4.872e-05 4.872e-05 4.872e-05 0.01% MLMG:computeResOfCorrection() 180 4.732e-05 4.732e-05 4.732e-05 0.01% MLMG::mgVcycle_down::1 36 4.129e-05 4.129e-05 4.129e-05 0.01% MLMG::mgVcycle_down::0 36 4.032e-05 4.032e-05 4.032e-05 0.01% Castro::expand_state() 5 3.902e-05 3.902e-05 3.902e-05 0.01% MLMG::actualBottomSolve() 36 3.893e-05 3.893e-05 3.893e-05 0.01% Castro::clean_state() 30 3.891e-05 3.891e-05 3.891e-05 0.01% MLMG::mgVcycle_down::2 36 3.68e-05 3.68e-05 3.68e-05 0.01% MLMG::solve() 6 3.378e-05 3.378e-05 3.378e-05 0.01% MLMG::mgVcycle_down::4 36 3.199e-05 3.199e-05 3.199e-05 0.01% MLMG::mgVcycle_down::3 36 3.112e-05 3.112e-05 3.112e-05 0.01% Castro::initialize_advance() 5 3.09e-05 3.09e-05 3.09e-05 0.01% MLMG::mgVcycle_up::4 36 2.896e-05 2.896e-05 2.896e-05 0.01% Castro::finalize_advance() 5 2.877e-05 2.877e-05 2.877e-05 0.01% Amr::writeSmallPlotFile() 1 2.49e-05 2.49e-05 2.49e-05 0.01% MLMG::mgVcycle_up::0 36 2.454e-05 2.454e-05 2.454e-05 0.01% Castro::buildMetrics() 1 2.395e-05 2.395e-05 2.395e-05 0.01% MLMG::oneIter() 36 2.334e-05 2.334e-05 2.334e-05 0.01% Castro::initialize_do_advance() 5 2.278e-05 2.278e-05 2.278e-05 0.01% makeSFC 30 2.272e-05 2.272e-05 2.272e-05 0.01% MLMG::mgVcycle_up::3 36 2.196e-05 2.196e-05 2.196e-05 0.01% Castro::swap_state_time_levels() 5 2.184e-05 2.184e-05 2.184e-05 0.01% MLMG::mgVcycle_up::2 36 2.177e-05 2.177e-05 2.177e-05 0.01% MLCellLinOp::solutionResidual() 42 2.17e-05 2.17e-05 2.17e-05 0.01% Castro::post_restart() 1 2.098e-05 2.098e-05 2.098e-05 0.01% MLMG::mgVcycle_up::1 36 2.095e-05 2.095e-05 2.095e-05 0.01% Castro::initMFs() 1 2.031e-05 2.031e-05 2.031e-05 0.01% Castro::create_source_corrector() 5 1.986e-05 1.986e-05 1.986e-05 0.01% Castro::finalize_do_advance() 5 1.869e-05 1.869e-05 1.869e-05 0.01% Castro::check_for_nan() 10 1.787e-05 1.787e-05 1.787e-05 0.01% MLMG::ResNormInf() 42 1.718e-05 1.718e-05 1.718e-05 0.00% MLPoisson::define() 6 1.43e-05 1.43e-05 1.43e-05 0.00% FillPatchSingleLevel 20 1.411e-05 1.411e-05 1.411e-05 0.00% MLMG::mgVcycle_bottom 36 1.402e-05 1.402e-05 1.402e-05 0.00% MLMG::computeResidual() 36 1.352e-05 1.352e-05 1.352e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.321e-05 1.321e-05 1.321e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.272e-05 1.272e-05 1.272e-05 0.00% Castro::construct_new_gravity() 5 1.185e-05 1.185e-05 1.185e-05 0.00% Castro::construct_new_source() 25 1.104e-05 1.104e-05 1.104e-05 0.00% Castro::construct_old_source() 25 9.551e-06 9.551e-06 9.551e-06 0.00% DistributionMapping::Distribute() 31 9.41e-06 9.41e-06 9.41e-06 0.00% Castro::do_new_sources() 5 8.707e-06 8.707e-06 8.707e-06 0.00% Castro::do_old_sources() 5 8.152e-06 8.152e-06 8.152e-06 0.00% Amr::initSubcycle() 1 8.044e-06 8.044e-06 8.044e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.127e-06 7.127e-06 7.127e-06 0.00% Gravity::actual_multilevel_solve() 1 6.443e-06 6.443e-06 6.443e-06 0.00% Castro::apply_source_to_state() 10 6.033e-06 6.033e-06 6.033e-06 0.00% MLLinOp::define() 6 5.892e-06 5.892e-06 5.892e-06 0.00% Castro::construct_old_gravity() 5 5.133e-06 5.133e-06 5.133e-06 0.00% Castro::post_timestep() 5 4.867e-06 4.867e-06 4.867e-06 0.00% Gravity::swapTimeLevels() 5 4.64e-06 4.64e-06 4.64e-06 0.00% MLPoisson::prepareForSolve() 6 4.08e-06 4.08e-06 4.08e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.83e-06 3.83e-06 3.83e-06 0.00% MLMG::computeMLResidual() 6 3.509e-06 3.509e-06 3.509e-06 0.00% MLMG::getGradSolution() 6 3.135e-06 3.135e-06 3.135e-06 0.00% Castro::computeNewDt() 5 3.021e-06 3.021e-06 3.021e-06 0.00% MLMG::MLResNormInf() 6 2.13e-06 2.13e-06 2.13e-06 0.00% MLMG::MLRhsNormInf() 6 1.973e-06 1.973e-06 1.973e-06 0.00% Castro::retry_advance_ctu() 5 1.899e-06 1.899e-06 1.899e-06 0.00% Gravity::set_mass_offset() 6 1.847e-06 1.847e-06 1.847e-06 0.00% Castro::FluxRegCrseInit 5 1.29e-06 1.29e-06 1.29e-06 0.00% Amr::init() 1 1.179e-06 1.179e-06 1.179e-06 0.00% Castro::FluxRegFineAdd() 5 1.159e-06 1.159e-06 1.159e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.123e-06 1.123e-06 1.123e-06 0.00% AmrLevel::AmrLevel() 1 8.18e-07 8.18e-07 8.18e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3445 0.3445 0.3445 99.99% Amr::coarseTimeStep() 5 0.2643 0.2643 0.2643 76.71% Amr::timeStep() 5 0.2628 0.2628 0.2628 76.27% Castro::advance() 5 0.2601 0.2601 0.2601 75.48% Castro::subcycle_advance_ctu() 5 0.2537 0.2537 0.2537 73.63% Castro::do_advance_ctu() 5 0.2536 0.2536 0.2536 73.61% Castro::construct_new_gravity() 5 0.1248 0.1248 0.1248 36.21% Gravity::solve_phi_with_mlmg() 6 0.1204 0.1204 0.1204 34.95% Gravity::solve_for_phi() 5 0.1174 0.1174 0.1174 34.08% Gravity::actual_solve_with_mlmg() 6 0.1172 0.1172 0.1172 34.02% MLMG::solve() 6 0.1066 0.1066 0.1066 30.93% MLMG::oneIter() 36 0.09991 0.09991 0.09991 29.00% MLMG::mgVcycle() 36 0.09838 0.09838 0.09838 28.55% Castro::construct_ctu_hydro_source() 5 0.09127 0.09127 0.09127 26.49% MLCellLinOp::smooth() 720 0.04852 0.04852 0.04852 14.08% Amr::init() 1 0.04747 0.04747 0.04747 13.78% Amr::restart() 1 0.04747 0.04747 0.04747 13.78% MLCellLinOp::applyBC() 1946 0.04522 0.04522 0.04522 13.13% AmrLevel::restart() 1 0.04064 0.04064 0.04064 11.79% StateData::restartDoit() 4 0.04056 0.04056 0.04056 11.77% VisMF::Read() 3 0.04045 0.04045 0.04045 11.74% Amr::writePlotFile() 1 0.03205 0.03205 0.03205 9.30% VisMF::Write(FabArray) 1 0.03055 0.03055 0.03055 8.87% MLMG::mgVcycle_bottom 36 0.03032 0.03032 0.03032 8.80% MLMG::actualBottomSolve() 36 0.0303 0.0303 0.0303 8.80% MLCGSolver::bicgstab 36 0.03001 0.03001 0.03001 8.71% Castro::clean_state() 30 0.02091 0.02091 0.02091 6.07% FillPatchIterator::Initialize 20 0.01936 0.01936 0.01936 5.62% FillPatchSingleLevel 20 0.01863 0.01863 0.01863 5.41% StateDataPhysBCFunct::() 20 0.01664 0.01664 0.01664 4.83% MLCellLinOp::apply() 500 0.01509 0.01509 0.01509 4.38% MLMG::mgVcycle_down::0 36 0.01399 0.01399 0.01399 4.06% MLPoisson::Fsmooth() 1440 0.01361 0.01361 0.01361 3.95% FabArray::FillBoundary() 1766 0.01349 0.01349 0.01349 3.92% FillBoundary_nowait() 1766 0.01315 0.01315 0.01315 3.82% StateData::FillBoundary(geom) 160 0.01109 0.01109 0.01109 3.22% MLMG::mgVcycle_up::0 36 0.01066 0.01066 0.01066 3.10% MLCellLinOp::correctionResidual() 216 0.009173 0.009173 0.009173 2.66% Castro::initialize_do_advance() 5 0.009158 0.009158 0.009158 2.66% Castro::computeTemp() 30 0.008663 0.008663 0.008663 2.51% MLPoisson::define() 6 0.008472 0.008472 0.008472 2.46% amrex::Dot() 484 0.008366 0.008366 0.008366 2.43% MLMG:computeResOfCorrection() 180 0.008065 0.008065 0.008065 2.34% Castro::construct_old_gravity() 5 0.007329 0.007329 0.007329 2.13% Gravity::get_old_grav_vector() 5 0.007324 0.007324 0.007324 2.13% Gravity::get_new_grav_vector() 5 0.007239 0.007239 0.007239 2.10% Castro::do_new_sources() 5 0.007144 0.007144 0.007144 2.07% amrex::Copy() 463 0.006838 0.006838 0.006838 1.98% MLMG::mgVcycle_down::1 36 0.006541 0.006541 0.006541 1.90% FabArray::ParallelCopy() 380 0.006236 0.006236 0.006236 1.81% FabArray::setVal() 537 0.006173 0.006173 0.006173 1.79% FabArray::ParallelCopy_nowait() 380 0.006125 0.006125 0.006125 1.78% MLMG::mgVcycle_down::2 36 0.006096 0.006096 0.006096 1.77% Castro::initialize_advance() 5 0.006022 0.006022 0.006022 1.75% Castro::enforce_min_density() 30 0.00601 0.00601 0.00601 1.74% FabArray::norminf() 326 0.005983 0.005983 0.005983 1.74% MLMG::mgVcycle_down::3 36 0.005938 0.005938 0.005938 1.72% MLMG::mgVcycle_down::4 36 0.005874 0.005874 0.005874 1.70% Castro::normalize_species() 30 0.005811 0.005811 0.005811 1.69% MLCellLinOp::defineAuxData() 6 0.0058 0.0058 0.0058 1.68% Castro::expand_state() 5 0.005437 0.005437 0.005437 1.58% MLCGSolver::ParallelAllReduce 659 0.00504 0.00504 0.00504 1.46% MLMG::addInterpCorrection() 180 0.004914 0.004914 0.004914 1.43% MLMG::mgVcycle_up::4 36 0.00481 0.00481 0.00481 1.40% MLMG::mgVcycle_up::1 36 0.00476 0.00476 0.00476 1.38% MLMG::mgVcycle_up::2 36 0.004657 0.004657 0.004657 1.35% amrex::average_down 180 0.004577 0.004577 0.004577 1.33% MLMG::mgVcycle_up::3 36 0.004569 0.004569 0.004569 1.33% Castro::do_old_sources() 5 0.004557 0.004557 0.004557 1.32% MLPoisson::Fapply() 500 0.004337 0.004337 0.004337 1.26% FabArray::Saxpy() 355 0.003525 0.003525 0.003525 1.02% FabArray::Xpay() 361 0.003409 0.003409 0.003409 0.99% Castro::post_restart() 1 0.003353 0.003353 0.003353 0.97% Gravity::multilevel_solve_for_new_phi() 1 0.003241 0.003241 0.003241 0.94% Gravity::actual_multilevel_solve() 1 0.003228 0.003228 0.003228 0.94% MLCellLinOp::solutionResidual() 42 0.003163 0.003163 0.003163 0.92% Gravity::fill_multipole_BCs() 6 0.003073 0.003073 0.003073 0.89% Castro::post_timestep() 5 0.002668 0.002668 0.002668 0.77% MLMG::computeResidual() 36 0.00262 0.00262 0.00262 0.76% Castro::reset_internal_energy(MultiFab) 30 0.002538 0.002538 0.002538 0.74% MLCellLinOp::defineBC() 6 0.002506 0.002506 0.002506 0.73% MLMG::prepareForSolve() 6 0.002425 0.002425 0.002425 0.70% BndryData::define() 6 0.002402 0.002402 0.002402 0.70% Castro::estTimeStep() 10 0.002036 0.002036 0.002036 0.59% FabArray::LinComb() 242 0.001856 0.001856 0.001856 0.54% amrex::Add() 72 0.001821 0.001821 0.001821 0.53% Castro::construct_new_source() 25 0.001632 0.001632 0.001632 0.47% Castro::construct_new_gravity_source() 5 0.001621 0.001621 0.001621 0.47% Castro::computeNewDt() 5 0.001352 0.001352 0.001352 0.39% Castro::construct_old_source() 25 0.001318 0.001318 0.001318 0.38% Castro::construct_old_gravity_source() 5 0.001309 0.001309 0.001309 0.38% Castro::apply_source_to_state() 10 0.0009157 0.0009157 0.0009157 0.27% MLMG::ResNormInf() 42 0.0009102 0.0009102 0.0009102 0.26% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.000888 0.000888 0.000888 0.26% FabArrayBase::getCPC() 632 0.0007403 0.0007403 0.0007403 0.21% MLMG::getGradSolution() 6 0.0007388 0.0007388 0.0007388 0.21% MLCellLinOp::compGrad() 6 0.0007357 0.0007357 0.0007357 0.21% MLCellLinOp::setLevelBC() 6 0.000724 0.000724 0.000724 0.21% Castro::reset_internal_energy(Fab) 240 0.0006819 0.0006819 0.0006819 0.20% FabArray::mult() 22 0.0006532 0.0006532 0.0006532 0.19% FabArray::setDomainBndry() 20 0.0006224 0.0006224 0.0006224 0.18% Castro::check_for_nan() 10 0.0006036 0.0006036 0.0006036 0.18% MLPoisson::prepareForSolve() 6 0.0005876 0.0005876 0.0005876 0.17% MultiFab::contains_nan() 10 0.0005857 0.0005857 0.0005857 0.17% MLCellLinOp::prepareForSolve() 6 0.0005835 0.0005835 0.0005835 0.17% MLMG::computeMLResidual() 6 0.0005597 0.0005597 0.0005597 0.16% Gravity::update_max_rhs() 6 0.0004323 0.0004323 0.0004323 0.13% Amr::InitAmr() 1 0.0004014 0.0004014 0.0004014 0.12% FabArrayBase::CPC::define() 244 0.0003972 0.0003972 0.0003972 0.12% Castro::enforce_speed_limit() 30 0.0003888 0.0003888 0.0003888 0.11% FabArrayBase::getFB() 1766 0.0003003 0.0003003 0.0003003 0.09% Castro::finalize_advance() 5 0.0002872 0.0002872 0.0002872 0.08% Castro::buildMetrics() 1 0.0002343 0.0002343 0.0002343 0.07% Gravity::swapTimeLevels() 5 0.0002196 0.0002196 0.0002196 0.06% MLLinOp::define() 6 0.0001516 0.0001516 0.0001516 0.04% MLMG::MLResNormInf() 6 0.0001481 0.0001481 0.0001481 0.04% MLLinOp::defineGrids() 6 0.0001457 0.0001457 0.0001457 0.04% MultiFab::max() 6 0.0001328 0.0001328 0.0001328 0.04% MLMG::MLRhsNormInf() 6 0.000115 0.000115 0.000115 0.03% FabArrayBase::FB::FB() 26 5.69e-05 5.69e-05 5.69e-05 0.02% MLLinOp::makeAgglomeratedDMap 6 3.762e-05 3.762e-05 3.762e-05 0.01% makeSFC 30 3.049e-05 3.049e-05 3.049e-05 0.01% Amr::writeSmallPlotFile() 1 2.49e-05 2.49e-05 2.49e-05 0.01% Castro::swap_state_time_levels() 5 2.184e-05 2.184e-05 2.184e-05 0.01% Castro::initMFs() 1 2.031e-05 2.031e-05 2.031e-05 0.01% Castro::create_source_corrector() 5 1.986e-05 1.986e-05 1.986e-05 0.01% Castro::finalize_do_advance() 5 1.869e-05 1.869e-05 1.869e-05 0.01% DistributionMapping::Distribute() 31 9.41e-06 9.41e-06 9.41e-06 0.00% Amr::initSubcycle() 1 8.044e-06 8.044e-06 8.044e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.47e-06 5.47e-06 5.47e-06 0.00% Castro::retry_advance_ctu() 5 1.899e-06 1.899e-06 1.899e-06 0.00% Gravity::set_mass_offset() 6 1.847e-06 1.847e-06 1.847e-06 0.00% Castro::FluxRegCrseInit 5 1.29e-06 1.29e-06 1.29e-06 0.00% Castro::FluxRegFineAdd() 5 1.159e-06 1.159e-06 1.159e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.123e-06 1.123e-06 1.123e-06 0.00% AmrLevel::AmrLevel() 1 8.18e-07 8.18e-07 8.18e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-21-gb487434b948f) finalized