Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-11-g42b7e2706638) initialized Starting run at 10:09:37 UTC on 2023-01-09. Successfully read inputs file ... Castro git describe: 23.01-12-g63195e409 AMReX git describe: 23.01-11-g42b7e2706 Microphysics git describe: 23.01 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.056970139 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.03259992 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.047379752 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.049190216 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.057417251 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.056980581 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.055932966 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.056068051 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.073806336 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.063023477 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.053512102 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.055726398 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.057855458 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.055873733 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.033527288 seconds Ending run at 10:09:38 UTC on 2023-01-09. Run time = 0.854453165 Run time without initialization = 0.716937451 Average number of zones advanced per microsecond: 3.656 Average number of zones advanced per microsecond per rank: 3.656 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.8545 ... 0.8545 ... 0.8545 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2282 0.2282 0.2282 26.71% Castro::construct_ctu_hydro_source() 10 0.2255 0.2255 0.2255 26.39% MLCellLinOp::applyBC() 4433 0.07311 0.07311 0.07311 8.56% MLPoisson::Fsmooth() 3280 0.03147 0.03147 0.03147 3.68% FillBoundary_nowait() 4023 0.03042 0.03042 0.03042 3.56% StateData::FillBoundary(geom) 328 0.02234 0.02234 0.02234 2.61% amrex::Dot() 1114 0.01957 0.01957 0.01957 2.29% amrex::Copy() 1029 0.01448 0.01448 0.01448 1.70% Castro::normalize_species() 62 0.01439 0.01439 0.01439 1.68% Castro::computeTemp() 63 0.01434 0.01434 0.01434 1.68% StateDataPhysBCFunct::() 41 0.01418 0.01418 0.01418 1.66% FabArray::norminf() 743 0.01385 0.01385 0.01385 1.62% FabArray::setVal() 1144 0.01277 0.01277 0.01277 1.49% FabArray::ParallelCopy_nowait() 861 0.01265 0.01265 0.01265 1.48% MLPoisson::Fapply() 1142 0.01009 0.01009 0.01009 1.18% MLCellLinOp::defineAuxData() 11 0.009214 0.009214 0.009214 1.08% FabArray::Saxpy() 813 0.007872 0.007872 0.007872 0.92% FabArray::Xpay() 821 0.007839 0.007839 0.007839 0.92% Castro::enforce_min_density() 62 0.007401 0.007401 0.007401 0.87% MLMG::addInterpCorrection() 410 0.006421 0.006421 0.006421 0.75% Gravity::fill_multipole_BCs() 11 0.006337 0.006337 0.006337 0.74% Castro::estTimeStep() 21 0.006093 0.006093 0.006093 0.71% amrex::average_down 410 0.005625 0.005625 0.005625 0.66% Castro::reset_internal_energy(MultiFab) 63 0.004424 0.004424 0.004424 0.52% FabArray::LinComb() 557 0.004356 0.004356 0.004356 0.51% amrex::Add() 164 0.00426 0.00426 0.00426 0.50% Amr::checkPoint() 3 0.004009 0.004009 0.004009 0.47% BndryData::define() 11 0.00348 0.00348 0.00348 0.41% Castro::construct_new_gravity_source() 10 0.003214 0.003214 0.003214 0.38% Castro::do_advance_ctu() 10 0.002689 0.002689 0.002689 0.31% Castro::construct_old_gravity_source() 10 0.002639 0.002639 0.002639 0.31% Amr::writePlotFile() 2 0.002333 0.002333 0.002333 0.27% MLCGSolver::bicgstab 82 0.002076 0.002076 0.002076 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001579 0.001579 0.001579 0.18% Castro::reset_internal_energy(Fab) 504 0.001451 0.001451 0.001451 0.17% Gravity::actual_solve_with_mlmg() 11 0.001404 0.001404 0.001404 0.16% MLCellLinOp::setLevelBC() 11 0.00134 0.00134 0.00134 0.16% FabArray::setDomainBndry() 41 0.001321 0.001321 0.001321 0.15% FabArray::mult() 43 0.001317 0.001317 0.001317 0.15% Castro::initData() 1 0.001253 0.001253 0.001253 0.15% MultiFab::contains_nan() 20 0.001172 0.001172 0.001172 0.14% MLCellLinOp::smooth() 1640 0.001137 0.001137 0.001137 0.13% MLCellLinOp::prepareForSolve() 11 0.001076 0.001076 0.001076 0.13% Castro::enforce_speed_limit() 62 0.0009392 0.0009392 0.0009392 0.11% MLCellLinOp::compGrad() 11 0.0008958 0.0008958 0.0008958 0.10% MLMG::prepareForSolve() 11 0.0008232 0.0008232 0.0008232 0.10% FabArray::FillBoundary() 4023 0.0007967 0.0007967 0.0007967 0.09% FabArrayBase::getCPC() 1323 0.0006951 0.0006951 0.0006951 0.08% FabArrayBase::CPC::define() 454 0.0006822 0.0006822 0.0006822 0.08% FabArrayBase::getFB() 4023 0.0006032 0.0006032 0.0006032 0.07% Gravity::get_new_grav_vector() 11 0.0005992 0.0005992 0.0005992 0.07% Castro::subcycle_advance_ctu() 10 0.000594 0.000594 0.000594 0.07% Gravity::get_old_grav_vector() 10 0.0005337 0.0005337 0.0005337 0.06% Amr::InitAmr() 1 0.0005005 0.0005005 0.0005005 0.06% MLCellLinOp::apply() 1142 0.0004468 0.0004468 0.0004468 0.05% MLMG::mgVcycle() 82 0.0003978 0.0003978 0.0003978 0.05% Amr::coarseTimeStep() 10 0.0003442 0.0003442 0.0003442 0.04% main() 1 0.000283 0.000283 0.000283 0.03% MLCGSolver::ParallelAllReduce 1514 0.0002757 0.0002757 0.0002757 0.03% MultiFab::max() 11 0.0002557 0.0002557 0.0002557 0.03% FabArray::ParallelCopy() 861 0.0002397 0.0002397 0.0002397 0.03% MLCellLinOp::correctionResidual() 492 0.0002272 0.0002272 0.0002272 0.03% MLCellLinOp::defineBC() 11 0.0002017 0.0002017 0.0002017 0.02% FillPatchIterator::Initialize 41 0.0002009 0.0002009 0.0002009 0.02% MLLinOp::defineGrids() 11 0.0001841 0.0001841 0.0001841 0.02% Gravity::solve_for_phi() 10 0.0001735 0.0001735 0.0001735 0.02% Amr::timeStep() 10 0.000165 0.000165 0.000165 0.02% StateData::checkPoint() 12 0.0001438 0.0001438 0.0001438 0.02% MLMG:computeResOfCorrection() 410 0.0001138 0.0001138 0.0001138 0.01% Gravity::update_max_rhs() 11 0.0001094 0.0001094 0.0001094 0.01% MLMG::mgVcycle_down::0 82 9.621e-05 9.621e-05 9.621e-05 0.01% MLMG::actualBottomSolve() 82 9.174e-05 9.174e-05 9.174e-05 0.01% FabArrayBase::FB::FB() 56 8.994e-05 8.994e-05 8.994e-05 0.01% Castro::finalize_advance() 10 8.971e-05 8.971e-05 8.971e-05 0.01% Castro::clean_state() 62 8.215e-05 8.215e-05 8.215e-05 0.01% Castro::advance() 10 8.001e-05 8.001e-05 8.001e-05 0.01% Castro::Castro() 1 7.876e-05 7.876e-05 7.876e-05 0.01% AmrLevel::checkPoint() 3 7.801e-05 7.801e-05 7.801e-05 0.01% MLMG::mgVcycle_down::1 82 7.742e-05 7.742e-05 7.742e-05 0.01% Castro::expand_state() 10 7.404e-05 7.404e-05 7.404e-05 0.01% MLMG::mgVcycle_down::2 82 7.388e-05 7.388e-05 7.388e-05 0.01% MLMG::solve() 11 7.203e-05 7.203e-05 7.203e-05 0.01% MLMG::mgVcycle_down::3 82 6.904e-05 6.904e-05 6.904e-05 0.01% MLMG::mgVcycle_down::4 82 6.804e-05 6.804e-05 6.804e-05 0.01% Castro::initialize_advance() 10 6.297e-05 6.297e-05 6.297e-05 0.01% MLMG::mgVcycle_up::4 82 5.381e-05 5.381e-05 5.381e-05 0.01% MLMG::oneIter() 82 5.073e-05 5.073e-05 5.073e-05 0.01% MLMG::mgVcycle_up::0 82 5.057e-05 5.057e-05 5.057e-05 0.01% MLCellLinOp::solutionResidual() 93 4.695e-05 4.695e-05 4.695e-05 0.01% MLMG::mgVcycle_up::1 82 4.619e-05 4.619e-05 4.619e-05 0.01% MLMG::mgVcycle_up::3 82 4.475e-05 4.475e-05 4.475e-05 0.01% MLMG::mgVcycle_up::2 82 4.373e-05 4.373e-05 4.373e-05 0.01% Castro::initialize_do_advance() 10 4.357e-05 4.357e-05 4.357e-05 0.01% Castro::swap_state_time_levels() 10 3.542e-05 3.542e-05 3.542e-05 0.00% Castro::enforce_consistent_e() 1 3.361e-05 3.361e-05 3.361e-05 0.00% MLMG::computeResidual() 82 3.317e-05 3.317e-05 3.317e-05 0.00% Castro::finalize_do_advance() 10 3.16e-05 3.16e-05 3.16e-05 0.00% MLMG::ResNormInf() 93 3.134e-05 3.134e-05 3.134e-05 0.00% Castro::construct_new_gravity() 10 3.124e-05 3.124e-05 3.124e-05 0.00% MLMG::mgVcycle_bottom 82 3.097e-05 3.097e-05 3.097e-05 0.00% StateData::define() 4 2.95e-05 2.95e-05 2.95e-05 0.00% FillPatchSingleLevel 41 2.949e-05 2.949e-05 2.949e-05 0.00% makeSFC 55 2.667e-05 2.667e-05 2.667e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.457e-05 2.457e-05 2.457e-05 0.00% Amr::writeSmallPlotFile() 1 2.442e-05 2.442e-05 2.442e-05 0.00% MLPoisson::define() 11 2.386e-05 2.386e-05 2.386e-05 0.00% Castro::create_source_corrector() 10 2.382e-05 2.382e-05 2.382e-05 0.00% Amr::FinalizeInit() 1 2.192e-05 2.192e-05 2.192e-05 0.00% Amr::defBaseLevel() 1 2.149e-05 2.149e-05 2.149e-05 0.00% Castro::initMFs() 1 1.827e-05 1.827e-05 1.827e-05 0.00% Castro::construct_old_source() 50 1.749e-05 1.749e-05 1.749e-05 0.00% Castro::do_new_sources() 10 1.722e-05 1.722e-05 1.722e-05 0.00% Castro::buildMetrics() 1 1.667e-05 1.667e-05 1.667e-05 0.00% Castro::construct_new_source() 50 1.634e-05 1.634e-05 1.634e-05 0.00% Amr::InitializeInit() 1 1.633e-05 1.633e-05 1.633e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.595e-05 1.595e-05 1.595e-05 0.00% Castro::do_old_sources() 10 1.59e-05 1.59e-05 1.59e-05 0.00% DistributionMapping::Distribute() 56 1.483e-05 1.483e-05 1.483e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.44e-05 1.44e-05 1.44e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.309e-05 1.309e-05 1.309e-05 0.00% Castro::check_for_nan() 20 1.244e-05 1.244e-05 1.244e-05 0.00% Castro::post_init() 1 1.106e-05 1.106e-05 1.106e-05 0.00% MLLinOp::define() 11 1.096e-05 1.096e-05 1.096e-05 0.00% Castro::construct_old_gravity() 10 1.017e-05 1.017e-05 1.017e-05 0.00% Castro::post_timestep() 10 9.871e-06 9.871e-06 9.871e-06 0.00% Castro::apply_source_to_state() 20 9.721e-06 9.721e-06 9.721e-06 0.00% MLPoisson::prepareForSolve() 11 9.191e-06 9.191e-06 9.191e-06 0.00% Gravity::swapTimeLevels() 10 9.027e-06 9.027e-06 9.027e-06 0.00% Amr::initSubcycle() 1 8.46e-06 8.46e-06 8.46e-06 0.00% MLMG::computeMLResidual() 11 8.455e-06 8.455e-06 8.455e-06 0.00% Gravity::actual_multilevel_solve() 1 7.665e-06 7.665e-06 7.665e-06 0.00% Castro::computeNewDt() 9 6.497e-06 6.497e-06 6.497e-06 0.00% MLMG::getGradSolution() 11 6.051e-06 6.051e-06 6.051e-06 0.00% AmrLevel::checkPointPost() 3 4.835e-06 4.835e-06 4.835e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.532e-06 4.532e-06 4.532e-06 0.00% MLMG::MLRhsNormInf() 11 3.79e-06 3.79e-06 3.79e-06 0.00% Castro::retry_advance_ctu() 10 3.64e-06 3.64e-06 3.64e-06 0.00% Gravity::set_mass_offset() 11 3.573e-06 3.573e-06 3.573e-06 0.00% MLMG::MLResNormInf() 11 3.412e-06 3.412e-06 3.412e-06 0.00% Castro::computeInitialDt() 2 2.865e-06 2.865e-06 2.865e-06 0.00% Castro::FluxRegCrseInit 10 2.759e-06 2.759e-06 2.759e-06 0.00% Amr::init() 1 2.686e-06 2.686e-06 2.686e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.036e-06 2.036e-06 2.036e-06 0.00% Castro::FluxRegFineAdd() 10 2.023e-06 2.023e-06 2.023e-06 0.00% AmrLevel::checkPointPre() 3 1.682e-06 1.682e-06 1.682e-06 0.00% Amr::initialInit() 1 1.143e-06 1.143e-06 1.143e-06 0.00% Castro::post_regrid() 1 1.077e-06 1.077e-06 1.077e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8545 0.8545 0.8545 100.00% Amr::coarseTimeStep() 10 0.6832 0.6832 0.6832 79.95% Amr::timeStep() 10 0.5668 0.5668 0.5668 66.33% Castro::advance() 10 0.5609 0.5609 0.5609 65.64% Castro::subcycle_advance_ctu() 10 0.5498 0.5498 0.5498 64.35% Castro::do_advance_ctu() 10 0.5492 0.5492 0.5492 64.28% Gravity::solve_phi_with_mlmg() 11 0.2722 0.2722 0.2722 31.86% Gravity::actual_solve_with_mlmg() 11 0.2654 0.2654 0.2654 31.06% Castro::construct_new_gravity() 10 0.2485 0.2485 0.2485 29.08% MLMG::solve() 11 0.2459 0.2459 0.2459 28.78% Gravity::solve_for_phi() 10 0.2333 0.2333 0.2333 27.31% MLMG::oneIter() 82 0.2321 0.2321 0.2321 27.16% MLMG::mgVcycle() 82 0.2285 0.2285 0.2285 26.74% VisMF::Write(FabArray) 11 0.2282 0.2282 0.2282 26.71% Castro::construct_ctu_hydro_source() 10 0.2255 0.2255 0.2255 26.39% Amr::checkPoint() 3 0.1691 0.1691 0.1691 19.78% AmrLevel::checkPoint() 3 0.165 0.165 0.165 19.31% StateData::checkPoint() 12 0.165 0.165 0.165 19.31% Amr::init() 1 0.1369 0.1369 0.1369 16.02% MLCellLinOp::smooth() 1640 0.1128 0.1128 0.1128 13.20% MLCellLinOp::applyBC() 4433 0.105 0.105 0.105 12.29% MLMG::mgVcycle_bottom 82 0.07056 0.07056 0.07056 8.26% MLMG::actualBottomSolve() 82 0.07053 0.07053 0.07053 8.25% MLCGSolver::bicgstab 82 0.06984 0.06984 0.06984 8.17% Amr::writePlotFile() 2 0.06626 0.06626 0.06626 7.75% Amr::initialInit() 1 0.04718 0.04718 0.04718 5.52% Amr::FinalizeInit() 1 0.04299 0.04299 0.04299 5.03% Castro::clean_state() 62 0.04219 0.04219 0.04219 4.94% FillPatchIterator::Initialize 41 0.04202 0.04202 0.04202 4.92% Castro::post_init() 1 0.04156 0.04156 0.04156 4.86% FillPatchSingleLevel 41 0.04049 0.04049 0.04049 4.74% Gravity::multilevel_solve_for_new_phi() 1 0.03935 0.03935 0.03935 4.60% Gravity::actual_multilevel_solve() 1 0.03933 0.03933 0.03933 4.60% StateDataPhysBCFunct::() 41 0.03652 0.03652 0.03652 4.27% MLCellLinOp::apply() 1142 0.03488 0.03488 0.03488 4.08% MLMG::mgVcycle_down::0 82 0.03278 0.03278 0.03278 3.84% FabArray::FillBoundary() 4023 0.03191 0.03191 0.03191 3.73% MLPoisson::Fsmooth() 3280 0.03147 0.03147 0.03147 3.68% FillBoundary_nowait() 4023 0.03112 0.03112 0.03112 3.64% MLMG::mgVcycle_up::0 82 0.02482 0.02482 0.02482 2.90% StateData::FillBoundary(geom) 328 0.02234 0.02234 0.02234 2.61% MLCellLinOp::correctionResidual() 492 0.02139 0.02139 0.02139 2.50% Castro::computeTemp() 63 0.02021 0.02021 0.02021 2.37% amrex::Dot() 1114 0.01957 0.01957 0.01957 2.29% Castro::initialize_do_advance() 10 0.01949 0.01949 0.01949 2.28% MLMG:computeResOfCorrection() 410 0.01886 0.01886 0.01886 2.21% Gravity::get_new_grav_vector() 11 0.01701 0.01701 0.01701 1.99% MLPoisson::define() 11 0.01534 0.01534 0.01534 1.80% MLMG::mgVcycle_down::1 82 0.01513 0.01513 0.01513 1.77% amrex::Copy() 1029 0.01448 0.01448 0.01448 1.70% Castro::normalize_species() 62 0.01439 0.01439 0.01439 1.68% Castro::construct_old_gravity() 10 0.01422 0.01422 0.01422 1.66% Gravity::get_old_grav_vector() 10 0.01421 0.01421 0.01421 1.66% MLMG::mgVcycle_down::2 82 0.01408 0.01408 0.01408 1.65% FabArray::norminf() 743 0.01385 0.01385 0.01385 1.62% MLMG::mgVcycle_down::3 82 0.01378 0.01378 0.01378 1.61% FabArray::ParallelCopy() 861 0.01367 0.01367 0.01367 1.60% MLMG::mgVcycle_down::4 82 0.0136 0.0136 0.0136 1.59% FabArray::ParallelCopy_nowait() 861 0.01343 0.01343 0.01343 1.57% FabArray::setVal() 1144 0.01277 0.01277 0.01277 1.49% Castro::do_new_sources() 10 0.01273 0.01273 0.01273 1.49% Castro::expand_state() 10 0.01191 0.01191 0.01191 1.39% MLCGSolver::ParallelAllReduce 1514 0.0117 0.0117 0.0117 1.37% MLMG::addInterpCorrection() 410 0.0113 0.0113 0.0113 1.32% MLMG::mgVcycle_up::4 82 0.0111 0.0111 0.0111 1.30% MLMG::mgVcycle_up::1 82 0.011 0.011 0.011 1.29% MLMG::mgVcycle_up::2 82 0.01074 0.01074 0.01074 1.26% MLMG::mgVcycle_up::3 82 0.01055 0.01055 0.01055 1.23% MLCellLinOp::defineAuxData() 11 0.01049 0.01049 0.01049 1.23% amrex::average_down 410 0.01048 0.01048 0.01048 1.23% Castro::initialize_advance() 10 0.01035 0.01035 0.01035 1.21% MLPoisson::Fapply() 1142 0.01009 0.01009 0.01009 1.18% Castro::do_old_sources() 10 0.009919 0.009919 0.009919 1.16% FabArray::Saxpy() 813 0.007872 0.007872 0.007872 0.92% FabArray::Xpay() 821 0.007839 0.007839 0.007839 0.92% Castro::enforce_min_density() 62 0.007401 0.007401 0.007401 0.87% MLCellLinOp::solutionResidual() 93 0.006979 0.006979 0.006979 0.82% Gravity::fill_multipole_BCs() 11 0.006597 0.006597 0.006597 0.77% Castro::estTimeStep() 21 0.006093 0.006093 0.006093 0.71% MLMG::computeResidual() 82 0.006032 0.006032 0.006032 0.71% Castro::reset_internal_energy(MultiFab) 63 0.005875 0.005875 0.005875 0.69% Castro::post_timestep() 10 0.005763 0.005763 0.005763 0.67% MLCellLinOp::defineBC() 11 0.004583 0.004583 0.004583 0.54% MLMG::prepareForSolve() 11 0.004452 0.004452 0.004452 0.52% BndryData::define() 11 0.004381 0.004381 0.004381 0.51% FabArray::LinComb() 557 0.004356 0.004356 0.004356 0.51% amrex::Add() 164 0.00426 0.00426 0.00426 0.50% Amr::InitializeInit() 1 0.00419 0.00419 0.00419 0.49% Amr::defBaseLevel() 1 0.004174 0.004174 0.004174 0.49% Castro::initData() 1 0.003694 0.003694 0.003694 0.43% Castro::computeNewDt() 9 0.003314 0.003314 0.003314 0.39% Castro::construct_new_source() 50 0.00323 0.00323 0.00323 0.38% Castro::construct_new_gravity_source() 10 0.003214 0.003214 0.003214 0.38% Castro::construct_old_source() 50 0.002656 0.002656 0.002656 0.31% Castro::construct_old_gravity_source() 10 0.002639 0.002639 0.002639 0.31% MLMG::ResNormInf() 93 0.002072 0.002072 0.002072 0.24% Castro::apply_source_to_state() 20 0.001808 0.001808 0.001808 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001579 0.001579 0.001579 0.18% Castro::reset_internal_energy(Fab) 504 0.001451 0.001451 0.001451 0.17% FabArrayBase::getCPC() 1323 0.001377 0.001377 0.001377 0.16% MLMG::getGradSolution() 11 0.001368 0.001368 0.001368 0.16% MLCellLinOp::compGrad() 11 0.001362 0.001362 0.001362 0.16% MLCellLinOp::setLevelBC() 11 0.00134 0.00134 0.00134 0.16% FabArray::setDomainBndry() 41 0.001321 0.001321 0.001321 0.15% FabArray::mult() 43 0.001317 0.001317 0.001317 0.15% Castro::check_for_nan() 20 0.001184 0.001184 0.001184 0.14% MultiFab::contains_nan() 20 0.001172 0.001172 0.001172 0.14% Castro::post_regrid() 1 0.001152 0.001152 0.001152 0.13% MLPoisson::prepareForSolve() 11 0.001085 0.001085 0.001085 0.13% MLCellLinOp::prepareForSolve() 11 0.001076 0.001076 0.001076 0.13% MLMG::computeMLResidual() 11 0.0009889 0.0009889 0.0009889 0.12% Castro::computeInitialDt() 2 0.0009492 0.0009492 0.0009492 0.11% Castro::enforce_speed_limit() 62 0.0009392 0.0009392 0.0009392 0.11% Gravity::update_max_rhs() 11 0.0008038 0.0008038 0.0008038 0.09% FabArrayBase::getFB() 4023 0.0006932 0.0006932 0.0006932 0.08% FabArrayBase::CPC::define() 454 0.0006822 0.0006822 0.0006822 0.08% Castro::finalize_advance() 10 0.000598 0.000598 0.000598 0.07% Amr::InitAmr() 1 0.000509 0.000509 0.000509 0.06% Gravity::swapTimeLevels() 10 0.0004276 0.0004276 0.0004276 0.05% Castro::Castro() 1 0.0004068 0.0004068 0.0004068 0.05% MLMG::MLResNormInf() 11 0.0002796 0.0002796 0.0002796 0.03% MultiFab::max() 11 0.0002557 0.0002557 0.0002557 0.03% MLLinOp::define() 11 0.0002504 0.0002504 0.0002504 0.03% MLLinOp::defineGrids() 11 0.0002394 0.0002394 0.0002394 0.03% MLMG::MLRhsNormInf() 11 0.0002213 0.0002213 0.0002213 0.03% Castro::buildMetrics() 1 0.0001514 0.0001514 0.0001514 0.02% FabArrayBase::FB::FB() 56 8.994e-05 8.994e-05 8.994e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.326e-05 5.326e-05 5.326e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.545e-05 4.545e-05 4.545e-05 0.01% makeSFC 55 4.017e-05 4.017e-05 4.017e-05 0.00% Castro::swap_state_time_levels() 10 3.542e-05 3.542e-05 3.542e-05 0.00% Castro::enforce_consistent_e() 1 3.361e-05 3.361e-05 3.361e-05 0.00% Castro::finalize_do_advance() 10 3.16e-05 3.16e-05 3.16e-05 0.00% StateData::define() 4 2.95e-05 2.95e-05 2.95e-05 0.00% Amr::writeSmallPlotFile() 1 2.442e-05 2.442e-05 2.442e-05 0.00% Castro::create_source_corrector() 10 2.382e-05 2.382e-05 2.382e-05 0.00% Castro::initMFs() 1 1.827e-05 1.827e-05 1.827e-05 0.00% DistributionMapping::Distribute() 56 1.483e-05 1.483e-05 1.483e-05 0.00% Amr::initSubcycle() 1 8.46e-06 8.46e-06 8.46e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.866e-06 5.866e-06 5.866e-06 0.00% AmrLevel::checkPointPost() 3 4.835e-06 4.835e-06 4.835e-06 0.00% Castro::retry_advance_ctu() 10 3.64e-06 3.64e-06 3.64e-06 0.00% Gravity::set_mass_offset() 11 3.573e-06 3.573e-06 3.573e-06 0.00% Castro::FluxRegCrseInit 10 2.759e-06 2.759e-06 2.759e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.036e-06 2.036e-06 2.036e-06 0.00% Castro::FluxRegFineAdd() 10 2.023e-06 2.023e-06 2.023e-06 0.00% AmrLevel::checkPointPre() 3 1.682e-06 1.682e-06 1.682e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-11-g42b7e2706638) finalized Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-11-g42b7e2706638) initialized Starting run at 10:09:39 UTC on 2023-01-09. Successfully read inputs file ... Castro git describe: 23.01-12-g63195e409 AMReX git describe: 23.01-11-g42b7e2706 Microphysics git describe: 23.01 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.460498443 Restart time = 0.047563769 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.053413202 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.06059234 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.059159812 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.06032784 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.064858459 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.032810989 seconds Ending run at 10:09:39 UTC on 2023-01-09. Run time = 0.379728039 Run time without initialization = 0.331566899 Average number of zones advanced per microsecond: 3.953 Average number of zones advanced per microsecond per rank: 3.953 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.3798 ... 0.3798 ... 0.3798 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1019 0.1019 0.1019 26.82% VisMF::Read() 3 0.04125 0.04125 0.04125 10.86% MLCellLinOp::applyBC() 1946 0.0351 0.0351 0.0351 9.24% VisMF::Write(FabArray) 1 0.03128 0.03128 0.03128 8.24% StateData::FillBoundary(geom) 160 0.02833 0.02833 0.02833 7.46% MLPoisson::Fsmooth() 1440 0.01482 0.01482 0.01482 3.90% FillBoundary_nowait() 1766 0.01287 0.01287 0.01287 3.39% amrex::Dot() 484 0.00894 0.00894 0.00894 2.35% amrex::Copy() 463 0.007113 0.007113 0.007113 1.87% FabArray::setVal() 537 0.006481 0.006481 0.006481 1.71% FabArray::ParallelCopy_nowait() 380 0.006322 0.006322 0.006322 1.66% FabArray::norminf() 326 0.00631 0.00631 0.00631 1.66% Castro::normalize_species() 30 0.005995 0.005995 0.005995 1.58% Castro::computeTemp() 30 0.005696 0.005696 0.005696 1.50% MLCellLinOp::defineAuxData() 6 0.005599 0.005599 0.005599 1.47% Castro::enforce_min_density() 30 0.005106 0.005106 0.005106 1.34% MLPoisson::Fapply() 500 0.004641 0.004641 0.004641 1.22% FabArray::Saxpy() 355 0.003706 0.003706 0.003706 0.98% FabArray::Xpay() 361 0.003586 0.003586 0.003586 0.94% MLMG::addInterpCorrection() 180 0.003052 0.003052 0.003052 0.80% Gravity::fill_multipole_BCs() 6 0.00285 0.00285 0.00285 0.75% amrex::average_down 180 0.002739 0.002739 0.002739 0.72% Amr::restart() 1 0.002569 0.002569 0.002569 0.68% StateDataPhysBCFunct::() 20 0.002499 0.002499 0.002499 0.66% Castro::estTimeStep() 10 0.002322 0.002322 0.002322 0.61% BndryData::define() 6 0.002057 0.002057 0.002057 0.54% FabArray::LinComb() 242 0.00197 0.00197 0.00197 0.52% amrex::Add() 72 0.001836 0.001836 0.001836 0.48% Castro::reset_internal_energy(MultiFab) 30 0.001743 0.001743 0.001743 0.46% Castro::construct_new_gravity_source() 5 0.001632 0.001632 0.001632 0.43% Castro::construct_old_gravity_source() 5 0.001358 0.001358 0.001358 0.36% Amr::writePlotFile() 1 0.001357 0.001357 0.001357 0.36% Castro::do_advance_ctu() 5 0.001288 0.001288 0.001288 0.34% MLCGSolver::bicgstab 36 0.001158 0.001158 0.001158 0.30% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009643 0.0009643 0.0009643 0.25% Castro::reset_internal_energy(Fab) 240 0.0008642 0.0008642 0.0008642 0.23% MLCellLinOp::smooth() 720 0.0007642 0.0007642 0.0007642 0.20% MLCellLinOp::setLevelBC() 6 0.0007446 0.0007446 0.0007446 0.20% Gravity::actual_solve_with_mlmg() 6 0.0007163 0.0007163 0.0007163 0.19% FabArray::mult() 22 0.0006654 0.0006654 0.0006654 0.18% FabArray::setDomainBndry() 20 0.0006551 0.0006551 0.0006551 0.17% MLCellLinOp::prepareForSolve() 6 0.0006521 0.0006521 0.0006521 0.17% Castro::enforce_speed_limit() 30 0.0006249 0.0006249 0.0006249 0.16% MultiFab::contains_nan() 10 0.0005911 0.0005911 0.0005911 0.16% MLMG::prepareForSolve() 6 0.0005175 0.0005175 0.0005175 0.14% MLCellLinOp::compGrad() 6 0.0004935 0.0004935 0.0004935 0.13% FabArray::FillBoundary() 1766 0.000431 0.000431 0.000431 0.11% FabArrayBase::CPC::define() 244 0.0004226 0.0004226 0.0004226 0.11% Amr::InitAmr() 1 0.000411 0.000411 0.000411 0.11% FabArrayBase::getCPC() 632 0.0003676 0.0003676 0.0003676 0.10% Gravity::get_old_grav_vector() 5 0.0003027 0.0003027 0.0003027 0.08% MLCellLinOp::apply() 500 0.0002841 0.0002841 0.0002841 0.07% Gravity::get_new_grav_vector() 5 0.0002741 0.0002741 0.0002741 0.07% main() 1 0.0002661 0.0002661 0.0002661 0.07% FabArrayBase::getFB() 1766 0.0002507 0.0002507 0.0002507 0.07% MLMG::mgVcycle() 36 0.0002324 0.0002324 0.0002324 0.06% Amr::coarseTimeStep() 5 0.0001804 0.0001804 0.0001804 0.05% MultiFab::max() 6 0.0001404 0.0001404 0.0001404 0.04% MLCGSolver::ParallelAllReduce 659 0.0001375 0.0001375 0.0001375 0.04% MLCellLinOp::correctionResidual() 216 0.0001263 0.0001263 0.0001263 0.03% FabArray::ParallelCopy() 380 0.0001238 0.0001238 0.0001238 0.03% FillPatchIterator::Initialize 20 0.0001152 0.0001152 0.0001152 0.03% MLCellLinOp::defineBC() 6 0.0001091 0.0001091 0.0001091 0.03% Amr::timeStep() 5 9.135e-05 9.135e-05 9.135e-05 0.02% MLLinOp::defineGrids() 6 9.094e-05 9.094e-05 9.094e-05 0.02% Castro::subcycle_advance_ctu() 5 7.322e-05 7.322e-05 7.322e-05 0.02% MLMG::mgVcycle_down::1 36 7.317e-05 7.317e-05 7.317e-05 0.02% AmrLevel::restart() 1 7.219e-05 7.219e-05 7.219e-05 0.02% Gravity::solve_for_phi() 5 7.085e-05 7.085e-05 7.085e-05 0.02% StateData::restartDoit() 4 6.308e-05 6.308e-05 6.308e-05 0.02% FabArrayBase::FB::FB() 26 6.176e-05 6.176e-05 6.176e-05 0.02% Gravity::update_max_rhs() 6 5.989e-05 5.989e-05 5.989e-05 0.02% MLMG:computeResOfCorrection() 180 5.495e-05 5.495e-05 5.495e-05 0.01% MLMG::mgVcycle_down::0 36 5.272e-05 5.272e-05 5.272e-05 0.01% MLMG::mgVcycle_down::2 36 5.019e-05 5.019e-05 5.019e-05 0.01% MLMG::mgVcycle_down::4 36 4.973e-05 4.973e-05 4.973e-05 0.01% MLMG::mgVcycle_down::3 36 4.861e-05 4.861e-05 4.861e-05 0.01% MLMG::actualBottomSolve() 36 4.848e-05 4.848e-05 4.848e-05 0.01% Castro::clean_state() 30 4.777e-05 4.777e-05 4.777e-05 0.01% Castro::expand_state() 5 4.197e-05 4.197e-05 4.197e-05 0.01% MLMG::solve() 6 4.152e-05 4.152e-05 4.152e-05 0.01% Castro::initialize_advance() 5 3.758e-05 3.758e-05 3.758e-05 0.01% Castro::advance() 5 3.391e-05 3.391e-05 3.391e-05 0.01% MLMG::mgVcycle_up::0 36 3.254e-05 3.254e-05 3.254e-05 0.01% MLMG::mgVcycle_up::4 36 3.112e-05 3.112e-05 3.112e-05 0.01% Castro::finalize_advance() 5 3.035e-05 3.035e-05 3.035e-05 0.01% MLMG::oneIter() 36 2.768e-05 2.768e-05 2.768e-05 0.01% Castro::buildMetrics() 1 2.76e-05 2.76e-05 2.76e-05 0.01% MLMG::mgVcycle_up::3 36 2.734e-05 2.734e-05 2.734e-05 0.01% MLMG::mgVcycle_up::2 36 2.694e-05 2.694e-05 2.694e-05 0.01% MLMG::mgVcycle_up::1 36 2.658e-05 2.658e-05 2.658e-05 0.01% MLCellLinOp::solutionResidual() 42 2.62e-05 2.62e-05 2.62e-05 0.01% Castro::initialize_do_advance() 5 2.557e-05 2.557e-05 2.557e-05 0.01% Amr::writeSmallPlotFile() 1 2.513e-05 2.513e-05 2.513e-05 0.01% Castro::swap_state_time_levels() 5 2.449e-05 2.449e-05 2.449e-05 0.01% Castro::post_restart() 1 2.332e-05 2.332e-05 2.332e-05 0.01% Castro::create_source_corrector() 5 2.148e-05 2.148e-05 2.148e-05 0.01% MLMG::ResNormInf() 42 2.042e-05 2.042e-05 2.042e-05 0.01% Castro::initMFs() 1 2.027e-05 2.027e-05 2.027e-05 0.01% Castro::finalize_do_advance() 5 2.001e-05 2.001e-05 2.001e-05 0.01% Castro::construct_new_gravity() 5 1.983e-05 1.983e-05 1.983e-05 0.01% FillPatchSingleLevel 20 1.925e-05 1.925e-05 1.925e-05 0.01% MLMG::mgVcycle_bottom 36 1.688e-05 1.688e-05 1.688e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.502e-05 1.502e-05 1.502e-05 0.00% MLPoisson::define() 6 1.501e-05 1.501e-05 1.501e-05 0.00% makeSFC 30 1.406e-05 1.406e-05 1.406e-05 0.00% MLMG::computeResidual() 36 1.4e-05 1.4e-05 1.4e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.308e-05 1.308e-05 1.308e-05 0.00% Castro::construct_new_source() 25 1.051e-05 1.051e-05 1.051e-05 0.00% Castro::do_new_sources() 5 1.048e-05 1.048e-05 1.048e-05 0.00% Castro::construct_old_source() 25 1.043e-05 1.043e-05 1.043e-05 0.00% Castro::do_old_sources() 5 9.928e-06 9.928e-06 9.928e-06 0.00% DistributionMapping::Distribute() 31 9.713e-06 9.713e-06 9.713e-06 0.00% Amr::initSubcycle() 1 8.406e-06 8.406e-06 8.406e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.357e-06 7.357e-06 7.357e-06 0.00% Gravity::actual_multilevel_solve() 1 6.964e-06 6.964e-06 6.964e-06 0.00% Castro::check_for_nan() 10 6.855e-06 6.855e-06 6.855e-06 0.00% Castro::apply_source_to_state() 10 6.319e-06 6.319e-06 6.319e-06 0.00% Castro::construct_old_gravity() 5 6.265e-06 6.265e-06 6.265e-06 0.00% Castro::post_timestep() 5 5.858e-06 5.858e-06 5.858e-06 0.00% Gravity::swapTimeLevels() 5 5.662e-06 5.662e-06 5.662e-06 0.00% MLLinOp::define() 6 5.263e-06 5.263e-06 5.263e-06 0.00% MLPoisson::prepareForSolve() 6 4.582e-06 4.582e-06 4.582e-06 0.00% Castro::computeNewDt() 5 3.982e-06 3.982e-06 3.982e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.824e-06 3.824e-06 3.824e-06 0.00% MLMG::computeMLResidual() 6 3.428e-06 3.428e-06 3.428e-06 0.00% MLMG::getGradSolution() 6 3.148e-06 3.148e-06 3.148e-06 0.00% Gravity::set_mass_offset() 6 2.498e-06 2.498e-06 2.498e-06 0.00% MLMG::MLRhsNormInf() 6 2.473e-06 2.473e-06 2.473e-06 0.00% MLMG::MLResNormInf() 6 2.303e-06 2.303e-06 2.303e-06 0.00% Castro::retry_advance_ctu() 5 2.16e-06 2.16e-06 2.16e-06 0.00% Castro::FluxRegCrseInit 5 1.359e-06 1.359e-06 1.359e-06 0.00% AmrLevel::AmrLevel() 1 1.229e-06 1.229e-06 1.229e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.149e-06 1.149e-06 1.149e-06 0.00% Amr::init() 1 1.062e-06 1.062e-06 1.062e-06 0.00% Castro::FluxRegFineAdd() 5 1.049e-06 1.049e-06 1.049e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3797 0.3797 0.3797 99.99% Amr::coarseTimeStep() 5 0.2985 0.2985 0.2985 78.61% Amr::timeStep() 5 0.2967 0.2967 0.2967 78.14% Castro::advance() 5 0.2937 0.2937 0.2937 77.34% Castro::subcycle_advance_ctu() 5 0.2874 0.2874 0.2874 75.67% Castro::do_advance_ctu() 5 0.2873 0.2873 0.2873 75.65% Castro::construct_new_gravity() 5 0.1401 0.1401 0.1401 36.90% Gravity::solve_phi_with_mlmg() 6 0.1302 0.1302 0.1302 34.29% Gravity::solve_for_phi() 5 0.1272 0.1272 0.1272 33.49% Gravity::actual_solve_with_mlmg() 6 0.1271 0.1271 0.1271 33.47% MLMG::solve() 6 0.1156 0.1156 0.1156 30.44% MLMG::oneIter() 36 0.1085 0.1085 0.1085 28.56% MLMG::mgVcycle() 36 0.1069 0.1069 0.1069 28.16% Castro::construct_ctu_hydro_source() 5 0.1019 0.1019 0.1019 26.82% MLCellLinOp::smooth() 720 0.05246 0.05246 0.05246 13.81% MLCellLinOp::applyBC() 1946 0.04872 0.04872 0.04872 12.83% Amr::init() 1 0.04761 0.04761 0.04761 12.54% Amr::restart() 1 0.04761 0.04761 0.04761 12.54% AmrLevel::restart() 1 0.04145 0.04145 0.04145 10.91% StateData::restartDoit() 4 0.04137 0.04137 0.04137 10.89% VisMF::Read() 3 0.04125 0.04125 0.04125 10.86% FillPatchIterator::Initialize 20 0.03361 0.03361 0.03361 8.85% Amr::writePlotFile() 1 0.0329 0.0329 0.0329 8.66% FillPatchSingleLevel 20 0.03284 0.03284 0.03284 8.65% MLMG::mgVcycle_bottom 36 0.03281 0.03281 0.03281 8.64% MLMG::actualBottomSolve() 36 0.03279 0.03279 0.03279 8.64% MLCGSolver::bicgstab 36 0.03247 0.03247 0.03247 8.55% VisMF::Write(FabArray) 1 0.03128 0.03128 0.03128 8.24% StateDataPhysBCFunct::() 20 0.03083 0.03083 0.03083 8.12% StateData::FillBoundary(geom) 160 0.02833 0.02833 0.02833 7.46% Castro::clean_state() 30 0.02008 0.02008 0.02008 5.29% MLCellLinOp::apply() 500 0.01649 0.01649 0.01649 4.34% MLPoisson::Fsmooth() 1440 0.01482 0.01482 0.01482 3.90% MLMG::mgVcycle_down::0 36 0.01462 0.01462 0.01462 3.85% FabArray::FillBoundary() 1766 0.01362 0.01362 0.01362 3.59% Castro::construct_old_gravity() 5 0.01337 0.01337 0.01337 3.52% Gravity::get_old_grav_vector() 5 0.01336 0.01336 0.01336 3.52% FillBoundary_nowait() 1766 0.01319 0.01319 0.01319 3.47% Gravity::get_new_grav_vector() 5 0.01283 0.01283 0.01283 3.38% MLMG::mgVcycle_up::0 36 0.01109 0.01109 0.01109 2.92% MLCellLinOp::correctionResidual() 216 0.00995 0.00995 0.00995 2.62% Castro::initialize_do_advance() 5 0.009807 0.009807 0.009807 2.58% MLPoisson::define() 6 0.009284 0.009284 0.009284 2.44% amrex::Dot() 484 0.00894 0.00894 0.00894 2.35% MLMG:computeResOfCorrection() 180 0.008727 0.008727 0.008727 2.30% Castro::computeTemp() 30 0.008304 0.008304 0.008304 2.19% MLMG::mgVcycle_down::1 36 0.00727 0.00727 0.00727 1.91% amrex::Copy() 463 0.007113 0.007113 0.007113 1.87% FabArray::ParallelCopy() 380 0.006824 0.006824 0.006824 1.80% MLMG::mgVcycle_down::2 36 0.006798 0.006798 0.006798 1.79% Castro::do_new_sources() 5 0.006741 0.006741 0.006741 1.78% FabArray::ParallelCopy_nowait() 380 0.0067 0.0067 0.0067 1.76% MLMG::mgVcycle_down::3 36 0.006637 0.006637 0.006637 1.75% MLMG::mgVcycle_down::4 36 0.006589 0.006589 0.006589 1.73% FabArray::setVal() 537 0.006481 0.006481 0.006481 1.71% MLCellLinOp::defineAuxData() 6 0.006399 0.006399 0.006399 1.68% FabArray::norminf() 326 0.00631 0.00631 0.00631 1.66% Castro::expand_state() 5 0.006249 0.006249 0.006249 1.65% Castro::initialize_advance() 5 0.006005 0.006005 0.006005 1.58% Castro::normalize_species() 30 0.005995 0.005995 0.005995 1.58% MLMG::addInterpCorrection() 180 0.005481 0.005481 0.005481 1.44% MLCGSolver::ParallelAllReduce 659 0.005396 0.005396 0.005396 1.42% MLMG::mgVcycle_up::4 36 0.005309 0.005309 0.005309 1.40% MLMG::mgVcycle_up::1 36 0.005293 0.005293 0.005293 1.39% MLMG::mgVcycle_up::2 36 0.0052 0.0052 0.0052 1.37% amrex::average_down 180 0.00514 0.00514 0.00514 1.35% Castro::enforce_min_density() 30 0.005106 0.005106 0.005106 1.34% MLMG::mgVcycle_up::3 36 0.005087 0.005087 0.005087 1.34% MLPoisson::Fapply() 500 0.004641 0.004641 0.004641 1.22% Castro::do_old_sources() 5 0.004453 0.004453 0.004453 1.17% FabArray::Saxpy() 355 0.003706 0.003706 0.003706 0.98% FabArray::Xpay() 361 0.003586 0.003586 0.003586 0.94% Castro::post_restart() 1 0.003415 0.003415 0.003415 0.90% MLCellLinOp::solutionResidual() 42 0.003325 0.003325 0.003325 0.88% Gravity::multilevel_solve_for_new_phi() 1 0.003296 0.003296 0.003296 0.87% Gravity::actual_multilevel_solve() 1 0.003283 0.003283 0.003283 0.86% Gravity::fill_multipole_BCs() 6 0.002979 0.002979 0.002979 0.78% Castro::post_timestep() 5 0.00292 0.00292 0.00292 0.77% MLMG::computeResidual() 36 0.002761 0.002761 0.002761 0.73% MLCellLinOp::defineBC() 6 0.002743 0.002743 0.002743 0.72% MLMG::prepareForSolve() 6 0.002674 0.002674 0.002674 0.70% BndryData::define() 6 0.002634 0.002634 0.002634 0.69% Castro::reset_internal_energy(MultiFab) 30 0.002608 0.002608 0.002608 0.69% Castro::estTimeStep() 10 0.002322 0.002322 0.002322 0.61% FabArray::LinComb() 242 0.00197 0.00197 0.00197 0.52% amrex::Add() 72 0.001836 0.001836 0.001836 0.48% Castro::construct_new_source() 25 0.001642 0.001642 0.001642 0.43% Castro::construct_new_gravity_source() 5 0.001632 0.001632 0.001632 0.43% Castro::computeNewDt() 5 0.001613 0.001613 0.001613 0.42% Castro::construct_old_source() 25 0.001369 0.001369 0.001369 0.36% Castro::construct_old_gravity_source() 5 0.001358 0.001358 0.001358 0.36% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009643 0.0009643 0.0009643 0.25% MLMG::ResNormInf() 42 0.000957 0.000957 0.000957 0.25% Castro::apply_source_to_state() 10 0.0009214 0.0009214 0.0009214 0.24% Castro::reset_internal_energy(Fab) 240 0.0008642 0.0008642 0.0008642 0.23% FabArrayBase::getCPC() 632 0.0007902 0.0007902 0.0007902 0.21% MLMG::getGradSolution() 6 0.0007731 0.0007731 0.0007731 0.20% MLCellLinOp::compGrad() 6 0.00077 0.00077 0.00077 0.20% MLCellLinOp::setLevelBC() 6 0.0007446 0.0007446 0.0007446 0.20% FabArray::mult() 22 0.0006654 0.0006654 0.0006654 0.18% MLPoisson::prepareForSolve() 6 0.0006567 0.0006567 0.0006567 0.17% FabArray::setDomainBndry() 20 0.0006551 0.0006551 0.0006551 0.17% MLCellLinOp::prepareForSolve() 6 0.0006521 0.0006521 0.0006521 0.17% Castro::enforce_speed_limit() 30 0.0006249 0.0006249 0.0006249 0.16% Castro::check_for_nan() 10 0.0005979 0.0005979 0.0005979 0.16% MultiFab::contains_nan() 10 0.0005911 0.0005911 0.0005911 0.16% MLMG::computeMLResidual() 6 0.0005815 0.0005815 0.0005815 0.15% Gravity::update_max_rhs() 6 0.0004713 0.0004713 0.0004713 0.12% FabArrayBase::CPC::define() 244 0.0004226 0.0004226 0.0004226 0.11% Amr::InitAmr() 1 0.0004194 0.0004194 0.0004194 0.11% FabArrayBase::getFB() 1766 0.0003124 0.0003124 0.0003124 0.08% Castro::finalize_advance() 5 0.0002975 0.0002975 0.0002975 0.08% Gravity::swapTimeLevels() 5 0.0002359 0.0002359 0.0002359 0.06% MLMG::MLResNormInf() 6 0.0001551 0.0001551 0.0001551 0.04% Castro::buildMetrics() 1 0.0001544 0.0001544 0.0001544 0.04% MultiFab::max() 6 0.0001404 0.0001404 0.0001404 0.04% MLLinOp::define() 6 0.0001267 0.0001267 0.0001267 0.03% MLLinOp::defineGrids() 6 0.0001214 0.0001214 0.0001214 0.03% MLMG::MLRhsNormInf() 6 0.0001188 0.0001188 0.0001188 0.03% FabArrayBase::FB::FB() 26 6.176e-05 6.176e-05 6.176e-05 0.02% MLLinOp::makeAgglomeratedDMap 6 2.934e-05 2.934e-05 2.934e-05 0.01% Amr::writeSmallPlotFile() 1 2.513e-05 2.513e-05 2.513e-05 0.01% Castro::swap_state_time_levels() 5 2.449e-05 2.449e-05 2.449e-05 0.01% makeSFC 30 2.198e-05 2.198e-05 2.198e-05 0.01% Castro::create_source_corrector() 5 2.148e-05 2.148e-05 2.148e-05 0.01% Castro::initMFs() 1 2.027e-05 2.027e-05 2.027e-05 0.01% Castro::finalize_do_advance() 5 2.001e-05 2.001e-05 2.001e-05 0.01% DistributionMapping::Distribute() 31 9.713e-06 9.713e-06 9.713e-06 0.00% Amr::initSubcycle() 1 8.406e-06 8.406e-06 8.406e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.614e-06 5.614e-06 5.614e-06 0.00% Gravity::set_mass_offset() 6 2.498e-06 2.498e-06 2.498e-06 0.00% Castro::retry_advance_ctu() 5 2.16e-06 2.16e-06 2.16e-06 0.00% Castro::FluxRegCrseInit 5 1.359e-06 1.359e-06 1.359e-06 0.00% AmrLevel::AmrLevel() 1 1.229e-06 1.229e-06 1.229e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.149e-06 1.149e-06 1.149e-06 0.00% Castro::FluxRegFineAdd() 5 1.049e-06 1.049e-06 1.049e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-11-g42b7e2706638) finalized