Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.11-34-g88fe04f00600) initialized Starting run at 09:58:10 UTC on 2022-12-01. Successfully read inputs file ... Castro git describe: 22.11-19-g093c32c96 AMReX git describe: 22.11-34-g88fe04f00 Microphysics git describe: 22.11-48-ge6ec0450 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.0533626 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.030861671 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.050365073 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.050924238 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.054411319 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.059333971 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.077243684 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.050043643 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.06825 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.051521944 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.05242604 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.060898641 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.064689642 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.086084399 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.031081918 seconds Ending run at 09:58:11 UTC on 2022-12-01. Run time = 0.89538798 Run time without initialization = 0.757953139 Average number of zones advanced per microsecond: 3.459 Average number of zones advanced per microsecond per rank: 3.459 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8954 ... 0.8954 ... 0.8954 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2081 0.2081 0.2081 23.24% Castro::construct_ctu_hydro_source() 10 0.1937 0.1937 0.1937 21.64% MLCellLinOp::applyBC() 4433 0.08165 0.08165 0.08165 9.12% MLPoisson::Fsmooth() 3280 0.06487 0.06487 0.06487 7.25% Amr::checkPoint() 3 0.04056 0.04056 0.04056 4.53% StateData::FillBoundary(geom) 328 0.02477 0.02477 0.02477 2.77% amrex::Dot() 1114 0.02296 0.02296 0.02296 2.56% StateDataPhysBCFunct::() 41 0.01808 0.01808 0.01808 2.02% Castro::normalize_species() 62 0.0161 0.0161 0.0161 1.80% amrex::Copy() 1029 0.01598 0.01598 0.01598 1.78% FabArray::LinComb() 1586 0.01465 0.01465 0.01465 1.64% FabArray::setVal() 1144 0.01451 0.01451 0.01451 1.62% FillBoundary_nowait() 4023 0.01435 0.01435 0.01435 1.60% Castro::computeTemp() 63 0.01413 0.01413 0.01413 1.58% FabArray::ParallelCopy_nowait() 861 0.01346 0.01346 0.01346 1.50% FabArray::norminf() 639 0.01328 0.01328 0.01328 1.48% MLPoisson::Fapply() 1142 0.01203 0.01203 0.01203 1.34% MLCellLinOp::defineAuxData() 11 0.01202 0.01202 0.01202 1.34% Gravity::fill_multipole_BCs() 11 0.008503 0.008503 0.008503 0.95% MLMG::addInterpCorrection() 410 0.007763 0.007763 0.007763 0.87% amrex::average_down 410 0.006991 0.006991 0.006991 0.78% Castro::enforce_min_density() 62 0.006849 0.006849 0.006849 0.76% FabArray::Xpay() 585 0.006721 0.006721 0.006721 0.75% Castro::estTimeStep() 21 0.004448 0.004448 0.004448 0.50% BndryData::define() 11 0.004011 0.004011 0.004011 0.45% Castro::reset_internal_energy(MultiFab) 63 0.003835 0.003835 0.003835 0.43% MLCGSolver::bicgstab 82 0.003358 0.003358 0.003358 0.37% Castro::construct_new_gravity_source() 10 0.003307 0.003307 0.003307 0.37% Castro::do_advance_ctu() 10 0.002923 0.002923 0.002923 0.33% Castro::construct_old_gravity_source() 10 0.002654 0.002654 0.002654 0.30% amrex::Add() 164 0.002564 0.002564 0.002564 0.29% Castro::reset_internal_energy(Fab) 504 0.002437 0.002437 0.002437 0.27% Amr::writePlotFile() 2 0.002304 0.002304 0.002304 0.26% MLMG::ResNormInf() 93 0.002115 0.002115 0.002115 0.24% FabArray::Saxpy() 20 0.001815 0.001815 0.001815 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.00169 0.00169 0.00169 0.19% MLCellLinOp::setLevelBC() 11 0.00155 0.00155 0.00155 0.17% Gravity::actual_solve_with_mlmg() 11 0.001381 0.001381 0.001381 0.15% FabArray::setDomainBndry() 41 0.001356 0.001356 0.001356 0.15% FabArray::mult() 43 0.001336 0.001336 0.001336 0.15% MLMG::prepareForSolve() 11 0.001242 0.001242 0.001242 0.14% Castro::initData() 1 0.001223 0.001223 0.001223 0.14% MLCellLinOp::prepareForSolve() 11 0.001199 0.001199 0.001199 0.13% MultiFab::contains_nan() 20 0.001186 0.001186 0.001186 0.13% Castro::enforce_speed_limit() 62 0.001174 0.001174 0.001174 0.13% MLCellLinOp::smooth() 1640 0.001139 0.001139 0.001139 0.13% MLCellLinOp::compGrad() 11 0.0009241 0.0009241 0.0009241 0.10% FabArray::FillBoundary() 4023 0.0008206 0.0008206 0.0008206 0.09% FabArrayBase::getCPC() 1323 0.0007867 0.0007867 0.0007867 0.09% FabArrayBase::CPC::define() 454 0.0006639 0.0006639 0.0006639 0.07% Gravity::get_new_grav_vector() 11 0.0006295 0.0006295 0.0006295 0.07% FabArrayBase::getFB() 4023 0.0006011 0.0006011 0.0006011 0.07% Gravity::get_old_grav_vector() 10 0.0005823 0.0005823 0.0005823 0.07% Amr::InitAmr() 1 0.0004697 0.0004697 0.0004697 0.05% MLCellLinOp::apply() 1142 0.0004664 0.0004664 0.0004664 0.05% MLLinOp::defineGrids() 11 0.0003922 0.0003922 0.0003922 0.04% Amr::coarseTimeStep() 10 0.0003913 0.0003913 0.0003913 0.04% CGSolver::sxay() 1586 0.0003788 0.0003788 0.0003788 0.04% FillPatchIterator::Initialize 41 0.0003035 0.0003035 0.0003035 0.03% MLCGSolver::ParallelAllReduce 1514 0.0002931 0.0002931 0.0002931 0.03% FabArray::ParallelCopy() 861 0.0002824 0.0002824 0.0002824 0.03% MLCellLinOp::defineBC() 11 0.0002757 0.0002757 0.0002757 0.03% MultiFab::max() 11 0.0002692 0.0002692 0.0002692 0.03% main() 1 0.0002552 0.0002552 0.0002552 0.03% MLCellLinOp::correctionResidual() 492 0.0002542 0.0002542 0.0002542 0.03% MLMG::MLRhsNormInf() 11 0.0002243 0.0002243 0.0002243 0.03% MLMG::mgVcycle() 82 0.0002141 0.0002141 0.0002141 0.02% Castro::subcycle_advance_ctu() 10 0.0001673 0.0001673 0.0001673 0.02% Amr::timeStep() 10 0.0001572 0.0001572 0.0001572 0.02% MLMG:computeResOfCorrection() 410 0.0001547 0.0001547 0.0001547 0.02% Gravity::update_max_rhs() 11 0.0001364 0.0001364 0.0001364 0.02% StateData::checkPoint() 12 0.0001364 0.0001364 0.0001364 0.02% MLMG::mgVcycle_down::0 82 0.0001221 0.0001221 0.0001221 0.01% Gravity::solve_for_phi() 10 0.0001176 0.0001176 0.0001176 0.01% MLMG::mgVcycle_down::1 82 0.0001056 0.0001056 0.0001056 0.01% Castro::finalize_advance() 10 9.9e-05 9.9e-05 9.9e-05 0.01% MLMG::mgVcycle_down::2 82 9.817e-05 9.817e-05 9.817e-05 0.01% MLMG::mgVcycle_down::4 82 9.501e-05 9.501e-05 9.501e-05 0.01% MLMG::mgVcycle_down::3 82 9.308e-05 9.308e-05 9.308e-05 0.01% Castro::Castro() 1 8.729e-05 8.729e-05 8.729e-05 0.01% Castro::initialize_advance() 10 8.648e-05 8.648e-05 8.648e-05 0.01% MLMG::actualBottomSolve() 82 8.623e-05 8.623e-05 8.623e-05 0.01% FabArrayBase::FB::FB() 56 8.187e-05 8.187e-05 8.187e-05 0.01% Castro::clean_state() 62 8.168e-05 8.168e-05 8.168e-05 0.01% Castro::expand_state() 10 7.982e-05 7.982e-05 7.982e-05 0.01% MLMG::mgVcycle_up::4 82 7.849e-05 7.849e-05 7.849e-05 0.01% MLMG::solve() 11 7.655e-05 7.655e-05 7.655e-05 0.01% Castro::advance() 10 7.522e-05 7.522e-05 7.522e-05 0.01% AmrLevel::checkPoint() 3 7.392e-05 7.392e-05 7.392e-05 0.01% Castro::initialize_do_advance() 10 6.758e-05 6.758e-05 6.758e-05 0.01% MLMG::mgVcycle_up::3 82 6.457e-05 6.457e-05 6.457e-05 0.01% MLMG::mgVcycle_up::2 82 6.306e-05 6.306e-05 6.306e-05 0.01% MLMG::mgVcycle_up::1 82 6.208e-05 6.208e-05 6.208e-05 0.01% MLMG::mgVcycle_up::0 82 6.028e-05 6.028e-05 6.028e-05 0.01% MLMG::oneIter() 82 5.847e-05 5.847e-05 5.847e-05 0.01% MLCellLinOp::solutionResidual() 93 5.817e-05 5.817e-05 5.817e-05 0.01% MLMG::computeResidual() 82 4.554e-05 4.554e-05 4.554e-05 0.01% StateData::define() 4 4.371e-05 4.371e-05 4.371e-05 0.00% Castro::swap_state_time_levels() 10 4.01e-05 4.01e-05 4.01e-05 0.00% MLMG::mgVcycle_bottom 82 3.468e-05 3.468e-05 3.468e-05 0.00% Castro::finalize_do_advance() 10 3.435e-05 3.435e-05 3.435e-05 0.00% Castro::enforce_consistent_e() 1 3.387e-05 3.387e-05 3.387e-05 0.00% Castro::post_timestep() 10 3.347e-05 3.347e-05 3.347e-05 0.00% MultiFab::Add() 82 3.011e-05 3.011e-05 3.011e-05 0.00% MLPoisson::define() 11 2.993e-05 2.993e-05 2.993e-05 0.00% FillPatchSingleLevel 41 2.954e-05 2.954e-05 2.954e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.688e-05 2.688e-05 2.688e-05 0.00% makeSFC 55 2.633e-05 2.633e-05 2.633e-05 0.00% Castro::construct_new_gravity() 10 2.622e-05 2.622e-05 2.622e-05 0.00% MLLinOp::define() 11 2.615e-05 2.615e-05 2.615e-05 0.00% Castro::initMFs() 1 2.525e-05 2.525e-05 2.525e-05 0.00% Amr::writeSmallPlotFile() 1 2.484e-05 2.484e-05 2.484e-05 0.00% Castro::create_source_corrector() 10 2.215e-05 2.215e-05 2.215e-05 0.00% Castro::buildMetrics() 1 2.202e-05 2.202e-05 2.202e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.967e-05 1.967e-05 1.967e-05 0.00% Amr::defBaseLevel() 1 1.956e-05 1.956e-05 1.956e-05 0.00% Castro::construct_old_source() 50 1.93e-05 1.93e-05 1.93e-05 0.00% Amr::FinalizeInit() 1 1.905e-05 1.905e-05 1.905e-05 0.00% Castro::construct_new_source() 50 1.785e-05 1.785e-05 1.785e-05 0.00% Castro::do_new_sources() 10 1.61e-05 1.61e-05 1.61e-05 0.00% Castro::do_old_sources() 10 1.56e-05 1.56e-05 1.56e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.487e-05 1.487e-05 1.487e-05 0.00% DistributionMapping::Distribute() 56 1.474e-05 1.474e-05 1.474e-05 0.00% Castro::check_for_nan() 20 1.236e-05 1.236e-05 1.236e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.236e-05 1.236e-05 1.236e-05 0.00% Castro::apply_source_to_state() 20 1.149e-05 1.149e-05 1.149e-05 0.00% MLPoisson::prepareForSolve() 11 1.024e-05 1.024e-05 1.024e-05 0.00% Castro::post_init() 1 1.017e-05 1.017e-05 1.017e-05 0.00% MLMG::computeMLResidual() 11 9.733e-06 9.733e-06 9.733e-06 0.00% Castro::construct_old_gravity() 10 9.613e-06 9.613e-06 9.613e-06 0.00% Gravity::swapTimeLevels() 10 9.264e-06 9.264e-06 9.264e-06 0.00% Gravity::actual_multilevel_solve() 1 9.101e-06 9.101e-06 9.101e-06 0.00% Amr::initSubcycle() 1 8.505e-06 8.505e-06 8.505e-06 0.00% Castro::retry_advance_ctu() 10 8.395e-06 8.395e-06 8.395e-06 0.00% Castro::computeNewDt() 9 7.266e-06 7.266e-06 7.266e-06 0.00% MLMG::getGradSolution() 11 6.49e-06 6.49e-06 6.49e-06 0.00% MultiFab::Copy() 11 5.379e-06 5.379e-06 5.379e-06 0.00% Amr::InitializeInit() 1 5.11e-06 5.11e-06 5.11e-06 0.00% Gravity::set_mass_offset() 11 4.901e-06 4.901e-06 4.901e-06 0.00% AmrLevel::checkPointPost() 3 4.191e-06 4.191e-06 4.191e-06 0.00% MLMG::MLResNormInf() 11 3.573e-06 3.573e-06 3.573e-06 0.00% Castro::computeInitialDt() 2 3.335e-06 3.335e-06 3.335e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.262e-06 3.262e-06 3.262e-06 0.00% Castro::FluxRegCrseInit 10 3.069e-06 3.069e-06 3.069e-06 0.00% Amr::init() 1 2.67e-06 2.67e-06 2.67e-06 0.00% Castro::FluxRegFineAdd() 10 2.545e-06 2.545e-06 2.545e-06 0.00% AmrLevel::checkPointPre() 3 2.401e-06 2.401e-06 2.401e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.989e-06 1.989e-06 1.989e-06 0.00% Castro::post_regrid() 1 1.213e-06 1.213e-06 1.213e-06 0.00% Amr::initialInit() 1 8.99e-07 8.99e-07 8.99e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8954 0.8954 0.8954 100.00% Amr::coarseTimeStep() 10 0.7267 0.7267 0.7267 81.15% Amr::timeStep() 10 0.5875 0.5875 0.5875 65.61% Castro::advance() 10 0.5797 0.5797 0.5797 64.74% Castro::subcycle_advance_ctu() 10 0.5671 0.5671 0.5671 63.34% Castro::do_advance_ctu() 10 0.5669 0.5669 0.5669 63.32% Gravity::solve_phi_with_mlmg() 11 0.3216 0.3216 0.3216 35.92% Gravity::actual_solve_with_mlmg() 11 0.3127 0.3127 0.3127 34.92% Castro::construct_new_gravity() 10 0.2926 0.2926 0.2926 32.68% MLMG::solve() 11 0.2892 0.2892 0.2892 32.30% Gravity::solve_for_phi() 10 0.2769 0.2769 0.2769 30.92% MLMG::oneIter() 82 0.2739 0.2739 0.2739 30.59% MLMG::mgVcycle() 82 0.2721 0.2721 0.2721 30.39% VisMF::Write(FabArray) 11 0.2081 0.2081 0.2081 23.24% Castro::construct_ctu_hydro_source() 10 0.1937 0.1937 0.1937 21.64% Amr::checkPoint() 3 0.1896 0.1896 0.1896 21.18% AmrLevel::checkPoint() 3 0.1491 0.1491 0.1491 16.65% StateData::checkPoint() 12 0.149 0.149 0.149 16.64% MLCellLinOp::smooth() 1640 0.1384 0.1384 0.1384 15.46% Amr::init() 1 0.1368 0.1368 0.1368 15.28% MLCellLinOp::applyBC() 4433 0.0975 0.0975 0.0975 10.89% MLMG::mgVcycle_bottom 82 0.08433 0.08433 0.08433 9.42% MLMG::actualBottomSolve() 82 0.0843 0.0843 0.0843 9.41% MLCGSolver::bicgstab 82 0.08346 0.08346 0.08346 9.32% MLPoisson::Fsmooth() 3280 0.06487 0.06487 0.06487 7.25% Amr::writePlotFile() 2 0.06208 0.06208 0.06208 6.93% Amr::initialInit() 1 0.05249 0.05249 0.05249 5.86% FillPatchIterator::Initialize 41 0.04856 0.04856 0.04856 5.42% Amr::FinalizeInit() 1 0.04847 0.04847 0.04847 5.41% Castro::post_init() 1 0.04711 0.04711 0.04711 5.26% FillPatchSingleLevel 41 0.0469 0.0469 0.0469 5.24% Gravity::multilevel_solve_for_new_phi() 1 0.04522 0.04522 0.04522 5.05% Gravity::actual_multilevel_solve() 1 0.0452 0.0452 0.0452 5.05% Castro::clean_state() 62 0.04388 0.04388 0.04388 4.90% StateDataPhysBCFunct::() 41 0.04285 0.04285 0.04285 4.79% MLCellLinOp::apply() 1142 0.03709 0.03709 0.03709 4.14% MLMG::mgVcycle_down::0 82 0.0358 0.0358 0.0358 4.00% MLMG::mgVcycle_up::0 82 0.03067 0.03067 0.03067 3.42% StateData::FillBoundary(geom) 328 0.02477 0.02477 0.02477 2.77% Castro::initialize_do_advance() 10 0.02449 0.02449 0.02449 2.74% amrex::Dot() 1114 0.02296 0.02296 0.02296 2.56% MLCellLinOp::correctionResidual() 492 0.02175 0.02175 0.02175 2.43% Castro::computeTemp() 63 0.0204 0.0204 0.0204 2.28% MLPoisson::define() 11 0.01911 0.01911 0.01911 2.13% MLMG:computeResOfCorrection() 410 0.01879 0.01879 0.01879 2.10% MLMG::mgVcycle_down::1 82 0.01809 0.01809 0.01809 2.02% Castro::expand_state() 10 0.01774 0.01774 0.01774 1.98% MLMG::mgVcycle_down::2 82 0.01773 0.01773 0.01773 1.98% Gravity::get_new_grav_vector() 11 0.01734 0.01734 0.01734 1.94% MLMG::mgVcycle_down::3 82 0.01681 0.01681 0.01681 1.88% Castro::normalize_species() 62 0.0161 0.0161 0.0161 1.80% amrex::Copy() 1029 0.01598 0.01598 0.01598 1.78% MLMG::mgVcycle_down::4 82 0.01593 0.01593 0.01593 1.78% FabArray::FillBoundary() 4023 0.01585 0.01585 0.01585 1.77% Castro::construct_old_gravity() 10 0.01511 0.01511 0.01511 1.69% Gravity::get_old_grav_vector() 10 0.0151 0.0151 0.0151 1.69% FillBoundary_nowait() 4023 0.01503 0.01503 0.01503 1.68% CGSolver::sxay() 1586 0.01503 0.01503 0.01503 1.68% FabArray::LinComb() 1586 0.01465 0.01465 0.01465 1.64% FabArray::ParallelCopy() 861 0.01459 0.01459 0.01459 1.63% FabArray::setVal() 1144 0.01451 0.01451 0.01451 1.62% FabArray::ParallelCopy_nowait() 861 0.01431 0.01431 0.01431 1.60% MLCGSolver::ParallelAllReduce 1514 0.01368 0.01368 0.01368 1.53% MLMG::mgVcycle_up::2 82 0.01358 0.01358 0.01358 1.52% MLCellLinOp::defineAuxData() 11 0.01336 0.01336 0.01336 1.49% MLMG::mgVcycle_up::1 82 0.01333 0.01333 0.01333 1.49% FabArray::norminf() 639 0.01328 0.01328 0.01328 1.48% MLMG::addInterpCorrection() 410 0.01303 0.01303 0.01303 1.46% MLMG::mgVcycle_up::3 82 0.01287 0.01287 0.01287 1.44% MLMG::mgVcycle_up::4 82 0.01277 0.01277 0.01277 1.43% amrex::average_down 410 0.01229 0.01229 0.01229 1.37% Castro::do_new_sources() 10 0.01217 0.01217 0.01217 1.36% MLPoisson::Fapply() 1142 0.01203 0.01203 0.01203 1.34% Castro::initialize_advance() 10 0.01189 0.01189 0.01189 1.33% Castro::do_old_sources() 10 0.01063 0.01063 0.01063 1.19% Gravity::fill_multipole_BCs() 11 0.008737 0.008737 0.008737 0.98% Castro::post_timestep() 10 0.007611 0.007611 0.007611 0.85% MLCellLinOp::solutionResidual() 93 0.00732 0.00732 0.00732 0.82% Castro::enforce_min_density() 62 0.006849 0.006849 0.006849 0.76% FabArray::Xpay() 585 0.006721 0.006721 0.006721 0.75% MLMG::computeResidual() 82 0.006346 0.006346 0.006346 0.71% Castro::reset_internal_energy(MultiFab) 63 0.006272 0.006272 0.006272 0.70% MLMG::prepareForSolve() 11 0.005495 0.005495 0.005495 0.61% MLCellLinOp::defineBC() 11 0.005239 0.005239 0.005239 0.59% BndryData::define() 11 0.004963 0.004963 0.004963 0.55% Castro::estTimeStep() 21 0.004448 0.004448 0.004448 0.50% Amr::InitializeInit() 1 0.004021 0.004021 0.004021 0.45% Amr::defBaseLevel() 1 0.004016 0.004016 0.004016 0.45% Castro::initData() 1 0.003489 0.003489 0.003489 0.39% Castro::construct_new_source() 50 0.003325 0.003325 0.003325 0.37% Castro::construct_new_gravity_source() 10 0.003307 0.003307 0.003307 0.37% Castro::construct_old_source() 50 0.002674 0.002674 0.002674 0.30% Castro::construct_old_gravity_source() 10 0.002654 0.002654 0.002654 0.30% amrex::Add() 164 0.002564 0.002564 0.002564 0.29% Castro::reset_internal_energy(Fab) 504 0.002437 0.002437 0.002437 0.27% MLMG::ResNormInf() 93 0.002115 0.002115 0.002115 0.24% Castro::apply_source_to_state() 20 0.001826 0.001826 0.001826 0.20% FabArray::Saxpy() 20 0.001815 0.001815 0.001815 0.20% Castro::computeNewDt() 9 0.001798 0.001798 0.001798 0.20% MultiFab::Add() 82 0.00173 0.00173 0.00173 0.19% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.00169 0.00169 0.00169 0.19% MLCellLinOp::setLevelBC() 11 0.00155 0.00155 0.00155 0.17% FabArrayBase::getCPC() 1323 0.001451 0.001451 0.001451 0.16% MLMG::getGradSolution() 11 0.001437 0.001437 0.001437 0.16% MLCellLinOp::compGrad() 11 0.001431 0.001431 0.001431 0.16% FabArray::setDomainBndry() 41 0.001356 0.001356 0.001356 0.15% FabArray::mult() 43 0.001336 0.001336 0.001336 0.15% MLPoisson::prepareForSolve() 11 0.001209 0.001209 0.001209 0.14% MLCellLinOp::prepareForSolve() 11 0.001199 0.001199 0.001199 0.13% Castro::check_for_nan() 20 0.001198 0.001198 0.001198 0.13% MultiFab::contains_nan() 20 0.001186 0.001186 0.001186 0.13% Castro::post_regrid() 1 0.001178 0.001178 0.001178 0.13% Castro::enforce_speed_limit() 62 0.001174 0.001174 0.001174 0.13% MLMG::computeMLResidual() 11 0.001029 0.001029 0.001029 0.11% Castro::computeInitialDt() 2 0.0009135 0.0009135 0.0009135 0.10% Gravity::update_max_rhs() 11 0.0008586 0.0008586 0.0008586 0.10% FabArrayBase::getFB() 4023 0.000683 0.000683 0.000683 0.08% FabArrayBase::CPC::define() 454 0.0006639 0.0006639 0.0006639 0.07% Castro::finalize_advance() 10 0.000651 0.000651 0.000651 0.07% Amr::InitAmr() 1 0.0004782 0.0004782 0.0004782 0.05% MLLinOp::define() 11 0.0004753 0.0004753 0.0004753 0.05% MLLinOp::defineGrids() 11 0.0004492 0.0004492 0.0004492 0.05% Castro::Castro() 1 0.0004469 0.0004469 0.0004469 0.05% Gravity::swapTimeLevels() 10 0.000445 0.000445 0.000445 0.05% MultiFab::Copy() 11 0.0003313 0.0003313 0.0003313 0.04% MLMG::MLResNormInf() 11 0.0002824 0.0002824 0.0002824 0.03% MultiFab::max() 11 0.0002692 0.0002692 0.0002692 0.03% MLMG::MLRhsNormInf() 11 0.0002243 0.0002243 0.0002243 0.03% Castro::buildMetrics() 1 0.0001611 0.0001611 0.0001611 0.02% FabArrayBase::FB::FB() 56 8.187e-05 8.187e-05 8.187e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.607e-05 5.607e-05 5.607e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.496e-05 5.496e-05 5.496e-05 0.01% StateData::define() 4 4.371e-05 4.371e-05 4.371e-05 0.00% Castro::swap_state_time_levels() 10 4.01e-05 4.01e-05 4.01e-05 0.00% makeSFC 55 4.009e-05 4.009e-05 4.009e-05 0.00% Castro::finalize_do_advance() 10 3.435e-05 3.435e-05 3.435e-05 0.00% Castro::enforce_consistent_e() 1 3.387e-05 3.387e-05 3.387e-05 0.00% Castro::initMFs() 1 2.525e-05 2.525e-05 2.525e-05 0.00% Amr::writeSmallPlotFile() 1 2.484e-05 2.484e-05 2.484e-05 0.00% Castro::create_source_corrector() 10 2.215e-05 2.215e-05 2.215e-05 0.00% DistributionMapping::Distribute() 56 1.474e-05 1.474e-05 1.474e-05 0.00% Amr::initSubcycle() 1 8.505e-06 8.505e-06 8.505e-06 0.00% Castro::retry_advance_ctu() 10 8.395e-06 8.395e-06 8.395e-06 0.00% Gravity::set_mass_offset() 11 4.901e-06 4.901e-06 4.901e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.236e-06 4.236e-06 4.236e-06 0.00% AmrLevel::checkPointPost() 3 4.191e-06 4.191e-06 4.191e-06 0.00% Castro::FluxRegCrseInit 10 3.069e-06 3.069e-06 3.069e-06 0.00% Castro::FluxRegFineAdd() 10 2.545e-06 2.545e-06 2.545e-06 0.00% AmrLevel::checkPointPre() 3 2.401e-06 2.401e-06 2.401e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.989e-06 1.989e-06 1.989e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2545 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.11-34-g88fe04f00600) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.11-34-g88fe04f00600) initialized Starting run at 09:58:12 UTC on 2022-12-01. Successfully read inputs file ... Castro git describe: 22.11-19-g093c32c96 AMReX git describe: 22.11-34-g88fe04f00 Microphysics git describe: 22.11-48-ge6ec0450 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.479792069 Restart time = 0.048975239 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.056023908 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.051448023 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.061469794 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.062272457 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.064191544 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.032509549 seconds Ending run at 09:58:12 UTC on 2022-12-01. Run time = 0.377903263 Run time without initialization = 0.328366085 Average number of zones advanced per microsecond: 3.992 Average number of zones advanced per microsecond per rank: 3.992 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3779 ... 0.3779 ... 0.3779 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0977 0.0977 0.0977 25.85% VisMF::Read() 3 0.04099 0.04099 0.04099 10.85% MLCellLinOp::applyBC() 1946 0.03485 0.03485 0.03485 9.22% VisMF::Write(FabArray) 1 0.03085 0.03085 0.03085 8.16% MLPoisson::Fsmooth() 1440 0.02711 0.02711 0.02711 7.17% StateData::FillBoundary(geom) 160 0.01181 0.01181 0.01181 3.13% amrex::Dot() 484 0.009617 0.009617 0.009617 2.54% Castro::normalize_species() 30 0.008202 0.008202 0.008202 2.17% Castro::computeTemp() 30 0.007542 0.007542 0.007542 2.00% amrex::Copy() 463 0.007416 0.007416 0.007416 1.96% FabArray::setVal() 537 0.006825 0.006825 0.006825 1.81% MLCellLinOp::defineAuxData() 6 0.006414 0.006414 0.006414 1.70% FillBoundary_nowait() 1766 0.006295 0.006295 0.006295 1.67% FabArray::LinComb() 690 0.006127 0.006127 0.006127 1.62% FabArray::ParallelCopy_nowait() 380 0.005938 0.005938 0.005938 1.57% FabArray::norminf() 278 0.005537 0.005537 0.005537 1.46% StateDataPhysBCFunct::() 20 0.00539 0.00539 0.00539 1.43% MLPoisson::Fapply() 500 0.005083 0.005083 0.005083 1.35% Gravity::fill_multipole_BCs() 6 0.004114 0.004114 0.004114 1.09% Amr::restart() 1 0.003603 0.003603 0.003603 0.95% MLMG::addInterpCorrection() 180 0.003354 0.003354 0.003354 0.89% Castro::enforce_min_density() 30 0.00301 0.00301 0.00301 0.80% Castro::estTimeStep() 10 0.002992 0.002992 0.002992 0.79% amrex::average_down 180 0.002974 0.002974 0.002974 0.79% FabArray::Xpay() 258 0.002881 0.002881 0.002881 0.76% BndryData::define() 6 0.002141 0.002141 0.002141 0.57% Castro::reset_internal_energy(Fab) 240 0.001803 0.001803 0.001803 0.48% Castro::reset_internal_energy(MultiFab) 30 0.001721 0.001721 0.001721 0.46% Castro::do_advance_ctu() 5 0.00162 0.00162 0.00162 0.43% Castro::construct_new_gravity_source() 5 0.00162 0.00162 0.00162 0.43% Amr::writePlotFile() 1 0.001484 0.001484 0.001484 0.39% Castro::construct_old_gravity_source() 5 0.001449 0.001449 0.001449 0.38% MLCGSolver::bicgstab 36 0.001414 0.001414 0.001414 0.37% amrex::Add() 72 0.001091 0.001091 0.001091 0.29% MLMG::ResNormInf() 42 0.0009254 0.0009254 0.0009254 0.24% FabArray::Saxpy() 10 0.0009188 0.0009188 0.0009188 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008953 0.0008953 0.0008953 0.24% Castro::post_timestep() 5 0.0008809 0.0008809 0.0008809 0.23% MLCellLinOp::setLevelBC() 6 0.0008144 0.0008144 0.0008144 0.22% Gravity::actual_solve_with_mlmg() 6 0.0007922 0.0007922 0.0007922 0.21% FabArray::setDomainBndry() 20 0.0006755 0.0006755 0.0006755 0.18% MLMG::prepareForSolve() 6 0.0006711 0.0006711 0.0006711 0.18% FabArray::mult() 22 0.000657 0.000657 0.000657 0.17% MLCellLinOp::prepareForSolve() 6 0.0006356 0.0006356 0.0006356 0.17% MultiFab::contains_nan() 10 0.0006019 0.0006019 0.0006019 0.16% MLCellLinOp::smooth() 720 0.0004902 0.0004902 0.0004902 0.13% MLCellLinOp::compGrad() 6 0.0004886 0.0004886 0.0004886 0.13% Castro::enforce_speed_limit() 30 0.0004756 0.0004756 0.0004756 0.13% Gravity::get_old_grav_vector() 5 0.0004233 0.0004233 0.0004233 0.11% FabArrayBase::CPC::define() 244 0.0003887 0.0003887 0.0003887 0.10% Amr::InitAmr() 1 0.0003884 0.0003884 0.0003884 0.10% Gravity::get_new_grav_vector() 5 0.0003664 0.0003664 0.0003664 0.10% FabArrayBase::getCPC() 632 0.0003656 0.0003656 0.0003656 0.10% FabArray::FillBoundary() 1766 0.0003567 0.0003567 0.0003567 0.09% FabArrayBase::getFB() 1766 0.0002684 0.0002684 0.0002684 0.07% main() 1 0.0002444 0.0002444 0.0002444 0.06% Amr::coarseTimeStep() 5 0.000227 0.000227 0.000227 0.06% MLCellLinOp::apply() 500 0.0002237 0.0002237 0.0002237 0.06% Castro::subcycle_advance_ctu() 5 0.0001866 0.0001866 0.0001866 0.05% CGSolver::sxay() 690 0.0001805 0.0001805 0.0001805 0.05% MLLinOp::defineGrids() 6 0.0001781 0.0001781 0.0001781 0.05% MLCellLinOp::defineBC() 6 0.0001535 0.0001535 0.0001535 0.04% MultiFab::max() 6 0.0001445 0.0001445 0.0001445 0.04% FillPatchIterator::Initialize 20 0.000138 0.000138 0.000138 0.04% MLCGSolver::ParallelAllReduce 659 0.0001307 0.0001307 0.0001307 0.03% FabArray::ParallelCopy() 380 0.0001242 0.0001242 0.0001242 0.03% MLMG::MLRhsNormInf() 6 0.0001149 0.0001149 0.0001149 0.03% Castro::advance() 5 0.0001101 0.0001101 0.0001101 0.03% Castro::create_source_corrector() 5 0.0001091 0.0001091 0.0001091 0.03% MLCellLinOp::correctionResidual() 216 0.0001077 0.0001077 0.0001077 0.03% Castro::construct_new_source() 25 0.0001005 0.0001005 0.0001005 0.03% Amr::timeStep() 5 9.369e-05 9.369e-05 9.369e-05 0.02% Gravity::update_max_rhs() 6 9.008e-05 9.008e-05 9.008e-05 0.02% MLMG::mgVcycle() 36 8.725e-05 8.725e-05 8.725e-05 0.02% StateData::restartDoit() 4 7.825e-05 7.825e-05 7.825e-05 0.02% Castro::finalize_advance() 5 7.685e-05 7.685e-05 7.685e-05 0.02% AmrLevel::restart() 1 7.56e-05 7.56e-05 7.56e-05 0.02% Castro::initialize_do_advance() 5 6.906e-05 6.906e-05 6.906e-05 0.02% MLMG:computeResOfCorrection() 180 6.588e-05 6.588e-05 6.588e-05 0.02% Castro::construct_new_gravity() 5 6.197e-05 6.197e-05 6.197e-05 0.02% MLMG::mgVcycle_down::0 36 6.136e-05 6.136e-05 6.136e-05 0.02% Gravity::solve_for_phi() 5 6.034e-05 6.034e-05 6.034e-05 0.02% FabArrayBase::FB::FB() 26 5.472e-05 5.472e-05 5.472e-05 0.01% Castro::initialize_advance() 5 5.133e-05 5.133e-05 5.133e-05 0.01% Castro::expand_state() 5 4.933e-05 4.933e-05 4.933e-05 0.01% MLMG::mgVcycle_down::1 36 4.704e-05 4.704e-05 4.704e-05 0.01% MLMG::mgVcycle_down::4 36 4.698e-05 4.698e-05 4.698e-05 0.01% MLMG::mgVcycle_down::2 36 4.431e-05 4.431e-05 4.431e-05 0.01% MLMG::mgVcycle_down::3 36 4.141e-05 4.141e-05 4.141e-05 0.01% Castro::clean_state() 30 4.101e-05 4.101e-05 4.101e-05 0.01% MLMG::actualBottomSolve() 36 3.889e-05 3.889e-05 3.889e-05 0.01% Castro::construct_old_source() 25 3.646e-05 3.646e-05 3.646e-05 0.01% MLMG::mgVcycle_up::4 36 3.618e-05 3.618e-05 3.618e-05 0.01% MLMG::solve() 6 3.38e-05 3.38e-05 3.38e-05 0.01% Castro::computeNewDt() 5 3.373e-05 3.373e-05 3.373e-05 0.01% Castro::post_restart() 1 3.277e-05 3.277e-05 3.277e-05 0.01% Castro::buildMetrics() 1 3.134e-05 3.134e-05 3.134e-05 0.01% Castro::swap_state_time_levels() 5 2.904e-05 2.904e-05 2.904e-05 0.01% MLMG::mgVcycle_up::0 36 2.89e-05 2.89e-05 2.89e-05 0.01% MLMG::mgVcycle_up::3 36 2.862e-05 2.862e-05 2.862e-05 0.01% MLMG::mgVcycle_up::2 36 2.736e-05 2.736e-05 2.736e-05 0.01% Castro::initMFs() 1 2.627e-05 2.627e-05 2.627e-05 0.01% MLMG::mgVcycle_up::1 36 2.586e-05 2.586e-05 2.586e-05 0.01% MLMG::oneIter() 36 2.584e-05 2.584e-05 2.584e-05 0.01% Castro::do_old_sources() 5 2.56e-05 2.56e-05 2.56e-05 0.01% Amr::writeSmallPlotFile() 1 2.491e-05 2.491e-05 2.491e-05 0.01% MLCellLinOp::solutionResidual() 42 2.489e-05 2.489e-05 2.489e-05 0.01% MLPoisson::define() 6 2.16e-05 2.16e-05 2.16e-05 0.01% MLLinOp::define() 6 2.119e-05 2.119e-05 2.119e-05 0.01% Castro::construct_old_gravity() 5 2.062e-05 2.062e-05 2.062e-05 0.01% MLMG::computeResidual() 36 1.983e-05 1.983e-05 1.983e-05 0.01% Castro::finalize_do_advance() 5 1.903e-05 1.903e-05 1.903e-05 0.01% Gravity::multilevel_solve_for_new_phi() 1 1.703e-05 1.703e-05 1.703e-05 0.00% MLMG::mgVcycle_bottom 36 1.578e-05 1.578e-05 1.578e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.482e-05 1.482e-05 1.482e-05 0.00% makeSFC 30 1.467e-05 1.467e-05 1.467e-05 0.00% FillPatchSingleLevel 20 1.421e-05 1.421e-05 1.421e-05 0.00% MultiFab::Add() 36 1.286e-05 1.286e-05 1.286e-05 0.00% Castro::do_new_sources() 5 9.54e-06 9.54e-06 9.54e-06 0.00% Amr::initSubcycle() 1 8.48e-06 8.48e-06 8.48e-06 0.00% Gravity::actual_multilevel_solve() 1 8.196e-06 8.196e-06 8.196e-06 0.00% Castro::check_for_nan() 10 8.191e-06 8.191e-06 8.191e-06 0.00% DistributionMapping::Distribute() 31 8.135e-06 8.135e-06 8.135e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.72e-06 7.72e-06 7.72e-06 0.00% Castro::apply_source_to_state() 10 5.878e-06 5.878e-06 5.878e-06 0.00% Gravity::swapTimeLevels() 5 5.828e-06 5.828e-06 5.828e-06 0.00% MLPoisson::prepareForSolve() 6 5.292e-06 5.292e-06 5.292e-06 0.00% MLMG::computeMLResidual() 6 4.451e-06 4.451e-06 4.451e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.674e-06 3.674e-06 3.674e-06 0.00% MLMG::getGradSolution() 6 3.138e-06 3.138e-06 3.138e-06 0.00% MultiFab::Copy() 6 2.892e-06 2.892e-06 2.892e-06 0.00% Gravity::set_mass_offset() 6 2.793e-06 2.793e-06 2.793e-06 0.00% MLMG::MLResNormInf() 6 2.166e-06 2.166e-06 2.166e-06 0.00% Castro::retry_advance_ctu() 5 1.981e-06 1.981e-06 1.981e-06 0.00% Castro::FluxRegCrseInit 5 1.879e-06 1.879e-06 1.879e-06 0.00% Amr::init() 1 1.511e-06 1.511e-06 1.511e-06 0.00% Castro::FluxRegFineAdd() 5 1.277e-06 1.277e-06 1.277e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.255e-06 1.255e-06 1.255e-06 0.00% AmrLevel::AmrLevel() 1 7.86e-07 7.86e-07 7.86e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3779 0.3779 0.3779 100.00% Amr::coarseTimeStep() 5 0.2956 0.2956 0.2956 78.22% Amr::timeStep() 5 0.2933 0.2933 0.2933 77.62% Castro::advance() 5 0.2879 0.2879 0.2879 76.18% Castro::subcycle_advance_ctu() 5 0.2827 0.2827 0.2827 74.80% Castro::do_advance_ctu() 5 0.2825 0.2825 0.2825 74.75% Castro::construct_new_gravity() 5 0.1443 0.1443 0.1443 38.17% Gravity::solve_phi_with_mlmg() 6 0.1401 0.1401 0.1401 37.07% Gravity::solve_for_phi() 5 0.1364 0.1364 0.1364 36.10% Gravity::actual_solve_with_mlmg() 6 0.1357 0.1357 0.1357 35.92% MLMG::solve() 6 0.1231 0.1231 0.1231 32.58% MLMG::oneIter() 36 0.1159 0.1159 0.1159 30.66% MLMG::mgVcycle() 36 0.1151 0.1151 0.1151 30.46% Castro::construct_ctu_hydro_source() 5 0.0977 0.0977 0.0977 25.85% MLCellLinOp::smooth() 720 0.05855 0.05855 0.05855 15.49% Amr::init() 1 0.04903 0.04903 0.04903 12.97% Amr::restart() 1 0.04903 0.04903 0.04903 12.97% MLCellLinOp::applyBC() 1946 0.04182 0.04182 0.04182 11.07% AmrLevel::restart() 1 0.0412 0.0412 0.0412 10.90% StateData::restartDoit() 4 0.04112 0.04112 0.04112 10.88% VisMF::Read() 3 0.04099 0.04099 0.04099 10.85% MLMG::mgVcycle_bottom 36 0.03547 0.03547 0.03547 9.39% MLMG::actualBottomSolve() 36 0.03546 0.03546 0.03546 9.38% MLCGSolver::bicgstab 36 0.0351 0.0351 0.0351 9.29% Amr::writePlotFile() 1 0.03259 0.03259 0.03259 8.62% VisMF::Write(FabArray) 1 0.03085 0.03085 0.03085 8.16% MLPoisson::Fsmooth() 1440 0.02711 0.02711 0.02711 7.17% Castro::clean_state() 30 0.02279 0.02279 0.02279 6.03% FillPatchIterator::Initialize 20 0.02001 0.02001 0.02001 5.29% FillPatchSingleLevel 20 0.01919 0.01919 0.01919 5.08% StateDataPhysBCFunct::() 20 0.0172 0.0172 0.0172 4.55% MLCellLinOp::apply() 500 0.01591 0.01591 0.01591 4.21% MLMG::mgVcycle_down::0 36 0.01536 0.01536 0.01536 4.06% MLMG::mgVcycle_up::0 36 0.01309 0.01309 0.01309 3.46% StateData::FillBoundary(geom) 160 0.01181 0.01181 0.01181 3.13% Castro::computeTemp() 30 0.01107 0.01107 0.01107 2.93% MLPoisson::define() 6 0.01023 0.01023 0.01023 2.71% amrex::Dot() 484 0.009617 0.009617 0.009617 2.54% Castro::initialize_do_advance() 5 0.009446 0.009446 0.009446 2.50% MLCellLinOp::correctionResidual() 216 0.009294 0.009294 0.009294 2.46% Castro::normalize_species() 30 0.008202 0.008202 0.008202 2.17% Castro::do_new_sources() 5 0.008193 0.008193 0.008193 2.17% MLMG:computeResOfCorrection() 180 0.008024 0.008024 0.008024 2.12% Gravity::get_new_grav_vector() 5 0.007666 0.007666 0.007666 2.03% MLMG::mgVcycle_down::1 36 0.007662 0.007662 0.007662 2.03% Castro::construct_old_gravity() 5 0.007583 0.007583 0.007583 2.01% Gravity::get_old_grav_vector() 5 0.007562 0.007562 0.007562 2.00% MLMG::mgVcycle_down::2 36 0.00746 0.00746 0.00746 1.97% amrex::Copy() 463 0.007416 0.007416 0.007416 1.96% MLCellLinOp::defineAuxData() 6 0.007147 0.007147 0.007147 1.89% MLMG::mgVcycle_down::3 36 0.007047 0.007047 0.007047 1.86% FabArray::FillBoundary() 1766 0.006975 0.006975 0.006975 1.85% FabArray::setVal() 537 0.006825 0.006825 0.006825 1.81% MLMG::mgVcycle_down::4 36 0.006762 0.006762 0.006762 1.79% FillBoundary_nowait() 1766 0.006618 0.006618 0.006618 1.75% FabArray::ParallelCopy() 380 0.006438 0.006438 0.006438 1.70% FabArray::ParallelCopy_nowait() 380 0.006314 0.006314 0.006314 1.67% CGSolver::sxay() 690 0.006307 0.006307 0.006307 1.67% FabArray::LinComb() 690 0.006127 0.006127 0.006127 1.62% MLCGSolver::ParallelAllReduce 659 0.005764 0.005764 0.005764 1.53% MLMG::mgVcycle_up::2 36 0.005718 0.005718 0.005718 1.51% MLMG::mgVcycle_up::1 36 0.005624 0.005624 0.005624 1.49% MLMG::addInterpCorrection() 180 0.005574 0.005574 0.005574 1.47% Castro::expand_state() 5 0.005566 0.005566 0.005566 1.47% FabArray::norminf() 278 0.005537 0.005537 0.005537 1.46% Castro::do_old_sources() 5 0.005504 0.005504 0.005504 1.46% MLMG::mgVcycle_up::4 36 0.005418 0.005418 0.005418 1.43% MLMG::mgVcycle_up::3 36 0.005408 0.005408 0.005408 1.43% Castro::post_timestep() 5 0.005357 0.005357 0.005357 1.42% amrex::average_down 180 0.005214 0.005214 0.005214 1.38% MLPoisson::Fapply() 500 0.005083 0.005083 0.005083 1.35% Castro::initialize_advance() 5 0.004753 0.004753 0.004753 1.26% Gravity::fill_multipole_BCs() 6 0.004236 0.004236 0.004236 1.12% Castro::post_restart() 1 0.004045 0.004045 0.004045 1.07% Gravity::multilevel_solve_for_new_phi() 1 0.003923 0.003923 0.003923 1.04% Gravity::actual_multilevel_solve() 1 0.003906 0.003906 0.003906 1.03% Castro::reset_internal_energy(MultiFab) 30 0.003524 0.003524 0.003524 0.93% MLCellLinOp::solutionResidual() 42 0.003246 0.003246 0.003246 0.86% Castro::enforce_min_density() 30 0.00301 0.00301 0.00301 0.80% Castro::estTimeStep() 10 0.002992 0.002992 0.002992 0.79% MLMG::prepareForSolve() 6 0.002924 0.002924 0.002924 0.77% FabArray::Xpay() 258 0.002881 0.002881 0.002881 0.76% MLCellLinOp::defineBC() 6 0.002836 0.002836 0.002836 0.75% MLMG::computeResidual() 36 0.002702 0.002702 0.002702 0.72% BndryData::define() 6 0.002682 0.002682 0.002682 0.71% Castro::computeNewDt() 5 0.00205 0.00205 0.00205 0.54% Castro::reset_internal_energy(Fab) 240 0.001803 0.001803 0.001803 0.48% Castro::construct_new_source() 25 0.00172 0.00172 0.00172 0.46% Castro::construct_new_gravity_source() 5 0.00162 0.00162 0.00162 0.43% Castro::construct_old_source() 25 0.001485 0.001485 0.001485 0.39% Castro::construct_old_gravity_source() 5 0.001449 0.001449 0.001449 0.38% amrex::Add() 72 0.001091 0.001091 0.001091 0.29% MLMG::ResNormInf() 42 0.0009254 0.0009254 0.0009254 0.24% Castro::apply_source_to_state() 10 0.0009247 0.0009247 0.0009247 0.24% FabArray::Saxpy() 10 0.0009188 0.0009188 0.0009188 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008953 0.0008953 0.0008953 0.24% MLCellLinOp::setLevelBC() 6 0.0008144 0.0008144 0.0008144 0.22% MLMG::getGradSolution() 6 0.0007626 0.0007626 0.0007626 0.20% MLCellLinOp::compGrad() 6 0.0007595 0.0007595 0.0007595 0.20% FabArrayBase::getCPC() 632 0.0007543 0.0007543 0.0007543 0.20% MultiFab::Add() 36 0.0007367 0.0007367 0.0007367 0.19% FabArray::setDomainBndry() 20 0.0006755 0.0006755 0.0006755 0.18% FabArray::mult() 22 0.000657 0.000657 0.000657 0.17% MLPoisson::prepareForSolve() 6 0.0006409 0.0006409 0.0006409 0.17% MLCellLinOp::prepareForSolve() 6 0.0006356 0.0006356 0.0006356 0.17% Castro::check_for_nan() 10 0.0006101 0.0006101 0.0006101 0.16% MultiFab::contains_nan() 10 0.0006019 0.0006019 0.0006019 0.16% MLMG::computeMLResidual() 6 0.0005682 0.0005682 0.0005682 0.15% Gravity::update_max_rhs() 6 0.0005027 0.0005027 0.0005027 0.13% Castro::enforce_speed_limit() 30 0.0004756 0.0004756 0.0004756 0.13% Amr::InitAmr() 1 0.0003969 0.0003969 0.0003969 0.11% FabArrayBase::CPC::define() 244 0.0003887 0.0003887 0.0003887 0.10% Castro::finalize_advance() 5 0.0003413 0.0003413 0.0003413 0.09% FabArrayBase::getFB() 1766 0.0003231 0.0003231 0.0003231 0.09% Gravity::swapTimeLevels() 5 0.0002469 0.0002469 0.0002469 0.07% MLLinOp::define() 6 0.0002299 0.0002299 0.0002299 0.06% MLLinOp::defineGrids() 6 0.0002087 0.0002087 0.0002087 0.06% MultiFab::Copy() 6 0.0001792 0.0001792 0.0001792 0.05% Castro::buildMetrics() 1 0.0001547 0.0001547 0.0001547 0.04% MLMG::MLResNormInf() 6 0.0001493 0.0001493 0.0001493 0.04% MultiFab::max() 6 0.0001445 0.0001445 0.0001445 0.04% MLMG::MLRhsNormInf() 6 0.0001149 0.0001149 0.0001149 0.03% Castro::create_source_corrector() 5 0.0001091 0.0001091 0.0001091 0.03% FabArrayBase::FB::FB() 26 5.472e-05 5.472e-05 5.472e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.935e-05 2.935e-05 2.935e-05 0.01% Castro::swap_state_time_levels() 5 2.904e-05 2.904e-05 2.904e-05 0.01% Castro::initMFs() 1 2.627e-05 2.627e-05 2.627e-05 0.01% Amr::writeSmallPlotFile() 1 2.491e-05 2.491e-05 2.491e-05 0.01% makeSFC 30 2.163e-05 2.163e-05 2.163e-05 0.01% Castro::finalize_do_advance() 5 1.903e-05 1.903e-05 1.903e-05 0.01% Amr::initSubcycle() 1 8.48e-06 8.48e-06 8.48e-06 0.00% DistributionMapping::Distribute() 31 8.135e-06 8.135e-06 8.135e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.851e-06 4.851e-06 4.851e-06 0.00% Gravity::set_mass_offset() 6 2.793e-06 2.793e-06 2.793e-06 0.00% Castro::retry_advance_ctu() 5 1.981e-06 1.981e-06 1.981e-06 0.00% Castro::FluxRegCrseInit 5 1.879e-06 1.879e-06 1.879e-06 0.00% Castro::FluxRegFineAdd() 5 1.277e-06 1.277e-06 1.277e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.255e-06 1.255e-06 1.255e-06 0.00% AmrLevel::AmrLevel() 1 7.86e-07 7.86e-07 7.86e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2545 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.11-34-g88fe04f00600) finalized