Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.07-12-g8e40952af9ab) initialized Starting run at 08:49:33 UTC on 2022-07-21. Successfully read inputs file ... Castro git describe: 22.07-10-gf80c3f7eb AMReX git describe: 22.07-12-g8e40952af Microphysics git describe: 22.07-15-ga4952214 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.041926767 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.024224849 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048146619 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.049981713 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.050452164 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.048621912 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.063638041 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.038703908 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.077525063 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.0657509 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.051376414 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.052109217 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.067471849 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.038643754 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.023890636 seconds Ending run at 08:49:34 UTC on 2022-07-21. Run time = 0.791498115 Run time without initialization = 0.676920408 Average number of zones advanced per microsecond: 3.873 Average number of zones advanced per microsecond per rank: 3.873 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.7915 ... 0.7915 ... 0.7915 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.1980 0.1980 0.1980 25.02% VisMF::Write(FabArray) 11 0.1606 0.1606 0.1606 20.29% MLCellLinOp::applyBC() 4379 0.07897 0.07897 0.07897 9.98% MLPoisson::Fsmooth() 3240 0.06268 0.06268 0.06268 7.92% MLCGSolver::bicgstab 81 0.0235 0.0235 0.0235 2.97% StateData::FillBoundary(geom) 328 0.02312 0.02312 0.02312 2.92% MultiFab::Dot() 1100 0.02173 0.02173 0.02173 2.75% Castro::computeTemp() 63 0.01495 0.01495 0.01495 1.89% MultiFab::LinComb() 1566 0.01403 0.01403 0.01403 1.77% FillBoundary_nowait() 3974 0.01395 0.01395 0.01395 1.76% FabArray::setVal() 1135 0.01392 0.01392 0.01392 1.76% FabArray::ParallelCopy_nowait() 851 0.01277 0.01277 0.01277 1.61% Castro::normalize_species() 62 0.0126 0.0126 0.0126 1.59% StateDataPhysBCFunct::() 41 0.01174 0.01174 0.01174 1.48% MLPoisson::Fapply() 1128 0.01148 0.01148 0.01148 1.45% MLCellLinOp::defineAuxData() 11 0.01134 0.01134 0.01134 1.43% Castro::enforce_min_density() 62 0.009261 0.009261 0.009261 1.17% Gravity::fill_multipole_BCs() 11 0.007985 0.007985 0.007985 1.01% MLMG::addInterpCorrection() 405 0.007331 0.007331 0.007331 0.93% amrex::average_down 405 0.006693 0.006693 0.006693 0.85% MultiFab::Xpay() 578 0.006469 0.006469 0.006469 0.82% Castro::estTimeStep() 21 0.004929 0.004929 0.004929 0.62% Castro::do_advance_ctu() 10 0.00443 0.00443 0.00443 0.56% Amr::checkPoint() 3 0.004348 0.004348 0.004348 0.55% Castro::reset_internal_energy(MultiFab) 63 0.004296 0.004296 0.004296 0.54% BndryData::define() 11 0.003727 0.003727 0.003727 0.47% Castro::construct_new_gravity_source() 10 0.003215 0.003215 0.003215 0.41% Castro::construct_old_gravity_source() 10 0.002602 0.002602 0.002602 0.33% Amr::writePlotFile() 2 0.002527 0.002527 0.002527 0.32% MLMG::ResNormInf() 92 0.001923 0.001923 0.001923 0.24% Gravity::get_new_grav_vector() 11 0.001911 0.001911 0.001911 0.24% MultiFab::Saxpy() 20 0.001809 0.001809 0.001809 0.23% Castro::expand_state() 10 0.001728 0.001728 0.001728 0.22% Gravity::get_old_grav_vector() 10 0.001717 0.001717 0.001717 0.22% MLMG::oneIter() 81 0.001667 0.001667 0.001667 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001605 0.001605 0.001605 0.20% MLCellLinOp::setLevelBC() 11 0.001512 0.001512 0.001512 0.19% Castro::reset_internal_energy(Fab) 504 0.001462 0.001462 0.001462 0.18% Gravity::actual_solve_with_mlmg() 11 0.00135 0.00135 0.00135 0.17% FabArray::mult() 43 0.00133 0.00133 0.00133 0.17% FabArray::setDomainBndry() 41 0.001308 0.001308 0.001308 0.17% Castro::enforce_speed_limit() 62 0.001259 0.001259 0.001259 0.16% Castro::initData() 1 0.001207 0.001207 0.001207 0.15% MultiFab::contains_nan() 20 0.00118 0.00118 0.00118 0.15% MLCellLinOp::smooth() 1620 0.001164 0.001164 0.001164 0.15% MLCellLinOp::prepareForSolve() 11 0.001153 0.001153 0.001153 0.15% MLMG::prepareForSolve() 11 0.001034 0.001034 0.001034 0.13% MLCellLinOp::compGrad() 11 0.0009084 0.0009084 0.0009084 0.11% FabArray::FillBoundary() 3974 0.0007677 0.0007677 0.0007677 0.10% FabArrayBase::getCPC() 1313 0.0007257 0.0007257 0.0007257 0.09% FabArrayBase::CPC::define() 454 0.0006687 0.0006687 0.0006687 0.08% FabArrayBase::getFB() 3974 0.0006089 0.0006089 0.0006089 0.08% Gravity::solve_for_phi() 10 0.0004898 0.0004898 0.0004898 0.06% Amr::InitAmr() 1 0.0004593 0.0004593 0.0004593 0.06% MLCellLinOp::apply() 1128 0.0004548 0.0004548 0.0004548 0.06% Gravity::update_max_rhs() 11 0.0004064 0.0004064 0.0004064 0.05% CGSolver::sxay() 1566 0.0003997 0.0003997 0.0003997 0.05% Amr::coarseTimeStep() 10 0.0003238 0.0003238 0.0003238 0.04% FillPatchIterator::Initialize 41 0.0002907 0.0002907 0.0002907 0.04% MLCGSolver::ParallelAllReduce 1495 0.0002882 0.0002882 0.0002882 0.04% MLCellLinOp::defineBC() 11 0.0002809 0.0002809 0.0002809 0.04% FabArray::ParallelCopy() 851 0.0002791 0.0002791 0.0002791 0.04% main() 1 0.0002601 0.0002601 0.0002601 0.03% MultiFab::Copy() 11 0.0002559 0.0002559 0.0002559 0.03% MultiFab::max() 11 0.0002543 0.0002543 0.0002543 0.03% Castro::subcycle_advance_ctu() 10 0.0002254 0.0002254 0.0002254 0.03% MLCellLinOp::correctionResidual() 486 0.0002227 0.0002227 0.0002227 0.03% Amr::timeStep() 10 0.0002078 0.0002078 0.0002078 0.03% MLMG::mgVcycle() 81 0.000207 0.000207 0.000207 0.03% Castro::construct_new_gravity() 10 0.0002018 0.0002018 0.0002018 0.03% MLMG::MLRhsNormInf() 11 0.0002004 0.0002004 0.0002004 0.03% MLLinOp::defineGrids() 11 0.000153 0.000153 0.000153 0.02% StateData::checkPoint() 12 0.0001322 0.0001322 0.0001322 0.02% MLMG:computeResOfCorrection() 405 0.0001145 0.0001145 0.0001145 0.01% MLMG::actualBottomSolve() 81 0.0001001 0.0001001 0.0001001 0.01% MLMG::mgVcycle_down::0 81 9.211e-05 9.211e-05 9.211e-05 0.01% Castro::initialize_advance() 10 8.522e-05 8.522e-05 8.522e-05 0.01% Castro::Castro() 1 8.518e-05 8.518e-05 8.518e-05 0.01% FabArrayBase::FB::FB() 56 8.232e-05 8.232e-05 8.232e-05 0.01% Castro::clean_state() 62 7.648e-05 7.648e-05 7.648e-05 0.01% MLMG::mgVcycle_down::1 81 7.618e-05 7.618e-05 7.618e-05 0.01% MLMG::solve() 11 7.489e-05 7.489e-05 7.489e-05 0.01% MLMG::mgVcycle_down::2 81 7.477e-05 7.477e-05 7.477e-05 0.01% AmrLevel::checkPoint() 3 7.332e-05 7.332e-05 7.332e-05 0.01% MLMG::mgVcycle_down::3 81 6.905e-05 6.905e-05 6.905e-05 0.01% MLMG::mgVcycle_down::4 81 6.89e-05 6.89e-05 6.89e-05 0.01% Castro::initialize_do_advance() 10 6.597e-05 6.597e-05 6.597e-05 0.01% Castro::advance() 10 6.05e-05 6.05e-05 6.05e-05 0.01% MLMG::mgVcycle_up::4 81 5.809e-05 5.809e-05 5.809e-05 0.01% Castro::finalize_advance() 10 5.339e-05 5.339e-05 5.339e-05 0.01% MLMG::mgVcycle_up::0 81 5.203e-05 5.203e-05 5.203e-05 0.01% MLMG::mgVcycle_up::1 81 4.872e-05 4.872e-05 4.872e-05 0.01% MLCellLinOp::solutionResidual() 92 4.859e-05 4.859e-05 4.859e-05 0.01% MLMG::mgVcycle_up::2 81 4.829e-05 4.829e-05 4.829e-05 0.01% MLMG::mgVcycle_up::3 81 4.804e-05 4.804e-05 4.804e-05 0.01% Castro::construct_new_source() 50 4.085e-05 4.085e-05 4.085e-05 0.01% Castro::swap_state_time_levels() 10 3.899e-05 3.899e-05 3.899e-05 0.00% StateData::define() 4 3.707e-05 3.707e-05 3.707e-05 0.00% Castro::finalize_do_advance() 10 3.55e-05 3.55e-05 3.55e-05 0.00% Castro::enforce_consistent_e() 1 3.179e-05 3.179e-05 3.179e-05 0.00% Gravity::actual_multilevel_solve() 1 3.037e-05 3.037e-05 3.037e-05 0.00% MLMG::computeResidual() 81 2.967e-05 2.967e-05 2.967e-05 0.00% FillPatchSingleLevel 41 2.931e-05 2.931e-05 2.931e-05 0.00% MLMG::mgVcycle_bottom 81 2.892e-05 2.892e-05 2.892e-05 0.00% makeSFC 55 2.722e-05 2.722e-05 2.722e-05 0.00% Amr::writeSmallPlotFile() 1 2.623e-05 2.623e-05 2.623e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.597e-05 2.597e-05 2.597e-05 0.00% Castro::initMFs() 1 2.589e-05 2.589e-05 2.589e-05 0.00% Amr::defBaseLevel() 1 2.515e-05 2.515e-05 2.515e-05 0.00% MLLinOp::define() 11 2.417e-05 2.417e-05 2.417e-05 0.00% MLPoisson::define() 11 2.386e-05 2.386e-05 2.386e-05 0.00% Castro::buildMetrics() 1 2.363e-05 2.363e-05 2.363e-05 0.00% Amr::FinalizeInit() 1 2.325e-05 2.325e-05 2.325e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.88e-05 1.88e-05 1.88e-05 0.00% Castro::construct_old_source() 50 1.833e-05 1.833e-05 1.833e-05 0.00% Castro::do_new_sources() 10 1.781e-05 1.781e-05 1.781e-05 0.00% DistributionMapping::Distribute() 56 1.599e-05 1.599e-05 1.599e-05 0.00% Castro::do_old_sources() 10 1.589e-05 1.589e-05 1.589e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.349e-05 1.349e-05 1.349e-05 0.00% Castro::apply_source_to_state() 20 1.228e-05 1.228e-05 1.228e-05 0.00% Castro::check_for_nan() 20 1.146e-05 1.146e-05 1.146e-05 0.00% Castro::construct_old_gravity() 10 9.964e-06 9.964e-06 9.964e-06 0.00% Gravity::swapTimeLevels() 10 9.337e-06 9.337e-06 9.337e-06 0.00% MLPoisson::prepareForSolve() 11 8.318e-06 8.318e-06 8.318e-06 0.00% Castro::post_timestep() 10 8.172e-06 8.172e-06 8.172e-06 0.00% Amr::initSubcycle() 1 8.004e-06 8.004e-06 8.004e-06 0.00% AmrLevel::AmrLevel(dm) 1 7.461e-06 7.461e-06 7.461e-06 0.00% Amr::InitializeInit() 1 7.091e-06 7.091e-06 7.091e-06 0.00% MLMG::computeMLResidual() 11 6.79e-06 6.79e-06 6.79e-06 0.00% Castro::computeNewDt() 9 6.299e-06 6.299e-06 6.299e-06 0.00% MLMG::getGradSolution() 11 6.127e-06 6.127e-06 6.127e-06 0.00% AmrLevel::checkPointPost() 3 6.018e-06 6.018e-06 6.018e-06 0.00% MLMG::buildFineMask() 11 5.016e-06 5.016e-06 5.016e-06 0.00% MLMG::MLResNormInf() 11 4.651e-06 4.651e-06 4.651e-06 0.00% Castro::retry_advance_ctu() 10 4.217e-06 4.217e-06 4.217e-06 0.00% Gravity::set_mass_offset() 11 4.007e-06 4.007e-06 4.007e-06 0.00% Castro::post_init() 1 3.818e-06 3.818e-06 3.818e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.526e-06 3.526e-06 3.526e-06 0.00% Castro::FluxRegFineAdd() 10 3.253e-06 3.253e-06 3.253e-06 0.00% Castro::create_source_corrector() 10 3.134e-06 3.134e-06 3.134e-06 0.00% Castro::FluxRegCrseInit 10 2.838e-06 2.838e-06 2.838e-06 0.00% Amr::init() 1 2.405e-06 2.405e-06 2.405e-06 0.00% Castro::computeInitialDt() 2 2.165e-06 2.165e-06 2.165e-06 0.00% AmrLevel::checkPointPre() 3 2.148e-06 2.148e-06 2.148e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.817e-06 1.817e-06 1.817e-06 0.00% Castro::post_regrid() 1 1.185e-06 1.185e-06 1.185e-06 0.00% Amr::initialInit() 1 1.099e-06 1.099e-06 1.099e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.7915 0.7915 0.7915 100.00% Amr::coarseTimeStep() 10 0.6528 0.6528 0.6528 82.48% Amr::timeStep() 10 0.5723 0.5723 0.5723 72.31% Castro::advance() 10 0.5651 0.5651 0.5651 71.40% Castro::subcycle_advance_ctu() 10 0.5549 0.5549 0.5549 70.10% Castro::do_advance_ctu() 10 0.5547 0.5547 0.5547 70.08% Gravity::solve_phi_with_mlmg() 11 0.3072 0.3072 0.3072 38.82% Gravity::actual_solve_with_mlmg() 11 0.299 0.299 0.299 37.78% Castro::construct_new_gravity() 10 0.2821 0.2821 0.2821 35.64% MLMG::solve() 11 0.277 0.277 0.277 34.99% Gravity::solve_for_phi() 10 0.2673 0.2673 0.2673 33.77% MLMG::oneIter() 81 0.2626 0.2626 0.2626 33.18% MLMG::mgVcycle() 81 0.261 0.261 0.261 32.97% Castro::construct_ctu_hydro_source() 10 0.198 0.198 0.198 25.02% VisMF::Write(FabArray) 11 0.1606 0.1606 0.1606 20.29% MLCellLinOp::smooth() 1620 0.1341 0.1341 0.1341 16.94% Amr::checkPoint() 3 0.1194 0.1194 0.1194 15.09% AmrLevel::checkPoint() 3 0.1151 0.1151 0.1151 14.54% StateData::checkPoint() 12 0.115 0.115 0.115 14.53% Amr::init() 1 0.114 0.114 0.114 14.40% MLCellLinOp::applyBC() 4379 0.09439 0.09439 0.09439 11.92% MLMG::mgVcycle_bottom 81 0.08013 0.08013 0.08013 10.12% MLMG::actualBottomSolve() 81 0.0801 0.0801 0.0801 10.12% MLCGSolver::bicgstab 81 0.07929 0.07929 0.07929 10.02% MLPoisson::Fsmooth() 3240 0.06268 0.06268 0.06268 7.92% Amr::writePlotFile() 2 0.04824 0.04824 0.04824 6.09% Amr::initialInit() 1 0.04771 0.04771 0.04771 6.03% Amr::FinalizeInit() 1 0.04364 0.04364 0.04364 5.51% Castro::clean_state() 62 0.04308 0.04308 0.04308 5.44% Castro::post_init() 1 0.04228 0.04228 0.04228 5.34% Gravity::multilevel_solve_for_new_phi() 1 0.04052 0.04052 0.04052 5.12% Gravity::actual_multilevel_solve() 1 0.0405 0.0405 0.0405 5.12% FillPatchIterator::Initialize 41 0.04044 0.04044 0.04044 5.11% FillPatchSingleLevel 41 0.03884 0.03884 0.03884 4.91% MLCellLinOp::apply() 1128 0.03557 0.03557 0.03557 4.49% MLMG::mgVcycle_down::0 81 0.03488 0.03488 0.03488 4.41% StateDataPhysBCFunct::() 41 0.03486 0.03486 0.03486 4.40% MLMG::mgVcycle_up::0 81 0.02991 0.02991 0.02991 3.78% StateData::FillBoundary(geom) 328 0.02312 0.02312 0.02312 2.92% MultiFab::Dot() 1100 0.02173 0.02173 0.02173 2.75% MLCellLinOp::correctionResidual() 486 0.02084 0.02084 0.02084 2.63% Castro::computeTemp() 63 0.02071 0.02071 0.02071 2.62% Castro::initialize_do_advance() 10 0.01953 0.01953 0.01953 2.47% MLMG:computeResOfCorrection() 405 0.01797 0.01797 0.01797 2.27% MLPoisson::define() 11 0.01781 0.01781 0.01781 2.25% MLMG::mgVcycle_down::1 81 0.01734 0.01734 0.01734 2.19% MLMG::mgVcycle_down::2 81 0.01691 0.01691 0.01691 2.14% Gravity::get_new_grav_vector() 11 0.01628 0.01628 0.01628 2.06% MLMG::mgVcycle_down::3 81 0.01605 0.01605 0.01605 2.03% FabArray::FillBoundary() 3974 0.01541 0.01541 0.01541 1.95% MLMG::mgVcycle_down::4 81 0.01527 0.01527 0.01527 1.93% FillBoundary_nowait() 3974 0.01464 0.01464 0.01464 1.85% CGSolver::sxay() 1566 0.01443 0.01443 0.01443 1.82% Castro::construct_old_gravity() 10 0.01433 0.01433 0.01433 1.81% Gravity::get_old_grav_vector() 10 0.01432 0.01432 0.01432 1.81% MultiFab::LinComb() 1566 0.01403 0.01403 0.01403 1.77% FabArray::setVal() 1135 0.01392 0.01392 0.01392 1.76% FabArray::ParallelCopy() 851 0.01384 0.01384 0.01384 1.75% FabArray::ParallelCopy_nowait() 851 0.01356 0.01356 0.01356 1.71% MLMG::mgVcycle_up::2 81 0.01302 0.01302 0.01302 1.64% MLCGSolver::ParallelAllReduce 1495 0.01297 0.01297 0.01297 1.64% MLMG::mgVcycle_up::1 81 0.01278 0.01278 0.01278 1.62% MLCellLinOp::defineAuxData() 11 0.01264 0.01264 0.01264 1.60% Castro::normalize_species() 62 0.0126 0.0126 0.0126 1.59% MLMG::mgVcycle_up::3 81 0.01231 0.01231 0.01231 1.55% MLMG::addInterpCorrection() 405 0.01226 0.01226 0.01226 1.55% MLMG::mgVcycle_up::4 81 0.01215 0.01215 0.01215 1.53% Castro::do_new_sources() 10 0.01204 0.01204 0.01204 1.52% amrex::average_down 405 0.01166 0.01166 0.01166 1.47% MLPoisson::Fapply() 1128 0.01148 0.01148 0.01148 1.45% Castro::expand_state() 10 0.01132 0.01132 0.01132 1.43% Castro::do_old_sources() 10 0.01013 0.01013 0.01013 1.28% Castro::initialize_advance() 10 0.01011 0.01011 0.01011 1.28% Castro::enforce_min_density() 62 0.009261 0.009261 0.009261 1.17% Gravity::fill_multipole_BCs() 11 0.007985 0.007985 0.007985 1.01% MLCellLinOp::solutionResidual() 92 0.007065 0.007065 0.007065 0.89% Castro::post_timestep() 10 0.006998 0.006998 0.006998 0.88% MultiFab::Xpay() 578 0.006469 0.006469 0.006469 0.82% MLMG::computeResidual() 81 0.00608 0.00608 0.00608 0.77% Castro::reset_internal_energy(MultiFab) 63 0.005759 0.005759 0.005759 0.73% MLMG::prepareForSolve() 11 0.005039 0.005039 0.005039 0.64% Castro::estTimeStep() 21 0.004929 0.004929 0.004929 0.62% MLCellLinOp::defineBC() 11 0.00492 0.00492 0.00492 0.62% BndryData::define() 11 0.004639 0.004639 0.004639 0.59% Amr::InitializeInit() 1 0.004075 0.004075 0.004075 0.51% Amr::defBaseLevel() 1 0.004068 0.004068 0.004068 0.51% Castro::initData() 1 0.003561 0.003561 0.003561 0.45% Castro::construct_new_source() 50 0.003255 0.003255 0.003255 0.41% Castro::construct_new_gravity_source() 10 0.003215 0.003215 0.003215 0.41% Castro::construct_old_source() 50 0.002621 0.002621 0.002621 0.33% Castro::construct_old_gravity_source() 10 0.002602 0.002602 0.002602 0.33% Castro::computeNewDt() 9 0.002247 0.002247 0.002247 0.28% MLMG::ResNormInf() 92 0.001923 0.001923 0.001923 0.24% Castro::apply_source_to_state() 20 0.001821 0.001821 0.001821 0.23% MultiFab::Saxpy() 20 0.001809 0.001809 0.001809 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001605 0.001605 0.001605 0.20% MLCellLinOp::setLevelBC() 11 0.001512 0.001512 0.001512 0.19% Castro::reset_internal_energy(Fab) 504 0.001462 0.001462 0.001462 0.18% FabArrayBase::getCPC() 1313 0.001394 0.001394 0.001394 0.18% MLMG::getGradSolution() 11 0.001393 0.001393 0.001393 0.18% MLCellLinOp::compGrad() 11 0.001387 0.001387 0.001387 0.18% FabArray::mult() 43 0.00133 0.00133 0.00133 0.17% FabArray::setDomainBndry() 41 0.001308 0.001308 0.001308 0.17% Castro::enforce_speed_limit() 62 0.001259 0.001259 0.001259 0.16% Castro::check_for_nan() 20 0.001192 0.001192 0.001192 0.15% MultiFab::contains_nan() 20 0.00118 0.00118 0.00118 0.15% Castro::post_regrid() 1 0.001163 0.001163 0.001163 0.15% MLPoisson::prepareForSolve() 11 0.001161 0.001161 0.001161 0.15% MLCellLinOp::prepareForSolve() 11 0.001153 0.001153 0.001153 0.15% MLMG::computeMLResidual() 11 0.001021 0.001021 0.001021 0.13% Gravity::update_max_rhs() 11 0.0008168 0.0008168 0.0008168 0.10% FabArrayBase::getFB() 3974 0.0006912 0.0006912 0.0006912 0.09% FabArrayBase::CPC::define() 454 0.0006687 0.0006687 0.0006687 0.08% Castro::computeInitialDt() 2 0.0006519 0.0006519 0.0006519 0.08% Amr::InitAmr() 1 0.0004673 0.0004673 0.0004673 0.06% Gravity::swapTimeLevels() 10 0.0004363 0.0004363 0.0004363 0.06% Castro::Castro() 1 0.0004328 0.0004328 0.0004328 0.05% MultiFab::Copy() 11 0.0002559 0.0002559 0.0002559 0.03% MultiFab::max() 11 0.0002543 0.0002543 0.0002543 0.03% MLMG::MLResNormInf() 11 0.0002536 0.0002536 0.0002536 0.03% MLLinOp::define() 11 0.0002344 0.0002344 0.0002344 0.03% MLLinOp::defineGrids() 11 0.0002102 0.0002102 0.0002102 0.03% MLMG::MLRhsNormInf() 11 0.0002004 0.0002004 0.0002004 0.03% Castro::buildMetrics() 1 0.00016 0.00016 0.00016 0.02% FabArrayBase::FB::FB() 56 8.232e-05 8.232e-05 8.232e-05 0.01% Castro::finalize_advance() 10 5.948e-05 5.948e-05 5.948e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.542e-05 5.542e-05 5.542e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.453e-05 4.453e-05 4.453e-05 0.01% makeSFC 55 4.193e-05 4.193e-05 4.193e-05 0.01% Castro::swap_state_time_levels() 10 3.899e-05 3.899e-05 3.899e-05 0.00% StateData::define() 4 3.707e-05 3.707e-05 3.707e-05 0.00% Castro::finalize_do_advance() 10 3.55e-05 3.55e-05 3.55e-05 0.00% Castro::enforce_consistent_e() 1 3.179e-05 3.179e-05 3.179e-05 0.00% Amr::writeSmallPlotFile() 1 2.623e-05 2.623e-05 2.623e-05 0.00% Castro::initMFs() 1 2.589e-05 2.589e-05 2.589e-05 0.00% DistributionMapping::Distribute() 56 1.599e-05 1.599e-05 1.599e-05 0.00% Amr::initSubcycle() 1 8.004e-06 8.004e-06 8.004e-06 0.00% AmrLevel::checkPointPost() 3 6.018e-06 6.018e-06 6.018e-06 0.00% MLMG::buildFineMask() 11 5.016e-06 5.016e-06 5.016e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.804e-06 4.804e-06 4.804e-06 0.00% Castro::retry_advance_ctu() 10 4.217e-06 4.217e-06 4.217e-06 0.00% Gravity::set_mass_offset() 11 4.007e-06 4.007e-06 4.007e-06 0.00% Castro::FluxRegFineAdd() 10 3.253e-06 3.253e-06 3.253e-06 0.00% Castro::create_source_corrector() 10 3.134e-06 3.134e-06 3.134e-06 0.00% Castro::FluxRegCrseInit 10 2.838e-06 2.838e-06 2.838e-06 0.00% AmrLevel::checkPointPre() 3 2.148e-06 2.148e-06 2.148e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.817e-06 1.817e-06 1.817e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.07-12-g8e40952af9ab) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.07-12-g8e40952af9ab) initialized Starting run at 08:49:35 UTC on 2022-07-21. Successfully read inputs file ... Castro git describe: 22.07-10-gf80c3f7eb AMReX git describe: 22.07-12-g8e40952af Microphysics git describe: 22.07-15-ga4952214 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.414151369 Restart time = 0.080742824 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.049983843 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.048380401 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.048944385 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.062820637 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.081565226 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.025499416 seconds Ending run at 08:49:35 UTC on 2022-07-21. Run time = 0.398860894 Run time without initialization = 0.317579493 Average number of zones advanced per microsecond: 4.127 Average number of zones advanced per microsecond per rank: 4.127 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3989 ... 0.3989 ... 0.3989 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0973 0.0973 0.0973 24.39% VisMF::Read() 3 0.03841 0.03841 0.03841 9.63% Amr::restart() 1 0.03819 0.03819 0.03819 9.57% MLCellLinOp::applyBC() 1946 0.03389 0.03389 0.03389 8.50% MLPoisson::Fsmooth() 1440 0.02649 0.02649 0.02649 6.64% VisMF::Write(FabArray) 1 0.02405 0.02405 0.02405 6.03% StateData::FillBoundary(geom) 160 0.01133 0.01133 0.01133 2.84% MLCGSolver::bicgstab 36 0.01002 0.01002 0.01002 2.51% Castro::normalize_species() 30 0.009342 0.009342 0.009342 2.34% MultiFab::Dot() 484 0.009195 0.009195 0.009195 2.31% Castro::computeTemp() 30 0.008433 0.008433 0.008433 2.11% FabArray::setVal() 537 0.00662 0.00662 0.00662 1.66% FillBoundary_nowait() 1766 0.006117 0.006117 0.006117 1.53% MLCellLinOp::defineAuxData() 6 0.006016 0.006016 0.006016 1.51% MultiFab::LinComb() 690 0.005922 0.005922 0.005922 1.48% FabArray::ParallelCopy_nowait() 380 0.005789 0.005789 0.005789 1.45% Gravity::fill_multipole_BCs() 6 0.005277 0.005277 0.005277 1.32% MLPoisson::Fapply() 500 0.004918 0.004918 0.004918 1.23% StateDataPhysBCFunct::() 20 0.004893 0.004893 0.004893 1.23% Castro::enforce_min_density() 30 0.004733 0.004733 0.004733 1.19% MLMG::addInterpCorrection() 180 0.003158 0.003158 0.003158 0.79% amrex::average_down 180 0.002924 0.002924 0.002924 0.73% MultiFab::Xpay() 258 0.002799 0.002799 0.002799 0.70% Castro::estTimeStep() 10 0.00224 0.00224 0.00224 0.56% Castro::do_advance_ctu() 5 0.002101 0.002101 0.002101 0.53% BndryData::define() 6 0.001997 0.001997 0.001997 0.50% Castro::reset_internal_energy(MultiFab) 30 0.001716 0.001716 0.001716 0.43% Castro::construct_new_gravity_source() 5 0.001702 0.001702 0.001702 0.43% Amr::writePlotFile() 1 0.001537 0.001537 0.001537 0.39% Castro::construct_old_gravity_source() 5 0.001449 0.001449 0.001449 0.36% Castro::enforce_speed_limit() 30 0.001169 0.001169 0.001169 0.29% Gravity::get_old_grav_vector() 5 0.0009467 0.0009467 0.0009467 0.24% MultiFab::Saxpy() 10 0.0009189 0.0009189 0.0009189 0.23% Castro::reset_internal_energy(Fab) 240 0.0008754 0.0008754 0.0008754 0.22% Castro::expand_state() 5 0.0008667 0.0008667 0.0008667 0.22% Gravity::get_new_grav_vector() 5 0.0008568 0.0008568 0.0008568 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008515 0.0008515 0.0008515 0.21% MLMG::ResNormInf() 42 0.0008495 0.0008495 0.0008495 0.21% MLCellLinOp::setLevelBC() 6 0.0007935 0.0007935 0.0007935 0.20% Gravity::actual_solve_with_mlmg() 6 0.0007744 0.0007744 0.0007744 0.19% MLMG::oneIter() 36 0.00073 0.00073 0.00073 0.18% FabArray::mult() 22 0.0006502 0.0006502 0.0006502 0.16% FabArray::setDomainBndry() 20 0.0006186 0.0006186 0.0006186 0.16% MLCellLinOp::prepareForSolve() 6 0.0006093 0.0006093 0.0006093 0.15% MultiFab::contains_nan() 10 0.0005907 0.0005907 0.0005907 0.15% MLMG::prepareForSolve() 6 0.0005571 0.0005571 0.0005571 0.14% MLCellLinOp::smooth() 720 0.0004971 0.0004971 0.0004971 0.12% MLCellLinOp::compGrad() 6 0.0004899 0.0004899 0.0004899 0.12% FabArrayBase::CPC::define() 244 0.0003995 0.0003995 0.0003995 0.10% FabArrayBase::getCPC() 632 0.0003843 0.0003843 0.0003843 0.10% Amr::InitAmr() 1 0.0003832 0.0003832 0.0003832 0.10% FabArray::FillBoundary() 1766 0.0003725 0.0003725 0.0003725 0.09% FabArrayBase::getFB() 1766 0.0002915 0.0002915 0.0002915 0.07% main() 1 0.0002387 0.0002387 0.0002387 0.06% Gravity::update_max_rhs() 6 0.0002253 0.0002253 0.0002253 0.06% Castro::subcycle_advance_ctu() 5 0.0002163 0.0002163 0.0002163 0.05% Gravity::solve_for_phi() 5 0.000216 0.000216 0.000216 0.05% MLCellLinOp::apply() 500 0.0002031 0.0002031 0.0002031 0.05% CGSolver::sxay() 690 0.0001822 0.0001822 0.0001822 0.05% Castro::construct_new_gravity() 5 0.000159 0.000159 0.000159 0.04% Amr::coarseTimeStep() 5 0.0001555 0.0001555 0.0001555 0.04% Castro::create_source_corrector() 5 0.000155 0.000155 0.000155 0.04% MLCellLinOp::defineBC() 6 0.0001468 0.0001468 0.0001468 0.04% MLCGSolver::ParallelAllReduce 659 0.0001459 0.0001459 0.0001459 0.04% Castro::construct_new_source() 25 0.0001378 0.0001378 0.0001378 0.03% FillPatchIterator::Initialize 20 0.0001362 0.0001362 0.0001362 0.03% FabArray::ParallelCopy() 380 0.0001349 0.0001349 0.0001349 0.03% MultiFab::Copy() 6 0.0001346 0.0001346 0.0001346 0.03% MultiFab::max() 6 0.0001335 0.0001335 0.0001335 0.03% Castro::initialize_advance() 5 0.0001135 0.0001135 0.0001135 0.03% MLMG::MLRhsNormInf() 6 0.0001048 0.0001048 0.0001048 0.03% Amr::timeStep() 5 0.0001034 0.0001034 0.0001034 0.03% MLLinOp::defineGrids() 6 0.0001021 0.0001021 0.0001021 0.03% Castro::construct_old_source() 25 9.766e-05 9.766e-05 9.766e-05 0.02% MLCellLinOp::correctionResidual() 216 9.326e-05 9.326e-05 9.326e-05 0.02% Castro::initialize_do_advance() 5 8.282e-05 8.282e-05 8.282e-05 0.02% MLMG::mgVcycle() 36 8.239e-05 8.239e-05 8.239e-05 0.02% Castro::post_timestep() 5 8.055e-05 8.055e-05 8.055e-05 0.02% Castro::advance() 5 7.88e-05 7.88e-05 7.88e-05 0.02% AmrLevel::restart() 1 7.265e-05 7.265e-05 7.265e-05 0.02% StateData::restartDoit() 4 6.246e-05 6.246e-05 6.246e-05 0.02% MLMG:computeResOfCorrection() 180 5.896e-05 5.896e-05 5.896e-05 0.01% FabArrayBase::FB::FB() 26 5.545e-05 5.545e-05 5.545e-05 0.01% Castro::construct_old_gravity() 5 4.683e-05 4.683e-05 4.683e-05 0.01% MLMG::actualBottomSolve() 36 4.272e-05 4.272e-05 4.272e-05 0.01% Castro::clean_state() 30 3.861e-05 3.861e-05 3.861e-05 0.01% MLMG::mgVcycle_down::0 36 3.81e-05 3.81e-05 3.81e-05 0.01% MLMG::mgVcycle_down::4 36 3.581e-05 3.581e-05 3.581e-05 0.01% MLMG::mgVcycle_down::1 36 3.52e-05 3.52e-05 3.52e-05 0.01% MLMG::solve() 6 3.506e-05 3.506e-05 3.506e-05 0.01% MLMG::mgVcycle_down::2 36 3.316e-05 3.316e-05 3.316e-05 0.01% Castro::buildMetrics() 1 3.266e-05 3.266e-05 3.266e-05 0.01% Castro::post_restart() 1 3.061e-05 3.061e-05 3.061e-05 0.01% MLMG::mgVcycle_down::3 36 3.025e-05 3.025e-05 3.025e-05 0.01% Gravity::actual_multilevel_solve() 1 2.956e-05 2.956e-05 2.956e-05 0.01% Castro::swap_state_time_levels() 5 2.773e-05 2.773e-05 2.773e-05 0.01% MLMG::mgVcycle_up::4 36 2.694e-05 2.694e-05 2.694e-05 0.01% Castro::initMFs() 1 2.66e-05 2.66e-05 2.66e-05 0.01% Amr::writeSmallPlotFile() 1 2.529e-05 2.529e-05 2.529e-05 0.01% Castro::finalize_advance() 5 2.462e-05 2.462e-05 2.462e-05 0.01% MLMG::mgVcycle_up::0 36 2.309e-05 2.309e-05 2.309e-05 0.01% MLCellLinOp::solutionResidual() 42 2.308e-05 2.308e-05 2.308e-05 0.01% MLMG::mgVcycle_up::3 36 2.221e-05 2.221e-05 2.221e-05 0.01% MLMG::mgVcycle_up::2 36 2.2e-05 2.2e-05 2.2e-05 0.01% MLLinOp::define() 6 2.001e-05 2.001e-05 2.001e-05 0.01% MLMG::mgVcycle_up::1 36 1.99e-05 1.99e-05 1.99e-05 0.00% Castro::finalize_do_advance() 5 1.804e-05 1.804e-05 1.804e-05 0.00% makeSFC 30 1.798e-05 1.798e-05 1.798e-05 0.00% Castro::do_new_sources() 5 1.737e-05 1.737e-05 1.737e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.703e-05 1.703e-05 1.703e-05 0.00% MLPoisson::define() 6 1.62e-05 1.62e-05 1.62e-05 0.00% MLMG::mgVcycle_bottom 36 1.53e-05 1.53e-05 1.53e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.482e-05 1.482e-05 1.482e-05 0.00% MLMG::computeResidual() 36 1.442e-05 1.442e-05 1.442e-05 0.00% MLLinOp::makeSubCommunicator() 6 1.417e-05 1.417e-05 1.417e-05 0.00% FillPatchSingleLevel 20 1.316e-05 1.316e-05 1.316e-05 0.00% DistributionMapping::Distribute() 31 1.228e-05 1.228e-05 1.228e-05 0.00% MLLinOp::makeAgglomeratedDMap 6 9.937e-06 9.937e-06 9.937e-06 0.00% Amr::initSubcycle() 1 9.221e-06 9.221e-06 9.221e-06 0.00% Castro::do_old_sources() 5 8.78e-06 8.78e-06 8.78e-06 0.00% Castro::apply_source_to_state() 10 5.854e-06 5.854e-06 5.854e-06 0.00% Castro::check_for_nan() 10 5.745e-06 5.745e-06 5.745e-06 0.00% MLPoisson::prepareForSolve() 6 4.759e-06 4.759e-06 4.759e-06 0.00% Gravity::swapTimeLevels() 5 4.588e-06 4.588e-06 4.588e-06 0.00% MLMG::getGradSolution() 6 3.119e-06 3.119e-06 3.119e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.033e-06 3.033e-06 3.033e-06 0.00% MLMG::buildFineMask() 6 2.935e-06 2.935e-06 2.935e-06 0.00% Castro::computeNewDt() 5 2.879e-06 2.879e-06 2.879e-06 0.00% MLMG::computeMLResidual() 6 2.869e-06 2.869e-06 2.869e-06 0.00% Gravity::set_mass_offset() 6 2.432e-06 2.432e-06 2.432e-06 0.00% MLMG::MLResNormInf() 6 2.362e-06 2.362e-06 2.362e-06 0.00% Castro::FluxRegCrseInit 5 1.933e-06 1.933e-06 1.933e-06 0.00% Castro::retry_advance_ctu() 5 1.768e-06 1.768e-06 1.768e-06 0.00% Castro::FluxRegFineAdd() 5 1.34e-06 1.34e-06 1.34e-06 0.00% AmrLevel::AmrLevel() 1 1.219e-06 1.219e-06 1.219e-06 0.00% Amr::init() 1 1.142e-06 1.142e-06 1.142e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3989 0.3989 0.3989 100.00% Amr::coarseTimeStep() 5 0.2918 0.2918 0.2918 73.16% Amr::timeStep() 5 0.2904 0.2904 0.2904 72.80% Castro::advance() 5 0.2865 0.2865 0.2865 71.83% Castro::subcycle_advance_ctu() 5 0.2803 0.2803 0.2803 70.27% Castro::do_advance_ctu() 5 0.2801 0.2801 0.2801 70.21% Castro::construct_new_gravity() 5 0.1406 0.1406 0.1406 35.25% Gravity::solve_phi_with_mlmg() 6 0.1365 0.1365 0.1365 34.22% Gravity::solve_for_phi() 5 0.1331 0.1331 0.1331 33.36% Gravity::actual_solve_with_mlmg() 6 0.1311 0.1311 0.1311 32.87% MLMG::solve() 6 0.1192 0.1192 0.1192 29.88% MLMG::oneIter() 36 0.1123 0.1123 0.1123 28.16% MLMG::mgVcycle() 36 0.1116 0.1116 0.1116 27.98% Castro::construct_ctu_hydro_source() 5 0.09729 0.09729 0.09729 24.39% Amr::init() 1 0.08079 0.08079 0.08079 20.25% Amr::restart() 1 0.08079 0.08079 0.08079 20.25% MLCellLinOp::smooth() 720 0.05716 0.05716 0.05716 14.33% MLCellLinOp::applyBC() 1946 0.04073 0.04073 0.04073 10.21% AmrLevel::restart() 1 0.0386 0.0386 0.0386 9.68% StateData::restartDoit() 4 0.03852 0.03852 0.03852 9.66% VisMF::Read() 3 0.03841 0.03841 0.03841 9.63% MLMG::mgVcycle_bottom 36 0.03414 0.03414 0.03414 8.56% MLMG::actualBottomSolve() 36 0.03412 0.03412 0.03412 8.55% MLCGSolver::bicgstab 36 0.03377 0.03377 0.03377 8.47% MLPoisson::Fsmooth() 1440 0.02649 0.02649 0.02649 6.64% Castro::clean_state() 30 0.02631 0.02631 0.02631 6.60% Amr::writePlotFile() 1 0.02559 0.02559 0.02559 6.41% VisMF::Write(FabArray) 1 0.02405 0.02405 0.02405 6.03% FillPatchIterator::Initialize 20 0.01896 0.01896 0.01896 4.75% FillPatchSingleLevel 20 0.0182 0.0182 0.0182 4.56% StateDataPhysBCFunct::() 20 0.01623 0.01623 0.01623 4.07% MLCellLinOp::apply() 500 0.01541 0.01541 0.01541 3.86% MLMG::mgVcycle_down::0 36 0.01507 0.01507 0.01507 3.78% MLMG::mgVcycle_up::0 36 0.01282 0.01282 0.01282 3.21% Castro::initialize_do_advance() 5 0.01137 0.01137 0.01137 2.85% StateData::FillBoundary(geom) 160 0.01133 0.01133 0.01133 2.84% Castro::computeTemp() 30 0.01102 0.01102 0.01102 2.76% MLPoisson::define() 6 0.009595 0.009595 0.009595 2.41% Castro::normalize_species() 30 0.009342 0.009342 0.009342 2.34% MultiFab::Dot() 484 0.009195 0.009195 0.009195 2.31% MLCellLinOp::correctionResidual() 216 0.008982 0.008982 0.008982 2.25% Castro::do_new_sources() 5 0.007873 0.007873 0.007873 1.97% MLMG:computeResOfCorrection() 180 0.007755 0.007755 0.007755 1.94% MLMG::mgVcycle_down::1 36 0.007438 0.007438 0.007438 1.86% Gravity::get_new_grav_vector() 5 0.00736 0.00736 0.00736 1.85% MLMG::mgVcycle_down::2 36 0.007218 0.007218 0.007218 1.81% Castro::construct_old_gravity() 5 0.00719 0.00719 0.00719 1.80% Gravity::get_old_grav_vector() 5 0.007143 0.007143 0.007143 1.79% FabArray::FillBoundary() 1766 0.006837 0.006837 0.006837 1.71% MLMG::mgVcycle_down::3 36 0.006816 0.006816 0.006816 1.71% MLCellLinOp::defineAuxData() 6 0.006742 0.006742 0.006742 1.69% FabArray::setVal() 537 0.00662 0.00662 0.00662 1.66% MLMG::mgVcycle_down::4 36 0.006531 0.006531 0.006531 1.64% FillBoundary_nowait() 1766 0.006464 0.006464 0.006464 1.62% FabArray::ParallelCopy() 380 0.006314 0.006314 0.006314 1.58% FabArray::ParallelCopy_nowait() 380 0.006179 0.006179 0.006179 1.55% Castro::initialize_advance() 5 0.00614 0.00614 0.00614 1.54% CGSolver::sxay() 690 0.006104 0.006104 0.006104 1.53% MultiFab::LinComb() 690 0.005922 0.005922 0.005922 1.48% Castro::do_old_sources() 5 0.005864 0.005864 0.005864 1.47% MLCGSolver::ParallelAllReduce 659 0.005542 0.005542 0.005542 1.39% MLMG::mgVcycle_up::2 36 0.005533 0.005533 0.005533 1.39% MLMG::mgVcycle_up::1 36 0.005475 0.005475 0.005475 1.37% MLMG::addInterpCorrection() 180 0.005353 0.005353 0.005353 1.34% Gravity::fill_multipole_BCs() 6 0.005277 0.005277 0.005277 1.32% Castro::expand_state() 5 0.005259 0.005259 0.005259 1.32% MLMG::mgVcycle_up::3 36 0.005257 0.005257 0.005257 1.32% MLMG::mgVcycle_up::4 36 0.005216 0.005216 0.005216 1.31% amrex::average_down 180 0.005078 0.005078 0.005078 1.27% MLPoisson::Fapply() 500 0.004918 0.004918 0.004918 1.23% Castro::enforce_min_density() 30 0.004733 0.004733 0.004733 1.19% Castro::post_restart() 1 0.003812 0.003812 0.003812 0.96% Castro::post_timestep() 5 0.003774 0.003774 0.003774 0.95% Gravity::multilevel_solve_for_new_phi() 1 0.003689 0.003689 0.003689 0.92% Gravity::actual_multilevel_solve() 1 0.003672 0.003672 0.003672 0.92% MLCellLinOp::solutionResidual() 42 0.003172 0.003172 0.003172 0.80% MultiFab::Xpay() 258 0.002799 0.002799 0.002799 0.70% MLMG::prepareForSolve() 6 0.002681 0.002681 0.002681 0.67% MLCellLinOp::defineBC() 6 0.002663 0.002663 0.002663 0.67% MLMG::computeResidual() 36 0.002637 0.002637 0.002637 0.66% Castro::reset_internal_energy(MultiFab) 30 0.002592 0.002592 0.002592 0.65% BndryData::define() 6 0.002516 0.002516 0.002516 0.63% Castro::estTimeStep() 10 0.00224 0.00224 0.00224 0.56% Castro::construct_new_source() 25 0.00184 0.00184 0.00184 0.46% Castro::construct_new_gravity_source() 5 0.001702 0.001702 0.001702 0.43% Castro::construct_old_source() 25 0.001547 0.001547 0.001547 0.39% Castro::construct_old_gravity_source() 5 0.001449 0.001449 0.001449 0.36% Castro::computeNewDt() 5 0.001278 0.001278 0.001278 0.32% Castro::enforce_speed_limit() 30 0.001169 0.001169 0.001169 0.29% Castro::apply_source_to_state() 10 0.0009247 0.0009247 0.0009247 0.23% MultiFab::Saxpy() 10 0.0009189 0.0009189 0.0009189 0.23% Castro::reset_internal_energy(Fab) 240 0.0008754 0.0008754 0.0008754 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008515 0.0008515 0.0008515 0.21% MLMG::ResNormInf() 42 0.0008495 0.0008495 0.0008495 0.21% MLCellLinOp::setLevelBC() 6 0.0007935 0.0007935 0.0007935 0.20% FabArrayBase::getCPC() 632 0.0007838 0.0007838 0.0007838 0.20% MLMG::getGradSolution() 6 0.0007571 0.0007571 0.0007571 0.19% MLCellLinOp::compGrad() 6 0.000754 0.000754 0.000754 0.19% FabArray::mult() 22 0.0006502 0.0006502 0.0006502 0.16% FabArray::setDomainBndry() 20 0.0006186 0.0006186 0.0006186 0.16% MLPoisson::prepareForSolve() 6 0.0006141 0.0006141 0.0006141 0.15% MLCellLinOp::prepareForSolve() 6 0.0006093 0.0006093 0.0006093 0.15% Castro::check_for_nan() 10 0.0005964 0.0005964 0.0005964 0.15% MultiFab::contains_nan() 10 0.0005907 0.0005907 0.0005907 0.15% MLMG::computeMLResidual() 6 0.0005525 0.0005525 0.0005525 0.14% Gravity::update_max_rhs() 6 0.0004357 0.0004357 0.0004357 0.11% FabArrayBase::CPC::define() 244 0.0003995 0.0003995 0.0003995 0.10% Amr::InitAmr() 1 0.0003924 0.0003924 0.0003924 0.10% FabArrayBase::getFB() 1766 0.000347 0.000347 0.000347 0.09% Gravity::swapTimeLevels() 5 0.0002305 0.0002305 0.0002305 0.06% MLLinOp::define() 6 0.0001743 0.0001743 0.0001743 0.04% Castro::buildMetrics() 1 0.0001586 0.0001586 0.0001586 0.04% Castro::create_source_corrector() 5 0.000155 0.000155 0.000155 0.04% MLLinOp::defineGrids() 6 0.0001543 0.0001543 0.0001543 0.04% MultiFab::Copy() 6 0.0001346 0.0001346 0.0001346 0.03% MultiFab::max() 6 0.0001335 0.0001335 0.0001335 0.03% MLMG::MLResNormInf() 6 0.0001332 0.0001332 0.0001332 0.03% MLMG::MLRhsNormInf() 6 0.0001048 0.0001048 0.0001048 0.03% FabArrayBase::FB::FB() 26 5.545e-05 5.545e-05 5.545e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 3.798e-05 3.798e-05 3.798e-05 0.01% makeSFC 30 2.805e-05 2.805e-05 2.805e-05 0.01% Castro::finalize_advance() 5 2.789e-05 2.789e-05 2.789e-05 0.01% Castro::swap_state_time_levels() 5 2.773e-05 2.773e-05 2.773e-05 0.01% Castro::initMFs() 1 2.66e-05 2.66e-05 2.66e-05 0.01% Amr::writeSmallPlotFile() 1 2.529e-05 2.529e-05 2.529e-05 0.01% Castro::finalize_do_advance() 5 1.804e-05 1.804e-05 1.804e-05 0.00% MLLinOp::makeSubCommunicator() 6 1.417e-05 1.417e-05 1.417e-05 0.00% DistributionMapping::Distribute() 31 1.228e-05 1.228e-05 1.228e-05 0.00% Amr::initSubcycle() 1 9.221e-06 9.221e-06 9.221e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.244e-06 5.244e-06 5.244e-06 0.00% MLMG::buildFineMask() 6 2.935e-06 2.935e-06 2.935e-06 0.00% Gravity::set_mass_offset() 6 2.432e-06 2.432e-06 2.432e-06 0.00% Castro::FluxRegCrseInit 5 1.933e-06 1.933e-06 1.933e-06 0.00% Castro::retry_advance_ctu() 5 1.768e-06 1.768e-06 1.768e-06 0.00% Castro::FluxRegFineAdd() 5 1.34e-06 1.34e-06 1.34e-06 0.00% AmrLevel::AmrLevel() 1 1.219e-06 1.219e-06 1.219e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.07-12-g8e40952af9ab) finalized