Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.11-12-g81e0635ce832) initialized Starting run at 09:51:48 UTC on 2022-11-08. Successfully read inputs file ... Castro git describe: 22.11-3-g71af3f92d AMReX git describe: 22.11-12-g81e0635ce Microphysics git describe: 22.11-8-g0d57093f reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.052611905 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.03028535 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048559124 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.052538306 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.057089811 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.06086007 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.059703629 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.049181891 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.078020224 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.066481473 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.051604101 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.055614623 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.062152873 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.075431862 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.030266129 seconds Ending run at 09:51:49 UTC on 2022-11-08. Run time = 0.882912471 Run time without initialization = 0.748140549 Average number of zones advanced per microsecond: 3.504 Average number of zones advanced per microsecond per rank: 3.504 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8829 ... 0.8829 ... 0.8829 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.2135 0.2135 0.2135 24.18% VisMF::Write(FabArray) 11 0.2034 0.2034 0.2034 23.03% MLCellLinOp::applyBC() 4433 0.07902 0.07902 0.07902 8.95% MLPoisson::Fsmooth() 3280 0.06358 0.06358 0.06358 7.20% Amr::checkPoint() 3 0.0316 0.0316 0.0316 3.58% MLCGSolver::bicgstab 82 0.02344 0.02344 0.02344 2.65% StateData::FillBoundary(geom) 328 0.02225 0.02225 0.02225 2.52% MultiFab::Dot() 1114 0.02186 0.02186 0.02186 2.48% Castro::normalize_species() 62 0.01587 0.01587 0.01587 1.80% Castro::computeTemp() 63 0.01425 0.01425 0.01425 1.61% MultiFab::LinComb() 1586 0.01403 0.01403 0.01403 1.59% FillBoundary_nowait() 4023 0.01403 0.01403 0.01403 1.59% FabArray::setVal() 1144 0.01392 0.01392 0.01392 1.58% StateDataPhysBCFunct::() 41 0.01293 0.01293 0.01293 1.46% FabArray::ParallelCopy_nowait() 861 0.01289 0.01289 0.01289 1.46% MLPoisson::Fapply() 1142 0.01163 0.01163 0.01163 1.32% MLCellLinOp::defineAuxData() 11 0.01127 0.01127 0.01127 1.28% Gravity::fill_multipole_BCs() 11 0.008494 0.008494 0.008494 0.96% Castro::enforce_min_density() 62 0.008298 0.008298 0.008298 0.94% MLMG::addInterpCorrection() 410 0.007585 0.007585 0.007585 0.86% amrex::average_down 410 0.006652 0.006652 0.006652 0.75% MultiFab::Xpay() 585 0.006432 0.006432 0.006432 0.73% Castro::do_advance_ctu() 10 0.00477 0.00477 0.00477 0.54% Castro::reset_internal_energy(MultiFab) 63 0.004647 0.004647 0.004647 0.53% Castro::estTimeStep() 21 0.004562 0.004562 0.004562 0.52% BndryData::define() 11 0.003746 0.003746 0.003746 0.42% Castro::construct_new_gravity_source() 10 0.003318 0.003318 0.003318 0.38% Amr::writePlotFile() 2 0.002826 0.002826 0.002826 0.32% Castro::construct_old_gravity_source() 10 0.002646 0.002646 0.002646 0.30% MLMG::ResNormInf() 93 0.002038 0.002038 0.002038 0.23% Gravity::get_new_grav_vector() 11 0.00192 0.00192 0.00192 0.22% MultiFab::Saxpy() 20 0.001798 0.001798 0.001798 0.20% Castro::expand_state() 10 0.001725 0.001725 0.001725 0.20% Gravity::get_old_grav_vector() 10 0.001724 0.001724 0.001724 0.20% MultiFab::Add() 82 0.001623 0.001623 0.001623 0.18% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.0016 0.0016 0.0016 0.18% Castro::reset_internal_energy(Fab) 504 0.001578 0.001578 0.001578 0.18% MLCellLinOp::setLevelBC() 11 0.00151 0.00151 0.00151 0.17% Gravity::actual_solve_with_mlmg() 11 0.00142 0.00142 0.00142 0.16% FabArray::setDomainBndry() 41 0.001307 0.001307 0.001307 0.15% FabArray::mult() 43 0.001305 0.001305 0.001305 0.15% Castro::initData() 1 0.001288 0.001288 0.001288 0.15% MLMG::prepareForSolve() 11 0.001216 0.001216 0.001216 0.14% Castro::enforce_speed_limit() 62 0.001206 0.001206 0.001206 0.14% MultiFab::contains_nan() 20 0.001174 0.001174 0.001174 0.13% MLCellLinOp::prepareForSolve() 11 0.001137 0.001137 0.001137 0.13% MLCellLinOp::smooth() 1640 0.001111 0.001111 0.001111 0.13% MLCellLinOp::compGrad() 11 0.0009151 0.0009151 0.0009151 0.10% FabArray::FillBoundary() 4023 0.0008385 0.0008385 0.0008385 0.09% FabArrayBase::getCPC() 1323 0.0007431 0.0007431 0.0007431 0.08% FabArrayBase::CPC::define() 454 0.000666 0.000666 0.000666 0.08% Castro::finalize_advance() 10 0.0006516 0.0006516 0.0006516 0.07% FabArrayBase::getFB() 4023 0.0005789 0.0005789 0.0005789 0.07% Amr::InitAmr() 1 0.0004964 0.0004964 0.0004964 0.06% MLCellLinOp::apply() 1142 0.000451 0.000451 0.000451 0.05% Gravity::solve_for_phi() 10 0.0004325 0.0004325 0.0004325 0.05% Gravity::update_max_rhs() 11 0.0004235 0.0004235 0.0004235 0.05% Amr::coarseTimeStep() 10 0.0003637 0.0003637 0.0003637 0.04% CGSolver::sxay() 1586 0.0003448 0.0003448 0.0003448 0.04% MultiFab::Copy() 11 0.0003149 0.0003149 0.0003149 0.04% main() 1 0.0002968 0.0002968 0.0002968 0.03% MLCGSolver::ParallelAllReduce 1514 0.0002848 0.0002848 0.0002848 0.03% FillPatchIterator::Initialize 41 0.0002831 0.0002831 0.0002831 0.03% MLCellLinOp::defineBC() 11 0.0002705 0.0002705 0.0002705 0.03% FabArray::ParallelCopy() 861 0.0002628 0.0002628 0.0002628 0.03% MultiFab::max() 11 0.000251 0.000251 0.000251 0.03% MLMG::MLRhsNormInf() 11 0.0002193 0.0002193 0.0002193 0.02% MLMG::mgVcycle() 82 0.0002149 0.0002149 0.0002149 0.02% MLCellLinOp::correctionResidual() 492 0.0002081 0.0002081 0.0002081 0.02% Castro::construct_new_gravity() 10 0.0002005 0.0002005 0.0002005 0.02% MLMG:computeResOfCorrection() 410 0.0001659 0.0001659 0.0001659 0.02% Amr::timeStep() 10 0.0001637 0.0001637 0.0001637 0.02% Castro::subcycle_advance_ctu() 10 0.000153 0.000153 0.000153 0.02% MLLinOp::defineGrids() 11 0.0001503 0.0001503 0.0001503 0.02% StateData::checkPoint() 12 0.0001339 0.0001339 0.0001339 0.02% MLMG::mgVcycle_down::0 82 0.0001182 0.0001182 0.0001182 0.01% MLMG::mgVcycle_down::1 82 0.0001022 0.0001022 0.0001022 0.01% MLMG::mgVcycle_down::2 82 9.615e-05 9.615e-05 9.615e-05 0.01% MLMG::mgVcycle_down::3 82 8.946e-05 8.946e-05 8.946e-05 0.01% Castro::Castro() 1 8.861e-05 8.861e-05 8.861e-05 0.01% MLMG::mgVcycle_down::4 82 8.723e-05 8.723e-05 8.723e-05 0.01% FabArrayBase::FB::FB() 56 8.631e-05 8.631e-05 8.631e-05 0.01% Castro::clean_state() 62 8.597e-05 8.597e-05 8.597e-05 0.01% Castro::initialize_advance() 10 8.414e-05 8.414e-05 8.414e-05 0.01% MLMG::actualBottomSolve() 82 8.009e-05 8.009e-05 8.009e-05 0.01% AmrLevel::checkPoint() 3 7.74e-05 7.74e-05 7.74e-05 0.01% MLMG::solve() 11 7.396e-05 7.396e-05 7.396e-05 0.01% MLMG::mgVcycle_up::4 82 7.004e-05 7.004e-05 7.004e-05 0.01% MLMG::oneIter() 82 6.429e-05 6.429e-05 6.429e-05 0.01% Castro::initialize_do_advance() 10 6.375e-05 6.375e-05 6.375e-05 0.01% MLMG::mgVcycle_up::0 82 6.085e-05 6.085e-05 6.085e-05 0.01% MLMG::mgVcycle_up::3 82 5.855e-05 5.855e-05 5.855e-05 0.01% MLMG::mgVcycle_up::1 82 5.829e-05 5.829e-05 5.829e-05 0.01% Castro::advance() 10 5.755e-05 5.755e-05 5.755e-05 0.01% MLMG::mgVcycle_up::2 82 5.648e-05 5.648e-05 5.648e-05 0.01% MLCellLinOp::solutionResidual() 93 5.211e-05 5.211e-05 5.211e-05 0.01% StateData::define() 4 4.407e-05 4.407e-05 4.407e-05 0.00% Castro::swap_state_time_levels() 10 4.087e-05 4.087e-05 4.087e-05 0.00% MLMG::computeResidual() 82 4.086e-05 4.086e-05 4.086e-05 0.00% Castro::finalize_do_advance() 10 3.508e-05 3.508e-05 3.508e-05 0.00% Castro::enforce_consistent_e() 1 3.38e-05 3.38e-05 3.38e-05 0.00% MLMG::mgVcycle_bottom 82 3.199e-05 3.199e-05 3.199e-05 0.00% MLPoisson::define() 11 3.127e-05 3.127e-05 3.127e-05 0.00% Gravity::actual_multilevel_solve() 1 3.028e-05 3.028e-05 3.028e-05 0.00% FillPatchSingleLevel 41 2.753e-05 2.753e-05 2.753e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.687e-05 2.687e-05 2.687e-05 0.00% makeSFC 55 2.628e-05 2.628e-05 2.628e-05 0.00% Castro::initMFs() 1 2.595e-05 2.595e-05 2.595e-05 0.00% Amr::writeSmallPlotFile() 1 2.575e-05 2.575e-05 2.575e-05 0.00% Castro::buildMetrics() 1 2.306e-05 2.306e-05 2.306e-05 0.00% MLLinOp::define() 11 2.298e-05 2.298e-05 2.298e-05 0.00% Amr::defBaseLevel() 1 2.18e-05 2.18e-05 2.18e-05 0.00% Amr::FinalizeInit() 1 2.031e-05 2.031e-05 2.031e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.814e-05 1.814e-05 1.814e-05 0.00% Castro::construct_old_source() 50 1.731e-05 1.731e-05 1.731e-05 0.00% Castro::construct_new_source() 50 1.658e-05 1.658e-05 1.658e-05 0.00% Castro::do_new_sources() 10 1.626e-05 1.626e-05 1.626e-05 0.00% Castro::do_old_sources() 10 1.58e-05 1.58e-05 1.58e-05 0.00% Castro::post_timestep() 10 1.519e-05 1.519e-05 1.519e-05 0.00% DistributionMapping::Distribute() 56 1.456e-05 1.456e-05 1.456e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.327e-05 1.327e-05 1.327e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.295e-05 1.295e-05 1.295e-05 0.00% Castro::check_for_nan() 20 1.098e-05 1.098e-05 1.098e-05 0.00% Castro::apply_source_to_state() 20 1.064e-05 1.064e-05 1.064e-05 0.00% Castro::construct_old_gravity() 10 1.032e-05 1.032e-05 1.032e-05 0.00% MLMG::computeMLResidual() 11 1.011e-05 1.011e-05 1.011e-05 0.00% Amr::initSubcycle() 1 9.382e-06 9.382e-06 9.382e-06 0.00% Gravity::swapTimeLevels() 10 9.217e-06 9.217e-06 9.217e-06 0.00% MLPoisson::prepareForSolve() 11 7.984e-06 7.984e-06 7.984e-06 0.00% MLMG::getGradSolution() 11 6.645e-06 6.645e-06 6.645e-06 0.00% Castro::computeNewDt() 9 6.454e-06 6.454e-06 6.454e-06 0.00% AmrLevel::checkPointPost() 3 6.045e-06 6.045e-06 6.045e-06 0.00% Amr::InitializeInit() 1 5.244e-06 5.244e-06 5.244e-06 0.00% Castro::retry_advance_ctu() 10 3.985e-06 3.985e-06 3.985e-06 0.00% Castro::post_init() 1 3.884e-06 3.884e-06 3.884e-06 0.00% MLMG::MLResNormInf() 11 3.727e-06 3.727e-06 3.727e-06 0.00% Castro::create_source_corrector() 10 3.601e-06 3.601e-06 3.601e-06 0.00% Gravity::set_mass_offset() 11 3.572e-06 3.572e-06 3.572e-06 0.00% Castro::FluxRegCrseInit 10 3.227e-06 3.227e-06 3.227e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.985e-06 2.985e-06 2.985e-06 0.00% Castro::computeInitialDt() 2 2.826e-06 2.826e-06 2.826e-06 0.00% AmrLevel::checkPointPre() 3 2.709e-06 2.709e-06 2.709e-06 0.00% Amr::init() 1 2.503e-06 2.503e-06 2.503e-06 0.00% Castro::FluxRegFineAdd() 10 1.959e-06 1.959e-06 1.959e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.907e-06 1.907e-06 1.907e-06 0.00% Castro::post_regrid() 1 1.314e-06 1.314e-06 1.314e-06 0.00% Amr::initialInit() 1 1.043e-06 1.043e-06 1.043e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8829 0.8829 0.8829 100.00% Amr::coarseTimeStep() 10 0.7177 0.7177 0.7177 81.28% Amr::timeStep() 10 0.5899 0.5899 0.5899 66.81% Castro::advance() 10 0.5822 0.5822 0.5822 65.94% Castro::subcycle_advance_ctu() 10 0.5695 0.5695 0.5695 64.50% Castro::do_advance_ctu() 10 0.5693 0.5693 0.5693 64.48% Gravity::solve_phi_with_mlmg() 11 0.3098 0.3098 0.3098 35.09% Gravity::actual_solve_with_mlmg() 11 0.3011 0.3011 0.3011 34.10% Castro::construct_new_gravity() 10 0.2814 0.2814 0.2814 31.87% MLMG::solve() 11 0.279 0.279 0.279 31.60% Gravity::solve_for_phi() 10 0.2667 0.2667 0.2667 30.20% MLMG::oneIter() 82 0.2643 0.2643 0.2643 29.93% MLMG::mgVcycle() 82 0.2626 0.2626 0.2626 29.74% Castro::construct_ctu_hydro_source() 10 0.2135 0.2135 0.2135 24.18% VisMF::Write(FabArray) 11 0.2034 0.2034 0.2034 23.03% Amr::checkPoint() 3 0.1774 0.1774 0.1774 20.09% AmrLevel::checkPoint() 3 0.1457 0.1457 0.1457 16.51% StateData::checkPoint() 12 0.1457 0.1457 0.1457 16.50% MLCellLinOp::smooth() 1640 0.1351 0.1351 0.1351 15.30% Amr::init() 1 0.1341 0.1341 0.1341 15.19% MLCellLinOp::applyBC() 4433 0.09455 0.09455 0.09455 10.71% MLMG::mgVcycle_bottom 82 0.08018 0.08018 0.08018 9.08% MLMG::actualBottomSolve() 82 0.08015 0.08015 0.08015 9.08% MLCGSolver::bicgstab 82 0.07936 0.07936 0.07936 8.99% MLPoisson::Fsmooth() 3280 0.06358 0.06358 0.06358 7.20% Amr::writePlotFile() 2 0.06067 0.06067 0.06067 6.87% Amr::initialInit() 1 0.0511 0.0511 0.0511 5.79% Amr::FinalizeInit() 1 0.04678 0.04678 0.04678 5.30% Castro::post_init() 1 0.04543 0.04543 0.04543 5.15% Castro::clean_state() 62 0.04504 0.04504 0.04504 5.10% Gravity::multilevel_solve_for_new_phi() 1 0.04361 0.04361 0.04361 4.94% Gravity::actual_multilevel_solve() 1 0.04359 0.04359 0.04359 4.94% FillPatchIterator::Initialize 41 0.04074 0.04074 0.04074 4.61% FillPatchSingleLevel 41 0.03915 0.03915 0.03915 4.43% MLCellLinOp::apply() 1142 0.03578 0.03578 0.03578 4.05% StateDataPhysBCFunct::() 41 0.03518 0.03518 0.03518 3.98% MLMG::mgVcycle_down::0 82 0.03503 0.03503 0.03503 3.97% MLMG::mgVcycle_up::0 82 0.03012 0.03012 0.03012 3.41% StateData::FillBoundary(geom) 328 0.02225 0.02225 0.02225 2.52% MultiFab::Dot() 1114 0.02186 0.02186 0.02186 2.48% MLCellLinOp::correctionResidual() 492 0.02096 0.02096 0.02096 2.37% Castro::computeTemp() 63 0.02048 0.02048 0.02048 2.32% Castro::initialize_do_advance() 10 0.01952 0.01952 0.01952 2.21% MLMG:computeResOfCorrection() 410 0.01813 0.01813 0.01813 2.05% MLPoisson::define() 11 0.01776 0.01776 0.01776 2.01% MLMG::mgVcycle_down::1 82 0.01753 0.01753 0.01753 1.99% MLMG::mgVcycle_down::2 82 0.01701 0.01701 0.01701 1.93% Gravity::get_new_grav_vector() 11 0.01621 0.01621 0.01621 1.84% MLMG::mgVcycle_down::3 82 0.0162 0.0162 0.0162 1.83% Castro::normalize_species() 62 0.01587 0.01587 0.01587 1.80% FabArray::FillBoundary() 4023 0.01553 0.01553 0.01553 1.76% MLMG::mgVcycle_down::4 82 0.01535 0.01535 0.01535 1.74% FillBoundary_nowait() 4023 0.01469 0.01469 0.01469 1.66% CGSolver::sxay() 1586 0.01437 0.01437 0.01437 1.63% Castro::construct_old_gravity() 10 0.01424 0.01424 0.01424 1.61% Gravity::get_old_grav_vector() 10 0.01423 0.01423 0.01423 1.61% MultiFab::LinComb() 1586 0.01403 0.01403 0.01403 1.59% FabArray::ParallelCopy() 861 0.01395 0.01395 0.01395 1.58% FabArray::setVal() 1144 0.01392 0.01392 0.01392 1.58% FabArray::ParallelCopy_nowait() 861 0.01368 0.01368 0.01368 1.55% MLMG::mgVcycle_up::2 82 0.01332 0.01332 0.01332 1.51% MLCGSolver::ParallelAllReduce 1514 0.01304 0.01304 0.01304 1.48% MLMG::mgVcycle_up::1 82 0.01297 0.01297 0.01297 1.47% MLMG::addInterpCorrection() 410 0.01257 0.01257 0.01257 1.42% MLCellLinOp::defineAuxData() 11 0.01256 0.01256 0.01256 1.42% MLMG::mgVcycle_up::3 82 0.01241 0.01241 0.01241 1.41% MLMG::mgVcycle_up::4 82 0.01224 0.01224 0.01224 1.39% Castro::initialize_advance() 10 0.01205 0.01205 0.01205 1.37% Castro::expand_state() 10 0.01173 0.01173 0.01173 1.33% amrex::average_down 410 0.01167 0.01167 0.01167 1.32% Castro::do_new_sources() 10 0.01165 0.01165 0.01165 1.32% MLPoisson::Fapply() 1142 0.01163 0.01163 0.01163 1.32% Castro::do_old_sources() 10 0.01028 0.01028 0.01028 1.16% Gravity::fill_multipole_BCs() 11 0.008494 0.008494 0.008494 0.96% Castro::enforce_min_density() 62 0.008298 0.008298 0.008298 0.94% Castro::post_timestep() 10 0.007478 0.007478 0.007478 0.85% MLCellLinOp::solutionResidual() 93 0.007079 0.007079 0.007079 0.80% MultiFab::Xpay() 585 0.006432 0.006432 0.006432 0.73% Castro::reset_internal_energy(MultiFab) 63 0.006225 0.006225 0.006225 0.71% MLMG::computeResidual() 82 0.006117 0.006117 0.006117 0.69% MLMG::prepareForSolve() 11 0.005271 0.005271 0.005271 0.60% MLCellLinOp::defineBC() 11 0.004934 0.004934 0.004934 0.56% BndryData::define() 11 0.004663 0.004663 0.004663 0.53% Castro::estTimeStep() 21 0.004562 0.004562 0.004562 0.52% Amr::InitializeInit() 1 0.004314 0.004314 0.004314 0.49% Amr::defBaseLevel() 1 0.004309 0.004309 0.004309 0.49% Castro::initData() 1 0.00379 0.00379 0.00379 0.43% Castro::construct_new_source() 50 0.003335 0.003335 0.003335 0.38% Castro::construct_new_gravity_source() 10 0.003318 0.003318 0.003318 0.38% Castro::construct_old_source() 50 0.002663 0.002663 0.002663 0.30% Castro::construct_old_gravity_source() 10 0.002646 0.002646 0.002646 0.30% MLMG::ResNormInf() 93 0.002038 0.002038 0.002038 0.23% Castro::computeNewDt() 9 0.002005 0.002005 0.002005 0.23% Castro::apply_source_to_state() 20 0.001809 0.001809 0.001809 0.20% MultiFab::Saxpy() 20 0.001798 0.001798 0.001798 0.20% MultiFab::Add() 82 0.001623 0.001623 0.001623 0.18% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.0016 0.0016 0.0016 0.18% Castro::reset_internal_energy(Fab) 504 0.001578 0.001578 0.001578 0.18% MLCellLinOp::setLevelBC() 11 0.00151 0.00151 0.00151 0.17% MLMG::getGradSolution() 11 0.001413 0.001413 0.001413 0.16% FabArrayBase::getCPC() 1323 0.001409 0.001409 0.001409 0.16% MLCellLinOp::compGrad() 11 0.001406 0.001406 0.001406 0.16% FabArray::setDomainBndry() 41 0.001307 0.001307 0.001307 0.15% FabArray::mult() 43 0.001305 0.001305 0.001305 0.15% Castro::enforce_speed_limit() 62 0.001206 0.001206 0.001206 0.14% Castro::check_for_nan() 20 0.001185 0.001185 0.001185 0.13% MultiFab::contains_nan() 20 0.001174 0.001174 0.001174 0.13% Castro::post_regrid() 1 0.001164 0.001164 0.001164 0.13% MLPoisson::prepareForSolve() 11 0.001145 0.001145 0.001145 0.13% MLCellLinOp::prepareForSolve() 11 0.001137 0.001137 0.001137 0.13% MLMG::computeMLResidual() 11 0.001013 0.001013 0.001013 0.11% Castro::computeInitialDt() 2 0.0008866 0.0008866 0.0008866 0.10% Gravity::update_max_rhs() 11 0.0008169 0.0008169 0.0008169 0.09% FabArrayBase::CPC::define() 454 0.000666 0.000666 0.000666 0.08% FabArrayBase::getFB() 4023 0.0006652 0.0006652 0.0006652 0.08% Castro::finalize_advance() 10 0.0006568 0.0006568 0.0006568 0.07% Amr::InitAmr() 1 0.0005058 0.0005058 0.0005058 0.06% Castro::Castro() 1 0.000436 0.000436 0.000436 0.05% Gravity::swapTimeLevels() 10 0.0004339 0.0004339 0.0004339 0.05% MultiFab::Copy() 11 0.0003149 0.0003149 0.0003149 0.04% MLMG::MLResNormInf() 11 0.0002778 0.0002778 0.0002778 0.03% MultiFab::max() 11 0.000251 0.000251 0.000251 0.03% MLLinOp::define() 11 0.0002279 0.0002279 0.0002279 0.03% MLMG::MLRhsNormInf() 11 0.0002193 0.0002193 0.0002193 0.02% MLLinOp::defineGrids() 11 0.0002049 0.0002049 0.0002049 0.02% Castro::buildMetrics() 1 0.0001591 0.0001591 0.0001591 0.02% FabArrayBase::FB::FB() 56 8.631e-05 8.631e-05 8.631e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.734e-05 5.734e-05 5.734e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.272e-05 5.272e-05 5.272e-05 0.01% StateData::define() 4 4.407e-05 4.407e-05 4.407e-05 0.00% Castro::swap_state_time_levels() 10 4.087e-05 4.087e-05 4.087e-05 0.00% makeSFC 55 3.977e-05 3.977e-05 3.977e-05 0.00% Castro::finalize_do_advance() 10 3.508e-05 3.508e-05 3.508e-05 0.00% Castro::enforce_consistent_e() 1 3.38e-05 3.38e-05 3.38e-05 0.00% Castro::initMFs() 1 2.595e-05 2.595e-05 2.595e-05 0.00% Amr::writeSmallPlotFile() 1 2.575e-05 2.575e-05 2.575e-05 0.00% DistributionMapping::Distribute() 56 1.456e-05 1.456e-05 1.456e-05 0.00% Amr::initSubcycle() 1 9.382e-06 9.382e-06 9.382e-06 0.00% AmrLevel::checkPointPost() 3 6.045e-06 6.045e-06 6.045e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.053e-06 4.053e-06 4.053e-06 0.00% Castro::retry_advance_ctu() 10 3.985e-06 3.985e-06 3.985e-06 0.00% Castro::create_source_corrector() 10 3.601e-06 3.601e-06 3.601e-06 0.00% Gravity::set_mass_offset() 11 3.572e-06 3.572e-06 3.572e-06 0.00% Castro::FluxRegCrseInit 10 3.227e-06 3.227e-06 3.227e-06 0.00% AmrLevel::checkPointPre() 3 2.709e-06 2.709e-06 2.709e-06 0.00% Castro::FluxRegFineAdd() 10 1.959e-06 1.959e-06 1.959e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.907e-06 1.907e-06 1.907e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2545 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.11-12-g81e0635ce832) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.11-12-g81e0635ce832) initialized Starting run at 09:51:50 UTC on 2022-11-08. Successfully read inputs file ... Castro git describe: 22.11-3-g71af3f92d AMReX git describe: 22.11-12-g81e0635ce Microphysics git describe: 22.11-8-g0d57093f reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.462693098 Restart time = 0.047903376 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.054192493 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.059159985 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.057244393 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.072900323 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.077049637 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.031816403 seconds Ending run at 09:51:50 UTC on 2022-11-08. Run time = 0.401269204 Run time without initialization = 0.352793376 Average number of zones advanced per microsecond: 3.715 Average number of zones advanced per microsecond per rank: 3.715 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.4013 ... 0.4013 ... 0.4013 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1242 0.1242 0.1242 30.95% VisMF::Read() 3 0.04003 0.04003 0.04003 9.98% MLCellLinOp::applyBC() 1946 0.03431 0.03431 0.03431 8.55% VisMF::Write(FabArray) 1 0.03019 0.03019 0.03019 7.52% MLPoisson::Fsmooth() 1440 0.02669 0.02669 0.02669 6.65% StateData::FillBoundary(geom) 160 0.01097 0.01097 0.01097 2.73% Castro::normalize_species() 30 0.01014 0.01014 0.01014 2.53% MLCGSolver::bicgstab 36 0.01004 0.01004 0.01004 2.50% MultiFab::Dot() 484 0.009298 0.009298 0.009298 2.32% Castro::computeTemp() 30 0.008937 0.008937 0.008937 2.23% FabArray::setVal() 537 0.006588 0.006588 0.006588 1.64% FillBoundary_nowait() 1766 0.006215 0.006215 0.006215 1.55% MLCellLinOp::defineAuxData() 6 0.005981 0.005981 0.005981 1.49% MultiFab::LinComb() 690 0.005968 0.005968 0.005968 1.49% FabArray::ParallelCopy_nowait() 380 0.005845 0.005845 0.005845 1.46% StateDataPhysBCFunct::() 20 0.005321 0.005321 0.005321 1.33% MLPoisson::Fapply() 500 0.004988 0.004988 0.004988 1.24% Castro::enforce_min_density() 30 0.0046 0.0046 0.0046 1.15% Gravity::fill_multipole_BCs() 6 0.004599 0.004599 0.004599 1.15% Amr::restart() 1 0.003613 0.003613 0.003613 0.90% MLMG::addInterpCorrection() 180 0.003272 0.003272 0.003272 0.82% amrex::average_down 180 0.002873 0.002873 0.002873 0.72% MultiFab::Xpay() 258 0.002801 0.002801 0.002801 0.70% Castro::estTimeStep() 10 0.002649 0.002649 0.002649 0.66% Castro::do_advance_ctu() 5 0.002438 0.002438 0.002438 0.61% BndryData::define() 6 0.002021 0.002021 0.002021 0.50% Castro::construct_new_gravity_source() 5 0.001833 0.001833 0.001833 0.46% Castro::reset_internal_energy(MultiFab) 30 0.001769 0.001769 0.001769 0.44% Amr::writePlotFile() 1 0.001733 0.001733 0.001733 0.43% Castro::construct_old_gravity_source() 5 0.001428 0.001428 0.001428 0.36% Gravity::get_old_grav_vector() 5 0.001023 0.001023 0.001023 0.25% Gravity::get_new_grav_vector() 5 0.0009508 0.0009508 0.0009508 0.24% MultiFab::Saxpy() 10 0.000915 0.000915 0.000915 0.23% MLMG::ResNormInf() 42 0.0009015 0.0009015 0.0009015 0.22% Castro::expand_state() 5 0.0008644 0.0008644 0.0008644 0.22% Castro::reset_internal_energy(Fab) 240 0.0008519 0.0008519 0.0008519 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008501 0.0008501 0.0008501 0.21% MLCellLinOp::setLevelBC() 6 0.0007896 0.0007896 0.0007896 0.20% Gravity::actual_solve_with_mlmg() 6 0.0007799 0.0007799 0.0007799 0.19% MultiFab::Add() 36 0.000703 0.000703 0.000703 0.18% MLMG::prepareForSolve() 6 0.0006503 0.0006503 0.0006503 0.16% FabArray::mult() 22 0.000646 0.000646 0.000646 0.16% FabArray::setDomainBndry() 20 0.0006263 0.0006263 0.0006263 0.16% MLCellLinOp::prepareForSolve() 6 0.0006145 0.0006145 0.0006145 0.15% MultiFab::contains_nan() 10 0.0005819 0.0005819 0.0005819 0.15% MLCellLinOp::smooth() 720 0.0005104 0.0005104 0.0005104 0.13% MLCellLinOp::compGrad() 6 0.0004816 0.0004816 0.0004816 0.12% Castro::enforce_speed_limit() 30 0.0004186 0.0004186 0.0004186 0.10% FabArray::FillBoundary() 1766 0.0004005 0.0004005 0.0004005 0.10% Amr::InitAmr() 1 0.0003963 0.0003963 0.0003963 0.10% FabArrayBase::CPC::define() 244 0.0003904 0.0003904 0.0003904 0.10% FabArrayBase::getCPC() 632 0.0003625 0.0003625 0.0003625 0.09% Castro::finalize_advance() 5 0.0003263 0.0003263 0.0003263 0.08% FabArrayBase::getFB() 1766 0.0002566 0.0002566 0.0002566 0.06% Gravity::update_max_rhs() 6 0.0002493 0.0002493 0.0002493 0.06% main() 1 0.0002462 0.0002462 0.0002462 0.06% Gravity::solve_for_phi() 5 0.0002132 0.0002132 0.0002132 0.05% MLCellLinOp::apply() 500 0.0001987 0.0001987 0.0001987 0.05% Castro::subcycle_advance_ctu() 5 0.0001789 0.0001789 0.0001789 0.04% Amr::coarseTimeStep() 5 0.0001751 0.0001751 0.0001751 0.04% Castro::construct_new_gravity() 5 0.0001734 0.0001734 0.0001734 0.04% CGSolver::sxay() 690 0.0001716 0.0001716 0.0001716 0.04% MultiFab::Copy() 6 0.0001713 0.0001713 0.0001713 0.04% MLCellLinOp::defineBC() 6 0.0001445 0.0001445 0.0001445 0.04% MultiFab::max() 6 0.0001341 0.0001341 0.0001341 0.03% FillPatchIterator::Initialize 20 0.0001329 0.0001329 0.0001329 0.03% MLCGSolver::ParallelAllReduce 659 0.0001256 0.0001256 0.0001256 0.03% Castro::construct_new_source() 25 0.0001243 0.0001243 0.0001243 0.03% FabArray::ParallelCopy() 380 0.0001235 0.0001235 0.0001235 0.03% MLMG::MLRhsNormInf() 6 0.0001131 0.0001131 0.0001131 0.03% Castro::advance() 5 0.0001045 0.0001045 0.0001045 0.03% MLCellLinOp::correctionResidual() 216 9.878e-05 9.878e-05 9.878e-05 0.02% MLMG::mgVcycle() 36 8.801e-05 8.801e-05 8.801e-05 0.02% Amr::timeStep() 5 8.569e-05 8.569e-05 8.569e-05 0.02% AmrLevel::restart() 1 7.957e-05 7.957e-05 7.957e-05 0.02% Castro::post_timestep() 5 7.499e-05 7.499e-05 7.499e-05 0.02% StateData::restartDoit() 4 7.153e-05 7.153e-05 7.153e-05 0.02% MLLinOp::defineGrids() 6 6.959e-05 6.959e-05 6.959e-05 0.02% MLMG:computeResOfCorrection() 180 6.905e-05 6.905e-05 6.905e-05 0.02% Castro::initialize_do_advance() 5 6.191e-05 6.191e-05 6.191e-05 0.02% FabArrayBase::FB::FB() 26 5.951e-05 5.951e-05 5.951e-05 0.01% MLMG::mgVcycle_down::0 36 5.164e-05 5.164e-05 5.164e-05 0.01% MLMG::mgVcycle_down::1 36 4.609e-05 4.609e-05 4.609e-05 0.01% MLMG::mgVcycle_down::2 36 4.222e-05 4.222e-05 4.222e-05 0.01% Castro::clean_state() 30 4.109e-05 4.109e-05 4.109e-05 0.01% MLMG::mgVcycle_down::4 36 4.041e-05 4.041e-05 4.041e-05 0.01% Castro::initialize_advance() 5 3.995e-05 3.995e-05 3.995e-05 0.01% Castro::construct_old_source() 25 3.852e-05 3.852e-05 3.852e-05 0.01% MLMG::mgVcycle_down::3 36 3.795e-05 3.795e-05 3.795e-05 0.01% MLMG::actualBottomSolve() 36 3.76e-05 3.76e-05 3.76e-05 0.01% Castro::create_source_corrector() 5 3.348e-05 3.348e-05 3.348e-05 0.01% MLMG::mgVcycle_up::4 36 3.262e-05 3.262e-05 3.262e-05 0.01% MLMG::solve() 6 3.237e-05 3.237e-05 3.237e-05 0.01% Castro::buildMetrics() 1 3.065e-05 3.065e-05 3.065e-05 0.01% Castro::finalize_do_advance() 5 2.973e-05 2.973e-05 2.973e-05 0.01% Castro::post_restart() 1 2.958e-05 2.958e-05 2.958e-05 0.01% Gravity::actual_multilevel_solve() 1 2.947e-05 2.947e-05 2.947e-05 0.01% MLMG::oneIter() 36 2.826e-05 2.826e-05 2.826e-05 0.01% MLMG::mgVcycle_up::0 36 2.803e-05 2.803e-05 2.803e-05 0.01% Amr::writeSmallPlotFile() 1 2.797e-05 2.797e-05 2.797e-05 0.01% Castro::swap_state_time_levels() 5 2.753e-05 2.753e-05 2.753e-05 0.01% Castro::do_old_sources() 5 2.7e-05 2.7e-05 2.7e-05 0.01% MLMG::mgVcycle_up::3 36 2.677e-05 2.677e-05 2.677e-05 0.01% Castro::initMFs() 1 2.647e-05 2.647e-05 2.647e-05 0.01% MLMG::mgVcycle_up::2 36 2.555e-05 2.555e-05 2.555e-05 0.01% MLMG::mgVcycle_up::1 36 2.464e-05 2.464e-05 2.464e-05 0.01% MLCellLinOp::solutionResidual() 42 2.246e-05 2.246e-05 2.246e-05 0.01% MLPoisson::define() 6 2.218e-05 2.218e-05 2.218e-05 0.01% Castro::construct_old_gravity() 5 1.974e-05 1.974e-05 1.974e-05 0.00% MLLinOp::define() 6 1.862e-05 1.862e-05 1.862e-05 0.00% MLMG::computeResidual() 36 1.782e-05 1.782e-05 1.782e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.778e-05 1.778e-05 1.778e-05 0.00% makeSFC 30 1.522e-05 1.522e-05 1.522e-05 0.00% MLMG::mgVcycle_bottom 36 1.502e-05 1.502e-05 1.502e-05 0.00% FillPatchSingleLevel 20 1.376e-05 1.376e-05 1.376e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.295e-05 1.295e-05 1.295e-05 0.00% Castro::do_new_sources() 5 9.315e-06 9.315e-06 9.315e-06 0.00% Amr::initSubcycle() 1 8.446e-06 8.446e-06 8.446e-06 0.00% DistributionMapping::Distribute() 31 8.303e-06 8.303e-06 8.303e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 6.93e-06 6.93e-06 6.93e-06 0.00% Castro::check_for_nan() 10 6.29e-06 6.29e-06 6.29e-06 0.00% Castro::apply_source_to_state() 10 5.699e-06 5.699e-06 5.699e-06 0.00% MLMG::computeMLResidual() 6 5.181e-06 5.181e-06 5.181e-06 0.00% Gravity::swapTimeLevels() 5 4.339e-06 4.339e-06 4.339e-06 0.00% MLPoisson::prepareForSolve() 6 4.311e-06 4.311e-06 4.311e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.523e-06 3.523e-06 3.523e-06 0.00% MLMG::getGradSolution() 6 3.287e-06 3.287e-06 3.287e-06 0.00% Castro::computeNewDt() 5 2.895e-06 2.895e-06 2.895e-06 0.00% Gravity::set_mass_offset() 6 2.234e-06 2.234e-06 2.234e-06 0.00% MLMG::MLResNormInf() 6 2.192e-06 2.192e-06 2.192e-06 0.00% Castro::retry_advance_ctu() 5 1.791e-06 1.791e-06 1.791e-06 0.00% Castro::FluxRegCrseInit 5 1.677e-06 1.677e-06 1.677e-06 0.00% AmrLevel::AmrLevel() 1 1.34e-06 1.34e-06 1.34e-06 0.00% Castro::FluxRegFineAdd() 5 1.185e-06 1.185e-06 1.185e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.147e-06 1.147e-06 1.147e-06 0.00% Amr::init() 1 1.147e-06 1.147e-06 1.147e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.4013 0.4013 0.4013 100.00% Amr::coarseTimeStep() 5 0.3207 0.3207 0.3207 79.92% Amr::timeStep() 5 0.3191 0.3191 0.3191 79.52% Castro::advance() 5 0.3137 0.3137 0.3137 78.18% Castro::subcycle_advance_ctu() 5 0.3076 0.3076 0.3076 76.64% Castro::do_advance_ctu() 5 0.3074 0.3074 0.3074 76.60% Castro::construct_new_gravity() 5 0.1408 0.1408 0.1408 35.09% Gravity::solve_phi_with_mlmg() 6 0.137 0.137 0.137 34.14% Gravity::solve_for_phi() 5 0.1335 0.1335 0.1335 33.26% Gravity::actual_solve_with_mlmg() 6 0.1323 0.1323 0.1323 32.97% Castro::construct_ctu_hydro_source() 5 0.1242 0.1242 0.1242 30.95% MLMG::solve() 6 0.1205 0.1205 0.1205 30.02% MLMG::oneIter() 36 0.1134 0.1134 0.1134 28.25% MLMG::mgVcycle() 36 0.1127 0.1127 0.1127 28.07% MLCellLinOp::smooth() 720 0.05777 0.05777 0.05777 14.40% Amr::init() 1 0.04796 0.04796 0.04796 11.95% Amr::restart() 1 0.04796 0.04796 0.04796 11.95% MLCellLinOp::applyBC() 1946 0.04125 0.04125 0.04125 10.28% AmrLevel::restart() 1 0.04024 0.04024 0.04024 10.03% StateData::restartDoit() 4 0.04016 0.04016 0.04016 10.01% VisMF::Read() 3 0.04003 0.04003 0.04003 9.98% MLMG::mgVcycle_bottom 36 0.03434 0.03434 0.03434 8.56% MLMG::actualBottomSolve() 36 0.03432 0.03432 0.03432 8.55% MLCGSolver::bicgstab 36 0.03398 0.03398 0.03398 8.47% Amr::writePlotFile() 1 0.03192 0.03192 0.03192 7.96% VisMF::Write(FabArray) 1 0.03019 0.03019 0.03019 7.52% Castro::clean_state() 30 0.02676 0.02676 0.02676 6.67% MLPoisson::Fsmooth() 1440 0.02669 0.02669 0.02669 6.65% FillPatchIterator::Initialize 20 0.01905 0.01905 0.01905 4.75% FillPatchSingleLevel 20 0.01829 0.01829 0.01829 4.56% StateDataPhysBCFunct::() 20 0.01629 0.01629 0.01629 4.06% MLCellLinOp::apply() 500 0.01559 0.01559 0.01559 3.89% MLMG::mgVcycle_down::0 36 0.01515 0.01515 0.01515 3.78% MLMG::mgVcycle_up::0 36 0.01298 0.01298 0.01298 3.23% Castro::computeTemp() 30 0.01156 0.01156 0.01156 2.88% StateData::FillBoundary(geom) 160 0.01097 0.01097 0.01097 2.73% Castro::initialize_do_advance() 5 0.01034 0.01034 0.01034 2.58% Castro::normalize_species() 30 0.01014 0.01014 0.01014 2.53% MLPoisson::define() 6 0.009516 0.009516 0.009516 2.37% MultiFab::Dot() 484 0.009298 0.009298 0.009298 2.32% MLCellLinOp::correctionResidual() 216 0.009111 0.009111 0.009111 2.27% Castro::do_new_sources() 5 0.008033 0.008033 0.008033 2.00% MLMG:computeResOfCorrection() 180 0.007873 0.007873 0.007873 1.96% MLMG::mgVcycle_down::1 36 0.007538 0.007538 0.007538 1.88% MLMG::mgVcycle_down::2 36 0.007278 0.007278 0.007278 1.81% Gravity::get_new_grav_vector() 5 0.00719 0.00719 0.00719 1.79% Castro::construct_old_gravity() 5 0.007138 0.007138 0.007138 1.78% Gravity::get_old_grav_vector() 5 0.007118 0.007118 0.007118 1.77% MLMG::mgVcycle_down::3 36 0.006933 0.006933 0.006933 1.73% FabArray::FillBoundary() 1766 0.006932 0.006932 0.006932 1.73% MLCellLinOp::defineAuxData() 6 0.006696 0.006696 0.006696 1.67% MLMG::mgVcycle_down::4 36 0.006612 0.006612 0.006612 1.65% FabArray::setVal() 537 0.006588 0.006588 0.006588 1.64% FillBoundary_nowait() 1766 0.006531 0.006531 0.006531 1.63% FabArray::ParallelCopy() 380 0.006344 0.006344 0.006344 1.58% Castro::do_old_sources() 5 0.006332 0.006332 0.006332 1.58% FabArray::ParallelCopy_nowait() 380 0.00622 0.00622 0.00622 1.55% CGSolver::sxay() 690 0.00614 0.00614 0.00614 1.53% MultiFab::LinComb() 690 0.005968 0.005968 0.005968 1.49% Castro::initialize_advance() 5 0.005715 0.005715 0.005715 1.42% Castro::expand_state() 5 0.005684 0.005684 0.005684 1.42% MLMG::mgVcycle_up::2 36 0.005614 0.005614 0.005614 1.40% MLCGSolver::ParallelAllReduce 659 0.005578 0.005578 0.005578 1.39% MLMG::mgVcycle_up::1 36 0.005545 0.005545 0.005545 1.38% MLMG::addInterpCorrection() 180 0.005419 0.005419 0.005419 1.35% MLMG::mgVcycle_up::3 36 0.005303 0.005303 0.005303 1.32% Castro::post_timestep() 5 0.005286 0.005286 0.005286 1.32% MLMG::mgVcycle_up::4 36 0.005275 0.005275 0.005275 1.31% amrex::average_down 180 0.005086 0.005086 0.005086 1.27% MLPoisson::Fapply() 500 0.004988 0.004988 0.004988 1.24% Castro::enforce_min_density() 30 0.0046 0.0046 0.0046 1.15% Gravity::fill_multipole_BCs() 6 0.004599 0.004599 0.004599 1.15% Castro::post_restart() 1 0.003926 0.003926 0.003926 0.98% Gravity::multilevel_solve_for_new_phi() 1 0.003805 0.003805 0.003805 0.95% Gravity::actual_multilevel_solve() 1 0.003787 0.003787 0.003787 0.94% MLCellLinOp::solutionResidual() 42 0.003185 0.003185 0.003185 0.79% MLMG::prepareForSolve() 6 0.002814 0.002814 0.002814 0.70% MultiFab::Xpay() 258 0.002801 0.002801 0.002801 0.70% MLCellLinOp::defineBC() 6 0.002679 0.002679 0.002679 0.67% Castro::estTimeStep() 10 0.002649 0.002649 0.002649 0.66% MLMG::computeResidual() 36 0.002641 0.002641 0.002641 0.66% Castro::reset_internal_energy(MultiFab) 30 0.002621 0.002621 0.002621 0.65% BndryData::define() 6 0.002535 0.002535 0.002535 0.63% Castro::construct_new_source() 25 0.001957 0.001957 0.001957 0.49% Castro::construct_new_gravity_source() 5 0.001833 0.001833 0.001833 0.46% Castro::construct_old_source() 25 0.001467 0.001467 0.001467 0.37% Castro::computeNewDt() 5 0.001446 0.001446 0.001446 0.36% Castro::construct_old_gravity_source() 5 0.001428 0.001428 0.001428 0.36% Castro::apply_source_to_state() 10 0.0009207 0.0009207 0.0009207 0.23% MultiFab::Saxpy() 10 0.000915 0.000915 0.000915 0.23% MLMG::ResNormInf() 42 0.0009015 0.0009015 0.0009015 0.22% Castro::reset_internal_energy(Fab) 240 0.0008519 0.0008519 0.0008519 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008501 0.0008501 0.0008501 0.21% MLCellLinOp::setLevelBC() 6 0.0007896 0.0007896 0.0007896 0.20% FabArrayBase::getCPC() 632 0.000753 0.000753 0.000753 0.19% MLMG::getGradSolution() 6 0.0007527 0.0007527 0.0007527 0.19% MLCellLinOp::compGrad() 6 0.0007494 0.0007494 0.0007494 0.19% MultiFab::Add() 36 0.000703 0.000703 0.000703 0.18% FabArray::mult() 22 0.000646 0.000646 0.000646 0.16% FabArray::setDomainBndry() 20 0.0006263 0.0006263 0.0006263 0.16% MLPoisson::prepareForSolve() 6 0.0006188 0.0006188 0.0006188 0.15% MLCellLinOp::prepareForSolve() 6 0.0006145 0.0006145 0.0006145 0.15% Castro::check_for_nan() 10 0.0005882 0.0005882 0.0005882 0.15% MultiFab::contains_nan() 10 0.0005819 0.0005819 0.0005819 0.15% MLMG::computeMLResidual() 6 0.0005667 0.0005667 0.0005667 0.14% Gravity::update_max_rhs() 6 0.0004617 0.0004617 0.0004617 0.12% Castro::enforce_speed_limit() 30 0.0004186 0.0004186 0.0004186 0.10% Amr::InitAmr() 1 0.0004047 0.0004047 0.0004047 0.10% FabArrayBase::CPC::define() 244 0.0003904 0.0003904 0.0003904 0.10% Castro::finalize_advance() 5 0.0003292 0.0003292 0.0003292 0.08% FabArrayBase::getFB() 1766 0.0003162 0.0003162 0.0003162 0.08% Gravity::swapTimeLevels() 5 0.0002238 0.0002238 0.0002238 0.06% MultiFab::Copy() 6 0.0001713 0.0001713 0.0001713 0.04% MLMG::MLResNormInf() 6 0.0001543 0.0001543 0.0001543 0.04% Castro::buildMetrics() 1 0.0001493 0.0001493 0.0001493 0.04% MultiFab::max() 6 0.0001341 0.0001341 0.0001341 0.03% MLLinOp::define() 6 0.0001186 0.0001186 0.0001186 0.03% MLMG::MLRhsNormInf() 6 0.0001131 0.0001131 0.0001131 0.03% MLLinOp::defineGrids() 6 0.0001 0.0001 0.0001 0.02% FabArrayBase::FB::FB() 26 5.951e-05 5.951e-05 5.951e-05 0.01% Castro::create_source_corrector() 5 3.348e-05 3.348e-05 3.348e-05 0.01% Castro::finalize_do_advance() 5 2.973e-05 2.973e-05 2.973e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.928e-05 2.928e-05 2.928e-05 0.01% Amr::writeSmallPlotFile() 1 2.797e-05 2.797e-05 2.797e-05 0.01% Castro::swap_state_time_levels() 5 2.753e-05 2.753e-05 2.753e-05 0.01% Castro::initMFs() 1 2.647e-05 2.647e-05 2.647e-05 0.01% makeSFC 30 2.236e-05 2.236e-05 2.236e-05 0.01% Amr::initSubcycle() 1 8.446e-06 8.446e-06 8.446e-06 0.00% DistributionMapping::Distribute() 31 8.303e-06 8.303e-06 8.303e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.692e-06 4.692e-06 4.692e-06 0.00% Gravity::set_mass_offset() 6 2.234e-06 2.234e-06 2.234e-06 0.00% Castro::retry_advance_ctu() 5 1.791e-06 1.791e-06 1.791e-06 0.00% Castro::FluxRegCrseInit 5 1.677e-06 1.677e-06 1.677e-06 0.00% AmrLevel::AmrLevel() 1 1.34e-06 1.34e-06 1.34e-06 0.00% Castro::FluxRegFineAdd() 5 1.185e-06 1.185e-06 1.185e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.147e-06 1.147e-06 1.147e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2545 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.11-12-g81e0635ce832) finalized