Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.10-17-g56b6402d2389) initialized Starting run at 08:37:17 UTC on 2022-10-17. Successfully read inputs file ... Castro git describe: 22.09-1-g65b273ad0 AMReX git describe: 22.10-17-g56b6402d2 Microphysics git describe: 22.10-4-g1dbcf8c2 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.053401329 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.03061614 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.048952758 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.051480171 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.050290966 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.050835335 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.073993361 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.049353987 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.07202545 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.061544314 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.060703557 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.04999279 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.056398908 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.066385831 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.030973466 seconds Ending run at 08:37:18 UTC on 2022-10-17. Run time = 0.860809431 Run time without initialization = 0.723542133 Average number of zones advanced per microsecond: 3.623 Average number of zones advanced per microsecond per rank: 3.623 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8608 ... 0.8608 ... 0.8608 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2226 0.2226 0.2226 25.86% Castro::construct_ctu_hydro_source() 10 0.1863 0.1863 0.1863 21.64% MLCellLinOp::applyBC() 4433 0.08151 0.08151 0.08151 9.47% MLPoisson::Fsmooth() 3280 0.06554 0.06554 0.06554 7.61% MLCGSolver::bicgstab 82 0.02446 0.02446 0.02446 2.84% StateData::FillBoundary(geom) 328 0.0238 0.0238 0.0238 2.77% MultiFab::Dot() 1114 0.02287 0.02287 0.02287 2.66% Castro::computeTemp() 63 0.01516 0.01516 0.01516 1.76% MultiFab::LinComb() 1586 0.01477 0.01477 0.01477 1.72% FabArray::setVal() 1144 0.01454 0.01454 0.01454 1.69% FillBoundary_nowait() 4023 0.01426 0.01426 0.01426 1.66% FabArray::ParallelCopy_nowait() 861 0.01346 0.01346 0.01346 1.56% StateDataPhysBCFunct::() 41 0.01284 0.01284 0.01284 1.49% Castro::normalize_species() 62 0.01253 0.01253 0.01253 1.46% MLPoisson::Fapply() 1142 0.01205 0.01205 0.01205 1.40% MLCellLinOp::defineAuxData() 11 0.01158 0.01158 0.01158 1.34% Castro::enforce_min_density() 62 0.01061 0.01061 0.01061 1.23% Gravity::fill_multipole_BCs() 11 0.008663 0.008663 0.008663 1.01% MLMG::addInterpCorrection() 410 0.007835 0.007835 0.007835 0.91% amrex::average_down 410 0.006971 0.006971 0.006971 0.81% MultiFab::Xpay() 585 0.006751 0.006751 0.006751 0.78% Amr::checkPoint() 3 0.005308 0.005308 0.005308 0.62% Castro::do_advance_ctu() 10 0.005166 0.005166 0.005166 0.60% Castro::estTimeStep() 21 0.004904 0.004904 0.004904 0.57% Castro::reset_internal_energy(MultiFab) 63 0.004125 0.004125 0.004125 0.48% BndryData::define() 11 0.003947 0.003947 0.003947 0.46% Castro::construct_new_gravity_source() 10 0.003255 0.003255 0.003255 0.38% Amr::writePlotFile() 2 0.002858 0.002858 0.002858 0.33% Castro::construct_old_gravity_source() 10 0.002553 0.002553 0.002553 0.30% MLMG::ResNormInf() 93 0.002106 0.002106 0.002106 0.24% Gravity::get_new_grav_vector() 11 0.001936 0.001936 0.001936 0.22% MultiFab::Saxpy() 20 0.001816 0.001816 0.001816 0.21% Gravity::get_old_grav_vector() 10 0.001745 0.001745 0.001745 0.20% Castro::expand_state() 10 0.001735 0.001735 0.001735 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001685 0.001685 0.001685 0.20% MultiFab::Add() 82 0.001682 0.001682 0.001682 0.20% MLCellLinOp::setLevelBC() 11 0.001559 0.001559 0.001559 0.18% Castro::reset_internal_energy(Fab) 504 0.00155 0.00155 0.00155 0.18% Castro::enforce_speed_limit() 62 0.001472 0.001472 0.001472 0.17% Gravity::actual_solve_with_mlmg() 11 0.001409 0.001409 0.001409 0.16% FabArray::mult() 43 0.001342 0.001342 0.001342 0.16% FabArray::setDomainBndry() 41 0.001338 0.001338 0.001338 0.16% MLMG::prepareForSolve() 11 0.001206 0.001206 0.001206 0.14% MultiFab::contains_nan() 20 0.001197 0.001197 0.001197 0.14% MLCellLinOp::prepareForSolve() 11 0.001192 0.001192 0.001192 0.14% Castro::initData() 1 0.001179 0.001179 0.001179 0.14% MLCellLinOp::smooth() 1640 0.001035 0.001035 0.001035 0.12% FabArray::FillBoundary() 4023 0.0009658 0.0009658 0.0009658 0.11% MLCellLinOp::compGrad() 11 0.0009409 0.0009409 0.0009409 0.11% FabArrayBase::getCPC() 1323 0.0007644 0.0007644 0.0007644 0.09% FabArrayBase::CPC::define() 454 0.0006698 0.0006698 0.0006698 0.08% FabArrayBase::getFB() 4023 0.0006046 0.0006046 0.0006046 0.07% Amr::InitAmr() 1 0.0004777 0.0004777 0.0004777 0.06% Gravity::solve_for_phi() 10 0.0004452 0.0004452 0.0004452 0.05% MLCellLinOp::apply() 1142 0.0004153 0.0004153 0.0004153 0.05% Gravity::update_max_rhs() 11 0.00041 0.00041 0.00041 0.05% CGSolver::sxay() 1586 0.0003469 0.0003469 0.0003469 0.04% MultiFab::Copy() 11 0.0003346 0.0003346 0.0003346 0.04% Amr::coarseTimeStep() 10 0.00032 0.00032 0.00032 0.04% FillPatchIterator::Initialize 41 0.0002889 0.0002889 0.0002889 0.03% MLCGSolver::ParallelAllReduce 1514 0.0002826 0.0002826 0.0002826 0.03% FabArray::ParallelCopy() 861 0.0002747 0.0002747 0.0002747 0.03% MLCellLinOp::defineBC() 11 0.0002746 0.0002746 0.0002746 0.03% main() 1 0.0002599 0.0002599 0.0002599 0.03% MultiFab::max() 11 0.0002594 0.0002594 0.0002594 0.03% Castro::subcycle_advance_ctu() 10 0.0002511 0.0002511 0.0002511 0.03% MLCellLinOp::correctionResidual() 492 0.0002341 0.0002341 0.0002341 0.03% MLMG::MLRhsNormInf() 11 0.0002209 0.0002209 0.0002209 0.03% MLMG::mgVcycle() 82 0.0002204 0.0002204 0.0002204 0.03% Castro::construct_new_gravity() 10 0.0002068 0.0002068 0.0002068 0.02% MLLinOp::defineGrids() 11 0.0001618 0.0001618 0.0001618 0.02% Amr::timeStep() 10 0.0001536 0.0001536 0.0001536 0.02% MLMG:computeResOfCorrection() 410 0.0001387 0.0001387 0.0001387 0.02% StateData::checkPoint() 12 0.0001249 0.0001249 0.0001249 0.01% MLMG::mgVcycle_down::0 82 0.0001067 0.0001067 0.0001067 0.01% Castro::initialize_advance() 10 0.0001003 0.0001003 0.0001003 0.01% MLMG::mgVcycle_down::1 82 9.536e-05 9.536e-05 9.536e-05 0.01% Castro::Castro() 1 9.422e-05 9.422e-05 9.422e-05 0.01% MLMG::mgVcycle_down::2 82 8.996e-05 8.996e-05 8.996e-05 0.01% MLMG::mgVcycle_down::3 82 8.404e-05 8.404e-05 8.404e-05 0.01% FabArrayBase::FB::FB() 56 8.394e-05 8.394e-05 8.394e-05 0.01% MLMG::mgVcycle_down::4 82 8.258e-05 8.258e-05 8.258e-05 0.01% Castro::clean_state() 62 7.968e-05 7.968e-05 7.968e-05 0.01% MLMG::actualBottomSolve() 82 7.917e-05 7.917e-05 7.917e-05 0.01% AmrLevel::checkPoint() 3 7.157e-05 7.157e-05 7.157e-05 0.01% MLMG::mgVcycle_up::4 82 7.026e-05 7.026e-05 7.026e-05 0.01% MLMG::solve() 11 6.658e-05 6.658e-05 6.658e-05 0.01% Castro::finalize_advance() 10 6.41e-05 6.41e-05 6.41e-05 0.01% Castro::initialize_do_advance() 10 6.312e-05 6.312e-05 6.312e-05 0.01% MLMG::mgVcycle_up::1 82 5.763e-05 5.763e-05 5.763e-05 0.01% MLMG::mgVcycle_up::3 82 5.732e-05 5.732e-05 5.732e-05 0.01% MLMG::oneIter() 82 5.67e-05 5.67e-05 5.67e-05 0.01% MLMG::mgVcycle_up::2 82 5.659e-05 5.659e-05 5.659e-05 0.01% MLMG::mgVcycle_up::0 82 5.541e-05 5.541e-05 5.541e-05 0.01% MLCellLinOp::solutionResidual() 93 5.324e-05 5.324e-05 5.324e-05 0.01% Castro::swap_state_time_levels() 10 4.353e-05 4.353e-05 4.353e-05 0.01% StateData::define() 4 4.248e-05 4.248e-05 4.248e-05 0.00% Castro::advance() 10 4.213e-05 4.213e-05 4.213e-05 0.00% MLMG::computeResidual() 82 3.88e-05 3.88e-05 3.88e-05 0.00% Castro::enforce_consistent_e() 1 3.585e-05 3.585e-05 3.585e-05 0.00% MLPoisson::define() 11 3.329e-05 3.329e-05 3.329e-05 0.00% Castro::finalize_do_advance() 10 3.275e-05 3.275e-05 3.275e-05 0.00% MLMG::mgVcycle_bottom 82 3.244e-05 3.244e-05 3.244e-05 0.00% Gravity::actual_multilevel_solve() 1 3.188e-05 3.188e-05 3.188e-05 0.00% FillPatchSingleLevel 41 2.905e-05 2.905e-05 2.905e-05 0.00% Castro::initMFs() 1 2.629e-05 2.629e-05 2.629e-05 0.00% makeSFC 55 2.599e-05 2.599e-05 2.599e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.593e-05 2.593e-05 2.593e-05 0.00% Amr::writeSmallPlotFile() 1 2.47e-05 2.47e-05 2.47e-05 0.00% Amr::defBaseLevel() 1 2.174e-05 2.174e-05 2.174e-05 0.00% Castro::buildMetrics() 1 2.167e-05 2.167e-05 2.167e-05 0.00% MLLinOp::define() 11 2.152e-05 2.152e-05 2.152e-05 0.00% Amr::FinalizeInit() 1 2.009e-05 2.009e-05 2.009e-05 0.00% Castro::construct_new_source() 50 1.839e-05 1.839e-05 1.839e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.758e-05 1.758e-05 1.758e-05 0.00% Castro::do_new_sources() 10 1.694e-05 1.694e-05 1.694e-05 0.00% Castro::construct_old_source() 50 1.672e-05 1.672e-05 1.672e-05 0.00% Castro::do_old_sources() 10 1.634e-05 1.634e-05 1.634e-05 0.00% DistributionMapping::Distribute() 56 1.409e-05 1.409e-05 1.409e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.324e-05 1.324e-05 1.324e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.272e-05 1.272e-05 1.272e-05 0.00% Castro::check_for_nan() 20 1.201e-05 1.201e-05 1.201e-05 0.00% Castro::apply_source_to_state() 20 1.025e-05 1.025e-05 1.025e-05 0.00% Castro::construct_old_gravity() 10 1.012e-05 1.012e-05 1.012e-05 0.00% MLMG::computeMLResidual() 11 9.886e-06 9.886e-06 9.886e-06 0.00% Castro::post_timestep() 10 8.923e-06 8.923e-06 8.923e-06 0.00% Amr::initSubcycle() 1 8.922e-06 8.922e-06 8.922e-06 0.00% Gravity::swapTimeLevels() 10 8.77e-06 8.77e-06 8.77e-06 0.00% MLPoisson::prepareForSolve() 11 8.544e-06 8.544e-06 8.544e-06 0.00% Castro::computeNewDt() 9 6.57e-06 6.57e-06 6.57e-06 0.00% MLMG::getGradSolution() 11 6.502e-06 6.502e-06 6.502e-06 0.00% Amr::InitializeInit() 1 5.331e-06 5.331e-06 5.331e-06 0.00% AmrLevel::checkPointPost() 3 5.043e-06 5.043e-06 5.043e-06 0.00% Castro::create_source_corrector() 10 4.417e-06 4.417e-06 4.417e-06 0.00% Gravity::set_mass_offset() 11 3.952e-06 3.952e-06 3.952e-06 0.00% Castro::post_init() 1 3.523e-06 3.523e-06 3.523e-06 0.00% Castro::retry_advance_ctu() 10 3.422e-06 3.422e-06 3.422e-06 0.00% MLMG::MLResNormInf() 11 3.264e-06 3.264e-06 3.264e-06 0.00% Castro::FluxRegCrseInit 10 3.21e-06 3.21e-06 3.21e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.952e-06 2.952e-06 2.952e-06 0.00% Castro::computeInitialDt() 2 2.703e-06 2.703e-06 2.703e-06 0.00% Amr::init() 1 2.536e-06 2.536e-06 2.536e-06 0.00% Castro::FluxRegFineAdd() 10 2.414e-06 2.414e-06 2.414e-06 0.00% AmrLevel::checkPointPre() 3 2.035e-06 2.035e-06 2.035e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.819e-06 1.819e-06 1.819e-06 0.00% Amr::initialInit() 1 1.55e-06 1.55e-06 1.55e-06 0.00% Castro::post_regrid() 1 1.54e-06 1.54e-06 1.54e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8608 0.8608 0.8608 100.00% Amr::coarseTimeStep() 10 0.6924 0.6924 0.6924 80.43% Amr::timeStep() 10 0.5736 0.5736 0.5736 66.64% Castro::advance() 10 0.5675 0.5675 0.5675 65.92% Castro::subcycle_advance_ctu() 10 0.5563 0.5563 0.5563 64.62% Castro::do_advance_ctu() 10 0.556 0.556 0.556 64.59% Gravity::solve_phi_with_mlmg() 11 0.3206 0.3206 0.3206 37.25% Gravity::actual_solve_with_mlmg() 11 0.3118 0.3118 0.3118 36.22% Castro::construct_new_gravity() 10 0.2915 0.2915 0.2915 33.87% MLMG::solve() 11 0.289 0.289 0.289 33.57% Gravity::solve_for_phi() 10 0.2761 0.2761 0.2761 32.07% MLMG::oneIter() 82 0.2739 0.2739 0.2739 31.81% MLMG::mgVcycle() 82 0.2721 0.2721 0.2721 31.61% VisMF::Write(FabArray) 11 0.2226 0.2226 0.2226 25.86% Castro::construct_ctu_hydro_source() 10 0.1863 0.1863 0.1863 21.64% Amr::checkPoint() 3 0.1693 0.1693 0.1693 19.66% AmrLevel::checkPoint() 3 0.164 0.164 0.164 19.05% StateData::checkPoint() 12 0.1639 0.1639 0.1639 19.04% MLCellLinOp::smooth() 1640 0.139 0.139 0.139 16.15% Amr::init() 1 0.1367 0.1367 0.1367 15.88% MLCellLinOp::applyBC() 4433 0.09742 0.09742 0.09742 11.32% MLMG::mgVcycle_bottom 82 0.08384 0.08384 0.08384 9.74% MLMG::actualBottomSolve() 82 0.08381 0.08381 0.08381 9.74% MLCGSolver::bicgstab 82 0.08299 0.08299 0.08299 9.64% MLPoisson::Fsmooth() 3280 0.06554 0.06554 0.06554 7.61% Amr::writePlotFile() 2 0.0617 0.0617 0.0617 7.17% Amr::initialInit() 1 0.05254 0.05254 0.05254 6.10% Amr::FinalizeInit() 1 0.04834 0.04834 0.04834 5.62% Castro::post_init() 1 0.04691 0.04691 0.04691 5.45% Gravity::multilevel_solve_for_new_phi() 1 0.04506 0.04506 0.04506 5.23% Gravity::actual_multilevel_solve() 1 0.04504 0.04504 0.04504 5.23% Castro::clean_state() 62 0.04459 0.04459 0.04459 5.18% FillPatchIterator::Initialize 41 0.0423 0.0423 0.0423 4.91% FillPatchSingleLevel 41 0.04067 0.04067 0.04067 4.72% MLCellLinOp::apply() 1142 0.03696 0.03696 0.03696 4.29% StateDataPhysBCFunct::() 41 0.03665 0.03665 0.03665 4.26% MLMG::mgVcycle_down::0 82 0.03609 0.03609 0.03609 4.19% MLMG::mgVcycle_up::0 82 0.03087 0.03087 0.03087 3.59% StateData::FillBoundary(geom) 328 0.0238 0.0238 0.0238 2.77% MultiFab::Dot() 1114 0.02287 0.02287 0.02287 2.66% MLCellLinOp::correctionResidual() 492 0.02171 0.02171 0.02171 2.52% Castro::computeTemp() 63 0.02083 0.02083 0.02083 2.42% Castro::initialize_do_advance() 10 0.01954 0.01954 0.01954 2.27% MLMG:computeResOfCorrection() 410 0.01873 0.01873 0.01873 2.18% MLPoisson::define() 11 0.01837 0.01837 0.01837 2.13% MLMG::mgVcycle_down::1 82 0.01812 0.01812 0.01812 2.10% MLMG::mgVcycle_down::2 82 0.01767 0.01767 0.01767 2.05% Gravity::get_new_grav_vector() 11 0.01701 0.01701 0.01701 1.98% MLMG::mgVcycle_down::3 82 0.01678 0.01678 0.01678 1.95% MLMG::mgVcycle_down::4 82 0.01599 0.01599 0.01599 1.86% FabArray::FillBoundary() 4023 0.01592 0.01592 0.01592 1.85% CGSolver::sxay() 1586 0.01511 0.01511 0.01511 1.76% Castro::construct_old_gravity() 10 0.01497 0.01497 0.01497 1.74% Gravity::get_old_grav_vector() 10 0.01496 0.01496 0.01496 1.74% FillBoundary_nowait() 4023 0.01495 0.01495 0.01495 1.74% MultiFab::LinComb() 1586 0.01477 0.01477 0.01477 1.72% FabArray::ParallelCopy() 861 0.01456 0.01456 0.01456 1.69% FabArray::setVal() 1144 0.01454 0.01454 0.01454 1.69% FabArray::ParallelCopy_nowait() 861 0.01428 0.01428 0.01428 1.66% MLCGSolver::ParallelAllReduce 1514 0.01361 0.01361 0.01361 1.58% MLMG::mgVcycle_up::2 82 0.01359 0.01359 0.01359 1.58% MLMG::mgVcycle_up::1 82 0.01333 0.01333 0.01333 1.55% MLMG::addInterpCorrection() 410 0.01309 0.01309 0.01309 1.52% MLCellLinOp::defineAuxData() 11 0.01292 0.01292 0.01292 1.50% MLMG::mgVcycle_up::3 82 0.01287 0.01287 0.01287 1.50% MLMG::mgVcycle_up::4 82 0.01275 0.01275 0.01275 1.48% Castro::normalize_species() 62 0.01253 0.01253 0.01253 1.46% Castro::do_new_sources() 10 0.01237 0.01237 0.01237 1.44% amrex::average_down 410 0.01228 0.01228 0.01228 1.43% MLPoisson::Fapply() 1142 0.01205 0.01205 0.01205 1.40% Castro::expand_state() 10 0.01141 0.01141 0.01141 1.33% Castro::initialize_advance() 10 0.01111 0.01111 0.01111 1.29% Castro::do_old_sources() 10 0.01088 0.01088 0.01088 1.26% Castro::enforce_min_density() 62 0.01061 0.01061 0.01061 1.23% Gravity::fill_multipole_BCs() 11 0.008663 0.008663 0.008663 1.01% MLCellLinOp::solutionResidual() 93 0.007217 0.007217 0.007217 0.84% MultiFab::Xpay() 585 0.006751 0.006751 0.006751 0.78% MLMG::computeResidual() 82 0.006237 0.006237 0.006237 0.72% Castro::post_timestep() 10 0.006005 0.006005 0.006005 0.70% Castro::reset_internal_energy(MultiFab) 63 0.005674 0.005674 0.005674 0.66% MLMG::prepareForSolve() 11 0.005443 0.005443 0.005443 0.63% MLCellLinOp::defineBC() 11 0.005176 0.005176 0.005176 0.60% Castro::estTimeStep() 21 0.004904 0.004904 0.004904 0.57% BndryData::define() 11 0.004901 0.004901 0.004901 0.57% Amr::InitializeInit() 1 0.0042 0.0042 0.0042 0.49% Amr::defBaseLevel() 1 0.004194 0.004194 0.004194 0.49% Castro::initData() 1 0.00365 0.00365 0.00365 0.42% Castro::construct_new_source() 50 0.003273 0.003273 0.003273 0.38% Castro::construct_new_gravity_source() 10 0.003255 0.003255 0.003255 0.38% Castro::construct_old_source() 50 0.00257 0.00257 0.00257 0.30% Castro::construct_old_gravity_source() 10 0.002553 0.002553 0.002553 0.30% MLMG::ResNormInf() 93 0.002106 0.002106 0.002106 0.24% Castro::computeNewDt() 9 0.002076 0.002076 0.002076 0.24% Castro::apply_source_to_state() 20 0.001826 0.001826 0.001826 0.21% MultiFab::Saxpy() 20 0.001816 0.001816 0.001816 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001685 0.001685 0.001685 0.20% MultiFab::Add() 82 0.001682 0.001682 0.001682 0.20% MLCellLinOp::setLevelBC() 11 0.001559 0.001559 0.001559 0.18% Castro::reset_internal_energy(Fab) 504 0.00155 0.00155 0.00155 0.18% Castro::enforce_speed_limit() 62 0.001472 0.001472 0.001472 0.17% MLMG::getGradSolution() 11 0.001453 0.001453 0.001453 0.17% MLCellLinOp::compGrad() 11 0.001446 0.001446 0.001446 0.17% FabArrayBase::getCPC() 1323 0.001434 0.001434 0.001434 0.17% FabArray::mult() 43 0.001342 0.001342 0.001342 0.16% FabArray::setDomainBndry() 41 0.001338 0.001338 0.001338 0.16% Castro::check_for_nan() 20 0.001209 0.001209 0.001209 0.14% MLPoisson::prepareForSolve() 11 0.001201 0.001201 0.001201 0.14% MultiFab::contains_nan() 20 0.001197 0.001197 0.001197 0.14% MLCellLinOp::prepareForSolve() 11 0.001192 0.001192 0.001192 0.14% Castro::post_regrid() 1 0.001156 0.001156 0.001156 0.13% MLMG::computeMLResidual() 11 0.001029 0.001029 0.001029 0.12% Gravity::update_max_rhs() 11 0.0008225 0.0008225 0.0008225 0.10% Castro::computeInitialDt() 2 0.0007351 0.0007351 0.0007351 0.09% FabArrayBase::getFB() 4023 0.0006885 0.0006885 0.0006885 0.08% FabArrayBase::CPC::define() 454 0.0006698 0.0006698 0.0006698 0.08% Amr::InitAmr() 1 0.0004866 0.0004866 0.0004866 0.06% Castro::Castro() 1 0.000463 0.000463 0.000463 0.05% Gravity::swapTimeLevels() 10 0.0004454 0.0004454 0.0004454 0.05% MultiFab::Copy() 11 0.0003346 0.0003346 0.0003346 0.04% MLMG::MLResNormInf() 11 0.0002812 0.0002812 0.0002812 0.03% MultiFab::max() 11 0.0002594 0.0002594 0.0002594 0.03% MLLinOp::define() 11 0.0002369 0.0002369 0.0002369 0.03% MLMG::MLRhsNormInf() 11 0.0002209 0.0002209 0.0002209 0.03% MLLinOp::defineGrids() 11 0.0002154 0.0002154 0.0002154 0.03% Castro::buildMetrics() 1 0.00016 0.00016 0.00016 0.02% FabArrayBase::FB::FB() 56 8.394e-05 8.394e-05 8.394e-05 0.01% Castro::finalize_advance() 10 6.972e-05 6.972e-05 6.972e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.573e-05 5.573e-05 5.573e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.181e-05 5.181e-05 5.181e-05 0.01% Castro::swap_state_time_levels() 10 4.353e-05 4.353e-05 4.353e-05 0.01% StateData::define() 4 4.248e-05 4.248e-05 4.248e-05 0.00% makeSFC 55 3.908e-05 3.908e-05 3.908e-05 0.00% Castro::enforce_consistent_e() 1 3.585e-05 3.585e-05 3.585e-05 0.00% Castro::finalize_do_advance() 10 3.275e-05 3.275e-05 3.275e-05 0.00% Castro::initMFs() 1 2.629e-05 2.629e-05 2.629e-05 0.00% Amr::writeSmallPlotFile() 1 2.47e-05 2.47e-05 2.47e-05 0.00% DistributionMapping::Distribute() 56 1.409e-05 1.409e-05 1.409e-05 0.00% Amr::initSubcycle() 1 8.922e-06 8.922e-06 8.922e-06 0.00% AmrLevel::checkPointPost() 3 5.043e-06 5.043e-06 5.043e-06 0.00% Castro::create_source_corrector() 10 4.417e-06 4.417e-06 4.417e-06 0.00% Gravity::set_mass_offset() 11 3.952e-06 3.952e-06 3.952e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.949e-06 3.949e-06 3.949e-06 0.00% Castro::retry_advance_ctu() 10 3.422e-06 3.422e-06 3.422e-06 0.00% Castro::FluxRegCrseInit 10 3.21e-06 3.21e-06 3.21e-06 0.00% Castro::FluxRegFineAdd() 10 2.414e-06 2.414e-06 2.414e-06 0.00% AmrLevel::checkPointPre() 3 2.035e-06 2.035e-06 2.035e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.819e-06 1.819e-06 1.819e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.10-17-g56b6402d2389) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.10-17-g56b6402d2389) initialized Starting run at 08:37:19 UTC on 2022-10-17. Successfully read inputs file ... Castro git describe: 22.09-1-g65b273ad0 AMReX git describe: 22.10-17-g56b6402d2 Microphysics git describe: 22.10-4-g1dbcf8c2 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.462195363 Restart time = 0.048966442 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.051672346 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.051031357 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.061255571 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.063292794 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.081009972 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.032917722 seconds Ending run at 08:37:19 UTC on 2022-10-17. Run time = 0.391167907 Run time without initialization = 0.341636852 Average number of zones advanced per microsecond: 3.837 Average number of zones advanced per microsecond per rank: 3.837 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3912 ... 0.3912 ... 0.3912 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1053 0.1053 0.1053 26.91% VisMF::Read() 3 0.04096 0.04096 0.04096 10.47% MLCellLinOp::applyBC() 1946 0.03463 0.03463 0.03463 8.85% VisMF::Write(FabArray) 1 0.03124 0.03124 0.03124 7.99% MLPoisson::Fsmooth() 1440 0.02724 0.02724 0.02724 6.96% StateData::FillBoundary(geom) 160 0.0117 0.0117 0.0117 2.99% MLCGSolver::bicgstab 36 0.01025 0.01025 0.01025 2.62% MultiFab::Dot() 484 0.009543 0.009543 0.009543 2.44% Castro::computeTemp() 30 0.009241 0.009241 0.009241 2.36% Castro::normalize_species() 30 0.008415 0.008415 0.008415 2.15% FabArray::setVal() 537 0.006808 0.006808 0.006808 1.74% MLCellLinOp::defineAuxData() 6 0.006373 0.006373 0.006373 1.63% Castro::enforce_min_density() 30 0.006296 0.006296 0.006296 1.61% FillBoundary_nowait() 1766 0.006224 0.006224 0.006224 1.59% MultiFab::LinComb() 690 0.006158 0.006158 0.006158 1.57% FabArray::ParallelCopy_nowait() 380 0.005956 0.005956 0.005956 1.52% StateDataPhysBCFunct::() 20 0.005372 0.005372 0.005372 1.37% MLPoisson::Fapply() 500 0.005081 0.005081 0.005081 1.30% Gravity::fill_multipole_BCs() 6 0.00443 0.00443 0.00443 1.13% Amr::restart() 1 0.003631 0.003631 0.003631 0.93% MLMG::addInterpCorrection() 180 0.003352 0.003352 0.003352 0.86% amrex::average_down 180 0.002995 0.002995 0.002995 0.77% MultiFab::Xpay() 258 0.002881 0.002881 0.002881 0.74% Castro::estTimeStep() 10 0.002716 0.002716 0.002716 0.69% Castro::do_advance_ctu() 5 0.00241 0.00241 0.00241 0.62% BndryData::define() 6 0.00217 0.00217 0.00217 0.55% Castro::enforce_speed_limit() 30 0.001964 0.001964 0.001964 0.50% Amr::writePlotFile() 1 0.001794 0.001794 0.001794 0.46% Castro::reset_internal_energy(MultiFab) 30 0.001723 0.001723 0.001723 0.44% Castro::construct_new_gravity_source() 5 0.001657 0.001657 0.001657 0.42% Castro::subcycle_advance_ctu() 5 0.001615 0.001615 0.001615 0.41% Castro::construct_old_gravity_source() 5 0.001392 0.001392 0.001392 0.36% Gravity::get_old_grav_vector() 5 0.000947 0.000947 0.000947 0.24% MultiFab::Saxpy() 10 0.0009251 0.0009251 0.0009251 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009227 0.0009227 0.0009227 0.24% MLMG::ResNormInf() 42 0.0009206 0.0009206 0.0009206 0.24% Gravity::get_new_grav_vector() 5 0.0008762 0.0008762 0.0008762 0.22% Castro::expand_state() 5 0.0008728 0.0008728 0.0008728 0.22% Castro::reset_internal_energy(Fab) 240 0.0008714 0.0008714 0.0008714 0.22% MLCellLinOp::setLevelBC() 6 0.0008382 0.0008382 0.0008382 0.21% Gravity::actual_solve_with_mlmg() 6 0.0008153 0.0008153 0.0008153 0.21% MultiFab::Add() 36 0.0007169 0.0007169 0.0007169 0.18% MLMG::prepareForSolve() 6 0.0007014 0.0007014 0.0007014 0.18% MLCellLinOp::prepareForSolve() 6 0.0006596 0.0006596 0.0006596 0.17% FabArray::mult() 22 0.0006509 0.0006509 0.0006509 0.17% FabArray::setDomainBndry() 20 0.0006479 0.0006479 0.0006479 0.17% MultiFab::contains_nan() 10 0.0005881 0.0005881 0.0005881 0.15% MLCellLinOp::compGrad() 6 0.0004911 0.0004911 0.0004911 0.13% MLCellLinOp::smooth() 720 0.0004652 0.0004652 0.0004652 0.12% FabArrayBase::CPC::define() 244 0.0004053 0.0004053 0.0004053 0.10% FabArray::FillBoundary() 1766 0.0003947 0.0003947 0.0003947 0.10% Amr::InitAmr() 1 0.0003947 0.0003947 0.0003947 0.10% FabArrayBase::getCPC() 632 0.0003726 0.0003726 0.0003726 0.10% FabArrayBase::getFB() 1766 0.0002622 0.0002622 0.0002622 0.07% main() 1 0.0002415 0.0002415 0.0002415 0.06% Gravity::update_max_rhs() 6 0.0002294 0.0002294 0.0002294 0.06% Gravity::solve_for_phi() 5 0.0002201 0.0002201 0.0002201 0.06% MLCellLinOp::apply() 500 0.0002032 0.0002032 0.0002032 0.05% MultiFab::Copy() 6 0.0001808 0.0001808 0.0001808 0.05% Amr::coarseTimeStep() 5 0.0001793 0.0001793 0.0001793 0.05% CGSolver::sxay() 690 0.0001659 0.0001659 0.0001659 0.04% Castro::construct_new_gravity() 5 0.000164 0.000164 0.000164 0.04% Castro::create_source_corrector() 5 0.000159 0.000159 0.000159 0.04% MLCellLinOp::defineBC() 6 0.000156 0.000156 0.000156 0.04% Castro::construct_new_source() 25 0.0001515 0.0001515 0.0001515 0.04% FillPatchIterator::Initialize 20 0.0001367 0.0001367 0.0001367 0.03% MultiFab::max() 6 0.0001344 0.0001344 0.0001344 0.03% FabArray::ParallelCopy() 380 0.0001236 0.0001236 0.0001236 0.03% MLCGSolver::ParallelAllReduce 659 0.0001178 0.0001178 0.0001178 0.03% MLMG::MLRhsNormInf() 6 0.0001133 0.0001133 0.0001133 0.03% Castro::post_timestep() 5 0.0001091 0.0001091 0.0001091 0.03% MLMG::mgVcycle() 36 0.0001085 0.0001085 0.0001085 0.03% MLCellLinOp::correctionResidual() 216 0.0001044 0.0001044 0.0001044 0.03% Castro::construct_old_source() 25 0.0001028 0.0001028 0.0001028 0.03% Castro::initialize_do_advance() 5 9.648e-05 9.648e-05 9.648e-05 0.02% Amr::timeStep() 5 8.91e-05 8.91e-05 8.91e-05 0.02% Castro::initialize_advance() 5 8.805e-05 8.805e-05 8.805e-05 0.02% MLLinOp::defineGrids() 6 8.679e-05 8.679e-05 8.679e-05 0.02% AmrLevel::restart() 1 8.595e-05 8.595e-05 8.595e-05 0.02% Castro::advance() 5 8.471e-05 8.471e-05 8.471e-05 0.02% Castro::computeNewDt() 5 7.815e-05 7.815e-05 7.815e-05 0.02% StateData::restartDoit() 4 7.488e-05 7.488e-05 7.488e-05 0.02% Castro::finalize_advance() 5 6.965e-05 6.965e-05 6.965e-05 0.02% MLMG:computeResOfCorrection() 180 6.239e-05 6.239e-05 6.239e-05 0.02% FabArrayBase::FB::FB() 26 5.848e-05 5.848e-05 5.848e-05 0.01% MLMG::mgVcycle_down::0 36 4.656e-05 4.656e-05 4.656e-05 0.01% Castro::construct_old_gravity() 5 4.398e-05 4.398e-05 4.398e-05 0.01% MLMG::mgVcycle_down::1 36 4.261e-05 4.261e-05 4.261e-05 0.01% Amr::writeSmallPlotFile() 1 3.892e-05 3.892e-05 3.892e-05 0.01% MLMG::mgVcycle_down::2 36 3.887e-05 3.887e-05 3.887e-05 0.01% Castro::clean_state() 30 3.88e-05 3.88e-05 3.88e-05 0.01% MLMG::mgVcycle_down::4 36 3.713e-05 3.713e-05 3.713e-05 0.01% MLMG::mgVcycle_down::3 36 3.632e-05 3.632e-05 3.632e-05 0.01% MLMG::actualBottomSolve() 36 3.413e-05 3.413e-05 3.413e-05 0.01% MLMG::solve() 6 3.338e-05 3.338e-05 3.338e-05 0.01% Castro::buildMetrics() 1 3.204e-05 3.204e-05 3.204e-05 0.01% MLMG::mgVcycle_up::4 36 3.123e-05 3.123e-05 3.123e-05 0.01% Castro::post_restart() 1 3.099e-05 3.099e-05 3.099e-05 0.01% MLMG::oneIter() 36 3.065e-05 3.065e-05 3.065e-05 0.01% Gravity::actual_multilevel_solve() 1 3.005e-05 3.005e-05 3.005e-05 0.01% Castro::swap_state_time_levels() 5 2.775e-05 2.775e-05 2.775e-05 0.01% Castro::initMFs() 1 2.708e-05 2.708e-05 2.708e-05 0.01% MLMG::mgVcycle_up::3 36 2.625e-05 2.625e-05 2.625e-05 0.01% MLMG::mgVcycle_up::0 36 2.506e-05 2.506e-05 2.506e-05 0.01% MLMG::mgVcycle_up::2 36 2.483e-05 2.483e-05 2.483e-05 0.01% MLCellLinOp::solutionResidual() 42 2.375e-05 2.375e-05 2.375e-05 0.01% MLMG::mgVcycle_up::1 36 2.33e-05 2.33e-05 2.33e-05 0.01% MLPoisson::define() 6 2.24e-05 2.24e-05 2.24e-05 0.01% MLLinOp::define() 6 2.089e-05 2.089e-05 2.089e-05 0.01% Castro::finalize_do_advance() 5 1.874e-05 1.874e-05 1.874e-05 0.00% MLMG::computeResidual() 36 1.827e-05 1.827e-05 1.827e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.727e-05 1.727e-05 1.727e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.563e-05 1.563e-05 1.563e-05 0.00% MLMG::mgVcycle_bottom 36 1.548e-05 1.548e-05 1.548e-05 0.00% FillPatchSingleLevel 20 1.439e-05 1.439e-05 1.439e-05 0.00% makeSFC 30 1.367e-05 1.367e-05 1.367e-05 0.00% Castro::do_new_sources() 5 1.027e-05 1.027e-05 1.027e-05 0.00% Amr::initSubcycle() 1 9.108e-06 9.108e-06 9.108e-06 0.00% DistributionMapping::Distribute() 31 8.642e-06 8.642e-06 8.642e-06 0.00% Castro::do_old_sources() 5 8.333e-06 8.333e-06 8.333e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.257e-06 7.257e-06 7.257e-06 0.00% Castro::check_for_nan() 10 6.002e-06 6.002e-06 6.002e-06 0.00% Castro::apply_source_to_state() 10 5.908e-06 5.908e-06 5.908e-06 0.00% MLMG::computeMLResidual() 6 5.179e-06 5.179e-06 5.179e-06 0.00% MLPoisson::prepareForSolve() 6 4.885e-06 4.885e-06 4.885e-06 0.00% Gravity::swapTimeLevels() 5 4.104e-06 4.104e-06 4.104e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.26e-06 3.26e-06 3.26e-06 0.00% MLMG::getGradSolution() 6 3.178e-06 3.178e-06 3.178e-06 0.00% MLMG::MLResNormInf() 6 2.094e-06 2.094e-06 2.094e-06 0.00% Gravity::set_mass_offset() 6 2.055e-06 2.055e-06 2.055e-06 0.00% Castro::retry_advance_ctu() 5 1.837e-06 1.837e-06 1.837e-06 0.00% Castro::FluxRegCrseInit 5 1.795e-06 1.795e-06 1.795e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.201e-06 1.201e-06 1.201e-06 0.00% Amr::init() 1 1.124e-06 1.124e-06 1.124e-06 0.00% Castro::FluxRegFineAdd() 5 1.115e-06 1.115e-06 1.115e-06 0.00% AmrLevel::AmrLevel() 1 8.12e-07 8.12e-07 8.12e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3912 0.3912 0.3912 100.00% Amr::coarseTimeStep() 5 0.3084 0.3084 0.3084 78.84% Amr::timeStep() 5 0.3066 0.3066 0.3066 78.38% Castro::advance() 5 0.3022 0.3022 0.3022 77.25% Castro::subcycle_advance_ctu() 5 0.2946 0.2946 0.2946 75.30% Castro::do_advance_ctu() 5 0.2929 0.2929 0.2929 74.88% Castro::construct_new_gravity() 5 0.1439 0.1439 0.1439 36.79% Gravity::solve_phi_with_mlmg() 6 0.1399 0.1399 0.1399 35.75% Gravity::solve_for_phi() 5 0.1362 0.1362 0.1362 34.82% Gravity::actual_solve_with_mlmg() 6 0.1353 0.1353 0.1353 34.59% MLMG::solve() 6 0.1227 0.1227 0.1227 31.37% MLMG::oneIter() 36 0.1154 0.1154 0.1154 29.49% MLMG::mgVcycle() 36 0.1146 0.1146 0.1146 29.30% Castro::construct_ctu_hydro_source() 5 0.1053 0.1053 0.1053 26.91% MLCellLinOp::smooth() 720 0.05849 0.05849 0.05849 14.95% Amr::init() 1 0.04902 0.04902 0.04902 12.53% Amr::restart() 1 0.04902 0.04902 0.04902 12.53% MLCellLinOp::applyBC() 1946 0.04157 0.04157 0.04157 10.63% AmrLevel::restart() 1 0.04117 0.04117 0.04117 10.52% StateData::restartDoit() 4 0.04108 0.04108 0.04108 10.50% VisMF::Read() 3 0.04096 0.04096 0.04096 10.47% MLMG::mgVcycle_bottom 36 0.03511 0.03511 0.03511 8.98% MLMG::actualBottomSolve() 36 0.0351 0.0351 0.0351 8.97% MLCGSolver::bicgstab 36 0.03475 0.03475 0.03475 8.88% Amr::writePlotFile() 1 0.03304 0.03304 0.03304 8.44% VisMF::Write(FabArray) 1 0.03124 0.03124 0.03124 7.99% Castro::clean_state() 30 0.02855 0.02855 0.02855 7.30% MLPoisson::Fsmooth() 1440 0.02724 0.02724 0.02724 6.96% FillPatchIterator::Initialize 20 0.01987 0.01987 0.01987 5.08% FillPatchSingleLevel 20 0.01908 0.01908 0.01908 4.88% StateDataPhysBCFunct::() 20 0.01707 0.01707 0.01707 4.36% MLCellLinOp::apply() 500 0.01579 0.01579 0.01579 4.04% MLMG::mgVcycle_down::0 36 0.01534 0.01534 0.01534 3.92% MLMG::mgVcycle_up::0 36 0.0131 0.0131 0.0131 3.35% Castro::initialize_do_advance() 5 0.01202 0.01202 0.01202 3.07% Castro::computeTemp() 30 0.01184 0.01184 0.01184 3.03% StateData::FillBoundary(geom) 160 0.0117 0.0117 0.0117 2.99% MLPoisson::define() 6 0.01018 0.01018 0.01018 2.60% MultiFab::Dot() 484 0.009543 0.009543 0.009543 2.44% MLCellLinOp::correctionResidual() 216 0.009228 0.009228 0.009228 2.36% Castro::normalize_species() 30 0.008415 0.008415 0.008415 2.15% MLMG:computeResOfCorrection() 180 0.007964 0.007964 0.007964 2.04% MLMG::mgVcycle_down::1 36 0.007658 0.007658 0.007658 1.96% Castro::construct_old_gravity() 5 0.007581 0.007581 0.007581 1.94% Gravity::get_new_grav_vector() 5 0.007543 0.007543 0.007543 1.93% Gravity::get_old_grav_vector() 5 0.007537 0.007537 0.007537 1.93% Castro::initialize_advance() 5 0.00747 0.00747 0.00747 1.91% MLMG::mgVcycle_down::2 36 0.007405 0.007405 0.007405 1.89% MLCellLinOp::defineAuxData() 6 0.007147 0.007147 0.007147 1.83% Castro::do_new_sources() 5 0.007093 0.007093 0.007093 1.81% MLMG::mgVcycle_down::3 36 0.007074 0.007074 0.007074 1.81% FabArray::FillBoundary() 1766 0.00694 0.00694 0.00694 1.77% FabArray::setVal() 537 0.006808 0.006808 0.006808 1.74% MLMG::mgVcycle_down::4 36 0.006723 0.006723 0.006723 1.72% FillBoundary_nowait() 1766 0.006545 0.006545 0.006545 1.67% FabArray::ParallelCopy() 380 0.006455 0.006455 0.006455 1.65% Castro::do_old_sources() 5 0.006349 0.006349 0.006349 1.62% FabArray::ParallelCopy_nowait() 380 0.006332 0.006332 0.006332 1.62% CGSolver::sxay() 690 0.006324 0.006324 0.006324 1.62% Castro::enforce_min_density() 30 0.006296 0.006296 0.006296 1.61% MultiFab::LinComb() 690 0.006158 0.006158 0.006158 1.57% MLCGSolver::ParallelAllReduce 659 0.005733 0.005733 0.005733 1.47% MLMG::mgVcycle_up::2 36 0.005702 0.005702 0.005702 1.46% MLMG::mgVcycle_up::1 36 0.005615 0.005615 0.005615 1.44% MLMG::addInterpCorrection() 180 0.005571 0.005571 0.005571 1.42% Castro::expand_state() 5 0.005523 0.005523 0.005523 1.41% MLMG::mgVcycle_up::4 36 0.005393 0.005393 0.005393 1.38% MLMG::mgVcycle_up::3 36 0.005391 0.005391 0.005391 1.38% amrex::average_down 180 0.005233 0.005233 0.005233 1.34% MLPoisson::Fapply() 500 0.005081 0.005081 0.005081 1.30% Gravity::fill_multipole_BCs() 6 0.00443 0.00443 0.00443 1.13% Castro::post_timestep() 5 0.004356 0.004356 0.004356 1.11% Castro::post_restart() 1 0.00404 0.00404 0.00404 1.03% Gravity::multilevel_solve_for_new_phi() 1 0.003915 0.003915 0.003915 1.00% Gravity::actual_multilevel_solve() 1 0.003898 0.003898 0.003898 1.00% MLCellLinOp::solutionResidual() 42 0.003249 0.003249 0.003249 0.83% MLMG::prepareForSolve() 6 0.002993 0.002993 0.002993 0.77% MultiFab::Xpay() 258 0.002881 0.002881 0.002881 0.74% MLCellLinOp::defineBC() 6 0.002876 0.002876 0.002876 0.74% BndryData::define() 6 0.00272 0.00272 0.00272 0.70% Castro::estTimeStep() 10 0.002716 0.002716 0.002716 0.69% MLMG::computeResidual() 36 0.002695 0.002695 0.002695 0.69% Castro::reset_internal_energy(MultiFab) 30 0.002594 0.002594 0.002594 0.66% Castro::enforce_speed_limit() 30 0.001964 0.001964 0.001964 0.50% Castro::construct_new_source() 25 0.001808 0.001808 0.001808 0.46% Castro::construct_new_gravity_source() 5 0.001657 0.001657 0.001657 0.42% Castro::computeNewDt() 5 0.001629 0.001629 0.001629 0.42% Castro::construct_old_source() 25 0.001495 0.001495 0.001495 0.38% Castro::construct_old_gravity_source() 5 0.001392 0.001392 0.001392 0.36% Castro::apply_source_to_state() 10 0.000931 0.000931 0.000931 0.24% MultiFab::Saxpy() 10 0.0009251 0.0009251 0.0009251 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009227 0.0009227 0.0009227 0.24% MLMG::ResNormInf() 42 0.0009206 0.0009206 0.0009206 0.24% Castro::reset_internal_energy(Fab) 240 0.0008714 0.0008714 0.0008714 0.22% MLCellLinOp::setLevelBC() 6 0.0008382 0.0008382 0.0008382 0.21% FabArrayBase::getCPC() 632 0.0007778 0.0007778 0.0007778 0.20% MLMG::getGradSolution() 6 0.0007686 0.0007686 0.0007686 0.20% MLCellLinOp::compGrad() 6 0.0007654 0.0007654 0.0007654 0.20% MultiFab::Add() 36 0.0007169 0.0007169 0.0007169 0.18% MLPoisson::prepareForSolve() 6 0.0006645 0.0006645 0.0006645 0.17% MLCellLinOp::prepareForSolve() 6 0.0006596 0.0006596 0.0006596 0.17% FabArray::mult() 22 0.0006509 0.0006509 0.0006509 0.17% FabArray::setDomainBndry() 20 0.0006479 0.0006479 0.0006479 0.17% Castro::check_for_nan() 10 0.0005941 0.0005941 0.0005941 0.15% MultiFab::contains_nan() 10 0.0005881 0.0005881 0.0005881 0.15% MLMG::computeMLResidual() 6 0.0005783 0.0005783 0.0005783 0.15% Gravity::update_max_rhs() 6 0.0004425 0.0004425 0.0004425 0.11% FabArrayBase::CPC::define() 244 0.0004053 0.0004053 0.0004053 0.10% Amr::InitAmr() 1 0.0004038 0.0004038 0.0004038 0.10% FabArrayBase::getFB() 1766 0.0003207 0.0003207 0.0003207 0.08% Gravity::swapTimeLevels() 5 0.0002274 0.0002274 0.0002274 0.06% MultiFab::Copy() 6 0.0001808 0.0001808 0.0001808 0.05% Castro::create_source_corrector() 5 0.000159 0.000159 0.000159 0.04% Castro::buildMetrics() 1 0.0001525 0.0001525 0.0001525 0.04% MLMG::MLResNormInf() 6 0.0001469 0.0001469 0.0001469 0.04% MLLinOp::define() 6 0.0001371 0.0001371 0.0001371 0.04% MultiFab::max() 6 0.0001344 0.0001344 0.0001344 0.03% MLLinOp::defineGrids() 6 0.0001162 0.0001162 0.0001162 0.03% MLMG::MLRhsNormInf() 6 0.0001133 0.0001133 0.0001133 0.03% Castro::finalize_advance() 5 7.256e-05 7.256e-05 7.256e-05 0.02% FabArrayBase::FB::FB() 26 5.848e-05 5.848e-05 5.848e-05 0.01% Amr::writeSmallPlotFile() 1 3.892e-05 3.892e-05 3.892e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.82e-05 2.82e-05 2.82e-05 0.01% Castro::swap_state_time_levels() 5 2.775e-05 2.775e-05 2.775e-05 0.01% Castro::initMFs() 1 2.708e-05 2.708e-05 2.708e-05 0.01% makeSFC 30 2.095e-05 2.095e-05 2.095e-05 0.01% Castro::finalize_do_advance() 5 1.874e-05 1.874e-05 1.874e-05 0.00% Amr::initSubcycle() 1 9.108e-06 9.108e-06 9.108e-06 0.00% DistributionMapping::Distribute() 31 8.642e-06 8.642e-06 8.642e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.625e-06 4.625e-06 4.625e-06 0.00% Gravity::set_mass_offset() 6 2.055e-06 2.055e-06 2.055e-06 0.00% Castro::retry_advance_ctu() 5 1.837e-06 1.837e-06 1.837e-06 0.00% Castro::FluxRegCrseInit 5 1.795e-06 1.795e-06 1.795e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.201e-06 1.201e-06 1.201e-06 0.00% Castro::FluxRegFineAdd() 5 1.115e-06 1.115e-06 1.115e-06 0.00% AmrLevel::AmrLevel() 1 8.12e-07 8.12e-07 8.12e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.10-17-g56b6402d2389) finalized