Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.10-20-g3082028e4287) initialized Starting run at 08:37:31 UTC on 2022-10-20. Successfully read inputs file ... Castro git describe: 22.09-1-g65b273ad0 AMReX git describe: 22.10-20-g3082028e4 Microphysics git describe: 22.10-4-g1dbcf8c2 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.051845868 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.029588338 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.049971741 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.051036512 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.049485695 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.061200673 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.06441032 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.050523297 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.069273208 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.068751624 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.059052118 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.05874323 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.063915871 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.047470558 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.029475098 seconds Ending run at 08:37:32 UTC on 2022-10-20. Run time = 0.85704784 Run time without initialization = 0.723945848 Average number of zones advanced per microsecond: 3.621 Average number of zones advanced per microsecond per rank: 3.621 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8571 ... 0.8571 ... 0.8571 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.2149 0.2149 0.2149 25.08% VisMF::Write(FabArray) 11 0.2 0.2 0.2 23.33% MLCellLinOp::applyBC() 4433 0.07909 0.07909 0.07909 9.23% MLPoisson::Fsmooth() 3280 0.06324 0.06324 0.06324 7.38% MLCGSolver::bicgstab 82 0.02342 0.02342 0.02342 2.73% StateData::FillBoundary(geom) 328 0.0234 0.0234 0.0234 2.73% MultiFab::Dot() 1114 0.02191 0.02191 0.02191 2.56% Castro::computeTemp() 63 0.0141 0.0141 0.0141 1.65% StateDataPhysBCFunct::() 41 0.01408 0.01408 0.01408 1.64% MultiFab::LinComb() 1586 0.01406 0.01406 0.01406 1.64% FabArray::setVal() 1144 0.01401 0.01401 0.01401 1.63% FillBoundary_nowait() 4023 0.01401 0.01401 0.01401 1.63% Castro::normalize_species() 62 0.01359 0.01359 0.01359 1.59% FabArray::ParallelCopy_nowait() 861 0.01298 0.01298 0.01298 1.51% MLPoisson::Fapply() 1142 0.01158 0.01158 0.01158 1.35% MLCellLinOp::defineAuxData() 11 0.01143 0.01143 0.01143 1.33% Castro::enforce_min_density() 62 0.009049 0.009049 0.009049 1.06% Gravity::fill_multipole_BCs() 11 0.00828 0.00828 0.00828 0.97% MLMG::addInterpCorrection() 410 0.007667 0.007667 0.007667 0.89% amrex::average_down 410 0.006795 0.006795 0.006795 0.79% MultiFab::Xpay() 585 0.0065 0.0065 0.0065 0.76% Amr::checkPoint() 3 0.006175 0.006175 0.006175 0.72% Castro::do_advance_ctu() 10 0.005398 0.005398 0.005398 0.63% Castro::estTimeStep() 21 0.005286 0.005286 0.005286 0.62% Castro::reset_internal_energy(MultiFab) 63 0.003959 0.003959 0.003959 0.46% BndryData::define() 11 0.00372 0.00372 0.00372 0.43% Castro::construct_new_gravity_source() 10 0.00331 0.00331 0.00331 0.39% Amr::writePlotFile() 2 0.002829 0.002829 0.002829 0.33% Castro::construct_old_gravity_source() 10 0.002619 0.002619 0.002619 0.31% MLMG::ResNormInf() 93 0.002029 0.002029 0.002029 0.24% Gravity::get_new_grav_vector() 11 0.001925 0.001925 0.001925 0.22% MultiFab::Saxpy() 20 0.001819 0.001819 0.001819 0.21% Gravity::get_old_grav_vector() 10 0.001754 0.001754 0.001754 0.20% Castro::expand_state() 10 0.001729 0.001729 0.001729 0.20% MultiFab::Add() 82 0.001661 0.001661 0.001661 0.19% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001632 0.001632 0.001632 0.19% Castro::reset_internal_energy(Fab) 504 0.001538 0.001538 0.001538 0.18% MLCellLinOp::setLevelBC() 11 0.001509 0.001509 0.001509 0.18% Gravity::actual_solve_with_mlmg() 11 0.001455 0.001455 0.001455 0.17% FabArray::mult() 43 0.001323 0.001323 0.001323 0.15% FabArray::setDomainBndry() 41 0.001309 0.001309 0.001309 0.15% Castro::initData() 1 0.001295 0.001295 0.001295 0.15% MLMG::prepareForSolve() 11 0.001224 0.001224 0.001224 0.14% MultiFab::contains_nan() 20 0.001189 0.001189 0.001189 0.14% MLCellLinOp::prepareForSolve() 11 0.001149 0.001149 0.001149 0.13% Castro::enforce_speed_limit() 62 0.001123 0.001123 0.001123 0.13% MLCellLinOp::smooth() 1640 0.0009999 0.0009999 0.0009999 0.12% MLCellLinOp::compGrad() 11 0.0009135 0.0009135 0.0009135 0.11% FabArray::FillBoundary() 4023 0.0008733 0.0008733 0.0008733 0.10% FabArrayBase::getCPC() 1323 0.0007797 0.0007797 0.0007797 0.09% Castro::subcycle_advance_ctu() 10 0.0007345 0.0007345 0.0007345 0.09% FabArrayBase::CPC::define() 454 0.0006527 0.0006527 0.0006527 0.08% FabArrayBase::getFB() 4023 0.0005969 0.0005969 0.0005969 0.07% Gravity::solve_for_phi() 10 0.0004798 0.0004798 0.0004798 0.06% Amr::InitAmr() 1 0.000476 0.000476 0.000476 0.06% MLCellLinOp::apply() 1142 0.0004254 0.0004254 0.0004254 0.05% Gravity::update_max_rhs() 11 0.0004031 0.0004031 0.0004031 0.05% CGSolver::sxay() 1586 0.0003499 0.0003499 0.0003499 0.04% Amr::coarseTimeStep() 10 0.000327 0.000327 0.000327 0.04% MultiFab::Copy() 11 0.0003166 0.0003166 0.0003166 0.04% FillPatchIterator::Initialize 41 0.0003005 0.0003005 0.0003005 0.04% MLCGSolver::ParallelAllReduce 1514 0.0002872 0.0002872 0.0002872 0.03% MLCellLinOp::defineBC() 11 0.0002802 0.0002802 0.0002802 0.03% main() 1 0.0002703 0.0002703 0.0002703 0.03% FabArray::ParallelCopy() 861 0.0002675 0.0002675 0.0002675 0.03% MultiFab::max() 11 0.0002632 0.0002632 0.0002632 0.03% MLCellLinOp::correctionResidual() 492 0.0002297 0.0002297 0.0002297 0.03% MLMG::MLRhsNormInf() 11 0.0002164 0.0002164 0.0002164 0.03% Castro::construct_new_gravity() 10 0.000216 0.000216 0.000216 0.03% MLMG::mgVcycle() 82 0.0002111 0.0002111 0.0002111 0.02% Amr::timeStep() 10 0.000176 0.000176 0.000176 0.02% MLMG:computeResOfCorrection() 410 0.0001581 0.0001581 0.0001581 0.02% MLLinOp::defineGrids() 11 0.0001499 0.0001499 0.0001499 0.02% StateData::checkPoint() 12 0.0001313 0.0001313 0.0001313 0.02% MLMG::mgVcycle_down::0 82 0.0001161 0.0001161 0.0001161 0.01% Castro::finalize_advance() 10 0.0001031 0.0001031 0.0001031 0.01% MLMG::mgVcycle_down::1 82 9.813e-05 9.813e-05 9.813e-05 0.01% Castro::advance() 10 9.447e-05 9.447e-05 9.447e-05 0.01% MLMG::mgVcycle_down::2 82 9.272e-05 9.272e-05 9.272e-05 0.01% MLMG::mgVcycle_down::3 82 8.946e-05 8.946e-05 8.946e-05 0.01% Castro::clean_state() 62 8.826e-05 8.826e-05 8.826e-05 0.01% FabArrayBase::FB::FB() 56 8.823e-05 8.823e-05 8.823e-05 0.01% MLMG::actualBottomSolve() 82 8.734e-05 8.734e-05 8.734e-05 0.01% Castro::Castro() 1 8.52e-05 8.52e-05 8.52e-05 0.01% Castro::initialize_advance() 10 8.361e-05 8.361e-05 8.361e-05 0.01% MLMG::mgVcycle_down::4 82 8.265e-05 8.265e-05 8.265e-05 0.01% AmrLevel::checkPoint() 3 7.307e-05 7.307e-05 7.307e-05 0.01% MLMG::solve() 11 7.017e-05 7.017e-05 7.017e-05 0.01% MLMG::mgVcycle_up::4 82 6.974e-05 6.974e-05 6.974e-05 0.01% Castro::initialize_do_advance() 10 6.503e-05 6.503e-05 6.503e-05 0.01% MLMG::oneIter() 82 6.482e-05 6.482e-05 6.482e-05 0.01% MLMG::mgVcycle_up::0 82 6.138e-05 6.138e-05 6.138e-05 0.01% MLMG::mgVcycle_up::3 82 5.783e-05 5.783e-05 5.783e-05 0.01% MLMG::mgVcycle_up::1 82 5.659e-05 5.659e-05 5.659e-05 0.01% MLMG::mgVcycle_up::2 82 5.595e-05 5.595e-05 5.595e-05 0.01% MLCellLinOp::solutionResidual() 93 5.38e-05 5.38e-05 5.38e-05 0.01% StateData::define() 4 4.403e-05 4.403e-05 4.403e-05 0.01% MLMG::computeResidual() 82 4.185e-05 4.185e-05 4.185e-05 0.00% Castro::swap_state_time_levels() 10 3.838e-05 3.838e-05 3.838e-05 0.00% Castro::enforce_consistent_e() 1 3.433e-05 3.433e-05 3.433e-05 0.00% Castro::finalize_do_advance() 10 3.403e-05 3.403e-05 3.403e-05 0.00% MLMG::mgVcycle_bottom 82 3.207e-05 3.207e-05 3.207e-05 0.00% MLPoisson::define() 11 3.141e-05 3.141e-05 3.141e-05 0.00% Gravity::actual_multilevel_solve() 1 3.136e-05 3.136e-05 3.136e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.836e-05 2.836e-05 2.836e-05 0.00% FillPatchSingleLevel 41 2.786e-05 2.786e-05 2.786e-05 0.00% makeSFC 55 2.699e-05 2.699e-05 2.699e-05 0.00% Castro::initMFs() 1 2.59e-05 2.59e-05 2.59e-05 0.00% Amr::writeSmallPlotFile() 1 2.552e-05 2.552e-05 2.552e-05 0.00% Amr::defBaseLevel() 1 2.369e-05 2.369e-05 2.369e-05 0.00% MLLinOp::define() 11 2.288e-05 2.288e-05 2.288e-05 0.00% Castro::buildMetrics() 1 2.263e-05 2.263e-05 2.263e-05 0.00% Amr::FinalizeInit() 1 2.107e-05 2.107e-05 2.107e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.917e-05 1.917e-05 1.917e-05 0.00% Castro::construct_new_source() 50 1.826e-05 1.826e-05 1.826e-05 0.00% Castro::construct_old_source() 50 1.776e-05 1.776e-05 1.776e-05 0.00% Castro::do_new_sources() 10 1.708e-05 1.708e-05 1.708e-05 0.00% Castro::do_old_sources() 10 1.615e-05 1.615e-05 1.615e-05 0.00% DistributionMapping::Distribute() 56 1.489e-05 1.489e-05 1.489e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.431e-05 1.431e-05 1.431e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.33e-05 1.33e-05 1.33e-05 0.00% Castro::check_for_nan() 20 1.239e-05 1.239e-05 1.239e-05 0.00% Castro::apply_source_to_state() 20 1.18e-05 1.18e-05 1.18e-05 0.00% MLMG::computeMLResidual() 11 1.017e-05 1.017e-05 1.017e-05 0.00% Castro::construct_old_gravity() 10 9.671e-06 9.671e-06 9.671e-06 0.00% Castro::post_timestep() 10 9.134e-06 9.134e-06 9.134e-06 0.00% Gravity::swapTimeLevels() 10 8.548e-06 8.548e-06 8.548e-06 0.00% MLPoisson::prepareForSolve() 11 8.39e-06 8.39e-06 8.39e-06 0.00% Amr::initSubcycle() 1 8.137e-06 8.137e-06 8.137e-06 0.00% MLMG::getGradSolution() 11 6.854e-06 6.854e-06 6.854e-06 0.00% AmrLevel::checkPointPost() 3 6.376e-06 6.376e-06 6.376e-06 0.00% Castro::computeNewDt() 9 5.974e-06 5.974e-06 5.974e-06 0.00% Amr::InitializeInit() 1 5.454e-06 5.454e-06 5.454e-06 0.00% Castro::post_init() 1 4.717e-06 4.717e-06 4.717e-06 0.00% Castro::retry_advance_ctu() 10 4.475e-06 4.475e-06 4.475e-06 0.00% Gravity::set_mass_offset() 11 4.063e-06 4.063e-06 4.063e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.882e-06 3.882e-06 3.882e-06 0.00% Castro::computeInitialDt() 2 3.705e-06 3.705e-06 3.705e-06 0.00% Castro::FluxRegCrseInit 10 3.464e-06 3.464e-06 3.464e-06 0.00% Castro::create_source_corrector() 10 3.381e-06 3.381e-06 3.381e-06 0.00% MLMG::MLResNormInf() 11 3.328e-06 3.328e-06 3.328e-06 0.00% Amr::init() 1 2.916e-06 2.916e-06 2.916e-06 0.00% Castro::FluxRegFineAdd() 10 2.628e-06 2.628e-06 2.628e-06 0.00% AmrLevel::checkPointPre() 3 2.179e-06 2.179e-06 2.179e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.974e-06 1.974e-06 1.974e-06 0.00% Castro::post_regrid() 1 1.312e-06 1.312e-06 1.312e-06 0.00% Amr::initialInit() 1 9.96e-07 9.96e-07 9.96e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8571 0.8571 0.8571 100.00% Amr::coarseTimeStep() 10 0.6942 0.6942 0.6942 81.00% Amr::timeStep() 10 0.5929 0.5929 0.5929 69.18% Castro::advance() 10 0.586 0.586 0.586 68.37% Castro::subcycle_advance_ctu() 10 0.5745 0.5745 0.5745 67.03% Castro::do_advance_ctu() 10 0.5737 0.5737 0.5737 66.94% Gravity::solve_phi_with_mlmg() 11 0.3099 0.3099 0.3099 36.16% Gravity::actual_solve_with_mlmg() 11 0.3014 0.3014 0.3014 35.17% Castro::construct_new_gravity() 10 0.2819 0.2819 0.2819 32.89% MLMG::solve() 11 0.2791 0.2791 0.2791 32.57% Gravity::solve_for_phi() 10 0.2669 0.2669 0.2669 31.14% MLMG::oneIter() 82 0.2644 0.2644 0.2644 30.84% MLMG::mgVcycle() 82 0.2626 0.2626 0.2626 30.64% Castro::construct_ctu_hydro_source() 10 0.2149 0.2149 0.2149 25.08% VisMF::Write(FabArray) 11 0.2 0.2 0.2 23.33% Amr::checkPoint() 3 0.15 0.15 0.15 17.50% AmrLevel::checkPoint() 3 0.1438 0.1438 0.1438 16.78% StateData::checkPoint() 12 0.1437 0.1437 0.1437 16.77% MLCellLinOp::smooth() 1640 0.1346 0.1346 0.1346 15.71% Amr::init() 1 0.1325 0.1325 0.1325 15.46% MLCellLinOp::applyBC() 4433 0.09466 0.09466 0.09466 11.04% MLMG::mgVcycle_bottom 82 0.08033 0.08033 0.08033 9.37% MLMG::actualBottomSolve() 82 0.0803 0.0803 0.0803 9.37% MLCGSolver::bicgstab 82 0.0795 0.0795 0.0795 9.28% MLPoisson::Fsmooth() 3280 0.06324 0.06324 0.06324 7.38% Amr::writePlotFile() 2 0.05919 0.05919 0.05919 6.91% Amr::initialInit() 1 0.05094 0.05094 0.05094 5.94% Amr::FinalizeInit() 1 0.04683 0.04683 0.04683 5.46% Castro::post_init() 1 0.04546 0.04546 0.04546 5.30% Gravity::multilevel_solve_for_new_phi() 1 0.0436 0.0436 0.0436 5.09% Gravity::actual_multilevel_solve() 1 0.04358 0.04358 0.04358 5.09% FillPatchIterator::Initialize 41 0.04311 0.04311 0.04311 5.03% Castro::clean_state() 62 0.04269 0.04269 0.04269 4.98% FillPatchSingleLevel 41 0.0415 0.0415 0.0415 4.84% StateDataPhysBCFunct::() 41 0.03747 0.03747 0.03747 4.37% MLCellLinOp::apply() 1142 0.03579 0.03579 0.03579 4.18% MLMG::mgVcycle_down::0 82 0.03509 0.03509 0.03509 4.09% MLMG::mgVcycle_up::0 82 0.03012 0.03012 0.03012 3.51% StateData::FillBoundary(geom) 328 0.0234 0.0234 0.0234 2.73% MultiFab::Dot() 1114 0.02191 0.02191 0.02191 2.56% MLCellLinOp::correctionResidual() 492 0.02098 0.02098 0.02098 2.45% Castro::initialize_do_advance() 10 0.02056 0.02056 0.02056 2.40% Castro::computeTemp() 63 0.0196 0.0196 0.0196 2.29% MLMG:computeResOfCorrection() 410 0.01813 0.01813 0.01813 2.12% MLPoisson::define() 11 0.01793 0.01793 0.01793 2.09% MLMG::mgVcycle_down::1 82 0.01755 0.01755 0.01755 2.05% MLMG::mgVcycle_down::2 82 0.01703 0.01703 0.01703 1.99% Gravity::get_new_grav_vector() 11 0.01656 0.01656 0.01656 1.93% MLMG::mgVcycle_down::3 82 0.01615 0.01615 0.01615 1.88% FabArray::FillBoundary() 4023 0.01557 0.01557 0.01557 1.82% MLMG::mgVcycle_down::4 82 0.01538 0.01538 0.01538 1.79% Castro::construct_old_gravity() 10 0.01475 0.01475 0.01475 1.72% Gravity::get_old_grav_vector() 10 0.01474 0.01474 0.01474 1.72% FillBoundary_nowait() 4023 0.01469 0.01469 0.01469 1.71% CGSolver::sxay() 1586 0.01441 0.01441 0.01441 1.68% FabArray::ParallelCopy() 861 0.01407 0.01407 0.01407 1.64% MultiFab::LinComb() 1586 0.01406 0.01406 0.01406 1.64% FabArray::setVal() 1144 0.01401 0.01401 0.01401 1.63% FabArray::ParallelCopy_nowait() 861 0.0138 0.0138 0.0138 1.61% Castro::normalize_species() 62 0.01359 0.01359 0.01359 1.59% MLMG::mgVcycle_up::2 82 0.01311 0.01311 0.01311 1.53% MLCGSolver::ParallelAllReduce 1514 0.01308 0.01308 0.01308 1.53% Castro::expand_state() 10 0.01304 0.01304 0.01304 1.52% MLMG::mgVcycle_up::1 82 0.01293 0.01293 0.01293 1.51% MLCellLinOp::defineAuxData() 11 0.01275 0.01275 0.01275 1.49% MLMG::addInterpCorrection() 410 0.01269 0.01269 0.01269 1.48% MLMG::mgVcycle_up::3 82 0.01246 0.01246 0.01246 1.45% MLMG::mgVcycle_up::4 82 0.01225 0.01225 0.01225 1.43% Castro::do_new_sources() 10 0.01192 0.01192 0.01192 1.39% amrex::average_down 410 0.01185 0.01185 0.01185 1.38% MLPoisson::Fapply() 1142 0.01158 0.01158 0.01158 1.35% Castro::initialize_advance() 10 0.01134 0.01134 0.01134 1.32% Castro::do_old_sources() 10 0.009762 0.009762 0.009762 1.14% Castro::enforce_min_density() 62 0.009049 0.009049 0.009049 1.06% Gravity::fill_multipole_BCs() 11 0.00828 0.00828 0.00828 0.97% MLCellLinOp::solutionResidual() 93 0.0071 0.0071 0.0071 0.83% Castro::post_timestep() 10 0.006737 0.006737 0.006737 0.79% MultiFab::Xpay() 585 0.0065 0.0065 0.0065 0.76% MLMG::computeResidual() 82 0.006101 0.006101 0.006101 0.71% Castro::reset_internal_energy(MultiFab) 63 0.005497 0.005497 0.005497 0.64% MLMG::prepareForSolve() 11 0.005298 0.005298 0.005298 0.62% Castro::estTimeStep() 21 0.005286 0.005286 0.005286 0.62% MLCellLinOp::defineBC() 11 0.004924 0.004924 0.004924 0.57% BndryData::define() 11 0.004644 0.004644 0.004644 0.54% Amr::InitializeInit() 1 0.004106 0.004106 0.004106 0.48% Amr::defBaseLevel() 1 0.004101 0.004101 0.004101 0.48% Castro::initData() 1 0.003577 0.003577 0.003577 0.42% Castro::construct_new_source() 50 0.003328 0.003328 0.003328 0.39% Castro::construct_new_gravity_source() 10 0.00331 0.00331 0.00331 0.39% Castro::construct_old_source() 50 0.002637 0.002637 0.002637 0.31% Castro::construct_old_gravity_source() 10 0.002619 0.002619 0.002619 0.31% Castro::computeNewDt() 9 0.002357 0.002357 0.002357 0.28% MLMG::ResNormInf() 93 0.002029 0.002029 0.002029 0.24% Castro::apply_source_to_state() 20 0.001831 0.001831 0.001831 0.21% MultiFab::Saxpy() 20 0.001819 0.001819 0.001819 0.21% MultiFab::Add() 82 0.001661 0.001661 0.001661 0.19% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001632 0.001632 0.001632 0.19% Castro::reset_internal_energy(Fab) 504 0.001538 0.001538 0.001538 0.18% MLCellLinOp::setLevelBC() 11 0.001509 0.001509 0.001509 0.18% FabArrayBase::getCPC() 1323 0.001432 0.001432 0.001432 0.17% MLMG::getGradSolution() 11 0.001414 0.001414 0.001414 0.16% MLCellLinOp::compGrad() 11 0.001407 0.001407 0.001407 0.16% FabArray::mult() 43 0.001323 0.001323 0.001323 0.15% FabArray::setDomainBndry() 41 0.001309 0.001309 0.001309 0.15% Castro::check_for_nan() 20 0.001201 0.001201 0.001201 0.14% MultiFab::contains_nan() 20 0.001189 0.001189 0.001189 0.14% MLPoisson::prepareForSolve() 11 0.001157 0.001157 0.001157 0.14% MLCellLinOp::prepareForSolve() 11 0.001149 0.001149 0.001149 0.13% Castro::enforce_speed_limit() 62 0.001123 0.001123 0.001123 0.13% Castro::post_regrid() 1 0.001117 0.001117 0.001117 0.13% MLMG::computeMLResidual() 11 0.001051 0.001051 0.001051 0.12% Gravity::update_max_rhs() 11 0.0008121 0.0008121 0.0008121 0.09% Castro::computeInitialDt() 2 0.0007678 0.0007678 0.0007678 0.09% FabArrayBase::getFB() 4023 0.0006851 0.0006851 0.0006851 0.08% FabArrayBase::CPC::define() 454 0.0006527 0.0006527 0.0006527 0.08% Amr::InitAmr() 1 0.0004841 0.0004841 0.0004841 0.06% Gravity::swapTimeLevels() 10 0.0004424 0.0004424 0.0004424 0.05% Castro::Castro() 1 0.0004365 0.0004365 0.0004365 0.05% MultiFab::Copy() 11 0.0003166 0.0003166 0.0003166 0.04% MLMG::MLResNormInf() 11 0.0002837 0.0002837 0.0002837 0.03% MultiFab::max() 11 0.0002632 0.0002632 0.0002632 0.03% MLLinOp::define() 11 0.0002287 0.0002287 0.0002287 0.03% MLMG::MLRhsNormInf() 11 0.0002164 0.0002164 0.0002164 0.03% MLLinOp::defineGrids() 11 0.0002058 0.0002058 0.0002058 0.02% Castro::buildMetrics() 1 0.0001598 0.0001598 0.0001598 0.02% Castro::finalize_advance() 10 0.0001092 0.0001092 0.0001092 0.01% FabArrayBase::FB::FB() 56 8.823e-05 8.823e-05 8.823e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.834e-05 5.834e-05 5.834e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.393e-05 5.393e-05 5.393e-05 0.01% StateData::define() 4 4.403e-05 4.403e-05 4.403e-05 0.01% makeSFC 55 4.062e-05 4.062e-05 4.062e-05 0.00% Castro::swap_state_time_levels() 10 3.838e-05 3.838e-05 3.838e-05 0.00% Castro::enforce_consistent_e() 1 3.433e-05 3.433e-05 3.433e-05 0.00% Castro::finalize_do_advance() 10 3.403e-05 3.403e-05 3.403e-05 0.00% Castro::initMFs() 1 2.59e-05 2.59e-05 2.59e-05 0.00% Amr::writeSmallPlotFile() 1 2.552e-05 2.552e-05 2.552e-05 0.00% DistributionMapping::Distribute() 56 1.489e-05 1.489e-05 1.489e-05 0.00% Amr::initSubcycle() 1 8.137e-06 8.137e-06 8.137e-06 0.00% AmrLevel::checkPointPost() 3 6.376e-06 6.376e-06 6.376e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.135e-06 5.135e-06 5.135e-06 0.00% Castro::retry_advance_ctu() 10 4.475e-06 4.475e-06 4.475e-06 0.00% Gravity::set_mass_offset() 11 4.063e-06 4.063e-06 4.063e-06 0.00% Castro::FluxRegCrseInit 10 3.464e-06 3.464e-06 3.464e-06 0.00% Castro::create_source_corrector() 10 3.381e-06 3.381e-06 3.381e-06 0.00% Castro::FluxRegFineAdd() 10 2.628e-06 2.628e-06 2.628e-06 0.00% AmrLevel::checkPointPre() 3 2.179e-06 2.179e-06 2.179e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.974e-06 1.974e-06 1.974e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2545 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.10-20-g3082028e4287) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.10-20-g3082028e4287) initialized Starting run at 08:37:33 UTC on 2022-10-20. Successfully read inputs file ... Castro git describe: 22.09-1-g65b273ad0 AMReX git describe: 22.10-20-g3082028e4 Microphysics git describe: 22.10-4-g1dbcf8c2 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.459759991 Restart time = 0.047570317 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.053229711 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.049995791 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.056274232 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.059993152 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.063667866 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.031234195 seconds Ending run at 08:37:33 UTC on 2022-10-20. Run time = 0.362901318 Run time without initialization = 0.314789072 Average number of zones advanced per microsecond: 4.164 Average number of zones advanced per microsecond per rank: 4.164 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3629 ... 0.3629 ... 0.3629 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0887 0.0887 0.0887 24.43% VisMF::Read() 3 0.03978 0.03978 0.03978 10.96% MLCellLinOp::applyBC() 1946 0.03417 0.03417 0.03417 9.42% VisMF::Write(FabArray) 1 0.02962 0.02962 0.02962 8.16% MLPoisson::Fsmooth() 1440 0.02668 0.02668 0.02668 7.35% StateData::FillBoundary(geom) 160 0.01173 0.01173 0.01173 3.23% MLCGSolver::bicgstab 36 0.009986 0.009986 0.009986 2.75% Castro::normalize_species() 30 0.009655 0.009655 0.009655 2.66% MultiFab::Dot() 484 0.009287 0.009287 0.009287 2.56% Castro::computeTemp() 30 0.007162 0.007162 0.007162 1.97% FabArray::setVal() 537 0.006608 0.006608 0.006608 1.82% FillBoundary_nowait() 1766 0.006158 0.006158 0.006158 1.70% MLCellLinOp::defineAuxData() 6 0.006144 0.006144 0.006144 1.69% MultiFab::LinComb() 690 0.005935 0.005935 0.005935 1.64% FabArray::ParallelCopy_nowait() 380 0.005892 0.005892 0.005892 1.62% StateDataPhysBCFunct::() 20 0.005397 0.005397 0.005397 1.49% MLPoisson::Fapply() 500 0.004922 0.004922 0.004922 1.36% Castro::enforce_min_density() 30 0.00423 0.00423 0.00423 1.17% Gravity::fill_multipole_BCs() 6 0.004131 0.004131 0.004131 1.14% Amr::restart() 1 0.003605 0.003605 0.003605 0.99% MLMG::addInterpCorrection() 180 0.003297 0.003297 0.003297 0.91% Castro::do_advance_ctu() 5 0.003016 0.003016 0.003016 0.83% amrex::average_down 180 0.002924 0.002924 0.002924 0.81% MultiFab::Xpay() 258 0.002832 0.002832 0.002832 0.78% Castro::estTimeStep() 10 0.002438 0.002438 0.002438 0.67% BndryData::define() 6 0.002025 0.002025 0.002025 0.56% Castro::construct_new_gravity_source() 5 0.00175 0.00175 0.00175 0.48% Castro::reset_internal_energy(MultiFab) 30 0.00174 0.00174 0.00174 0.48% Amr::writePlotFile() 1 0.0017 0.0017 0.0017 0.47% Castro::construct_old_gravity_source() 5 0.001497 0.001497 0.001497 0.41% Gravity::get_old_grav_vector() 5 0.0009991 0.0009991 0.0009991 0.28% Castro::reset_internal_energy(Fab) 240 0.0009578 0.0009578 0.0009578 0.26% Gravity::get_new_grav_vector() 5 0.0009506 0.0009506 0.0009506 0.26% MultiFab::Saxpy() 10 0.0009166 0.0009166 0.0009166 0.25% MLMG::ResNormInf() 42 0.0008879 0.0008879 0.0008879 0.24% Castro::expand_state() 5 0.0008697 0.0008697 0.0008697 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008671 0.0008671 0.0008671 0.24% MLCellLinOp::setLevelBC() 6 0.0008075 0.0008075 0.0008075 0.22% Gravity::actual_solve_with_mlmg() 6 0.0007924 0.0007924 0.0007924 0.22% MultiFab::Add() 36 0.0007209 0.0007209 0.0007209 0.20% MLMG::prepareForSolve() 6 0.0006495 0.0006495 0.0006495 0.18% FabArray::mult() 22 0.0006481 0.0006481 0.0006481 0.18% FabArray::setDomainBndry() 20 0.0006426 0.0006426 0.0006426 0.18% MLCellLinOp::prepareForSolve() 6 0.0006235 0.0006235 0.0006235 0.17% MultiFab::contains_nan() 10 0.000601 0.000601 0.000601 0.17% Castro::enforce_speed_limit() 30 0.0005436 0.0005436 0.0005436 0.15% MLCellLinOp::compGrad() 6 0.0004869 0.0004869 0.0004869 0.13% MLCellLinOp::smooth() 720 0.0004467 0.0004467 0.0004467 0.12% Amr::InitAmr() 1 0.000394 0.000394 0.000394 0.11% FabArray::FillBoundary() 1766 0.0003817 0.0003817 0.0003817 0.11% FabArrayBase::CPC::define() 244 0.0003799 0.0003799 0.0003799 0.10% FabArrayBase::getCPC() 632 0.0003606 0.0003606 0.0003606 0.10% FabArrayBase::getFB() 1766 0.0002549 0.0002549 0.0002549 0.07% main() 1 0.0002498 0.0002498 0.0002498 0.07% Gravity::update_max_rhs() 6 0.0002309 0.0002309 0.0002309 0.06% Gravity::solve_for_phi() 5 0.0002143 0.0002143 0.0002143 0.06% MLCellLinOp::apply() 500 0.0001825 0.0001825 0.0001825 0.05% Castro::subcycle_advance_ctu() 5 0.0001752 0.0001752 0.0001752 0.05% MultiFab::Copy() 6 0.0001746 0.0001746 0.0001746 0.05% Castro::construct_new_gravity() 5 0.0001642 0.0001642 0.0001642 0.05% Amr::coarseTimeStep() 5 0.0001626 0.0001626 0.0001626 0.04% CGSolver::sxay() 690 0.0001626 0.0001626 0.0001626 0.04% MLCellLinOp::defineBC() 6 0.0001446 0.0001446 0.0001446 0.04% FillPatchIterator::Initialize 20 0.0001367 0.0001367 0.0001367 0.04% MultiFab::max() 6 0.0001347 0.0001347 0.0001347 0.04% Castro::post_timestep() 5 0.0001193 0.0001193 0.0001193 0.03% FabArray::ParallelCopy() 380 0.0001173 0.0001173 0.0001173 0.03% MLCGSolver::ParallelAllReduce 659 0.0001153 0.0001153 0.0001153 0.03% MLMG::MLRhsNormInf() 6 0.0001127 0.0001127 0.0001127 0.03% Castro::construct_new_source() 25 0.0001106 0.0001106 0.0001106 0.03% Castro::advance() 5 0.0001037 0.0001037 0.0001037 0.03% MLLinOp::defineGrids() 6 0.0001023 0.0001023 0.0001023 0.03% MLCellLinOp::correctionResidual() 216 0.000102 0.000102 0.000102 0.03% MLMG::mgVcycle() 36 9.314e-05 9.314e-05 9.314e-05 0.03% AmrLevel::restart() 1 8.365e-05 8.365e-05 8.365e-05 0.02% Amr::timeStep() 5 8.092e-05 8.092e-05 8.092e-05 0.02% StateData::restartDoit() 4 7.394e-05 7.394e-05 7.394e-05 0.02% MLMG:computeResOfCorrection() 180 7.274e-05 7.274e-05 7.274e-05 0.02% Castro::finalize_advance() 5 6.959e-05 6.959e-05 6.959e-05 0.02% Castro::create_source_corrector() 5 6.446e-05 6.446e-05 6.446e-05 0.02% FabArrayBase::FB::FB() 26 5.499e-05 5.499e-05 5.499e-05 0.02% Castro::construct_old_source() 25 5.203e-05 5.203e-05 5.203e-05 0.01% MLMG::mgVcycle_down::0 36 5.201e-05 5.201e-05 5.201e-05 0.01% MLMG::mgVcycle_down::1 36 4.631e-05 4.631e-05 4.631e-05 0.01% Castro::initialize_do_advance() 5 4.569e-05 4.569e-05 4.569e-05 0.01% Castro::clean_state() 30 4.254e-05 4.254e-05 4.254e-05 0.01% MLMG::mgVcycle_down::2 36 4.208e-05 4.208e-05 4.208e-05 0.01% MLMG::actualBottomSolve() 36 4.077e-05 4.077e-05 4.077e-05 0.01% Castro::initialize_advance() 5 4.01e-05 4.01e-05 4.01e-05 0.01% MLMG::mgVcycle_down::4 36 3.964e-05 3.964e-05 3.964e-05 0.01% MLMG::mgVcycle_down::3 36 3.87e-05 3.87e-05 3.87e-05 0.01% Castro::computeNewDt() 5 3.533e-05 3.533e-05 3.533e-05 0.01% MLMG::mgVcycle_up::4 36 3.36e-05 3.36e-05 3.36e-05 0.01% Castro::buildMetrics() 1 3.213e-05 3.213e-05 3.213e-05 0.01% MLMG::solve() 6 3.167e-05 3.167e-05 3.167e-05 0.01% Gravity::actual_multilevel_solve() 1 2.929e-05 2.929e-05 2.929e-05 0.01% Castro::construct_old_gravity() 5 2.903e-05 2.903e-05 2.903e-05 0.01% MLMG::oneIter() 36 2.882e-05 2.882e-05 2.882e-05 0.01% Castro::post_restart() 1 2.794e-05 2.794e-05 2.794e-05 0.01% Castro::swap_state_time_levels() 5 2.754e-05 2.754e-05 2.754e-05 0.01% MLMG::mgVcycle_up::0 36 2.733e-05 2.733e-05 2.733e-05 0.01% MLMG::mgVcycle_up::3 36 2.711e-05 2.711e-05 2.711e-05 0.01% Amr::writeSmallPlotFile() 1 2.641e-05 2.641e-05 2.641e-05 0.01% Castro::initMFs() 1 2.522e-05 2.522e-05 2.522e-05 0.01% Castro::do_old_sources() 5 2.507e-05 2.507e-05 2.507e-05 0.01% MLMG::mgVcycle_up::2 36 2.503e-05 2.503e-05 2.503e-05 0.01% MLMG::mgVcycle_up::1 36 2.394e-05 2.394e-05 2.394e-05 0.01% MLCellLinOp::solutionResidual() 42 2.345e-05 2.345e-05 2.345e-05 0.01% MLPoisson::define() 6 2.134e-05 2.134e-05 2.134e-05 0.01% MLLinOp::define() 6 1.975e-05 1.975e-05 1.975e-05 0.01% MLMG::computeResidual() 36 1.907e-05 1.907e-05 1.907e-05 0.01% Castro::finalize_do_advance() 5 1.763e-05 1.763e-05 1.763e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.754e-05 1.754e-05 1.754e-05 0.00% FillPatchSingleLevel 20 1.741e-05 1.741e-05 1.741e-05 0.00% MLMG::mgVcycle_bottom 36 1.523e-05 1.523e-05 1.523e-05 0.00% makeSFC 30 1.511e-05 1.511e-05 1.511e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.473e-05 1.473e-05 1.473e-05 0.00% Castro::do_new_sources() 5 9.493e-06 9.493e-06 9.493e-06 0.00% Amr::initSubcycle() 1 9.077e-06 9.077e-06 9.077e-06 0.00% DistributionMapping::Distribute() 31 9.057e-06 9.057e-06 9.057e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.546e-06 7.546e-06 7.546e-06 0.00% Castro::check_for_nan() 10 6.291e-06 6.291e-06 6.291e-06 0.00% Castro::apply_source_to_state() 10 6.017e-06 6.017e-06 6.017e-06 0.00% MLMG::computeMLResidual() 6 4.709e-06 4.709e-06 4.709e-06 0.00% MLPoisson::prepareForSolve() 6 4.517e-06 4.517e-06 4.517e-06 0.00% Gravity::swapTimeLevels() 5 4.463e-06 4.463e-06 4.463e-06 0.00% MLMG::getGradSolution() 6 3.32e-06 3.32e-06 3.32e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.07e-06 3.07e-06 3.07e-06 0.00% MLMG::MLResNormInf() 6 2.08e-06 2.08e-06 2.08e-06 0.00% Gravity::set_mass_offset() 6 2.045e-06 2.045e-06 2.045e-06 0.00% Castro::FluxRegCrseInit 5 1.831e-06 1.831e-06 1.831e-06 0.00% Castro::retry_advance_ctu() 5 1.709e-06 1.709e-06 1.709e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.203e-06 1.203e-06 1.203e-06 0.00% Amr::init() 1 1.141e-06 1.141e-06 1.141e-06 0.00% Castro::FluxRegFineAdd() 5 1.126e-06 1.126e-06 1.126e-06 0.00% AmrLevel::AmrLevel() 1 8.37e-07 8.37e-07 8.37e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3629 0.3629 0.3629 100.00% Amr::coarseTimeStep() 5 0.2833 0.2833 0.2833 78.06% Amr::timeStep() 5 0.2819 0.2819 0.2819 77.67% Castro::advance() 5 0.2769 0.2769 0.2769 76.28% Castro::subcycle_advance_ctu() 5 0.2715 0.2715 0.2715 74.82% Castro::do_advance_ctu() 5 0.2714 0.2714 0.2714 74.77% Castro::construct_new_gravity() 5 0.1407 0.1407 0.1407 38.77% Gravity::solve_phi_with_mlmg() 6 0.1365 0.1365 0.1365 37.61% Gravity::solve_for_phi() 5 0.133 0.133 0.133 36.65% Gravity::actual_solve_with_mlmg() 6 0.1322 0.1322 0.1322 36.43% MLMG::solve() 6 0.1202 0.1202 0.1202 33.11% MLMG::oneIter() 36 0.1131 0.1131 0.1131 31.15% MLMG::mgVcycle() 36 0.1123 0.1123 0.1123 30.94% Castro::construct_ctu_hydro_source() 5 0.08867 0.08867 0.08867 24.43% MLCellLinOp::smooth() 720 0.05753 0.05753 0.05753 15.85% Amr::init() 1 0.04761 0.04761 0.04761 13.12% Amr::restart() 1 0.04761 0.04761 0.04761 13.12% MLCellLinOp::applyBC() 1946 0.04102 0.04102 0.04102 11.30% AmrLevel::restart() 1 0.03999 0.03999 0.03999 11.02% StateData::restartDoit() 4 0.0399 0.0399 0.0399 10.99% VisMF::Read() 3 0.03978 0.03978 0.03978 10.96% MLMG::mgVcycle_bottom 36 0.03416 0.03416 0.03416 9.41% MLMG::actualBottomSolve() 36 0.03414 0.03414 0.03414 9.41% MLCGSolver::bicgstab 36 0.0338 0.0338 0.0338 9.31% Amr::writePlotFile() 1 0.03132 0.03132 0.03132 8.63% VisMF::Write(FabArray) 1 0.02962 0.02962 0.02962 8.16% MLPoisson::Fsmooth() 1440 0.02668 0.02668 0.02668 7.35% Castro::clean_state() 30 0.02433 0.02433 0.02433 6.70% FillPatchIterator::Initialize 20 0.01991 0.01991 0.01991 5.49% FillPatchSingleLevel 20 0.01913 0.01913 0.01913 5.27% StateDataPhysBCFunct::() 20 0.01712 0.01712 0.01712 4.72% MLCellLinOp::apply() 500 0.01546 0.01546 0.01546 4.26% MLMG::mgVcycle_down::0 36 0.01511 0.01511 0.01511 4.16% MLMG::mgVcycle_up::0 36 0.01294 0.01294 0.01294 3.57% StateData::FillBoundary(geom) 160 0.01173 0.01173 0.01173 3.23% Castro::initialize_do_advance() 5 0.01083 0.01083 0.01083 2.98% Castro::computeTemp() 30 0.00986 0.00986 0.00986 2.72% MLPoisson::define() 6 0.009725 0.009725 0.009725 2.68% Castro::normalize_species() 30 0.009655 0.009655 0.009655 2.66% MultiFab::Dot() 484 0.009287 0.009287 0.009287 2.56% MLCellLinOp::correctionResidual() 216 0.009055 0.009055 0.009055 2.49% MLMG:computeResOfCorrection() 180 0.007827 0.007827 0.007827 2.16% Castro::do_new_sources() 5 0.007623 0.007623 0.007623 2.10% Castro::construct_old_gravity() 5 0.007596 0.007596 0.007596 2.09% Gravity::get_old_grav_vector() 5 0.007566 0.007566 0.007566 2.08% Gravity::get_new_grav_vector() 5 0.007534 0.007534 0.007534 2.08% MLMG::mgVcycle_down::1 36 0.00751 0.00751 0.00751 2.07% MLMG::mgVcycle_down::2 36 0.007283 0.007283 0.007283 2.01% MLMG::mgVcycle_down::3 36 0.006898 0.006898 0.006898 1.90% MLCellLinOp::defineAuxData() 6 0.006861 0.006861 0.006861 1.89% FabArray::FillBoundary() 1766 0.00685 0.00685 0.00685 1.89% FabArray::setVal() 537 0.006608 0.006608 0.006608 1.82% MLMG::mgVcycle_down::4 36 0.006595 0.006595 0.006595 1.82% FillBoundary_nowait() 1766 0.006468 0.006468 0.006468 1.78% FabArray::ParallelCopy() 380 0.00638 0.00638 0.00638 1.76% FabArray::ParallelCopy_nowait() 380 0.006263 0.006263 0.006263 1.73% CGSolver::sxay() 690 0.006097 0.006097 0.006097 1.68% MultiFab::LinComb() 690 0.005935 0.005935 0.005935 1.64% MLMG::mgVcycle_up::2 36 0.00561 0.00561 0.00561 1.55% Castro::expand_state() 5 0.005559 0.005559 0.005559 1.53% MLCGSolver::ParallelAllReduce 659 0.00555 0.00555 0.00555 1.53% MLMG::mgVcycle_up::1 36 0.005523 0.005523 0.005523 1.52% MLMG::addInterpCorrection() 180 0.005489 0.005489 0.005489 1.51% Castro::do_old_sources() 5 0.005364 0.005364 0.005364 1.48% MLMG::mgVcycle_up::3 36 0.00532 0.00532 0.00532 1.47% MLMG::mgVcycle_up::4 36 0.005267 0.005267 0.005267 1.45% Castro::initialize_advance() 5 0.005136 0.005136 0.005136 1.42% amrex::average_down 180 0.005122 0.005122 0.005122 1.41% Castro::post_timestep() 5 0.004951 0.004951 0.004951 1.36% MLPoisson::Fapply() 500 0.004922 0.004922 0.004922 1.36% Castro::enforce_min_density() 30 0.00423 0.00423 0.00423 1.17% Gravity::fill_multipole_BCs() 6 0.004131 0.004131 0.004131 1.14% Castro::post_restart() 1 0.003841 0.003841 0.003841 1.06% Gravity::multilevel_solve_for_new_phi() 1 0.00372 0.00372 0.00372 1.02% Gravity::actual_multilevel_solve() 1 0.003703 0.003703 0.003703 1.02% MLCellLinOp::solutionResidual() 42 0.003194 0.003194 0.003194 0.88% MLMG::prepareForSolve() 6 0.002844 0.002844 0.002844 0.78% MultiFab::Xpay() 258 0.002832 0.002832 0.002832 0.78% Castro::reset_internal_energy(MultiFab) 30 0.002698 0.002698 0.002698 0.74% MLCellLinOp::defineBC() 6 0.00269 0.00269 0.00269 0.74% MLMG::computeResidual() 36 0.002654 0.002654 0.002654 0.73% BndryData::define() 6 0.002545 0.002545 0.002545 0.70% Castro::estTimeStep() 10 0.002438 0.002438 0.002438 0.67% Castro::construct_new_source() 25 0.001861 0.001861 0.001861 0.51% Castro::construct_new_gravity_source() 5 0.00175 0.00175 0.00175 0.48% Castro::construct_old_source() 25 0.001549 0.001549 0.001549 0.43% Castro::construct_old_gravity_source() 5 0.001497 0.001497 0.001497 0.41% Castro::computeNewDt() 5 0.001264 0.001264 0.001264 0.35% Castro::reset_internal_energy(Fab) 240 0.0009578 0.0009578 0.0009578 0.26% Castro::apply_source_to_state() 10 0.0009226 0.0009226 0.0009226 0.25% MultiFab::Saxpy() 10 0.0009166 0.0009166 0.0009166 0.25% MLMG::ResNormInf() 42 0.0008879 0.0008879 0.0008879 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008671 0.0008671 0.0008671 0.24% MLCellLinOp::setLevelBC() 6 0.0008075 0.0008075 0.0008075 0.22% MLMG::getGradSolution() 6 0.000759 0.000759 0.000759 0.21% MLCellLinOp::compGrad() 6 0.0007557 0.0007557 0.0007557 0.21% FabArrayBase::getCPC() 632 0.0007405 0.0007405 0.0007405 0.20% MultiFab::Add() 36 0.0007209 0.0007209 0.0007209 0.20% FabArray::mult() 22 0.0006481 0.0006481 0.0006481 0.18% FabArray::setDomainBndry() 20 0.0006426 0.0006426 0.0006426 0.18% MLPoisson::prepareForSolve() 6 0.000628 0.000628 0.000628 0.17% MLCellLinOp::prepareForSolve() 6 0.0006235 0.0006235 0.0006235 0.17% Castro::check_for_nan() 10 0.0006073 0.0006073 0.0006073 0.17% MultiFab::contains_nan() 10 0.000601 0.000601 0.000601 0.17% MLMG::computeMLResidual() 6 0.0005636 0.0005636 0.0005636 0.16% Castro::enforce_speed_limit() 30 0.0005436 0.0005436 0.0005436 0.15% Gravity::update_max_rhs() 6 0.0004427 0.0004427 0.0004427 0.12% Amr::InitAmr() 1 0.0004031 0.0004031 0.0004031 0.11% FabArrayBase::CPC::define() 244 0.0003799 0.0003799 0.0003799 0.10% FabArrayBase::getFB() 1766 0.0003099 0.0003099 0.0003099 0.09% Gravity::swapTimeLevels() 5 0.000224 0.000224 0.000224 0.06% MultiFab::Copy() 6 0.0001746 0.0001746 0.0001746 0.05% MLLinOp::define() 6 0.0001534 0.0001534 0.0001534 0.04% Castro::buildMetrics() 1 0.0001477 0.0001477 0.0001477 0.04% MLMG::MLResNormInf() 6 0.000147 0.000147 0.000147 0.04% MultiFab::max() 6 0.0001347 0.0001347 0.0001347 0.04% MLLinOp::defineGrids() 6 0.0001336 0.0001336 0.0001336 0.04% MLMG::MLRhsNormInf() 6 0.0001127 0.0001127 0.0001127 0.03% Castro::finalize_advance() 5 7.255e-05 7.255e-05 7.255e-05 0.02% Castro::create_source_corrector() 5 6.446e-05 6.446e-05 6.446e-05 0.02% FabArrayBase::FB::FB() 26 5.499e-05 5.499e-05 5.499e-05 0.02% MLLinOp::makeAgglomeratedDMap 6 3.011e-05 3.011e-05 3.011e-05 0.01% Castro::swap_state_time_levels() 5 2.754e-05 2.754e-05 2.754e-05 0.01% Amr::writeSmallPlotFile() 1 2.641e-05 2.641e-05 2.641e-05 0.01% Castro::initMFs() 1 2.522e-05 2.522e-05 2.522e-05 0.01% makeSFC 30 2.257e-05 2.257e-05 2.257e-05 0.01% Castro::finalize_do_advance() 5 1.763e-05 1.763e-05 1.763e-05 0.00% Amr::initSubcycle() 1 9.077e-06 9.077e-06 9.077e-06 0.00% DistributionMapping::Distribute() 31 9.057e-06 9.057e-06 9.057e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.669e-06 4.669e-06 4.669e-06 0.00% Gravity::set_mass_offset() 6 2.045e-06 2.045e-06 2.045e-06 0.00% Castro::FluxRegCrseInit 5 1.831e-06 1.831e-06 1.831e-06 0.00% Castro::retry_advance_ctu() 5 1.709e-06 1.709e-06 1.709e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.203e-06 1.203e-06 1.203e-06 0.00% Castro::FluxRegFineAdd() 5 1.126e-06 1.126e-06 1.126e-06 0.00% AmrLevel::AmrLevel() 1 8.37e-07 8.37e-07 8.37e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2545 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.10-20-g3082028e4287) finalized