Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.05-29-g1305eb3d364d) initialized Starting run at 08:25:50 UTC on 2022-05-25. Successfully read inputs file ... Castro git describe: 22.05-32-g1070f4487 AMReX git describe: 22.05-29-g1305eb3d3 Microphysics git describe: 22.05-1-g39742967 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.043777241 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.025529601 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.049667208 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.050841887 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.063447791 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.073201596 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.057927759 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.040930289 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.059313117 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.053387874 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.063450893 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.063167137 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.064841205 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.040842135 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.024936584 seconds Ending run at 08:25:51 UTC on 2022-05-25. Run time = 0.828690311 Run time without initialization = 0.706566909 Average number of zones advanced per microsecond: 3.710 Average number of zones advanced per microsecond per rank: 3.710 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.8287 ... 0.8287 ... 0.8287 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.1857 0.1857 0.1857 22.41% VisMF::Write(FabArray) 11 0.1693 0.1693 0.1693 20.43% MLCellLinOp::applyBC() 4433 0.08169 0.08169 0.08169 9.86% MLPoisson::Fsmooth() 3280 0.06457 0.06457 0.06457 7.79% StateData::FillBoundary(geom) 328 0.02487 0.02487 0.02487 3.00% StateDataPhysBCFunct::() 41 0.02474 0.02474 0.02474 2.99% MLCGSolver::bicgstab 82 0.02428 0.02428 0.02428 2.93% MultiFab::Dot() 1114 0.02252 0.02252 0.02252 2.72% Castro::normalize_species() 62 0.01857 0.01857 0.01857 2.24% Castro::computeTemp() 63 0.01599 0.01599 0.01599 1.93% MultiFab::LinComb() 1586 0.0146 0.0146 0.0146 1.76% FabArray::setVal() 1144 0.01436 0.01436 0.01436 1.73% FillBoundary_nowait() 4023 0.01413 0.01413 0.01413 1.71% FabArray::ParallelCopy_nowait() 861 0.01326 0.01326 0.01326 1.60% MLPoisson::Fapply() 1142 0.01187 0.01187 0.01187 1.43% Castro::enforce_min_density() 62 0.0118 0.0118 0.0118 1.42% MLCellLinOp::defineAuxData() 11 0.01163 0.01163 0.01163 1.40% Gravity::fill_multipole_BCs() 11 0.008516 0.008516 0.008516 1.03% MLMG::addInterpCorrection() 410 0.00757 0.00757 0.00757 0.91% Castro::estTimeStep() 21 0.00753 0.00753 0.00753 0.91% amrex::average_down 410 0.006912 0.006912 0.006912 0.83% MultiFab::Xpay() 585 0.006649 0.006649 0.006649 0.80% Castro::do_advance_ctu() 10 0.005347 0.005347 0.005347 0.65% Castro::reset_internal_energy(MultiFab) 63 0.004744 0.004744 0.004744 0.57% Amr::checkPoint() 3 0.004317 0.004317 0.004317 0.52% BndryData::define() 11 0.003933 0.003933 0.003933 0.47% Castro::enforce_speed_limit() 62 0.003864 0.003864 0.003864 0.47% Castro::construct_new_gravity_source() 10 0.003145 0.003145 0.003145 0.38% Castro::construct_old_gravity_source() 10 0.002485 0.002485 0.002485 0.30% Amr::writePlotFile() 2 0.002476 0.002476 0.002476 0.30% Gravity::get_new_grav_vector() 11 0.001943 0.001943 0.001943 0.23% MLMG::ResNormInf() 93 0.001897 0.001897 0.001897 0.23% MultiFab::Saxpy() 20 0.001808 0.001808 0.001808 0.22% Gravity::get_old_grav_vector() 10 0.001754 0.001754 0.001754 0.21% Castro::expand_state() 10 0.001729 0.001729 0.001729 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001665 0.001665 0.001665 0.20% MLMG::oneIter() 82 0.001658 0.001658 0.001658 0.20% Castro::reset_internal_energy(Fab) 504 0.001559 0.001559 0.001559 0.19% MLCellLinOp::setLevelBC() 11 0.001545 0.001545 0.001545 0.19% FabArray::mult() 43 0.00134 0.00134 0.00134 0.16% Gravity::actual_solve_with_mlmg() 11 0.001334 0.001334 0.001334 0.16% FabArray::setDomainBndry() 41 0.001319 0.001319 0.001319 0.16% Castro::initData() 1 0.001276 0.001276 0.001276 0.15% MLCellLinOp::prepareForSolve() 11 0.001175 0.001175 0.001175 0.14% MultiFab::contains_nan() 20 0.001169 0.001169 0.001169 0.14% MLCellLinOp::smooth() 1640 0.00109 0.00109 0.00109 0.13% MLMG::prepareForSolve() 11 0.001037 0.001037 0.001037 0.13% MLCellLinOp::compGrad() 11 0.0009327 0.0009327 0.0009327 0.11% FabArrayBase::getCPC() 1323 0.0007564 0.0007564 0.0007564 0.09% FabArray::FillBoundary() 4023 0.0007502 0.0007502 0.0007502 0.09% FabArrayBase::CPC::define() 454 0.0006876 0.0006876 0.0006876 0.08% FabArrayBase::getFB() 4023 0.0006157 0.0006157 0.0006157 0.07% Amr::InitAmr() 1 0.0004822 0.0004822 0.0004822 0.06% MLCellLinOp::apply() 1142 0.0004616 0.0004616 0.0004616 0.06% Gravity::solve_for_phi() 10 0.0004121 0.0004121 0.0004121 0.05% Gravity::update_max_rhs() 11 0.0004078 0.0004078 0.0004078 0.05% CGSolver::sxay() 1586 0.0003698 0.0003698 0.0003698 0.04% Amr::coarseTimeStep() 10 0.0003214 0.0003214 0.0003214 0.04% FillPatchIterator::Initialize 41 0.0002824 0.0002824 0.0002824 0.03% MLCellLinOp::defineBC() 11 0.0002782 0.0002782 0.0002782 0.03% MLCGSolver::ParallelAllReduce 1514 0.0002702 0.0002702 0.0002702 0.03% FabArray::ParallelCopy() 861 0.0002652 0.0002652 0.0002652 0.03% MultiFab::Copy() 11 0.0002616 0.0002616 0.0002616 0.03% main() 1 0.000257 0.000257 0.000257 0.03% MultiFab::max() 11 0.0002542 0.0002542 0.0002542 0.03% MLLinOp::defineGrids() 11 0.0002203 0.0002203 0.0002203 0.03% MLCellLinOp::correctionResidual() 492 0.0002175 0.0002175 0.0002175 0.03% Castro::construct_new_gravity() 10 0.0002106 0.0002106 0.0002106 0.03% MLMG::MLRhsNormInf() 11 0.0002076 0.0002076 0.0002076 0.03% Castro::subcycle_advance_ctu() 10 0.0002002 0.0002002 0.0002002 0.02% MLMG::mgVcycle() 82 0.000196 0.000196 0.000196 0.02% Amr::timeStep() 10 0.0001898 0.0001898 0.0001898 0.02% StateData::checkPoint() 12 0.0001306 0.0001306 0.0001306 0.02% MLMG:computeResOfCorrection() 410 0.0001256 0.0001256 0.0001256 0.02% MLMG::actualBottomSolve() 82 9.55e-05 9.55e-05 9.55e-05 0.01% MLMG::mgVcycle_down::0 82 9.119e-05 9.119e-05 9.119e-05 0.01% Castro::Castro() 1 8.869e-05 8.869e-05 8.869e-05 0.01% FabArrayBase::FB::FB() 56 8.72e-05 8.72e-05 8.72e-05 0.01% Castro::initialize_advance() 10 8.082e-05 8.082e-05 8.082e-05 0.01% MLMG::mgVcycle_down::1 82 7.857e-05 7.857e-05 7.857e-05 0.01% MLMG::mgVcycle_down::2 82 7.682e-05 7.682e-05 7.682e-05 0.01% Castro::clean_state() 62 7.48e-05 7.48e-05 7.48e-05 0.01% MLMG::mgVcycle_down::4 82 7.413e-05 7.413e-05 7.413e-05 0.01% MLMG::solve() 11 7.259e-05 7.259e-05 7.259e-05 0.01% MLMG::mgVcycle_down::3 82 7.223e-05 7.223e-05 7.223e-05 0.01% AmrLevel::checkPoint() 3 7.222e-05 7.222e-05 7.222e-05 0.01% Castro::finalize_advance() 10 6.671e-05 6.671e-05 6.671e-05 0.01% Castro::initialize_do_advance() 10 6.355e-05 6.355e-05 6.355e-05 0.01% Castro::advance() 10 6.177e-05 6.177e-05 6.177e-05 0.01% MLMG::mgVcycle_up::4 82 5.644e-05 5.644e-05 5.644e-05 0.01% Castro::post_timestep() 10 5.391e-05 5.391e-05 5.391e-05 0.01% MLMG::mgVcycle_up::0 82 5.124e-05 5.124e-05 5.124e-05 0.01% MLCellLinOp::solutionResidual() 93 5.043e-05 5.043e-05 5.043e-05 0.01% MLMG::mgVcycle_up::1 82 4.849e-05 4.849e-05 4.849e-05 0.01% MLMG::mgVcycle_up::3 82 4.764e-05 4.764e-05 4.764e-05 0.01% MLMG::mgVcycle_up::2 82 4.624e-05 4.624e-05 4.624e-05 0.01% StateData::define() 4 3.884e-05 3.884e-05 3.884e-05 0.00% Castro::swap_state_time_levels() 10 3.869e-05 3.869e-05 3.869e-05 0.00% Castro::construct_new_source() 50 3.635e-05 3.635e-05 3.635e-05 0.00% Castro::finalize_do_advance() 10 3.423e-05 3.423e-05 3.423e-05 0.00% Castro::enforce_consistent_e() 1 3.392e-05 3.392e-05 3.392e-05 0.00% MLMG::mgVcycle_bottom 82 3.288e-05 3.288e-05 3.288e-05 0.00% Gravity::actual_multilevel_solve() 1 3.224e-05 3.224e-05 3.224e-05 0.00% MLMG::computeResidual() 82 3.114e-05 3.114e-05 3.114e-05 0.00% Castro::initMFs() 1 2.859e-05 2.859e-05 2.859e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.744e-05 2.744e-05 2.744e-05 0.00% FillPatchSingleLevel 41 2.619e-05 2.619e-05 2.619e-05 0.00% Amr::writeSmallPlotFile() 1 2.538e-05 2.538e-05 2.538e-05 0.00% makeSFC 55 2.464e-05 2.464e-05 2.464e-05 0.00% Castro::buildMetrics() 1 2.385e-05 2.385e-05 2.385e-05 0.00% MLLinOp::define() 11 2.278e-05 2.278e-05 2.278e-05 0.00% MLPoisson::define() 11 2.184e-05 2.184e-05 2.184e-05 0.00% Amr::FinalizeInit() 1 2.123e-05 2.123e-05 2.123e-05 0.00% Amr::defBaseLevel() 1 2.006e-05 2.006e-05 2.006e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.906e-05 1.906e-05 1.906e-05 0.00% Castro::do_new_sources() 10 1.61e-05 1.61e-05 1.61e-05 0.00% Castro::construct_old_source() 50 1.575e-05 1.575e-05 1.575e-05 0.00% Castro::do_old_sources() 10 1.549e-05 1.549e-05 1.549e-05 0.00% DistributionMapping::Distribute() 56 1.444e-05 1.444e-05 1.444e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.307e-05 1.307e-05 1.307e-05 0.00% Castro::check_for_nan() 20 1.175e-05 1.175e-05 1.175e-05 0.00% Castro::apply_source_to_state() 20 1.128e-05 1.128e-05 1.128e-05 0.00% Castro::construct_old_gravity() 10 1.045e-05 1.045e-05 1.045e-05 0.00% Castro::post_init() 1 8.897e-06 8.897e-06 8.897e-06 0.00% Amr::initSubcycle() 1 8.812e-06 8.812e-06 8.812e-06 0.00% Gravity::swapTimeLevels() 10 8.313e-06 8.313e-06 8.313e-06 0.00% MLPoisson::prepareForSolve() 11 8.097e-06 8.097e-06 8.097e-06 0.00% MLMG::computeMLResidual() 11 7.19e-06 7.19e-06 7.19e-06 0.00% AmrLevel::AmrLevel(dm) 1 6.924e-06 6.924e-06 6.924e-06 0.00% Amr::InitializeInit() 1 6.723e-06 6.723e-06 6.723e-06 0.00% Castro::computeNewDt() 9 6.62e-06 6.62e-06 6.62e-06 0.00% MLMG::getGradSolution() 11 5.757e-06 5.757e-06 5.757e-06 0.00% MLMG::buildFineMask() 11 5.559e-06 5.559e-06 5.559e-06 0.00% MLMG::MLResNormInf() 11 4.352e-06 4.352e-06 4.352e-06 0.00% Castro::retry_advance_ctu() 10 4.2e-06 4.2e-06 4.2e-06 0.00% AmrLevel::checkPointPost() 3 4.095e-06 4.095e-06 4.095e-06 0.00% Castro::create_source_corrector() 10 4.006e-06 4.006e-06 4.006e-06 0.00% Gravity::set_mass_offset() 11 3.486e-06 3.486e-06 3.486e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.323e-06 3.323e-06 3.323e-06 0.00% Castro::FluxRegCrseInit 10 3.013e-06 3.013e-06 3.013e-06 0.00% Castro::FluxRegFineAdd() 10 2.473e-06 2.473e-06 2.473e-06 0.00% Castro::computeInitialDt() 2 2.425e-06 2.425e-06 2.425e-06 0.00% Amr::init() 1 2.048e-06 2.048e-06 2.048e-06 0.00% AmrLevel::checkPointPre() 3 1.986e-06 1.986e-06 1.986e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.857e-06 1.857e-06 1.857e-06 0.00% Amr::initialInit() 1 1.311e-06 1.311e-06 1.311e-06 0.00% Castro::post_regrid() 1 1.004e-06 1.004e-06 1.004e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8287 0.8287 0.8287 100.00% Amr::coarseTimeStep() 10 0.6814 0.6814 0.6814 82.23% Amr::timeStep() 10 0.5954 0.5954 0.5954 71.85% Castro::advance() 10 0.585 0.585 0.585 70.59% Castro::subcycle_advance_ctu() 10 0.5717 0.5717 0.5717 68.99% Castro::do_advance_ctu() 10 0.5715 0.5715 0.5715 68.96% Gravity::solve_phi_with_mlmg() 11 0.3171 0.3171 0.3171 38.27% Gravity::actual_solve_with_mlmg() 11 0.3084 0.3084 0.3084 37.21% Castro::construct_new_gravity() 10 0.2884 0.2884 0.2884 34.80% MLMG::solve() 11 0.2856 0.2856 0.2856 34.47% Gravity::solve_for_phi() 10 0.2729 0.2729 0.2729 32.93% MLMG::oneIter() 82 0.271 0.271 0.271 32.70% MLMG::mgVcycle() 82 0.2693 0.2693 0.2693 32.50% Castro::construct_ctu_hydro_source() 10 0.1857 0.1857 0.1857 22.41% VisMF::Write(FabArray) 11 0.1693 0.1693 0.1693 20.43% MLCellLinOp::smooth() 1640 0.1381 0.1381 0.1381 16.66% Amr::checkPoint() 3 0.1257 0.1257 0.1257 15.17% Amr::init() 1 0.1215 0.1215 0.1215 14.66% AmrLevel::checkPoint() 3 0.1214 0.1214 0.1214 14.65% StateData::checkPoint() 12 0.1213 0.1213 0.1213 14.64% MLCellLinOp::applyBC() 4433 0.09728 0.09728 0.09728 11.74% MLMG::mgVcycle_bottom 82 0.08284 0.08284 0.08284 10.00% MLMG::actualBottomSolve() 82 0.08281 0.08281 0.08281 9.99% MLCGSolver::bicgstab 82 0.08198 0.08198 0.08198 9.89% MLPoisson::Fsmooth() 3280 0.06457 0.06457 0.06457 7.79% Castro::clean_state() 62 0.0559 0.0559 0.0559 6.74% FillPatchIterator::Initialize 41 0.05524 0.05524 0.05524 6.67% FillPatchSingleLevel 41 0.05364 0.05364 0.05364 6.47% Amr::initialInit() 1 0.05207 0.05207 0.05207 6.28% Amr::writePlotFile() 2 0.05059 0.05059 0.05059 6.10% StateDataPhysBCFunct::() 41 0.04961 0.04961 0.04961 5.99% Amr::FinalizeInit() 1 0.04803 0.04803 0.04803 5.80% Castro::post_init() 1 0.0467 0.0467 0.0467 5.63% Gravity::multilevel_solve_for_new_phi() 1 0.04474 0.04474 0.04474 5.40% Gravity::actual_multilevel_solve() 1 0.04472 0.04472 0.04472 5.40% MLCellLinOp::apply() 1142 0.03669 0.03669 0.03669 4.43% MLMG::mgVcycle_down::0 82 0.0359 0.0359 0.0359 4.33% MLMG::mgVcycle_up::0 82 0.03087 0.03087 0.03087 3.73% Castro::initialize_do_advance() 10 0.02612 0.02612 0.02612 3.15% StateData::FillBoundary(geom) 328 0.02487 0.02487 0.02487 3.00% MultiFab::Dot() 1114 0.02252 0.02252 0.02252 2.72% Castro::computeTemp() 63 0.0223 0.0223 0.0223 2.69% Castro::construct_old_gravity() 10 0.02223 0.02223 0.02223 2.68% Gravity::get_old_grav_vector() 10 0.02222 0.02222 0.02222 2.68% MLCellLinOp::correctionResidual() 492 0.0215 0.0215 0.0215 2.59% MLMG:computeResOfCorrection() 410 0.01857 0.01857 0.01857 2.24% Castro::normalize_species() 62 0.01857 0.01857 0.01857 2.24% MLPoisson::define() 11 0.01846 0.01846 0.01846 2.23% MLMG::mgVcycle_down::1 82 0.018 0.018 0.018 2.17% MLMG::mgVcycle_down::2 82 0.01746 0.01746 0.01746 2.11% Castro::expand_state() 10 0.01719 0.01719 0.01719 2.07% Gravity::get_new_grav_vector() 11 0.01718 0.01718 0.01718 2.07% MLMG::mgVcycle_down::3 82 0.01658 0.01658 0.01658 2.00% MLMG::mgVcycle_down::4 82 0.01576 0.01576 0.01576 1.90% FabArray::FillBoundary() 4023 0.01558 0.01558 0.01558 1.88% CGSolver::sxay() 1586 0.01497 0.01497 0.01497 1.81% FillBoundary_nowait() 4023 0.01483 0.01483 0.01483 1.79% MultiFab::LinComb() 1586 0.0146 0.0146 0.0146 1.76% FabArray::setVal() 1144 0.01436 0.01436 0.01436 1.73% FabArray::ParallelCopy() 861 0.01434 0.01434 0.01434 1.73% FabArray::ParallelCopy_nowait() 861 0.01408 0.01408 0.01408 1.70% MLMG::mgVcycle_up::2 82 0.0134 0.0134 0.0134 1.62% MLCGSolver::ParallelAllReduce 1514 0.01337 0.01337 0.01337 1.61% Castro::initialize_advance() 10 0.01317 0.01317 0.01317 1.59% MLMG::mgVcycle_up::1 82 0.01316 0.01316 0.01316 1.59% MLCellLinOp::defineAuxData() 11 0.01298 0.01298 0.01298 1.57% MLMG::addInterpCorrection() 410 0.01273 0.01273 0.01273 1.54% MLMG::mgVcycle_up::3 82 0.01268 0.01268 0.01268 1.53% MLMG::mgVcycle_up::4 82 0.01247 0.01247 0.01247 1.51% Castro::do_old_sources() 10 0.01221 0.01221 0.01221 1.47% amrex::average_down 410 0.01209 0.01209 0.01209 1.46% Castro::do_new_sources() 10 0.01208 0.01208 0.01208 1.46% MLPoisson::Fapply() 1142 0.01187 0.01187 0.01187 1.43% Castro::enforce_min_density() 62 0.0118 0.0118 0.0118 1.42% Castro::post_timestep() 10 0.01023 0.01023 0.01023 1.23% Gravity::fill_multipole_BCs() 11 0.008516 0.008516 0.008516 1.03% Castro::estTimeStep() 21 0.00753 0.00753 0.00753 0.91% MLCellLinOp::solutionResidual() 93 0.007278 0.007278 0.007278 0.88% MultiFab::Xpay() 585 0.006649 0.006649 0.006649 0.80% Castro::reset_internal_energy(MultiFab) 63 0.006304 0.006304 0.006304 0.76% MLMG::computeResidual() 82 0.006287 0.006287 0.006287 0.76% MLCellLinOp::defineBC() 11 0.005161 0.005161 0.005161 0.62% MLMG::prepareForSolve() 11 0.005151 0.005151 0.005151 0.62% BndryData::define() 11 0.004883 0.004883 0.004883 0.59% Amr::InitializeInit() 1 0.004042 0.004042 0.004042 0.49% Amr::defBaseLevel() 1 0.004035 0.004035 0.004035 0.49% Castro::enforce_speed_limit() 62 0.003864 0.003864 0.003864 0.47% Castro::initData() 1 0.003519 0.003519 0.003519 0.42% Castro::computeNewDt() 9 0.00329 0.00329 0.00329 0.40% Castro::construct_new_source() 50 0.003181 0.003181 0.003181 0.38% Castro::construct_new_gravity_source() 10 0.003145 0.003145 0.003145 0.38% Castro::construct_old_source() 50 0.002501 0.002501 0.002501 0.30% Castro::construct_old_gravity_source() 10 0.002485 0.002485 0.002485 0.30% MLMG::ResNormInf() 93 0.001897 0.001897 0.001897 0.23% Castro::apply_source_to_state() 20 0.00182 0.00182 0.00182 0.22% MultiFab::Saxpy() 20 0.001808 0.001808 0.001808 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001665 0.001665 0.001665 0.20% Castro::reset_internal_energy(Fab) 504 0.001559 0.001559 0.001559 0.19% MLCellLinOp::setLevelBC() 11 0.001545 0.001545 0.001545 0.19% FabArrayBase::getCPC() 1323 0.001444 0.001444 0.001444 0.17% MLMG::getGradSolution() 11 0.00142 0.00142 0.00142 0.17% MLCellLinOp::compGrad() 11 0.001414 0.001414 0.001414 0.17% FabArray::mult() 43 0.00134 0.00134 0.00134 0.16% FabArray::setDomainBndry() 41 0.001319 0.001319 0.001319 0.16% MLPoisson::prepareForSolve() 11 0.001183 0.001183 0.001183 0.14% Castro::check_for_nan() 20 0.001181 0.001181 0.001181 0.14% MLCellLinOp::prepareForSolve() 11 0.001175 0.001175 0.001175 0.14% MultiFab::contains_nan() 20 0.001169 0.001169 0.001169 0.14% Castro::post_regrid() 1 0.001095 0.001095 0.001095 0.13% MLMG::computeMLResidual() 11 0.001029 0.001029 0.001029 0.12% Gravity::update_max_rhs() 11 0.0008101 0.0008101 0.0008101 0.10% Castro::computeInitialDt() 2 0.0007113 0.0007113 0.0007113 0.09% FabArrayBase::getFB() 4023 0.0007029 0.0007029 0.0007029 0.08% FabArrayBase::CPC::define() 454 0.0006876 0.0006876 0.0006876 0.08% Amr::InitAmr() 1 0.0004911 0.0004911 0.0004911 0.06% Castro::Castro() 1 0.0004451 0.0004451 0.0004451 0.05% Gravity::swapTimeLevels() 10 0.000444 0.000444 0.000444 0.05% MLLinOp::define() 11 0.000296 0.000296 0.000296 0.04% MLLinOp::defineGrids() 11 0.0002732 0.0002732 0.0002732 0.03% MultiFab::Copy() 11 0.0002616 0.0002616 0.0002616 0.03% MLMG::MLResNormInf() 11 0.0002584 0.0002584 0.0002584 0.03% MultiFab::max() 11 0.0002542 0.0002542 0.0002542 0.03% MLMG::MLRhsNormInf() 11 0.0002076 0.0002076 0.0002076 0.03% Castro::buildMetrics() 1 0.0001613 0.0001613 0.0001613 0.02% FabArrayBase::FB::FB() 56 8.72e-05 8.72e-05 8.72e-05 0.01% Castro::finalize_advance() 10 7.22e-05 7.22e-05 7.22e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.103e-05 5.103e-05 5.103e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.577e-05 4.577e-05 4.577e-05 0.01% StateData::define() 4 3.884e-05 3.884e-05 3.884e-05 0.00% Castro::swap_state_time_levels() 10 3.869e-05 3.869e-05 3.869e-05 0.00% makeSFC 55 3.796e-05 3.796e-05 3.796e-05 0.00% Castro::finalize_do_advance() 10 3.423e-05 3.423e-05 3.423e-05 0.00% Castro::enforce_consistent_e() 1 3.392e-05 3.392e-05 3.392e-05 0.00% Castro::initMFs() 1 2.859e-05 2.859e-05 2.859e-05 0.00% Amr::writeSmallPlotFile() 1 2.538e-05 2.538e-05 2.538e-05 0.00% DistributionMapping::Distribute() 56 1.444e-05 1.444e-05 1.444e-05 0.00% Amr::initSubcycle() 1 8.812e-06 8.812e-06 8.812e-06 0.00% MLMG::buildFineMask() 11 5.559e-06 5.559e-06 5.559e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.44e-06 4.44e-06 4.44e-06 0.00% Castro::retry_advance_ctu() 10 4.2e-06 4.2e-06 4.2e-06 0.00% AmrLevel::checkPointPost() 3 4.095e-06 4.095e-06 4.095e-06 0.00% Castro::create_source_corrector() 10 4.006e-06 4.006e-06 4.006e-06 0.00% Gravity::set_mass_offset() 11 3.486e-06 3.486e-06 3.486e-06 0.00% Castro::FluxRegCrseInit 10 3.013e-06 3.013e-06 3.013e-06 0.00% Castro::FluxRegFineAdd() 10 2.473e-06 2.473e-06 2.473e-06 0.00% AmrLevel::checkPointPre() 3 1.986e-06 1.986e-06 1.986e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.857e-06 1.857e-06 1.857e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.05-29-g1305eb3d364d) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.05-29-g1305eb3d364d) initialized Starting run at 08:25:51 UTC on 2022-05-25. Successfully read inputs file ... Castro git describe: 22.05-32-g1070f4487 AMReX git describe: 22.05-29-g1305eb3d3 Microphysics git describe: 22.05-1-g39742967 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.458174617 Restart time = 0.045859765 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.053260387 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.049623977 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.059864569 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.06549993 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.066088639 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.026433775 seconds Ending run at 08:25:52 UTC on 2022-05-25. Run time = 0.367591516 Run time without initialization = 0.321198365 Average number of zones advanced per microsecond: 4.081 Average number of zones advanced per microsecond per rank: 4.081 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.3676 ... 0.3676 ... 0.3676 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0903 0.0903 0.0903 24.57% VisMF::Read() 3 0.03854 0.03854 0.03854 10.48% MLCellLinOp::applyBC() 1946 0.03527 0.03527 0.03527 9.59% MLPoisson::Fsmooth() 1440 0.02757 0.02757 0.02757 7.50% VisMF::Write(FabArray) 1 0.02494 0.02494 0.02494 6.79% StateData::FillBoundary(geom) 160 0.01179 0.01179 0.01179 3.21% MLCGSolver::bicgstab 36 0.01033 0.01033 0.01033 2.81% Castro::computeTemp() 30 0.009938 0.009938 0.009938 2.70% MultiFab::Dot() 484 0.009619 0.009619 0.009619 2.62% Castro::normalize_species() 30 0.008349 0.008349 0.008349 2.27% FabArray::setVal() 537 0.006821 0.006821 0.006821 1.86% Castro::enforce_min_density() 30 0.006306 0.006306 0.006306 1.72% MultiFab::LinComb() 690 0.006193 0.006193 0.006193 1.68% FillBoundary_nowait() 1766 0.006174 0.006174 0.006174 1.68% MLCellLinOp::defineAuxData() 6 0.006167 0.006167 0.006167 1.68% FabArray::ParallelCopy_nowait() 380 0.006069 0.006069 0.006069 1.65% StateDataPhysBCFunct::() 20 0.005605 0.005605 0.005605 1.52% MLPoisson::Fapply() 500 0.005119 0.005119 0.005119 1.39% Gravity::fill_multipole_BCs() 6 0.004758 0.004758 0.004758 1.29% Castro::estTimeStep() 10 0.003709 0.003709 0.003709 1.01% MLMG::addInterpCorrection() 180 0.003242 0.003242 0.003242 0.88% Amr::restart() 1 0.003059 0.003059 0.003059 0.83% amrex::average_down 180 0.003004 0.003004 0.003004 0.82% MultiFab::Xpay() 258 0.002911 0.002911 0.002911 0.79% Castro::do_advance_ctu() 5 0.002358 0.002358 0.002358 0.64% BndryData::define() 6 0.002162 0.002162 0.002162 0.59% Castro::construct_new_gravity_source() 5 0.001719 0.001719 0.001719 0.47% Castro::reset_internal_energy(Fab) 240 0.001709 0.001709 0.001709 0.46% Amr::writePlotFile() 1 0.001601 0.001601 0.001601 0.44% Castro::reset_internal_energy(MultiFab) 30 0.00159 0.00159 0.00159 0.43% Castro::construct_old_gravity_source() 5 0.001337 0.001337 0.001337 0.36% Castro::enforce_speed_limit() 30 0.001092 0.001092 0.001092 0.30% MultiFab::Saxpy() 10 0.0009206 0.0009206 0.0009206 0.25% Gravity::get_old_grav_vector() 5 0.0008862 0.0008862 0.0008862 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008825 0.0008825 0.0008825 0.24% Castro::expand_state() 5 0.0008713 0.0008713 0.0008713 0.24% Gravity::get_new_grav_vector() 5 0.0008703 0.0008703 0.0008703 0.24% MLMG::ResNormInf() 42 0.0008359 0.0008359 0.0008359 0.23% MLCellLinOp::setLevelBC() 6 0.0008294 0.0008294 0.0008294 0.23% MLMG::oneIter() 36 0.0007377 0.0007377 0.0007377 0.20% Gravity::actual_solve_with_mlmg() 6 0.0006932 0.0006932 0.0006932 0.19% FabArray::setDomainBndry() 20 0.00068 0.00068 0.00068 0.18% FabArray::mult() 22 0.0006434 0.0006434 0.0006434 0.18% MLCellLinOp::prepareForSolve() 6 0.0006338 0.0006338 0.0006338 0.17% MultiFab::contains_nan() 10 0.000598 0.000598 0.000598 0.16% MLMG::prepareForSolve() 6 0.0005645 0.0005645 0.0005645 0.15% MLCellLinOp::compGrad() 6 0.0004906 0.0004906 0.0004906 0.13% MLCellLinOp::smooth() 720 0.0004521 0.0004521 0.0004521 0.12% FabArrayBase::CPC::define() 244 0.0003901 0.0003901 0.0003901 0.11% Amr::InitAmr() 1 0.0003778 0.0003778 0.0003778 0.10% FabArrayBase::getCPC() 632 0.0003632 0.0003632 0.0003632 0.10% FabArray::FillBoundary() 1766 0.0003492 0.0003492 0.0003492 0.09% FabArrayBase::getFB() 1766 0.0002669 0.0002669 0.0002669 0.07% main() 1 0.0002444 0.0002444 0.0002444 0.07% Gravity::update_max_rhs() 6 0.000241 0.000241 0.000241 0.07% Castro::subcycle_advance_ctu() 5 0.0002235 0.0002235 0.0002235 0.06% Gravity::solve_for_phi() 5 0.0002199 0.0002199 0.0002199 0.06% MLCellLinOp::apply() 500 0.0001881 0.0001881 0.0001881 0.05% Amr::coarseTimeStep() 5 0.0001662 0.0001662 0.0001662 0.05% CGSolver::sxay() 690 0.0001636 0.0001636 0.0001636 0.04% FillPatchIterator::Initialize 20 0.0001536 0.0001536 0.0001536 0.04% MLLinOp::defineGrids() 6 0.000151 0.000151 0.000151 0.04% Castro::construct_new_source() 25 0.0001477 0.0001477 0.0001477 0.04% MLCellLinOp::defineBC() 6 0.0001475 0.0001475 0.0001475 0.04% MultiFab::Copy() 6 0.0001431 0.0001431 0.0001431 0.04% Castro::create_source_corrector() 5 0.0001372 0.0001372 0.0001372 0.04% MultiFab::max() 6 0.000135 0.000135 0.000135 0.04% Castro::construct_new_gravity() 5 0.0001217 0.0001217 0.0001217 0.03% FabArray::ParallelCopy() 380 0.0001208 0.0001208 0.0001208 0.03% MLCGSolver::ParallelAllReduce 659 0.000117 0.000117 0.000117 0.03% MLMG::MLRhsNormInf() 6 0.0001081 0.0001081 0.0001081 0.03% Amr::timeStep() 5 0.0001048 0.0001048 0.0001048 0.03% Castro::advance() 5 0.0001038 0.0001038 0.0001038 0.03% MLCellLinOp::correctionResidual() 216 9.521e-05 9.521e-05 9.521e-05 0.03% Castro::finalize_advance() 5 9.426e-05 9.426e-05 9.426e-05 0.03% Castro::post_timestep() 5 9.062e-05 9.062e-05 9.062e-05 0.02% MLMG::mgVcycle() 36 8.8e-05 8.8e-05 8.8e-05 0.02% Castro::construct_old_source() 25 7.767e-05 7.767e-05 7.767e-05 0.02% AmrLevel::restart() 1 7.07e-05 7.07e-05 7.07e-05 0.02% Castro::initialize_advance() 5 6.787e-05 6.787e-05 6.787e-05 0.02% StateData::restartDoit() 4 6.784e-05 6.784e-05 6.784e-05 0.02% FabArrayBase::FB::FB() 26 5.819e-05 5.819e-05 5.819e-05 0.02% Castro::initialize_do_advance() 5 5.695e-05 5.695e-05 5.695e-05 0.02% MLMG:computeResOfCorrection() 180 5.224e-05 5.224e-05 5.224e-05 0.01% Castro::construct_old_gravity() 5 4.983e-05 4.983e-05 4.983e-05 0.01% MLMG::actualBottomSolve() 36 4.259e-05 4.259e-05 4.259e-05 0.01% Castro::computeNewDt() 5 3.856e-05 3.856e-05 3.856e-05 0.01% MLMG::mgVcycle_down::0 36 3.74e-05 3.74e-05 3.74e-05 0.01% MLMG::mgVcycle_down::1 36 3.692e-05 3.692e-05 3.692e-05 0.01% MLMG::mgVcycle_down::2 36 3.528e-05 3.528e-05 3.528e-05 0.01% MLMG::mgVcycle_down::4 36 3.513e-05 3.513e-05 3.513e-05 0.01% MLMG::solve() 6 3.474e-05 3.474e-05 3.474e-05 0.01% Castro::clean_state() 30 3.469e-05 3.469e-05 3.469e-05 0.01% Castro::post_restart() 1 3.459e-05 3.459e-05 3.459e-05 0.01% Castro::buildMetrics() 1 3.223e-05 3.223e-05 3.223e-05 0.01% MLMG::mgVcycle_down::3 36 3.136e-05 3.136e-05 3.136e-05 0.01% Gravity::actual_multilevel_solve() 1 3.066e-05 3.066e-05 3.066e-05 0.01% Castro::initMFs() 1 2.78e-05 2.78e-05 2.78e-05 0.01% MLMG::mgVcycle_up::4 36 2.697e-05 2.697e-05 2.697e-05 0.01% Amr::writeSmallPlotFile() 1 2.672e-05 2.672e-05 2.672e-05 0.01% Castro::swap_state_time_levels() 5 2.61e-05 2.61e-05 2.61e-05 0.01% MLMG::mgVcycle_up::0 36 2.269e-05 2.269e-05 2.269e-05 0.01% MLCellLinOp::solutionResidual() 42 2.226e-05 2.226e-05 2.226e-05 0.01% MLMG::mgVcycle_up::3 36 2.193e-05 2.193e-05 2.193e-05 0.01% MLMG::mgVcycle_up::2 36 2.177e-05 2.177e-05 2.177e-05 0.01% MLLinOp::define() 6 2.057e-05 2.057e-05 2.057e-05 0.01% MLMG::mgVcycle_up::1 36 1.994e-05 1.994e-05 1.994e-05 0.01% Castro::finalize_do_advance() 5 1.886e-05 1.886e-05 1.886e-05 0.01% Gravity::multilevel_solve_for_new_phi() 1 1.73e-05 1.73e-05 1.73e-05 0.00% MLMG::computeResidual() 36 1.491e-05 1.491e-05 1.491e-05 0.00% MLMG::mgVcycle_bottom 36 1.465e-05 1.465e-05 1.465e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.404e-05 1.404e-05 1.404e-05 0.00% FillPatchSingleLevel 20 1.392e-05 1.392e-05 1.392e-05 0.00% makeSFC 30 1.386e-05 1.386e-05 1.386e-05 0.00% MLPoisson::define() 6 1.376e-05 1.376e-05 1.376e-05 0.00% Castro::do_new_sources() 5 9.168e-06 9.168e-06 9.168e-06 0.00% DistributionMapping::Distribute() 31 9.014e-06 9.014e-06 9.014e-06 0.00% Amr::initSubcycle() 1 8.404e-06 8.404e-06 8.404e-06 0.00% Castro::check_for_nan() 10 8.125e-06 8.125e-06 8.125e-06 0.00% Castro::do_old_sources() 5 8.027e-06 8.027e-06 8.027e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.328e-06 7.328e-06 7.328e-06 0.00% Castro::apply_source_to_state() 10 5.455e-06 5.455e-06 5.455e-06 0.00% MLPoisson::prepareForSolve() 6 4.352e-06 4.352e-06 4.352e-06 0.00% Gravity::swapTimeLevels() 5 4.2e-06 4.2e-06 4.2e-06 0.00% MLMG::computeMLResidual() 6 3.266e-06 3.266e-06 3.266e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.112e-06 3.112e-06 3.112e-06 0.00% MLMG::buildFineMask() 6 3.058e-06 3.058e-06 3.058e-06 0.00% MLMG::getGradSolution() 6 2.871e-06 2.871e-06 2.871e-06 0.00% MLMG::MLResNormInf() 6 2.338e-06 2.338e-06 2.338e-06 0.00% Gravity::set_mass_offset() 6 2.056e-06 2.056e-06 2.056e-06 0.00% Castro::retry_advance_ctu() 5 1.956e-06 1.956e-06 1.956e-06 0.00% Castro::FluxRegCrseInit 5 1.885e-06 1.885e-06 1.885e-06 0.00% AmrLevel::AmrLevel() 1 1.151e-06 1.151e-06 1.151e-06 0.00% Castro::FluxRegFineAdd() 5 1.099e-06 1.099e-06 1.099e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.048e-06 1.048e-06 1.048e-06 0.00% Amr::init() 1 1.03e-06 1.03e-06 1.03e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.3676 0.3676 0.3676 100.00% Amr::coarseTimeStep() 5 0.2945 0.2945 0.2945 80.11% Amr::timeStep() 5 0.2924 0.2924 0.2924 79.54% Castro::advance() 5 0.2872 0.2872 0.2872 78.14% Castro::subcycle_advance_ctu() 5 0.2804 0.2804 0.2804 76.26% Castro::do_advance_ctu() 5 0.2801 0.2801 0.2801 76.20% Castro::construct_new_gravity() 5 0.1448 0.1448 0.1448 39.40% Gravity::solve_phi_with_mlmg() 6 0.1406 0.1406 0.1406 38.24% Gravity::solve_for_phi() 5 0.1371 0.1371 0.1371 37.28% Gravity::actual_solve_with_mlmg() 6 0.1357 0.1357 0.1357 36.91% MLMG::solve() 6 0.1235 0.1235 0.1235 33.59% MLMG::oneIter() 36 0.1164 0.1164 0.1164 31.67% MLMG::mgVcycle() 36 0.1157 0.1157 0.1157 31.47% Castro::construct_ctu_hydro_source() 5 0.09032 0.09032 0.09032 24.57% MLCellLinOp::smooth() 720 0.05925 0.05925 0.05925 16.12% Amr::init() 1 0.04591 0.04591 0.04591 12.49% Amr::restart() 1 0.0459 0.0459 0.0459 12.49% MLCellLinOp::applyBC() 1946 0.04212 0.04212 0.04212 11.46% AmrLevel::restart() 1 0.03875 0.03875 0.03875 10.54% StateData::restartDoit() 4 0.03867 0.03867 0.03867 10.52% VisMF::Read() 3 0.03854 0.03854 0.03854 10.48% MLMG::mgVcycle_bottom 36 0.03545 0.03545 0.03545 9.64% MLMG::actualBottomSolve() 36 0.03544 0.03544 0.03544 9.64% MLCGSolver::bicgstab 36 0.03508 0.03508 0.03508 9.54% Castro::clean_state() 30 0.02902 0.02902 0.02902 7.89% MLPoisson::Fsmooth() 1440 0.02757 0.02757 0.02757 7.50% Amr::writePlotFile() 1 0.02654 0.02654 0.02654 7.22% VisMF::Write(FabArray) 1 0.02494 0.02494 0.02494 6.79% FillPatchIterator::Initialize 20 0.02026 0.02026 0.02026 5.51% FillPatchSingleLevel 20 0.01942 0.01942 0.01942 5.28% StateDataPhysBCFunct::() 20 0.0174 0.0174 0.0174 4.73% MLCellLinOp::apply() 500 0.01594 0.01594 0.01594 4.34% MLMG::mgVcycle_down::0 36 0.01549 0.01549 0.01549 4.21% MLMG::mgVcycle_up::0 36 0.01327 0.01327 0.01327 3.61% Castro::computeTemp() 30 0.01324 0.01324 0.01324 3.60% StateData::FillBoundary(geom) 160 0.01179 0.01179 0.01179 3.21% Castro::initialize_do_advance() 5 0.01092 0.01092 0.01092 2.97% MLPoisson::define() 6 0.009948 0.009948 0.009948 2.71% MultiFab::Dot() 484 0.009619 0.009619 0.009619 2.62% MLCellLinOp::correctionResidual() 216 0.009316 0.009316 0.009316 2.53% Castro::normalize_species() 30 0.008349 0.008349 0.008349 2.27% MLMG:computeResOfCorrection() 180 0.008029 0.008029 0.008029 2.18% MLMG::mgVcycle_down::1 36 0.007761 0.007761 0.007761 2.11% Gravity::get_new_grav_vector() 5 0.007664 0.007664 0.007664 2.08% MLMG::mgVcycle_down::2 36 0.00752 0.00752 0.00752 2.05% Castro::construct_old_gravity() 5 0.007442 0.007442 0.007442 2.02% Gravity::get_old_grav_vector() 5 0.007393 0.007393 0.007393 2.01% MLMG::mgVcycle_down::3 36 0.007134 0.007134 0.007134 1.94% MLCellLinOp::defineAuxData() 6 0.006889 0.006889 0.006889 1.87% FabArray::FillBoundary() 1766 0.006849 0.006849 0.006849 1.86% FabArray::setVal() 537 0.006821 0.006821 0.006821 1.86% MLMG::mgVcycle_down::4 36 0.0068 0.0068 0.0068 1.85% Castro::do_old_sources() 5 0.00674 0.00674 0.00674 1.83% Castro::initialize_advance() 5 0.00668 0.00668 0.00668 1.82% FabArray::ParallelCopy() 380 0.006568 0.006568 0.006568 1.79% FillBoundary_nowait() 1766 0.006499 0.006499 0.006499 1.77% Castro::do_new_sources() 5 0.006482 0.006482 0.006482 1.76% FabArray::ParallelCopy_nowait() 380 0.006447 0.006447 0.006447 1.75% CGSolver::sxay() 690 0.006357 0.006357 0.006357 1.73% Castro::enforce_min_density() 30 0.006306 0.006306 0.006306 1.72% MultiFab::LinComb() 690 0.006193 0.006193 0.006193 1.68% MLCGSolver::ParallelAllReduce 659 0.005745 0.005745 0.005745 1.56% MLMG::mgVcycle_up::2 36 0.005738 0.005738 0.005738 1.56% Castro::expand_state() 5 0.005724 0.005724 0.005724 1.56% MLMG::mgVcycle_up::1 36 0.005648 0.005648 0.005648 1.54% MLMG::addInterpCorrection() 180 0.005506 0.005506 0.005506 1.50% MLMG::mgVcycle_up::3 36 0.005416 0.005416 0.005416 1.47% MLMG::mgVcycle_up::4 36 0.005384 0.005384 0.005384 1.46% amrex::average_down 180 0.005296 0.005296 0.005296 1.44% MLPoisson::Fapply() 500 0.005119 0.005119 0.005119 1.39% Castro::post_timestep() 5 0.005057 0.005057 0.005057 1.38% Gravity::fill_multipole_BCs() 6 0.004758 0.004758 0.004758 1.29% Castro::post_restart() 1 0.003915 0.003915 0.003915 1.07% Gravity::multilevel_solve_for_new_phi() 1 0.003785 0.003785 0.003785 1.03% Gravity::actual_multilevel_solve() 1 0.003768 0.003768 0.003768 1.02% Castro::estTimeStep() 10 0.003709 0.003709 0.003709 1.01% Castro::reset_internal_energy(MultiFab) 30 0.003299 0.003299 0.003299 0.90% MLCellLinOp::solutionResidual() 42 0.003234 0.003234 0.003234 0.88% MultiFab::Xpay() 258 0.002911 0.002911 0.002911 0.79% MLCellLinOp::defineBC() 6 0.002845 0.002845 0.002845 0.77% MLMG::prepareForSolve() 6 0.002801 0.002801 0.002801 0.76% BndryData::define() 6 0.002698 0.002698 0.002698 0.73% MLMG::computeResidual() 36 0.00269 0.00269 0.00269 0.73% Castro::computeNewDt() 5 0.001929 0.001929 0.001929 0.52% Castro::construct_new_source() 25 0.001867 0.001867 0.001867 0.51% Castro::construct_new_gravity_source() 5 0.001719 0.001719 0.001719 0.47% Castro::reset_internal_energy(Fab) 240 0.001709 0.001709 0.001709 0.46% Castro::construct_old_source() 25 0.001415 0.001415 0.001415 0.38% Castro::construct_old_gravity_source() 5 0.001337 0.001337 0.001337 0.36% Castro::enforce_speed_limit() 30 0.001092 0.001092 0.001092 0.30% Castro::apply_source_to_state() 10 0.000926 0.000926 0.000926 0.25% MultiFab::Saxpy() 10 0.0009206 0.0009206 0.0009206 0.25% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0008825 0.0008825 0.0008825 0.24% MLMG::ResNormInf() 42 0.0008359 0.0008359 0.0008359 0.23% MLCellLinOp::setLevelBC() 6 0.0008294 0.0008294 0.0008294 0.23% MLMG::getGradSolution() 6 0.0007611 0.0007611 0.0007611 0.21% MLCellLinOp::compGrad() 6 0.0007582 0.0007582 0.0007582 0.21% FabArrayBase::getCPC() 632 0.0007533 0.0007533 0.0007533 0.20% FabArray::setDomainBndry() 20 0.00068 0.00068 0.00068 0.18% FabArray::mult() 22 0.0006434 0.0006434 0.0006434 0.18% MLPoisson::prepareForSolve() 6 0.0006382 0.0006382 0.0006382 0.17% MLCellLinOp::prepareForSolve() 6 0.0006338 0.0006338 0.0006338 0.17% Castro::check_for_nan() 10 0.0006062 0.0006062 0.0006062 0.16% MultiFab::contains_nan() 10 0.000598 0.000598 0.000598 0.16% MLMG::computeMLResidual() 6 0.0005624 0.0005624 0.0005624 0.15% Gravity::update_max_rhs() 6 0.0004555 0.0004555 0.0004555 0.12% FabArrayBase::CPC::define() 244 0.0003901 0.0003901 0.0003901 0.11% Amr::InitAmr() 1 0.0003862 0.0003862 0.0003862 0.11% FabArrayBase::getFB() 1766 0.0003251 0.0003251 0.0003251 0.09% Gravity::swapTimeLevels() 5 0.0002338 0.0002338 0.0002338 0.06% MLLinOp::define() 6 0.0002008 0.0002008 0.0002008 0.05% MLLinOp::defineGrids() 6 0.0001803 0.0001803 0.0001803 0.05% Castro::buildMetrics() 1 0.0001539 0.0001539 0.0001539 0.04% MultiFab::Copy() 6 0.0001431 0.0001431 0.0001431 0.04% Castro::create_source_corrector() 5 0.0001372 0.0001372 0.0001372 0.04% MultiFab::max() 6 0.000135 0.000135 0.000135 0.04% MLMG::MLResNormInf() 6 0.0001339 0.0001339 0.0001339 0.04% MLMG::MLRhsNormInf() 6 0.0001081 0.0001081 0.0001081 0.03% Castro::finalize_advance() 5 9.724e-05 9.724e-05 9.724e-05 0.03% FabArrayBase::FB::FB() 26 5.819e-05 5.819e-05 5.819e-05 0.02% MLLinOp::makeAgglomeratedDMap 6 2.826e-05 2.826e-05 2.826e-05 0.01% Castro::initMFs() 1 2.78e-05 2.78e-05 2.78e-05 0.01% Amr::writeSmallPlotFile() 1 2.672e-05 2.672e-05 2.672e-05 0.01% Castro::swap_state_time_levels() 5 2.61e-05 2.61e-05 2.61e-05 0.01% makeSFC 30 2.093e-05 2.093e-05 2.093e-05 0.01% Castro::finalize_do_advance() 5 1.886e-05 1.886e-05 1.886e-05 0.01% DistributionMapping::Distribute() 31 9.014e-06 9.014e-06 9.014e-06 0.00% Amr::initSubcycle() 1 8.404e-06 8.404e-06 8.404e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.054e-06 5.054e-06 5.054e-06 0.00% MLMG::buildFineMask() 6 3.058e-06 3.058e-06 3.058e-06 0.00% Gravity::set_mass_offset() 6 2.056e-06 2.056e-06 2.056e-06 0.00% Castro::retry_advance_ctu() 5 1.956e-06 1.956e-06 1.956e-06 0.00% Castro::FluxRegCrseInit 5 1.885e-06 1.885e-06 1.885e-06 0.00% AmrLevel::AmrLevel() 1 1.151e-06 1.151e-06 1.151e-06 0.00% Castro::FluxRegFineAdd() 5 1.099e-06 1.099e-06 1.099e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.048e-06 1.048e-06 1.048e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.05-29-g1305eb3d364d) finalized