Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-17-gd5ddf3b22e94) initialized Starting run at 10:08:11 UTC on 2023-01-13. Successfully read inputs file ... Castro git describe: 23.01-14-g78178b78e AMReX git describe: 23.01-17-gd5ddf3b22 Microphysics git describe: 23.01-3-g1e475055 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.058597441 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.033763061 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.04611614 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.047635654 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.049061639 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.053170126 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.059345974 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.05852022 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.07381944 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.068812803 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.065738264 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.059069951 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.061207575 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.05735355 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.033472884 seconds Ending run at 10:08:12 UTC on 2023-01-13. Run time = 0.874658464 Run time without initialization = 0.733964741 Average number of zones advanced per microsecond: 3.572 Average number of zones advanced per microsecond per rank: 3.572 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.8747 ... 0.8747 ... 0.8747 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- VisMF::Write(FabArray) 11 0.2349 0.2349 0.2349 26.85% Castro::construct_ctu_hydro_source() 10 0.2103 0.2103 0.2103 24.04% MLCellLinOp::applyBC() 4433 0.07536 0.07536 0.07536 8.62% MLPoisson::Fsmooth() 3280 0.03233 0.03233 0.03233 3.70% FillBoundary_nowait() 4023 0.03121 0.03121 0.03121 3.57% StateDataPhysBCFunct::() 41 0.02434 0.02434 0.02434 2.78% StateData::FillBoundary(geom) 328 0.02352 0.02352 0.02352 2.69% amrex::Dot() 1114 0.02043 0.02043 0.02043 2.34% Castro::normalize_species() 62 0.01541 0.01541 0.01541 1.76% Castro::enforce_min_density() 62 0.01504 0.01504 0.01504 1.72% amrex::Copy() 1029 0.01475 0.01475 0.01475 1.69% Castro::computeTemp() 63 0.01465 0.01465 0.01465 1.67% FabArray::norminf() 743 0.01431 0.01431 0.01431 1.64% FabArray::ParallelCopy_nowait() 861 0.01308 0.01308 0.01308 1.50% FabArray::setVal() 1144 0.01308 0.01308 0.01308 1.49% MLPoisson::Fapply() 1142 0.01041 0.01041 0.01041 1.19% MLCellLinOp::defineAuxData() 11 0.009579 0.009579 0.009579 1.10% FabArray::Saxpy() 813 0.008091 0.008091 0.008091 0.92% FabArray::Xpay() 821 0.008065 0.008065 0.008065 0.92% Gravity::fill_multipole_BCs() 11 0.006675 0.006675 0.006675 0.76% MLMG::addInterpCorrection() 410 0.00657 0.00657 0.00657 0.75% amrex::average_down 410 0.005848 0.005848 0.005848 0.67% Castro::estTimeStep() 21 0.004851 0.004851 0.004851 0.55% FabArray::LinComb() 557 0.004478 0.004478 0.004478 0.51% Castro::reset_internal_energy(MultiFab) 63 0.004412 0.004412 0.004412 0.50% amrex::Add() 164 0.004301 0.004301 0.004301 0.49% Amr::checkPoint() 3 0.003976 0.003976 0.003976 0.45% BndryData::define() 11 0.003719 0.003719 0.003719 0.43% Castro::do_advance_ctu() 10 0.003655 0.003655 0.003655 0.42% Castro::construct_new_gravity_source() 10 0.00328 0.00328 0.00328 0.37% Castro::construct_old_gravity_source() 10 0.002718 0.002718 0.002718 0.31% Amr::writePlotFile() 2 0.00237 0.00237 0.00237 0.27% MLCGSolver::bicgstab 82 0.002095 0.002095 0.002095 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001643 0.001643 0.001643 0.19% Castro::reset_internal_energy(Fab) 504 0.001478 0.001478 0.001478 0.17% Gravity::actual_solve_with_mlmg() 11 0.001397 0.001397 0.001397 0.16% MLCellLinOp::setLevelBC() 11 0.001383 0.001383 0.001383 0.16% FabArray::mult() 43 0.001337 0.001337 0.001337 0.15% FabArray::setDomainBndry() 41 0.001306 0.001306 0.001306 0.15% Castro::initData() 1 0.001275 0.001275 0.001275 0.15% MultiFab::contains_nan() 20 0.001171 0.001171 0.001171 0.13% MLCellLinOp::prepareForSolve() 11 0.001101 0.001101 0.001101 0.13% MLCellLinOp::smooth() 1640 0.001094 0.001094 0.001094 0.13% Castro::enforce_speed_limit() 62 0.001077 0.001077 0.001077 0.12% MLCellLinOp::compGrad() 11 0.0009098 0.0009098 0.0009098 0.10% MLMG::prepareForSolve() 11 0.0008325 0.0008325 0.0008325 0.10% FabArray::FillBoundary() 4023 0.0008088 0.0008088 0.0008088 0.09% FabArrayBase::getCPC() 1323 0.0007291 0.0007291 0.0007291 0.08% FabArrayBase::CPC::define() 454 0.0006786 0.0006786 0.0006786 0.08% FabArrayBase::getFB() 4023 0.0006106 0.0006106 0.0006106 0.07% Gravity::get_new_grav_vector() 11 0.0006039 0.0006039 0.0006039 0.07% Gravity::get_old_grav_vector() 10 0.0005417 0.0005417 0.0005417 0.06% Amr::InitAmr() 1 0.0005211 0.0005211 0.0005211 0.06% MLCellLinOp::apply() 1142 0.0004331 0.0004331 0.0004331 0.05% MLMG::mgVcycle() 82 0.0004042 0.0004042 0.0004042 0.05% Amr::coarseTimeStep() 10 0.0003427 0.0003427 0.0003427 0.04% main() 1 0.0003003 0.0003003 0.0003003 0.03% MLCGSolver::ParallelAllReduce 1514 0.0002803 0.0002803 0.0002803 0.03% MultiFab::max() 11 0.0002561 0.0002561 0.0002561 0.03% FabArray::ParallelCopy() 861 0.0002401 0.0002401 0.0002401 0.03% Castro::construct_new_source() 50 0.0002332 0.0002332 0.0002332 0.03% MLCellLinOp::correctionResidual() 492 0.0002303 0.0002303 0.0002303 0.03% MLCellLinOp::defineBC() 11 0.0002057 0.0002057 0.0002057 0.02% FillPatchIterator::Initialize 41 0.0002047 0.0002047 0.0002047 0.02% MLLinOp::defineGrids() 11 0.0001699 0.0001699 0.0001699 0.02% Amr::timeStep() 10 0.000159 0.000159 0.000159 0.02% Castro::subcycle_advance_ctu() 10 0.0001505 0.0001505 0.0001505 0.02% Gravity::solve_for_phi() 10 0.0001356 0.0001356 0.0001356 0.02% StateData::checkPoint() 12 0.0001346 0.0001346 0.0001346 0.02% MLMG:computeResOfCorrection() 410 0.0001138 0.0001138 0.0001138 0.01% Gravity::update_max_rhs() 11 0.0001095 0.0001095 0.0001095 0.01% MLMG::mgVcycle_down::0 82 9.754e-05 9.754e-05 9.754e-05 0.01% AmrLevel::checkPoint() 3 9.13e-05 9.13e-05 9.13e-05 0.01% MLMG::actualBottomSolve() 82 8.883e-05 8.883e-05 8.883e-05 0.01% FabArrayBase::FB::FB() 56 8.557e-05 8.557e-05 8.557e-05 0.01% Castro::Castro() 1 8.293e-05 8.293e-05 8.293e-05 0.01% MLMG::mgVcycle_down::1 82 8.189e-05 8.189e-05 8.189e-05 0.01% Castro::clean_state() 62 8.164e-05 8.164e-05 8.164e-05 0.01% MLMG::mgVcycle_down::2 82 7.903e-05 7.903e-05 7.903e-05 0.01% Castro::expand_state() 10 7.519e-05 7.519e-05 7.519e-05 0.01% MLMG::mgVcycle_down::4 82 7.399e-05 7.399e-05 7.399e-05 0.01% MLMG::mgVcycle_down::3 82 7.387e-05 7.387e-05 7.387e-05 0.01% MLMG::solve() 11 7.323e-05 7.323e-05 7.323e-05 0.01% Castro::initialize_advance() 10 6.32e-05 6.32e-05 6.32e-05 0.01% Castro::finalize_advance() 10 5.983e-05 5.983e-05 5.983e-05 0.01% MLMG::mgVcycle_up::4 82 5.853e-05 5.853e-05 5.853e-05 0.01% MLMG::oneIter() 82 5.529e-05 5.529e-05 5.529e-05 0.01% MLMG::mgVcycle_up::0 82 4.991e-05 4.991e-05 4.991e-05 0.01% MLCellLinOp::solutionResidual() 93 4.874e-05 4.874e-05 4.874e-05 0.01% Amr::InitializeInit() 1 4.87e-05 4.87e-05 4.87e-05 0.01% MLMG::mgVcycle_up::1 82 4.716e-05 4.716e-05 4.716e-05 0.01% MLMG::mgVcycle_up::3 82 4.715e-05 4.715e-05 4.715e-05 0.01% MLMG::mgVcycle_up::2 82 4.579e-05 4.579e-05 4.579e-05 0.01% Castro::initialize_do_advance() 10 4.232e-05 4.232e-05 4.232e-05 0.00% Castro::advance() 10 4.002e-05 4.002e-05 4.002e-05 0.00% Castro::finalize_do_advance() 10 3.478e-05 3.478e-05 3.478e-05 0.00% Castro::enforce_consistent_e() 1 3.448e-05 3.448e-05 3.448e-05 0.00% Castro::swap_state_time_levels() 10 3.284e-05 3.284e-05 3.284e-05 0.00% MLMG::ResNormInf() 93 3.162e-05 3.162e-05 3.162e-05 0.00% MLMG::mgVcycle_bottom 82 2.957e-05 2.957e-05 2.957e-05 0.00% StateData::define() 4 2.878e-05 2.878e-05 2.878e-05 0.00% FillPatchSingleLevel 41 2.872e-05 2.872e-05 2.872e-05 0.00% MLMG::computeResidual() 82 2.757e-05 2.757e-05 2.757e-05 0.00% Amr::writeSmallPlotFile() 1 2.504e-05 2.504e-05 2.504e-05 0.00% makeSFC 55 2.476e-05 2.476e-05 2.476e-05 0.00% Castro::construct_new_gravity() 10 2.336e-05 2.336e-05 2.336e-05 0.00% MLPoisson::define() 11 2.221e-05 2.221e-05 2.221e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.204e-05 2.204e-05 2.204e-05 0.00% Amr::defBaseLevel() 1 2.005e-05 2.005e-05 2.005e-05 0.00% Castro::initMFs() 1 1.989e-05 1.989e-05 1.989e-05 0.00% Amr::FinalizeInit() 1 1.964e-05 1.964e-05 1.964e-05 0.00% Castro::do_new_sources() 10 1.796e-05 1.796e-05 1.796e-05 0.00% Castro::construct_old_source() 50 1.775e-05 1.775e-05 1.775e-05 0.00% Castro::buildMetrics() 1 1.694e-05 1.694e-05 1.694e-05 0.00% DistributionMapping::Distribute() 56 1.576e-05 1.576e-05 1.576e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.541e-05 1.541e-05 1.541e-05 0.00% Castro::do_old_sources() 10 1.521e-05 1.521e-05 1.521e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.521e-05 1.521e-05 1.521e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.379e-05 1.379e-05 1.379e-05 0.00% Castro::check_for_nan() 20 1.118e-05 1.118e-05 1.118e-05 0.00% Castro::construct_old_gravity() 10 1.058e-05 1.058e-05 1.058e-05 0.00% Castro::apply_source_to_state() 20 9.893e-06 9.893e-06 9.893e-06 0.00% Castro::post_init() 1 9.748e-06 9.748e-06 9.748e-06 0.00% MLLinOp::define() 11 9.152e-06 9.152e-06 9.152e-06 0.00% Castro::post_timestep() 10 9.052e-06 9.052e-06 9.052e-06 0.00% Gravity::swapTimeLevels() 10 8.979e-06 8.979e-06 8.979e-06 0.00% Amr::initSubcycle() 1 8.888e-06 8.888e-06 8.888e-06 0.00% MLPoisson::prepareForSolve() 11 7.852e-06 7.852e-06 7.852e-06 0.00% MLMG::computeMLResidual() 11 7.565e-06 7.565e-06 7.565e-06 0.00% Gravity::actual_multilevel_solve() 1 7.052e-06 7.052e-06 7.052e-06 0.00% Castro::computeNewDt() 9 6.621e-06 6.621e-06 6.621e-06 0.00% MLMG::getGradSolution() 11 5.853e-06 5.853e-06 5.853e-06 0.00% AmrLevel::checkPointPost() 3 5.592e-06 5.592e-06 5.592e-06 0.00% Castro::create_source_corrector() 10 5.209e-06 5.209e-06 5.209e-06 0.00% MLMG::MLResNormInf() 11 3.936e-06 3.936e-06 3.936e-06 0.00% MLMG::MLRhsNormInf() 11 3.914e-06 3.914e-06 3.914e-06 0.00% Castro::retry_advance_ctu() 10 3.729e-06 3.729e-06 3.729e-06 0.00% Gravity::set_mass_offset() 11 3.536e-06 3.536e-06 3.536e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.837e-06 2.837e-06 2.837e-06 0.00% Castro::FluxRegFineAdd() 10 2.735e-06 2.735e-06 2.735e-06 0.00% Castro::computeInitialDt() 2 2.714e-06 2.714e-06 2.714e-06 0.00% Amr::init() 1 2.606e-06 2.606e-06 2.606e-06 0.00% AmrLevel::checkPointPre() 3 2.529e-06 2.529e-06 2.529e-06 0.00% Castro::FluxRegCrseInit 10 2.524e-06 2.524e-06 2.524e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.979e-06 1.979e-06 1.979e-06 0.00% Castro::post_regrid() 1 1.136e-06 1.136e-06 1.136e-06 0.00% Amr::initialInit() 1 9.33e-07 9.33e-07 9.33e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.8747 0.8747 0.8747 100.00% Amr::coarseTimeStep() 10 0.7003 0.7003 0.7003 80.06% Amr::timeStep() 10 0.5809 0.5809 0.5809 66.41% Castro::advance() 10 0.5717 0.5717 0.5717 65.36% Castro::subcycle_advance_ctu() 10 0.559 0.559 0.559 63.91% Castro::do_advance_ctu() 10 0.5589 0.5589 0.5589 63.89% Gravity::solve_phi_with_mlmg() 11 0.2807 0.2807 0.2807 32.09% Gravity::actual_solve_with_mlmg() 11 0.2736 0.2736 0.2736 31.28% Castro::construct_new_gravity() 10 0.2559 0.2559 0.2559 29.26% MLMG::solve() 11 0.2534 0.2534 0.2534 28.97% Gravity::solve_for_phi() 10 0.2407 0.2407 0.2407 27.52% MLMG::oneIter() 82 0.2392 0.2392 0.2392 27.35% MLMG::mgVcycle() 82 0.2357 0.2357 0.2357 26.94% VisMF::Write(FabArray) 11 0.2349 0.2349 0.2349 26.85% Castro::construct_ctu_hydro_source() 10 0.2103 0.2103 0.2103 24.04% Amr::checkPoint() 3 0.1746 0.1746 0.1746 19.96% AmrLevel::checkPoint() 3 0.1706 0.1706 0.1706 19.51% StateData::checkPoint() 12 0.1705 0.1705 0.1705 19.49% Amr::init() 1 0.14 0.14 0.14 16.01% MLCellLinOp::smooth() 1640 0.1159 0.1159 0.1159 13.25% MLCellLinOp::applyBC() 4433 0.1081 0.1081 0.1081 12.36% MLMG::mgVcycle_bottom 82 0.07304 0.07304 0.07304 8.35% MLMG::actualBottomSolve() 82 0.07301 0.07301 0.07301 8.35% MLCGSolver::bicgstab 82 0.07231 0.07231 0.07231 8.27% Amr::writePlotFile() 2 0.06737 0.06737 0.06737 7.70% FillPatchIterator::Initialize 41 0.05337 0.05337 0.05337 6.10% FillPatchSingleLevel 41 0.05186 0.05186 0.05186 5.93% Castro::clean_state() 62 0.05139 0.05139 0.05139 5.88% StateDataPhysBCFunct::() 41 0.04786 0.04786 0.04786 5.47% Amr::initialInit() 1 0.04754 0.04754 0.04754 5.43% Amr::FinalizeInit() 1 0.0436 0.0436 0.0436 4.98% Castro::post_init() 1 0.04222 0.04222 0.04222 4.83% Gravity::multilevel_solve_for_new_phi() 1 0.04042 0.04042 0.04042 4.62% Gravity::actual_multilevel_solve() 1 0.04041 0.04041 0.04041 4.62% MLCellLinOp::apply() 1142 0.03596 0.03596 0.03596 4.11% MLMG::mgVcycle_down::0 82 0.03358 0.03358 0.03358 3.84% FabArray::FillBoundary() 4023 0.03272 0.03272 0.03272 3.74% Castro::initialize_do_advance() 10 0.03247 0.03247 0.03247 3.71% MLPoisson::Fsmooth() 3280 0.03233 0.03233 0.03233 3.70% FillBoundary_nowait() 4023 0.03191 0.03191 0.03191 3.65% MLMG::mgVcycle_up::0 82 0.02551 0.02551 0.02551 2.92% StateData::FillBoundary(geom) 328 0.02352 0.02352 0.02352 2.69% Castro::expand_state() 10 0.02318 0.02318 0.02318 2.65% MLCellLinOp::correctionResidual() 492 0.02207 0.02207 0.02207 2.52% Castro::computeTemp() 63 0.02054 0.02054 0.02054 2.35% amrex::Dot() 1114 0.02043 0.02043 0.02043 2.34% MLMG:computeResOfCorrection() 410 0.01948 0.01948 0.01948 2.23% Gravity::get_new_grav_vector() 11 0.01669 0.01669 0.01669 1.91% MLPoisson::define() 11 0.01601 0.01601 0.01601 1.83% MLMG::mgVcycle_down::1 82 0.01564 0.01564 0.01564 1.79% Castro::normalize_species() 62 0.01541 0.01541 0.01541 1.76% Castro::enforce_min_density() 62 0.01504 0.01504 0.01504 1.72% Castro::construct_old_gravity() 10 0.01484 0.01484 0.01484 1.70% Gravity::get_old_grav_vector() 10 0.01483 0.01483 0.01483 1.69% amrex::Copy() 1029 0.01475 0.01475 0.01475 1.69% MLMG::mgVcycle_down::2 82 0.01459 0.01459 0.01459 1.67% FabArray::norminf() 743 0.01431 0.01431 0.01431 1.64% MLMG::mgVcycle_down::3 82 0.01424 0.01424 0.01424 1.63% FabArray::ParallelCopy() 861 0.01412 0.01412 0.01412 1.61% MLMG::mgVcycle_down::4 82 0.01401 0.01401 0.01401 1.60% FabArray::ParallelCopy_nowait() 861 0.01388 0.01388 0.01388 1.59% FabArray::setVal() 1144 0.01308 0.01308 0.01308 1.49% Castro::do_new_sources() 10 0.0127 0.0127 0.0127 1.45% MLCGSolver::ParallelAllReduce 1514 0.01225 0.01225 0.01225 1.40% Castro::initialize_advance() 10 0.01204 0.01204 0.01204 1.38% Castro::do_old_sources() 10 0.01204 0.01204 0.01204 1.38% MLMG::addInterpCorrection() 410 0.01167 0.01167 0.01167 1.33% MLMG::mgVcycle_up::4 82 0.01147 0.01147 0.01147 1.31% MLMG::mgVcycle_up::1 82 0.01128 0.01128 0.01128 1.29% MLMG::mgVcycle_up::2 82 0.01105 0.01105 0.01105 1.26% amrex::average_down 410 0.01091 0.01091 0.01091 1.25% MLCellLinOp::defineAuxData() 11 0.01089 0.01089 0.01089 1.24% MLMG::mgVcycle_up::3 82 0.01084 0.01084 0.01084 1.24% MLPoisson::Fapply() 1142 0.01041 0.01041 0.01041 1.19% Castro::post_timestep() 10 0.009034 0.009034 0.009034 1.03% FabArray::Saxpy() 813 0.008091 0.008091 0.008091 0.92% FabArray::Xpay() 821 0.008065 0.008065 0.008065 0.92% MLCellLinOp::solutionResidual() 93 0.007155 0.007155 0.007155 0.82% Gravity::fill_multipole_BCs() 11 0.006916 0.006916 0.006916 0.79% MLMG::computeResidual() 82 0.006183 0.006183 0.006183 0.71% Castro::reset_internal_energy(MultiFab) 63 0.00589 0.00589 0.00589 0.67% MLCellLinOp::defineBC() 11 0.004865 0.004865 0.004865 0.56% Castro::estTimeStep() 21 0.004851 0.004851 0.004851 0.55% BndryData::define() 11 0.004659 0.004659 0.004659 0.53% MLMG::prepareForSolve() 11 0.004575 0.004575 0.004575 0.52% FabArray::LinComb() 557 0.004478 0.004478 0.004478 0.51% amrex::Add() 164 0.004301 0.004301 0.004301 0.49% Amr::InitializeInit() 1 0.003937 0.003937 0.003937 0.45% Amr::defBaseLevel() 1 0.003889 0.003889 0.003889 0.44% Castro::construct_new_source() 50 0.003513 0.003513 0.003513 0.40% Castro::initData() 1 0.003394 0.003394 0.003394 0.39% Castro::construct_new_gravity_source() 10 0.00328 0.00328 0.00328 0.37% Castro::construct_old_source() 50 0.002736 0.002736 0.002736 0.31% Castro::construct_old_gravity_source() 10 0.002718 0.002718 0.002718 0.31% Castro::computeNewDt() 9 0.002363 0.002363 0.002363 0.27% MLMG::ResNormInf() 93 0.002106 0.002106 0.002106 0.24% Castro::apply_source_to_state() 20 0.001823 0.001823 0.001823 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001643 0.001643 0.001643 0.19% Castro::reset_internal_energy(Fab) 504 0.001478 0.001478 0.001478 0.17% FabArrayBase::getCPC() 1323 0.001408 0.001408 0.001408 0.16% MLCellLinOp::setLevelBC() 11 0.001383 0.001383 0.001383 0.16% MLMG::getGradSolution() 11 0.001378 0.001378 0.001378 0.16% MLCellLinOp::compGrad() 11 0.001372 0.001372 0.001372 0.16% FabArray::mult() 43 0.001337 0.001337 0.001337 0.15% FabArray::setDomainBndry() 41 0.001306 0.001306 0.001306 0.15% Castro::check_for_nan() 20 0.001182 0.001182 0.001182 0.14% MultiFab::contains_nan() 20 0.001171 0.001171 0.001171 0.13% Castro::post_regrid() 1 0.001143 0.001143 0.001143 0.13% MLPoisson::prepareForSolve() 11 0.001109 0.001109 0.001109 0.13% MLCellLinOp::prepareForSolve() 11 0.001101 0.001101 0.001101 0.13% Castro::enforce_speed_limit() 62 0.001077 0.001077 0.001077 0.12% MLMG::computeMLResidual() 11 0.001007 0.001007 0.001007 0.12% Castro::computeInitialDt() 2 0.0009284 0.0009284 0.0009284 0.11% Gravity::update_max_rhs() 11 0.0008083 0.0008083 0.0008083 0.09% FabArrayBase::getFB() 4023 0.0006962 0.0006962 0.0006962 0.08% FabArrayBase::CPC::define() 454 0.0006786 0.0006786 0.0006786 0.08% Castro::finalize_advance() 10 0.0005737 0.0005737 0.0005737 0.07% Amr::InitAmr() 1 0.00053 0.00053 0.00053 0.06% Gravity::swapTimeLevels() 10 0.0004358 0.0004358 0.0004358 0.05% Castro::Castro() 1 0.0004268 0.0004268 0.0004268 0.05% MLMG::MLResNormInf() 11 0.000283 0.000283 0.000283 0.03% MultiFab::max() 11 0.0002561 0.0002561 0.0002561 0.03% MLLinOp::define() 11 0.0002358 0.0002358 0.0002358 0.03% MLLinOp::defineGrids() 11 0.0002266 0.0002266 0.0002266 0.03% MLMG::MLRhsNormInf() 11 0.0002167 0.0002167 0.0002167 0.02% Castro::buildMetrics() 1 0.0001618 0.0001618 0.0001618 0.02% FabArrayBase::FB::FB() 56 8.557e-05 8.557e-05 8.557e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.472e-05 5.472e-05 5.472e-05 0.01% AmrLevel::AmrLevel(dm) 1 4.399e-05 4.399e-05 4.399e-05 0.01% makeSFC 55 3.932e-05 3.932e-05 3.932e-05 0.00% Castro::finalize_do_advance() 10 3.478e-05 3.478e-05 3.478e-05 0.00% Castro::enforce_consistent_e() 1 3.448e-05 3.448e-05 3.448e-05 0.00% Castro::swap_state_time_levels() 10 3.284e-05 3.284e-05 3.284e-05 0.00% StateData::define() 4 2.878e-05 2.878e-05 2.878e-05 0.00% Amr::writeSmallPlotFile() 1 2.504e-05 2.504e-05 2.504e-05 0.00% Castro::initMFs() 1 1.989e-05 1.989e-05 1.989e-05 0.00% DistributionMapping::Distribute() 56 1.576e-05 1.576e-05 1.576e-05 0.00% Amr::initSubcycle() 1 8.888e-06 8.888e-06 8.888e-06 0.00% AmrLevel::checkPointPost() 3 5.592e-06 5.592e-06 5.592e-06 0.00% Castro::create_source_corrector() 10 5.209e-06 5.209e-06 5.209e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.041e-06 4.041e-06 4.041e-06 0.00% Castro::retry_advance_ctu() 10 3.729e-06 3.729e-06 3.729e-06 0.00% Gravity::set_mass_offset() 11 3.536e-06 3.536e-06 3.536e-06 0.00% Castro::FluxRegFineAdd() 10 2.735e-06 2.735e-06 2.735e-06 0.00% AmrLevel::checkPointPre() 3 2.529e-06 2.529e-06 2.529e-06 0.00% Castro::FluxRegCrseInit 10 2.524e-06 2.524e-06 2.524e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.979e-06 1.979e-06 1.979e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-17-gd5ddf3b22e94) finalized Initializing CUDA... CUDA initialized with 1 device. AMReX (23.01-17-gd5ddf3b22e94) initialized Starting run at 10:08:13 UTC on 2023-01-13. Successfully read inputs file ... Castro git describe: 23.01-14-g78178b78e AMReX git describe: 23.01-17-gd5ddf3b22 Microphysics git describe: 23.01-3-g1e475055 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.454532534 Restart time = 0.10952652 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.049465936 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.05046919 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.058905748 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.059205259 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.073128013 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.03824676 seconds Ending run at 10:08:13 UTC on 2023-01-13. Run time = 0.439985345 Run time without initialization = 0.329882132 Average number of zones advanced per microsecond: 3.973 Average number of zones advanced per microsecond per rank: 3.973 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.44 ... 0.44 ... 0.44 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1049 0.1049 0.1049 23.83% Amr::restart() 1 0.06465 0.06465 0.06465 14.69% VisMF::Read() 3 0.04114 0.04114 0.04114 9.35% VisMF::Write(FabArray) 1 0.03659 0.03659 0.03659 8.32% MLCellLinOp::applyBC() 1946 0.03266 0.03266 0.03266 7.42% MLPoisson::Fsmooth() 1440 0.01406 0.01406 0.01406 3.19% FillBoundary_nowait() 1766 0.01277 0.01277 0.01277 2.90% StateData::FillBoundary(geom) 160 0.0113 0.0113 0.0113 2.57% Castro::normalize_species() 30 0.009358 0.009358 0.009358 2.13% amrex::Dot() 484 0.00874 0.00874 0.00874 1.99% Castro::enforce_min_density() 30 0.008064 0.008064 0.008064 1.83% amrex::Copy() 463 0.006939 0.006939 0.006939 1.58% Castro::computeTemp() 30 0.00637 0.00637 0.00637 1.45% FabArray::setVal() 537 0.006254 0.006254 0.006254 1.42% FabArray::norminf() 326 0.006184 0.006184 0.006184 1.41% FabArray::ParallelCopy_nowait() 380 0.005941 0.005941 0.005941 1.35% StateDataPhysBCFunct::() 20 0.005795 0.005795 0.005795 1.32% MLCellLinOp::defineAuxData() 6 0.005238 0.005238 0.005238 1.19% MLPoisson::Fapply() 500 0.004498 0.004498 0.004498 1.02% FabArray::Saxpy() 355 0.003627 0.003627 0.003627 0.82% FabArray::Xpay() 361 0.003502 0.003502 0.003502 0.80% Gravity::fill_multipole_BCs() 6 0.00329 0.00329 0.00329 0.75% MLMG::addInterpCorrection() 180 0.002846 0.002846 0.002846 0.65% Castro::do_advance_ctu() 5 0.002698 0.002698 0.002698 0.61% amrex::average_down 180 0.002547 0.002547 0.002547 0.58% Castro::estTimeStep() 10 0.002285 0.002285 0.002285 0.52% BndryData::define() 6 0.00201 0.00201 0.00201 0.46% Castro::reset_internal_energy(MultiFab) 30 0.00196 0.00196 0.00196 0.45% FabArray::LinComb() 242 0.00192 0.00192 0.00192 0.44% amrex::Add() 72 0.001833 0.001833 0.001833 0.42% Castro::construct_new_gravity_source() 5 0.001757 0.001757 0.001757 0.40% Castro::construct_old_gravity_source() 5 0.001528 0.001528 0.001528 0.35% Amr::writePlotFile() 1 0.001494 0.001494 0.001494 0.34% MLCGSolver::bicgstab 36 0.0009317 0.0009317 0.0009317 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009115 0.0009115 0.0009115 0.21% Castro::reset_internal_energy(Fab) 240 0.0008929 0.0008929 0.0008929 0.20% Castro::enforce_speed_limit() 30 0.0007616 0.0007616 0.0007616 0.17% MLCellLinOp::setLevelBC() 6 0.0007356 0.0007356 0.0007356 0.17% Gravity::actual_solve_with_mlmg() 6 0.0007087 0.0007087 0.0007087 0.16% FabArray::mult() 22 0.0006576 0.0006576 0.0006576 0.15% FabArray::setDomainBndry() 20 0.0006421 0.0006421 0.0006421 0.15% MLCellLinOp::prepareForSolve() 6 0.0005932 0.0005932 0.0005932 0.13% MultiFab::contains_nan() 10 0.000582 0.000582 0.000582 0.13% MLCellLinOp::smooth() 720 0.0005405 0.0005405 0.0005405 0.12% MLCellLinOp::compGrad() 6 0.0004939 0.0004939 0.0004939 0.11% MLMG::prepareForSolve() 6 0.0004555 0.0004555 0.0004555 0.10% FabArrayBase::CPC::define() 244 0.0004116 0.0004116 0.0004116 0.09% Amr::InitAmr() 1 0.0003957 0.0003957 0.0003957 0.09% FabArray::FillBoundary() 1766 0.0003572 0.0003572 0.0003572 0.08% FabArrayBase::getCPC() 632 0.0003487 0.0003487 0.0003487 0.08% main() 1 0.0003232 0.0003232 0.0003232 0.07% Gravity::get_new_grav_vector() 5 0.000312 0.000312 0.000312 0.07% Gravity::get_old_grav_vector() 5 0.0003025 0.0003025 0.0003025 0.07% FabArrayBase::getFB() 1766 0.0002632 0.0002632 0.0002632 0.06% MLCellLinOp::apply() 500 0.000211 0.000211 0.000211 0.05% Castro::construct_new_source() 25 0.000193 0.000193 0.000193 0.04% MLMG::mgVcycle() 36 0.0001709 0.0001709 0.0001709 0.04% Amr::coarseTimeStep() 5 0.0001639 0.0001639 0.0001639 0.04% Castro::advance() 5 0.0001492 0.0001492 0.0001492 0.03% Castro::subcycle_advance_ctu() 5 0.0001374 0.0001374 0.0001374 0.03% MLCGSolver::ParallelAllReduce 659 0.0001369 0.0001369 0.0001369 0.03% MultiFab::max() 6 0.0001349 0.0001349 0.0001349 0.03% MLLinOp::defineGrids() 6 0.0001178 0.0001178 0.0001178 0.03% FabArray::ParallelCopy() 380 0.0001134 0.0001134 0.0001134 0.03% Castro::post_timestep() 5 0.0001128 0.0001128 0.0001128 0.03% MLCellLinOp::defineBC() 6 0.0001049 0.0001049 0.0001049 0.02% AmrLevel::restart() 1 0.0001045 0.0001045 0.0001045 0.02% FillPatchIterator::Initialize 20 0.0001016 0.0001016 0.0001016 0.02% MLCellLinOp::correctionResidual() 216 0.0001014 0.0001014 0.0001014 0.02% Castro::create_source_corrector() 5 9.42e-05 9.42e-05 9.42e-05 0.02% Castro::initialize_do_advance() 5 8.579e-05 8.579e-05 8.579e-05 0.02% Amr::timeStep() 5 7.366e-05 7.366e-05 7.366e-05 0.02% Castro::construct_old_source() 25 7.349e-05 7.349e-05 7.349e-05 0.02% Gravity::solve_for_phi() 5 6.579e-05 6.579e-05 6.579e-05 0.01% FabArrayBase::FB::FB() 26 5.841e-05 5.841e-05 5.841e-05 0.01% Gravity::update_max_rhs() 6 5.623e-05 5.623e-05 5.623e-05 0.01% StateData::restartDoit() 4 5.589e-05 5.589e-05 5.589e-05 0.01% Castro::initialize_advance() 5 5.288e-05 5.288e-05 5.288e-05 0.01% MLMG:computeResOfCorrection() 180 4.943e-05 4.943e-05 4.943e-05 0.01% Castro::clean_state() 30 4.239e-05 4.239e-05 4.239e-05 0.01% Castro::construct_old_gravity() 5 4.06e-05 4.06e-05 4.06e-05 0.01% MLMG::mgVcycle_down::0 36 4.038e-05 4.038e-05 4.038e-05 0.01% Castro::expand_state() 5 3.786e-05 3.786e-05 3.786e-05 0.01% MLMG::actualBottomSolve() 36 3.782e-05 3.782e-05 3.782e-05 0.01% MLMG::mgVcycle_down::1 36 3.586e-05 3.586e-05 3.586e-05 0.01% MLMG::solve() 6 3.387e-05 3.387e-05 3.387e-05 0.01% Castro::finalize_advance() 5 3.315e-05 3.315e-05 3.315e-05 0.01% MLMG::mgVcycle_down::4 36 3.301e-05 3.301e-05 3.301e-05 0.01% MLMG::mgVcycle_down::2 36 3.271e-05 3.271e-05 3.271e-05 0.01% MLMG::mgVcycle_down::3 36 3.181e-05 3.181e-05 3.181e-05 0.01% Amr::writeSmallPlotFile() 1 3.01e-05 3.01e-05 3.01e-05 0.01% MLMG::mgVcycle_up::4 36 2.794e-05 2.794e-05 2.794e-05 0.01% Castro::buildMetrics() 1 2.516e-05 2.516e-05 2.516e-05 0.01% MLMG::oneIter() 36 2.374e-05 2.374e-05 2.374e-05 0.01% MLMG::mgVcycle_up::0 36 2.284e-05 2.284e-05 2.284e-05 0.01% MLCellLinOp::solutionResidual() 42 2.196e-05 2.196e-05 2.196e-05 0.00% Castro::post_restart() 1 2.152e-05 2.152e-05 2.152e-05 0.00% MLMG::mgVcycle_up::3 36 2.083e-05 2.083e-05 2.083e-05 0.00% MLMG::mgVcycle_up::2 36 2.07e-05 2.07e-05 2.07e-05 0.00% MLMG::mgVcycle_up::1 36 1.985e-05 1.985e-05 1.985e-05 0.00% Castro::swap_state_time_levels() 5 1.979e-05 1.979e-05 1.979e-05 0.00% Castro::initMFs() 1 1.927e-05 1.927e-05 1.927e-05 0.00% FillPatchSingleLevel 20 1.869e-05 1.869e-05 1.869e-05 0.00% Castro::finalize_do_advance() 5 1.79e-05 1.79e-05 1.79e-05 0.00% Castro::construct_new_gravity() 5 1.752e-05 1.752e-05 1.752e-05 0.00% MLMG::ResNormInf() 42 1.726e-05 1.726e-05 1.726e-05 0.00% MLMG::mgVcycle_bottom 36 1.458e-05 1.458e-05 1.458e-05 0.00% makeSFC 30 1.39e-05 1.39e-05 1.39e-05 0.00% MLPoisson::define() 6 1.362e-05 1.362e-05 1.362e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.302e-05 1.302e-05 1.302e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.245e-05 1.245e-05 1.245e-05 0.00% MLMG::computeResidual() 36 1.233e-05 1.233e-05 1.233e-05 0.00% Castro::do_new_sources() 5 1.018e-05 1.018e-05 1.018e-05 0.00% DistributionMapping::Distribute() 31 9.007e-06 9.007e-06 9.007e-06 0.00% Castro::do_old_sources() 5 8.978e-06 8.978e-06 8.978e-06 0.00% Amr::initSubcycle() 1 8.232e-06 8.232e-06 8.232e-06 0.00% Castro::check_for_nan() 10 6.804e-06 6.804e-06 6.804e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 6.778e-06 6.778e-06 6.778e-06 0.00% Gravity::actual_multilevel_solve() 1 6.552e-06 6.552e-06 6.552e-06 0.00% Castro::apply_source_to_state() 10 5.936e-06 5.936e-06 5.936e-06 0.00% MLLinOp::define() 6 5.682e-06 5.682e-06 5.682e-06 0.00% Gravity::swapTimeLevels() 5 4.258e-06 4.258e-06 4.258e-06 0.00% MLPoisson::prepareForSolve() 6 4.06e-06 4.06e-06 4.06e-06 0.00% MLMG::computeMLResidual() 6 3.433e-06 3.433e-06 3.433e-06 0.00% Castro::computeNewDt() 5 3.43e-06 3.43e-06 3.43e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.352e-06 3.352e-06 3.352e-06 0.00% AmrLevel::AmrLevel() 1 3.082e-06 3.082e-06 3.082e-06 0.00% MLMG::getGradSolution() 6 2.963e-06 2.963e-06 2.963e-06 0.00% MLMG::MLResNormInf() 6 2.309e-06 2.309e-06 2.309e-06 0.00% MLMG::MLRhsNormInf() 6 2.189e-06 2.189e-06 2.189e-06 0.00% Gravity::set_mass_offset() 6 2.069e-06 2.069e-06 2.069e-06 0.00% Castro::retry_advance_ctu() 5 1.999e-06 1.999e-06 1.999e-06 0.00% Amr::init() 1 1.787e-06 1.787e-06 1.787e-06 0.00% Castro::FluxRegCrseInit 5 1.251e-06 1.251e-06 1.251e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.115e-06 1.115e-06 1.115e-06 0.00% Castro::FluxRegFineAdd() 5 1.096e-06 1.096e-06 1.096e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.44 0.44 0.44 100.00% Amr::coarseTimeStep() 5 0.2913 0.2913 0.2913 66.21% Amr::timeStep() 5 0.2898 0.2898 0.2898 65.85% Castro::advance() 5 0.2856 0.2856 0.2856 64.90% Castro::subcycle_advance_ctu() 5 0.2787 0.2787 0.2787 63.33% Castro::do_advance_ctu() 5 0.2785 0.2785 0.2785 63.30% Castro::construct_new_gravity() 5 0.1287 0.1287 0.1287 29.25% Gravity::solve_phi_with_mlmg() 6 0.1241 0.1241 0.1241 28.21% Gravity::solve_for_phi() 5 0.1211 0.1211 0.1211 27.52% Gravity::actual_solve_with_mlmg() 6 0.1206 0.1206 0.1206 27.40% Amr::init() 1 0.1096 0.1096 0.1096 24.90% Amr::restart() 1 0.1096 0.1096 0.1096 24.90% MLMG::solve() 6 0.1096 0.1096 0.1096 24.90% Castro::construct_ctu_hydro_source() 5 0.1049 0.1049 0.1049 23.83% MLMG::oneIter() 36 0.1027 0.1027 0.1027 23.35% MLMG::mgVcycle() 36 0.1012 0.1012 0.1012 23.00% MLCellLinOp::smooth() 720 0.04959 0.04959 0.04959 11.27% MLCellLinOp::applyBC() 1946 0.04611 0.04611 0.04611 10.48% AmrLevel::restart() 1 0.04136 0.04136 0.04136 9.40% StateData::restartDoit() 4 0.04125 0.04125 0.04125 9.38% VisMF::Read() 3 0.04114 0.04114 0.04114 9.35% Amr::writePlotFile() 1 0.03834 0.03834 0.03834 8.71% VisMF::Write(FabArray) 1 0.03659 0.03659 0.03659 8.32% MLMG::mgVcycle_bottom 36 0.03148 0.03148 0.03148 7.15% MLMG::actualBottomSolve() 36 0.03146 0.03146 0.03146 7.15% MLCGSolver::bicgstab 36 0.03116 0.03116 0.03116 7.08% Castro::clean_state() 30 0.02745 0.02745 0.02745 6.24% FillPatchIterator::Initialize 20 0.01985 0.01985 0.01985 4.51% FillPatchSingleLevel 20 0.0191 0.0191 0.0191 4.34% StateDataPhysBCFunct::() 20 0.0171 0.0171 0.0171 3.89% MLCellLinOp::apply() 500 0.01556 0.01556 0.01556 3.54% MLMG::mgVcycle_down::0 36 0.01416 0.01416 0.01416 3.22% MLPoisson::Fsmooth() 1440 0.01406 0.01406 0.01406 3.19% FabArray::FillBoundary() 1766 0.01345 0.01345 0.01345 3.06% FillBoundary_nowait() 1766 0.01309 0.01309 0.01309 2.97% StateData::FillBoundary(geom) 160 0.0113 0.0113 0.0113 2.57% Castro::initialize_do_advance() 5 0.011 0.011 0.011 2.50% MLMG::mgVcycle_up::0 36 0.01072 0.01072 0.01072 2.44% MLCellLinOp::correctionResidual() 216 0.009454 0.009454 0.009454 2.15% Castro::normalize_species() 30 0.009358 0.009358 0.009358 2.13% Castro::computeTemp() 30 0.009223 0.009223 0.009223 2.10% MLPoisson::define() 6 0.008822 0.008822 0.008822 2.00% amrex::Dot() 484 0.00874 0.00874 0.00874 1.99% Castro::do_new_sources() 5 0.008718 0.008718 0.008718 1.98% MLMG:computeResOfCorrection() 180 0.008286 0.008286 0.008286 1.88% Castro::enforce_min_density() 30 0.008064 0.008064 0.008064 1.83% Gravity::get_new_grav_vector() 5 0.007519 0.007519 0.007519 1.71% Castro::construct_old_gravity() 5 0.007278 0.007278 0.007278 1.65% Gravity::get_old_grav_vector() 5 0.007237 0.007237 0.007237 1.64% amrex::Copy() 463 0.006939 0.006939 0.006939 1.58% MLMG::mgVcycle_down::1 36 0.006763 0.006763 0.006763 1.54% Castro::initialize_advance() 5 0.006473 0.006473 0.006473 1.47% FabArray::ParallelCopy() 380 0.006423 0.006423 0.006423 1.46% Castro::do_old_sources() 5 0.006398 0.006398 0.006398 1.45% FabArray::ParallelCopy_nowait() 380 0.00631 0.00631 0.00631 1.43% MLMG::mgVcycle_down::2 36 0.006276 0.006276 0.006276 1.43% FabArray::setVal() 537 0.006254 0.006254 0.006254 1.42% FabArray::norminf() 326 0.006184 0.006184 0.006184 1.41% MLMG::mgVcycle_down::3 36 0.006159 0.006159 0.006159 1.40% MLMG::mgVcycle_down::4 36 0.00608 0.00608 0.00608 1.38% MLCellLinOp::defineAuxData() 6 0.005992 0.005992 0.005992 1.36% Castro::expand_state() 5 0.005667 0.005667 0.005667 1.29% MLCGSolver::ParallelAllReduce 659 0.005291 0.005291 0.005291 1.20% MLMG::addInterpCorrection() 180 0.005069 0.005069 0.005069 1.15% MLMG::mgVcycle_up::4 36 0.004946 0.004946 0.004946 1.12% MLMG::mgVcycle_up::1 36 0.004908 0.004908 0.004908 1.12% MLMG::mgVcycle_up::2 36 0.004809 0.004809 0.004809 1.09% amrex::average_down 180 0.004756 0.004756 0.004756 1.08% MLMG::mgVcycle_up::3 36 0.004721 0.004721 0.004721 1.07% MLPoisson::Fapply() 500 0.004498 0.004498 0.004498 1.02% Castro::post_timestep() 5 0.004113 0.004113 0.004113 0.93% FabArray::Saxpy() 355 0.003627 0.003627 0.003627 0.82% FabArray::Xpay() 361 0.003502 0.003502 0.003502 0.80% Gravity::fill_multipole_BCs() 6 0.003411 0.003411 0.003411 0.78% Castro::post_restart() 1 0.003384 0.003384 0.003384 0.77% Gravity::multilevel_solve_for_new_phi() 1 0.00327 0.00327 0.00327 0.74% Gravity::actual_multilevel_solve() 1 0.003257 0.003257 0.003257 0.74% MLCellLinOp::solutionResidual() 42 0.003225 0.003225 0.003225 0.73% Castro::reset_internal_energy(MultiFab) 30 0.002853 0.002853 0.002853 0.65% MLCellLinOp::defineBC() 6 0.002665 0.002665 0.002665 0.61% MLMG::computeResidual() 36 0.002658 0.002658 0.002658 0.60% BndryData::define() 6 0.00256 0.00256 0.00256 0.58% MLMG::prepareForSolve() 6 0.002502 0.002502 0.002502 0.57% Castro::estTimeStep() 10 0.002285 0.002285 0.002285 0.52% Castro::construct_new_source() 25 0.00195 0.00195 0.00195 0.44% FabArray::LinComb() 242 0.00192 0.00192 0.00192 0.44% amrex::Add() 72 0.001833 0.001833 0.001833 0.42% Castro::construct_new_gravity_source() 5 0.001757 0.001757 0.001757 0.40% Castro::construct_old_source() 25 0.001601 0.001601 0.001601 0.36% Castro::construct_old_gravity_source() 5 0.001528 0.001528 0.001528 0.35% Castro::computeNewDt() 5 0.001404 0.001404 0.001404 0.32% MLMG::ResNormInf() 42 0.0009366 0.0009366 0.0009366 0.21% Castro::apply_source_to_state() 10 0.0009229 0.0009229 0.0009229 0.21% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009115 0.0009115 0.0009115 0.21% Castro::reset_internal_energy(Fab) 240 0.0008929 0.0008929 0.0008929 0.20% Castro::enforce_speed_limit() 30 0.0007616 0.0007616 0.0007616 0.17% FabArrayBase::getCPC() 632 0.0007603 0.0007603 0.0007603 0.17% MLMG::getGradSolution() 6 0.0007599 0.0007599 0.0007599 0.17% MLCellLinOp::compGrad() 6 0.0007569 0.0007569 0.0007569 0.17% MLCellLinOp::setLevelBC() 6 0.0007356 0.0007356 0.0007356 0.17% FabArray::mult() 22 0.0006576 0.0006576 0.0006576 0.15% FabArray::setDomainBndry() 20 0.0006421 0.0006421 0.0006421 0.15% MLPoisson::prepareForSolve() 6 0.0005972 0.0005972 0.0005972 0.14% MLCellLinOp::prepareForSolve() 6 0.0005932 0.0005932 0.0005932 0.13% Castro::check_for_nan() 10 0.0005888 0.0005888 0.0005888 0.13% MLMG::computeMLResidual() 6 0.0005825 0.0005825 0.0005825 0.13% MultiFab::contains_nan() 10 0.000582 0.000582 0.000582 0.13% Gravity::update_max_rhs() 6 0.0004439 0.0004439 0.0004439 0.10% FabArrayBase::CPC::define() 244 0.0004116 0.0004116 0.0004116 0.09% Amr::InitAmr() 1 0.0004039 0.0004039 0.0004039 0.09% FabArrayBase::getFB() 1766 0.0003216 0.0003216 0.0003216 0.07% Castro::finalize_advance() 5 0.0002925 0.0002925 0.0002925 0.07% Gravity::swapTimeLevels() 5 0.0002229 0.0002229 0.0002229 0.05% MLMG::MLResNormInf() 6 0.0001526 0.0001526 0.0001526 0.03% MLLinOp::define() 6 0.0001524 0.0001524 0.0001524 0.03% MLLinOp::defineGrids() 6 0.0001468 0.0001468 0.0001468 0.03% Castro::buildMetrics() 1 0.0001467 0.0001467 0.0001467 0.03% MultiFab::max() 6 0.0001349 0.0001349 0.0001349 0.03% MLMG::MLRhsNormInf() 6 0.0001177 0.0001177 0.0001177 0.03% Castro::create_source_corrector() 5 9.42e-05 9.42e-05 9.42e-05 0.02% FabArrayBase::FB::FB() 26 5.841e-05 5.841e-05 5.841e-05 0.01% Amr::writeSmallPlotFile() 1 3.01e-05 3.01e-05 3.01e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.782e-05 2.782e-05 2.782e-05 0.01% makeSFC 30 2.104e-05 2.104e-05 2.104e-05 0.00% Castro::swap_state_time_levels() 5 1.979e-05 1.979e-05 1.979e-05 0.00% Castro::initMFs() 1 1.927e-05 1.927e-05 1.927e-05 0.00% Castro::finalize_do_advance() 5 1.79e-05 1.79e-05 1.79e-05 0.00% DistributionMapping::Distribute() 31 9.007e-06 9.007e-06 9.007e-06 0.00% Amr::initSubcycle() 1 8.232e-06 8.232e-06 8.232e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 5.218e-06 5.218e-06 5.218e-06 0.00% AmrLevel::AmrLevel() 1 3.082e-06 3.082e-06 3.082e-06 0.00% Gravity::set_mass_offset() 6 2.069e-06 2.069e-06 2.069e-06 0.00% Castro::retry_advance_ctu() 5 1.999e-06 1.999e-06 1.999e-06 0.00% Castro::FluxRegCrseInit 5 1.251e-06 1.251e-06 1.251e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.115e-06 1.115e-06 1.115e-06 0.00% Castro::FluxRegFineAdd() 5 1.096e-06 1.096e-06 1.096e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2459 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.01-17-gd5ddf3b22e94) finalized