Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.04-38-g3d344ec19655) initialized Starting run at 08:24:05 UTC on 2022-04-29. Successfully read inputs file ... Castro git describe: 22.04-44-gef991073e AMReX git describe: 22.04-38-g3d344ec19 Microphysics git describe: 22.04-3-g3c498521 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.041468003 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.024231229 seconds [Level 0 step 1] ADVANCE with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.045976675 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.049128668 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.0602044 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.062572255 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.060115715 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.038695067 secs. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.048177954 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.056019808 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.062963852 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.061076236 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.052792331 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.038264782 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.024440734 seconds Ending run at 08:24:06 UTC on 2022-04-29. Run time = 0.774144173 Run time without initialization = 0.660889804 Average number of zones advanced per microsecond: 3.967 Average number of zones advanced per microsecond per rank: 3.967 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.7742 ... 0.7742 ... 0.7742 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.1833 0.1833 0.1833 23.67% VisMF::Write(FabArray) 11 0.1607 0.1607 0.1607 20.75% MLCellLinOp::applyBC() 4379 0.07272 0.07272 0.07272 9.39% MLPoisson::Fsmooth() 3240 0.05988 0.05988 0.05988 7.73% StateData::FillBoundary(geom) 328 0.02471 0.02471 0.02471 3.19% MLCGSolver::bicgstab 81 0.02149 0.02149 0.02149 2.78% MultiFab::Dot() 1100 0.02099 0.02099 0.02099 2.71% Castro::normalize_species() 62 0.0183 0.0183 0.0183 2.36% Castro::computeTemp() 63 0.01459 0.01459 0.01459 1.88% FillBoundary_nowait() 3974 0.01416 0.01416 0.01416 1.83% FabArray::setVal() 1135 0.01335 0.01335 0.01335 1.72% FabArray::ParallelCopy_nowait() 851 0.01318 0.01318 0.01318 1.70% Castro::enforce_min_density() 62 0.01276 0.01276 0.01276 1.65% MultiFab::LinComb() 1566 0.01259 0.01259 0.01259 1.63% StateDataPhysBCFunct::() 41 0.01135 0.01135 0.01135 1.47% MLPoisson::Fapply() 1128 0.01063 0.01063 0.01063 1.37% MLCellLinOp::defineAuxData() 11 0.01009 0.01009 0.01009 1.30% Gravity::fill_multipole_BCs() 11 0.008296 0.008296 0.008296 1.07% MLMG::addInterpCorrection() 405 0.006607 0.006607 0.006607 0.85% Castro::estTimeStep() 21 0.006472 0.006472 0.006472 0.84% amrex::average_down 405 0.006178 0.006178 0.006178 0.80% MultiFab::Xpay() 578 0.006094 0.006094 0.006094 0.79% Castro::reset_internal_energy(MultiFab) 63 0.005493 0.005493 0.005493 0.71% Castro::do_advance_ctu() 10 0.005078 0.005078 0.005078 0.66% Amr::checkPoint() 3 0.004037 0.004037 0.004037 0.52% BndryData::define() 11 0.003805 0.003805 0.003805 0.49% Castro::enforce_speed_limit() 62 0.003043 0.003043 0.003043 0.39% Castro::construct_new_gravity_source() 10 0.002705 0.002705 0.002705 0.35% Amr::writePlotFile() 2 0.0024 0.0024 0.0024 0.31% Gravity::get_new_grav_vector() 11 0.001939 0.001939 0.001939 0.25% MLMG::ResNormInf() 92 0.001916 0.001916 0.001916 0.25% Castro::construct_old_gravity_source() 10 0.001882 0.001882 0.001882 0.24% MultiFab::Saxpy() 20 0.001817 0.001817 0.001817 0.23% Gravity::get_old_grav_vector() 10 0.00174 0.00174 0.00174 0.22% Castro::expand_state() 10 0.001737 0.001737 0.001737 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001671 0.001671 0.001671 0.22% MLMG::oneIter() 81 0.001669 0.001669 0.001669 0.22% Castro::reset_internal_energy(Fab) 504 0.001569 0.001569 0.001569 0.20% Gravity::actual_solve_with_mlmg() 11 0.001436 0.001436 0.001436 0.19% MLCellLinOp::setLevelBC() 11 0.001344 0.001344 0.001344 0.17% FabArray::mult() 43 0.001328 0.001328 0.001328 0.17% FabArray::setDomainBndry() 41 0.001315 0.001315 0.001315 0.17% MultiFab::contains_nan() 20 0.001174 0.001174 0.001174 0.15% MLCellLinOp::smooth() 1620 0.001157 0.001157 0.001157 0.15% Castro::initData() 1 0.001127 0.001127 0.001127 0.15% MLCellLinOp::prepareForSolve() 11 0.001083 0.001083 0.001083 0.14% MLCellLinOp::compGrad() 11 0.0009071 0.0009071 0.0009071 0.12% FabArrayBase::getCPC() 1313 0.0008486 0.0008486 0.0008486 0.11% FabArray::FillBoundary() 3974 0.0008318 0.0008318 0.0008318 0.11% MLMG::prepareForSolve() 11 0.0007746 0.0007746 0.0007746 0.10% FabArrayBase::getFB() 3974 0.0007166 0.0007166 0.0007166 0.09% FabArrayBase::CPC::define() 454 0.0006868 0.0006868 0.0006868 0.09% MLCellLinOp::apply() 1128 0.0005125 0.0005125 0.0005125 0.07% Amr::InitAmr() 1 0.0004752 0.0004752 0.0004752 0.06% CGSolver::sxay() 1566 0.0004194 0.0004194 0.0004194 0.05% Gravity::update_max_rhs() 11 0.0004073 0.0004073 0.0004073 0.05% Gravity::solve_for_phi() 10 0.0003752 0.0003752 0.0003752 0.05% MLCGSolver::ParallelAllReduce 1495 0.0003605 0.0003605 0.0003605 0.05% MLLinOp::defineGrids() 11 0.000358 0.000358 0.000358 0.05% MLMG::mgVcycle() 81 0.0003533 0.0003533 0.0003533 0.05% FabArray::ParallelCopy() 851 0.0002948 0.0002948 0.0002948 0.04% main() 1 0.000278 0.000278 0.000278 0.04% MultiFab::Copy() 11 0.0002718 0.0002718 0.0002718 0.04% MultiFab::max() 11 0.0002543 0.0002543 0.0002543 0.03% FillPatchIterator::Initialize 41 0.0002466 0.0002466 0.0002466 0.03% MLCellLinOp::correctionResidual() 486 0.0002185 0.0002185 0.0002185 0.03% Castro::construct_new_gravity() 10 0.0002082 0.0002082 0.0002082 0.03% MLMG::MLRhsNormInf() 11 0.0001994 0.0001994 0.0001994 0.03% Amr::coarseTimeStep() 10 0.0001976 0.0001976 0.0001976 0.03% Amr::timeStep() 10 0.0001949 0.0001949 0.0001949 0.03% MLCellLinOp::defineBC() 11 0.0001901 0.0001901 0.0001901 0.02% Castro::subcycle_advance_ctu() 10 0.0001872 0.0001872 0.0001872 0.02% MLMG:computeResOfCorrection() 405 0.000151 0.000151 0.000151 0.02% StateData::checkPoint() 12 0.0001277 0.0001277 0.0001277 0.02% MLMG::actualBottomSolve() 81 0.0001057 0.0001057 0.0001057 0.01% FabArrayBase::FB::FB() 56 8.759e-05 8.759e-05 8.759e-05 0.01% MLMG::mgVcycle_down::0 81 8.34e-05 8.34e-05 8.34e-05 0.01% Castro::advance() 10 8.168e-05 8.168e-05 8.168e-05 0.01% MLMG::mgVcycle_down::1 81 8.161e-05 8.161e-05 8.161e-05 0.01% Castro::Castro() 1 7.861e-05 7.861e-05 7.861e-05 0.01% MLMG::mgVcycle_down::2 81 7.368e-05 7.368e-05 7.368e-05 0.01% Castro::initialize_advance() 10 7.367e-05 7.367e-05 7.367e-05 0.01% MLMG::solve() 11 7.347e-05 7.347e-05 7.347e-05 0.01% Castro::clean_state() 62 7.34e-05 7.34e-05 7.34e-05 0.01% AmrLevel::checkPoint() 3 7.048e-05 7.048e-05 7.048e-05 0.01% MLMG::mgVcycle_down::4 81 7.025e-05 7.025e-05 7.025e-05 0.01% MLMG::mgVcycle_down::3 81 6.929e-05 6.929e-05 6.929e-05 0.01% MLMG::mgVcycle_up::4 81 6.308e-05 6.308e-05 6.308e-05 0.01% Castro::initialize_do_advance() 10 5.625e-05 5.625e-05 5.625e-05 0.01% Castro::finalize_advance() 10 5.385e-05 5.385e-05 5.385e-05 0.01% MLMG::mgVcycle_up::0 81 5.052e-05 5.052e-05 5.052e-05 0.01% MLCellLinOp::solutionResidual() 92 4.822e-05 4.822e-05 4.822e-05 0.01% MLMG::mgVcycle_up::3 81 4.767e-05 4.767e-05 4.767e-05 0.01% MLMG::mgVcycle_up::1 81 4.758e-05 4.758e-05 4.758e-05 0.01% MLMG::mgVcycle_up::2 81 4.728e-05 4.728e-05 4.728e-05 0.01% StateData::define() 4 4.39e-05 4.39e-05 4.39e-05 0.01% Castro::construct_new_source() 50 3.966e-05 3.966e-05 3.966e-05 0.01% Castro::swap_state_time_levels() 10 3.796e-05 3.796e-05 3.796e-05 0.00% Castro::finalize_do_advance() 10 3.687e-05 3.687e-05 3.687e-05 0.00% MLMG::mgVcycle_bottom 81 3.603e-05 3.603e-05 3.603e-05 0.00% Castro::enforce_consistent_e() 1 3.563e-05 3.563e-05 3.563e-05 0.00% MLMG::computeResidual() 81 3.455e-05 3.455e-05 3.455e-05 0.00% Gravity::actual_multilevel_solve() 1 3.129e-05 3.129e-05 3.129e-05 0.00% Castro::post_timestep() 10 3.061e-05 3.061e-05 3.061e-05 0.00% FillPatchSingleLevel 41 2.94e-05 2.94e-05 2.94e-05 0.00% Gravity::solve_phi_with_mlmg() 11 2.596e-05 2.596e-05 2.596e-05 0.00% makeSFC 55 2.561e-05 2.561e-05 2.561e-05 0.00% MLLinOp::define() 11 2.499e-05 2.499e-05 2.499e-05 0.00% Castro::initMFs() 1 2.482e-05 2.482e-05 2.482e-05 0.00% Amr::writeSmallPlotFile() 1 2.376e-05 2.376e-05 2.376e-05 0.00% MLPoisson::define() 11 2.078e-05 2.078e-05 2.078e-05 0.00% Castro::buildMetrics() 1 2.066e-05 2.066e-05 2.066e-05 0.00% Amr::FinalizeInit() 1 2.035e-05 2.035e-05 2.035e-05 0.00% Castro::construct_old_source() 50 1.97e-05 1.97e-05 1.97e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.753e-05 1.753e-05 1.753e-05 0.00% Castro::do_new_sources() 10 1.726e-05 1.726e-05 1.726e-05 0.00% Castro::do_old_sources() 10 1.661e-05 1.661e-05 1.661e-05 0.00% DistributionMapping::Distribute() 56 1.545e-05 1.545e-05 1.545e-05 0.00% Amr::defBaseLevel() 1 1.501e-05 1.501e-05 1.501e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.472e-05 1.472e-05 1.472e-05 0.00% Castro::apply_source_to_state() 20 1.122e-05 1.122e-05 1.122e-05 0.00% Castro::construct_old_gravity() 10 1.067e-05 1.067e-05 1.067e-05 0.00% Castro::check_for_nan() 20 1.032e-05 1.032e-05 1.032e-05 0.00% AmrLevel::AmrLevel(dm) 1 9.953e-06 9.953e-06 9.953e-06 0.00% Amr::initSubcycle() 1 9.755e-06 9.755e-06 9.755e-06 0.00% Gravity::swapTimeLevels() 10 8.899e-06 8.899e-06 8.899e-06 0.00% MLPoisson::prepareForSolve() 11 8.749e-06 8.749e-06 8.749e-06 0.00% Castro::post_init() 1 8.616e-06 8.616e-06 8.616e-06 0.00% Amr::InitializeInit() 1 6.566e-06 6.566e-06 6.566e-06 0.00% Castro::computeNewDt() 9 6.455e-06 6.455e-06 6.455e-06 0.00% MLMG::computeMLResidual() 11 6.32e-06 6.32e-06 6.32e-06 0.00% MLMG::getGradSolution() 11 6.099e-06 6.099e-06 6.099e-06 0.00% MLMG::buildFineMask() 11 5.275e-06 5.275e-06 5.275e-06 0.00% MLMG::MLResNormInf() 11 4.873e-06 4.873e-06 4.873e-06 0.00% Gravity::set_mass_offset() 11 4.565e-06 4.565e-06 4.565e-06 0.00% AmrLevel::checkPointPost() 3 4.325e-06 4.325e-06 4.325e-06 0.00% Castro::retry_advance_ctu() 10 4.069e-06 4.069e-06 4.069e-06 0.00% Castro::create_source_corrector() 10 4.049e-06 4.049e-06 4.049e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.48e-06 3.48e-06 3.48e-06 0.00% Castro::FluxRegCrseInit 10 3.223e-06 3.223e-06 3.223e-06 0.00% Amr::init() 1 3.047e-06 3.047e-06 3.047e-06 0.00% Castro::FluxRegFineAdd() 10 2.67e-06 2.67e-06 2.67e-06 0.00% Castro::computeInitialDt() 2 2.364e-06 2.364e-06 2.364e-06 0.00% AmrLevel::checkPointPre() 3 2.206e-06 2.206e-06 2.206e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.899e-06 1.899e-06 1.899e-06 0.00% Castro::post_regrid() 1 1.427e-06 1.427e-06 1.427e-06 0.00% Amr::initialInit() 1 1.048e-06 1.048e-06 1.048e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.7742 0.7742 0.7742 100.00% Amr::coarseTimeStep() 10 0.6362 0.6362 0.6362 82.18% Amr::timeStep() 10 0.5556 0.5556 0.5556 71.76% Castro::advance() 10 0.5446 0.5446 0.5446 70.34% Castro::subcycle_advance_ctu() 10 0.5311 0.5311 0.5311 68.60% Castro::do_advance_ctu() 10 0.5309 0.5309 0.5309 68.57% Gravity::solve_phi_with_mlmg() 11 0.2911 0.2911 0.2911 37.60% Gravity::actual_solve_with_mlmg() 11 0.2826 0.2826 0.2826 36.50% Castro::construct_new_gravity() 10 0.2671 0.2671 0.2671 34.50% MLMG::solve() 11 0.2615 0.2615 0.2615 33.78% Gravity::solve_for_phi() 10 0.2518 0.2518 0.2518 32.52% MLMG::oneIter() 81 0.2478 0.2478 0.2478 32.00% MLMG::mgVcycle() 81 0.2461 0.2461 0.2461 31.79% Castro::construct_ctu_hydro_source() 10 0.1833 0.1833 0.1833 23.67% VisMF::Write(FabArray) 11 0.1607 0.1607 0.1607 20.75% MLCellLinOp::smooth() 1620 0.1266 0.1266 0.1266 16.35% Amr::checkPoint() 3 0.1185 0.1185 0.1185 15.31% AmrLevel::checkPoint() 3 0.1145 0.1145 0.1145 14.79% StateData::checkPoint() 12 0.1144 0.1144 0.1144 14.78% Amr::init() 1 0.1126 0.1126 0.1126 14.55% MLCellLinOp::applyBC() 4379 0.08852 0.08852 0.08852 11.43% MLMG::mgVcycle_bottom 81 0.07447 0.07447 0.07447 9.62% MLMG::actualBottomSolve() 81 0.07443 0.07443 0.07443 9.61% MLCGSolver::bicgstab 81 0.07369 0.07369 0.07369 9.52% MLPoisson::Fsmooth() 3240 0.05988 0.05988 0.05988 7.73% Castro::clean_state() 62 0.05516 0.05516 0.05516 7.13% Amr::writePlotFile() 2 0.04878 0.04878 0.04878 6.30% Amr::initialInit() 1 0.04683 0.04683 0.04683 6.05% Amr::FinalizeInit() 1 0.04299 0.04299 0.04299 5.55% Castro::post_init() 1 0.04166 0.04166 0.04166 5.38% FillPatchIterator::Initialize 41 0.04159 0.04159 0.04159 5.37% FillPatchSingleLevel 41 0.04003 0.04003 0.04003 5.17% Gravity::multilevel_solve_for_new_phi() 1 0.03972 0.03972 0.03972 5.13% Gravity::actual_multilevel_solve() 1 0.0397 0.0397 0.0397 5.13% StateDataPhysBCFunct::() 41 0.03605 0.03605 0.03605 4.66% MLMG::mgVcycle_down::0 81 0.0342 0.0342 0.0342 4.42% MLCellLinOp::apply() 1128 0.03364 0.03364 0.03364 4.35% MLMG::mgVcycle_up::0 81 0.02923 0.02923 0.02923 3.78% StateData::FillBoundary(geom) 328 0.02471 0.02471 0.02471 3.19% Castro::computeTemp() 63 0.02165 0.02165 0.02165 2.80% MultiFab::Dot() 1100 0.02099 0.02099 0.02099 2.71% Castro::initialize_do_advance() 10 0.02004 0.02004 0.02004 2.59% MLCellLinOp::correctionResidual() 486 0.01966 0.01966 0.01966 2.54% Castro::normalize_species() 62 0.0183 0.0183 0.0183 2.36% MLMG:computeResOfCorrection() 405 0.01708 0.01708 0.01708 2.21% Gravity::get_new_grav_vector() 11 0.01696 0.01696 0.01696 2.19% MLPoisson::define() 11 0.01688 0.01688 0.01688 2.18% MLMG::mgVcycle_down::1 81 0.01638 0.01638 0.01638 2.12% FabArray::FillBoundary() 3974 0.01579 0.01579 0.01579 2.04% MLMG::mgVcycle_down::2 81 0.01575 0.01575 0.01575 2.03% FillBoundary_nowait() 3974 0.01496 0.01496 0.01496 1.93% MLMG::mgVcycle_down::3 81 0.0149 0.0149 0.0149 1.93% Castro::construct_old_gravity() 10 0.01487 0.01487 0.01487 1.92% Gravity::get_old_grav_vector() 10 0.01486 0.01486 0.01486 1.92% FabArray::ParallelCopy() 851 0.01435 0.01435 0.01435 1.85% MLMG::mgVcycle_down::4 81 0.0141 0.0141 0.0141 1.82% FabArray::ParallelCopy_nowait() 851 0.01406 0.01406 0.01406 1.82% Castro::initialize_advance() 10 0.01336 0.01336 0.01336 1.73% FabArray::setVal() 1135 0.01335 0.01335 0.01335 1.72% CGSolver::sxay() 1566 0.01301 0.01301 0.01301 1.68% Castro::enforce_min_density() 62 0.01276 0.01276 0.01276 1.65% MLCGSolver::ParallelAllReduce 1495 0.01262 0.01262 0.01262 1.63% MultiFab::LinComb() 1566 0.01259 0.01259 0.01259 1.63% MLMG::mgVcycle_up::2 81 0.01213 0.01213 0.01213 1.57% MLMG::mgVcycle_up::1 81 0.01194 0.01194 0.01194 1.54% Castro::do_new_sources() 10 0.01191 0.01191 0.01191 1.54% MLMG::addInterpCorrection() 405 0.0118 0.0118 0.0118 1.52% MLCellLinOp::defineAuxData() 11 0.01147 0.01147 0.01147 1.48% amrex::average_down 405 0.01139 0.01139 0.01139 1.47% MLMG::mgVcycle_up::3 81 0.01138 0.01138 0.01138 1.47% Castro::do_old_sources() 10 0.01127 0.01127 0.01127 1.46% MLMG::mgVcycle_up::4 81 0.01125 0.01125 0.01125 1.45% Castro::expand_state() 10 0.01117 0.01117 0.01117 1.44% Castro::post_timestep() 10 0.01079 0.01079 0.01079 1.39% MLPoisson::Fapply() 1128 0.01063 0.01063 0.01063 1.37% Gravity::fill_multipole_BCs() 11 0.008296 0.008296 0.008296 1.07% Castro::reset_internal_energy(MultiFab) 63 0.007062 0.007062 0.007062 0.91% MLCellLinOp::solutionResidual() 92 0.006974 0.006974 0.006974 0.90% Castro::estTimeStep() 21 0.006472 0.006472 0.006472 0.84% MultiFab::Xpay() 578 0.006094 0.006094 0.006094 0.79% MLMG::computeResidual() 81 0.005998 0.005998 0.005998 0.77% MLCellLinOp::defineBC() 11 0.004944 0.004944 0.004944 0.64% BndryData::define() 11 0.004754 0.004754 0.004754 0.61% MLMG::prepareForSolve() 11 0.004564 0.004564 0.004564 0.59% Amr::InitializeInit() 1 0.003842 0.003842 0.003842 0.50% Amr::defBaseLevel() 1 0.003835 0.003835 0.003835 0.50% Castro::initData() 1 0.003333 0.003333 0.003333 0.43% Castro::computeNewDt() 9 0.003079 0.003079 0.003079 0.40% Castro::enforce_speed_limit() 62 0.003043 0.003043 0.003043 0.39% Castro::construct_new_source() 50 0.002745 0.002745 0.002745 0.35% Castro::construct_new_gravity_source() 10 0.002705 0.002705 0.002705 0.35% MLMG::ResNormInf() 92 0.001916 0.001916 0.001916 0.25% Castro::construct_old_source() 50 0.001902 0.001902 0.001902 0.25% Castro::construct_old_gravity_source() 10 0.001882 0.001882 0.001882 0.24% Castro::apply_source_to_state() 20 0.001829 0.001829 0.001829 0.24% MultiFab::Saxpy() 20 0.001817 0.001817 0.001817 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001671 0.001671 0.001671 0.22% Castro::reset_internal_energy(Fab) 504 0.001569 0.001569 0.001569 0.20% FabArrayBase::getCPC() 1313 0.001535 0.001535 0.001535 0.20% MLMG::getGradSolution() 11 0.001371 0.001371 0.001371 0.18% MLCellLinOp::compGrad() 11 0.001365 0.001365 0.001365 0.18% MLCellLinOp::setLevelBC() 11 0.001344 0.001344 0.001344 0.17% FabArray::mult() 43 0.001328 0.001328 0.001328 0.17% FabArray::setDomainBndry() 41 0.001315 0.001315 0.001315 0.17% Castro::check_for_nan() 20 0.001184 0.001184 0.001184 0.15% MultiFab::contains_nan() 20 0.001174 0.001174 0.001174 0.15% Castro::post_regrid() 1 0.0011 0.0011 0.0011 0.14% MLPoisson::prepareForSolve() 11 0.001092 0.001092 0.001092 0.14% MLCellLinOp::prepareForSolve() 11 0.001083 0.001083 0.001083 0.14% MLMG::computeMLResidual() 11 0.001016 0.001016 0.001016 0.13% Gravity::update_max_rhs() 11 0.0008091 0.0008091 0.0008091 0.10% FabArrayBase::getFB() 3974 0.0008042 0.0008042 0.0008042 0.10% FabArrayBase::CPC::define() 454 0.0006868 0.0006868 0.0006868 0.09% Castro::computeInitialDt() 2 0.0005843 0.0005843 0.0005843 0.08% Amr::InitAmr() 1 0.0004849 0.0004849 0.0004849 0.06% Gravity::swapTimeLevels() 10 0.0004433 0.0004433 0.0004433 0.06% MLLinOp::define() 11 0.0004394 0.0004394 0.0004394 0.06% Castro::Castro() 1 0.0004287 0.0004287 0.0004287 0.06% MLLinOp::defineGrids() 11 0.0004145 0.0004145 0.0004145 0.05% MultiFab::Copy() 11 0.0002718 0.0002718 0.0002718 0.04% MLMG::MLResNormInf() 11 0.0002587 0.0002587 0.0002587 0.03% MultiFab::max() 11 0.0002543 0.0002543 0.0002543 0.03% MLMG::MLRhsNormInf() 11 0.0001994 0.0001994 0.0001994 0.03% Castro::buildMetrics() 1 0.0001597 0.0001597 0.0001597 0.02% FabArrayBase::FB::FB() 56 8.759e-05 8.759e-05 8.759e-05 0.01% Castro::finalize_advance() 10 5.974e-05 5.974e-05 5.974e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.452e-05 5.452e-05 5.452e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.385e-05 5.385e-05 5.385e-05 0.01% StateData::define() 4 4.39e-05 4.39e-05 4.39e-05 0.01% makeSFC 55 3.98e-05 3.98e-05 3.98e-05 0.01% Castro::swap_state_time_levels() 10 3.796e-05 3.796e-05 3.796e-05 0.00% Castro::finalize_do_advance() 10 3.687e-05 3.687e-05 3.687e-05 0.00% Castro::enforce_consistent_e() 1 3.563e-05 3.563e-05 3.563e-05 0.00% Castro::initMFs() 1 2.482e-05 2.482e-05 2.482e-05 0.00% Amr::writeSmallPlotFile() 1 2.376e-05 2.376e-05 2.376e-05 0.00% DistributionMapping::Distribute() 56 1.545e-05 1.545e-05 1.545e-05 0.00% Amr::initSubcycle() 1 9.755e-06 9.755e-06 9.755e-06 0.00% MLMG::buildFineMask() 11 5.275e-06 5.275e-06 5.275e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.741e-06 4.741e-06 4.741e-06 0.00% Gravity::set_mass_offset() 11 4.565e-06 4.565e-06 4.565e-06 0.00% AmrLevel::checkPointPost() 3 4.325e-06 4.325e-06 4.325e-06 0.00% Castro::retry_advance_ctu() 10 4.069e-06 4.069e-06 4.069e-06 0.00% Castro::create_source_corrector() 10 4.049e-06 4.049e-06 4.049e-06 0.00% Castro::FluxRegCrseInit 10 3.223e-06 3.223e-06 3.223e-06 0.00% Castro::FluxRegFineAdd() 10 2.67e-06 2.67e-06 2.67e-06 0.00% AmrLevel::checkPointPre() 3 2.206e-06 2.206e-06 2.206e-06 0.00% MLLinOp::makeSubCommunicator() 11 1.899e-06 1.899e-06 1.899e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.04-38-g3d344ec19655) finalized Initializing CUDA... CUDA initialized with 1 GPU AMReX (22.04-38-g3d344ec19655) initialized Starting run at 08:24:07 UTC on 2022-04-29. Successfully read inputs file ... Castro git describe: 22.04-44-gef991073e AMReX git describe: 22.04-38-g3d344ec19 Microphysics git describe: 22.04-3-g3c498521 reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.429881256 Restart time = 0.045525787 seconds. [Level 0 step 6] ADVANCE with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.050472356 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.048635202 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.059357408 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.063353596 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.063473733 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.058237115 seconds Ending run at 08:24:07 UTC on 2022-04-29. Run time = 0.389943432 Run time without initialization = 0.343869226 Average number of zones advanced per microsecond: 3.812 Average number of zones advanced per microsecond per rank: 3.812 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9489481728 TinyProfiler total time across processes [min...avg...max]: 0.39 ... 0.39 ... 0.39 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.0919 0.0919 0.0919 23.56% VisMF::Read() 3 0.03855 0.03855 0.03855 9.89% Amr::writePlotFile() 1 0.03454 0.03454 0.03454 8.86% MLCellLinOp::applyBC() 1946 0.03209 0.03209 0.03209 8.23% MLPoisson::Fsmooth() 1440 0.02616 0.02616 0.02616 6.71% VisMF::Write(FabArray) 1 0.02378 0.02378 0.02378 6.10% StateData::FillBoundary(geom) 160 0.01152 0.01152 0.01152 2.96% Castro::normalize_species() 30 0.009893 0.009893 0.009893 2.54% MLCGSolver::bicgstab 36 0.009323 0.009323 0.009323 2.39% MultiFab::Dot() 484 0.009125 0.009125 0.009125 2.34% Castro::computeTemp() 30 0.008016 0.008016 0.008016 2.06% FabArray::setVal() 537 0.006431 0.006431 0.006431 1.65% FillBoundary_nowait() 1766 0.006308 0.006308 0.006308 1.62% Castro::enforce_min_density() 30 0.006302 0.006302 0.006302 1.62% FabArray::ParallelCopy_nowait() 380 0.006051 0.006051 0.006051 1.55% StateDataPhysBCFunct::() 20 0.005602 0.005602 0.005602 1.44% MLCellLinOp::defineAuxData() 6 0.00554 0.00554 0.00554 1.42% MultiFab::LinComb() 690 0.005477 0.005477 0.005477 1.40% Gravity::fill_multipole_BCs() 6 0.00473 0.00473 0.00473 1.21% MLPoisson::Fapply() 500 0.004662 0.004662 0.004662 1.20% Castro::estTimeStep() 10 0.003623 0.003623 0.003623 0.93% Amr::restart() 1 0.002917 0.002917 0.002917 0.75% MLMG::addInterpCorrection() 180 0.002916 0.002916 0.002916 0.75% Castro::do_advance_ctu() 5 0.002739 0.002739 0.002739 0.70% MultiFab::Xpay() 258 0.002716 0.002716 0.002716 0.70% amrex::average_down 180 0.002707 0.002707 0.002707 0.69% BndryData::define() 6 0.002109 0.002109 0.002109 0.54% Castro::reset_internal_energy(MultiFab) 30 0.001996 0.001996 0.001996 0.51% Castro::construct_new_gravity_source() 5 0.001594 0.001594 0.001594 0.41% Castro::construct_old_gravity_source() 5 0.001305 0.001305 0.001305 0.33% MultiFab::Saxpy() 10 0.0009297 0.0009297 0.0009297 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.000906 0.000906 0.000906 0.23% Gravity::get_old_grav_vector() 5 0.0008825 0.0008825 0.0008825 0.23% Castro::expand_state() 5 0.0008753 0.0008753 0.0008753 0.22% Gravity::get_new_grav_vector() 5 0.0008645 0.0008645 0.0008645 0.22% MLMG::ResNormInf() 42 0.0008523 0.0008523 0.0008523 0.22% Castro::enforce_speed_limit() 30 0.0008332 0.0008332 0.0008332 0.21% Castro::reset_internal_energy(Fab) 240 0.0007933 0.0007933 0.0007933 0.20% Gravity::actual_solve_with_mlmg() 6 0.0007403 0.0007403 0.0007403 0.19% MLMG::oneIter() 36 0.0007401 0.0007401 0.0007401 0.19% MLCellLinOp::setLevelBC() 6 0.0007234 0.0007234 0.0007234 0.19% FabArray::setDomainBndry() 20 0.0006656 0.0006656 0.0006656 0.17% FabArray::mult() 22 0.0006561 0.0006561 0.0006561 0.17% MLCellLinOp::prepareForSolve() 6 0.0005963 0.0005963 0.0005963 0.15% MultiFab::contains_nan() 10 0.0005942 0.0005942 0.0005942 0.15% MLCellLinOp::smooth() 720 0.0005831 0.0005831 0.0005831 0.15% MLCellLinOp::compGrad() 6 0.0004928 0.0004928 0.0004928 0.13% MLMG::prepareForSolve() 6 0.0004231 0.0004231 0.0004231 0.11% FabArrayBase::CPC::define() 244 0.0004119 0.0004119 0.0004119 0.11% FabArrayBase::getCPC() 632 0.0003996 0.0003996 0.0003996 0.10% Amr::InitAmr() 1 0.0003817 0.0003817 0.0003817 0.10% FabArray::FillBoundary() 1766 0.0003746 0.0003746 0.0003746 0.10% FabArrayBase::getFB() 1766 0.000281 0.000281 0.000281 0.07% main() 1 0.0002771 0.0002771 0.0002771 0.07% MLCellLinOp::apply() 500 0.0002324 0.0002324 0.0002324 0.06% Gravity::update_max_rhs() 6 0.0002281 0.0002281 0.0002281 0.06% CGSolver::sxay() 690 0.000203 0.000203 0.000203 0.05% Gravity::solve_for_phi() 5 0.0001891 0.0001891 0.0001891 0.05% MLLinOp::defineGrids() 6 0.000183 0.000183 0.000183 0.05% MLMG::mgVcycle() 36 0.000173 0.000173 0.000173 0.04% MLCGSolver::ParallelAllReduce 659 0.0001479 0.0001479 0.0001479 0.04% MultiFab::Copy() 6 0.0001424 0.0001424 0.0001424 0.04% FabArray::ParallelCopy() 380 0.0001398 0.0001398 0.0001398 0.04% MultiFab::max() 6 0.000137 0.000137 0.000137 0.04% Castro::subcycle_advance_ctu() 5 0.0001255 0.0001255 0.0001255 0.03% FillPatchIterator::Initialize 20 0.0001196 0.0001196 0.0001196 0.03% Castro::construct_new_gravity() 5 0.0001087 0.0001087 0.0001087 0.03% Amr::coarseTimeStep() 5 0.0001076 0.0001076 0.0001076 0.03% MLMG::MLRhsNormInf() 6 0.0001065 0.0001065 0.0001065 0.03% MLCellLinOp::correctionResidual() 216 0.0001061 0.0001061 0.0001061 0.03% MLCellLinOp::defineBC() 6 0.0001046 0.0001046 0.0001046 0.03% Amr::timeStep() 5 9.308e-05 9.308e-05 9.308e-05 0.02% Castro::advance() 5 8.097e-05 8.097e-05 8.097e-05 0.02% StateData::restartDoit() 4 7.291e-05 7.291e-05 7.291e-05 0.02% AmrLevel::restart() 1 6.916e-05 6.916e-05 6.916e-05 0.02% MLMG:computeResOfCorrection() 180 6.122e-05 6.122e-05 6.122e-05 0.02% FabArrayBase::FB::FB() 26 5.826e-05 5.826e-05 5.826e-05 0.01% Castro::initialize_advance() 5 5.533e-05 5.533e-05 5.533e-05 0.01% MLMG::actualBottomSolve() 36 4.697e-05 4.697e-05 4.697e-05 0.01% Castro::construct_new_source() 25 4.668e-05 4.668e-05 4.668e-05 0.01% Castro::initialize_do_advance() 5 4.281e-05 4.281e-05 4.281e-05 0.01% Castro::create_source_corrector() 5 4.244e-05 4.244e-05 4.244e-05 0.01% Castro::clean_state() 30 4.025e-05 4.025e-05 4.025e-05 0.01% MLMG::mgVcycle_down::0 36 3.852e-05 3.852e-05 3.852e-05 0.01% MLMG::mgVcycle_down::1 36 3.691e-05 3.691e-05 3.691e-05 0.01% MLMG::solve() 6 3.628e-05 3.628e-05 3.628e-05 0.01% MLMG::mgVcycle_down::2 36 3.508e-05 3.508e-05 3.508e-05 0.01% Castro::construct_old_source() 25 3.382e-05 3.382e-05 3.382e-05 0.01% MLMG::mgVcycle_down::4 36 3.356e-05 3.356e-05 3.356e-05 0.01% MLMG::mgVcycle_down::3 36 3.229e-05 3.229e-05 3.229e-05 0.01% Gravity::actual_multilevel_solve() 1 2.994e-05 2.994e-05 2.994e-05 0.01% Castro::buildMetrics() 1 2.987e-05 2.987e-05 2.987e-05 0.01% Castro::post_restart() 1 2.796e-05 2.796e-05 2.796e-05 0.01% MLMG::mgVcycle_up::4 36 2.722e-05 2.722e-05 2.722e-05 0.01% Castro::swap_state_time_levels() 5 2.647e-05 2.647e-05 2.647e-05 0.01% Castro::initMFs() 1 2.551e-05 2.551e-05 2.551e-05 0.01% Amr::writeSmallPlotFile() 1 2.472e-05 2.472e-05 2.472e-05 0.01% Castro::finalize_advance() 5 2.436e-05 2.436e-05 2.436e-05 0.01% MLCellLinOp::solutionResidual() 42 2.359e-05 2.359e-05 2.359e-05 0.01% MLMG::mgVcycle_up::0 36 2.293e-05 2.293e-05 2.293e-05 0.01% MLMG::mgVcycle_up::3 36 2.256e-05 2.256e-05 2.256e-05 0.01% MLMG::mgVcycle_up::2 36 2.237e-05 2.237e-05 2.237e-05 0.01% MLMG::mgVcycle_up::1 36 2.151e-05 2.151e-05 2.151e-05 0.01% MLLinOp::define() 6 2.032e-05 2.032e-05 2.032e-05 0.01% Castro::finalize_do_advance() 5 1.9e-05 1.9e-05 1.9e-05 0.00% MLMG::computeResidual() 36 1.674e-05 1.674e-05 1.674e-05 0.00% MLMG::mgVcycle_bottom 36 1.63e-05 1.63e-05 1.63e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.589e-05 1.589e-05 1.589e-05 0.00% FillPatchSingleLevel 20 1.444e-05 1.444e-05 1.444e-05 0.00% MLPoisson::define() 6 1.391e-05 1.391e-05 1.391e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.387e-05 1.387e-05 1.387e-05 0.00% makeSFC 30 1.359e-05 1.359e-05 1.359e-05 0.00% DistributionMapping::Distribute() 31 9.137e-06 9.137e-06 9.137e-06 0.00% Castro::do_new_sources() 5 9.131e-06 9.131e-06 9.131e-06 0.00% Amr::initSubcycle() 1 9.053e-06 9.053e-06 9.053e-06 0.00% Castro::do_old_sources() 5 8.626e-06 8.626e-06 8.626e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.811e-06 7.811e-06 7.811e-06 0.00% Castro::check_for_nan() 10 6.708e-06 6.708e-06 6.708e-06 0.00% Castro::apply_source_to_state() 10 6.39e-06 6.39e-06 6.39e-06 0.00% Castro::post_timestep() 5 5.351e-06 5.351e-06 5.351e-06 0.00% Castro::construct_old_gravity() 5 5.201e-06 5.201e-06 5.201e-06 0.00% Gravity::swapTimeLevels() 5 5.011e-06 5.011e-06 5.011e-06 0.00% MLPoisson::prepareForSolve() 6 4.392e-06 4.392e-06 4.392e-06 0.00% Castro::computeNewDt() 5 3.593e-06 3.593e-06 3.593e-06 0.00% MLMG::computeMLResidual() 6 3.299e-06 3.299e-06 3.299e-06 0.00% MLMG::buildFineMask() 6 3.175e-06 3.175e-06 3.175e-06 0.00% MLMG::getGradSolution() 6 3.101e-06 3.101e-06 3.101e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.015e-06 3.015e-06 3.015e-06 0.00% MLMG::MLResNormInf() 6 2.633e-06 2.633e-06 2.633e-06 0.00% Castro::FluxRegCrseInit 5 2.196e-06 2.196e-06 2.196e-06 0.00% Gravity::set_mass_offset() 6 2.034e-06 2.034e-06 2.034e-06 0.00% Castro::retry_advance_ctu() 5 1.912e-06 1.912e-06 1.912e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.281e-06 1.281e-06 1.281e-06 0.00% Castro::FluxRegFineAdd() 5 1.068e-06 1.068e-06 1.068e-06 0.00% Amr::init() 1 9.94e-07 9.94e-07 9.94e-07 0.00% AmrLevel::AmrLevel() 1 8.87e-07 8.87e-07 8.87e-07 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.39 0.39 0.39 100.00% Amr::coarseTimeStep() 5 0.2854 0.2854 0.2854 73.18% Amr::timeStep() 5 0.2833 0.2833 0.2833 72.64% Castro::advance() 5 0.2787 0.2787 0.2787 71.46% Castro::subcycle_advance_ctu() 5 0.2712 0.2712 0.2712 69.55% Castro::do_advance_ctu() 5 0.2711 0.2711 0.2711 69.51% Castro::construct_new_gravity() 5 0.1359 0.1359 0.1359 34.86% Gravity::solve_phi_with_mlmg() 6 0.1318 0.1318 0.1318 33.79% Gravity::solve_for_phi() 5 0.1284 0.1284 0.1284 32.93% Gravity::actual_solve_with_mlmg() 6 0.1269 0.1269 0.1269 32.54% MLMG::solve() 6 0.1154 0.1154 0.1154 29.58% MLMG::oneIter() 36 0.1087 0.1087 0.1087 27.87% MLMG::mgVcycle() 36 0.1079 0.1079 0.1079 27.68% Castro::construct_ctu_hydro_source() 5 0.09187 0.09187 0.09187 23.56% Amr::writePlotFile() 1 0.05832 0.05832 0.05832 14.95% MLCellLinOp::smooth() 720 0.05561 0.05561 0.05561 14.26% Amr::init() 1 0.04556 0.04556 0.04556 11.68% Amr::restart() 1 0.04556 0.04556 0.04556 11.68% MLCellLinOp::applyBC() 1946 0.03911 0.03911 0.03911 10.03% AmrLevel::restart() 1 0.03876 0.03876 0.03876 9.94% StateData::restartDoit() 4 0.03869 0.03869 0.03869 9.92% VisMF::Read() 3 0.03855 0.03855 0.03855 9.89% MLMG::mgVcycle_bottom 36 0.03245 0.03245 0.03245 8.32% MLMG::actualBottomSolve() 36 0.03243 0.03243 0.03243 8.32% MLCGSolver::bicgstab 36 0.03211 0.03211 0.03211 8.23% Castro::clean_state() 30 0.02787 0.02787 0.02787 7.15% MLPoisson::Fsmooth() 1440 0.02616 0.02616 0.02616 6.71% VisMF::Write(FabArray) 1 0.02378 0.02378 0.02378 6.10% FillPatchIterator::Initialize 20 0.0199 0.0199 0.0199 5.10% FillPatchSingleLevel 20 0.01912 0.01912 0.01912 4.90% StateDataPhysBCFunct::() 20 0.01713 0.01713 0.01713 4.39% MLMG::mgVcycle_down::0 36 0.0151 0.0151 0.0151 3.87% MLCellLinOp::apply() 500 0.01487 0.01487 0.01487 3.81% MLMG::mgVcycle_up::0 36 0.01289 0.01289 0.01289 3.31% StateData::FillBoundary(geom) 160 0.01152 0.01152 0.01152 2.96% Castro::initialize_do_advance() 5 0.01102 0.01102 0.01102 2.83% Castro::computeTemp() 30 0.01081 0.01081 0.01081 2.77% Castro::normalize_species() 30 0.009893 0.009893 0.009893 2.54% MLPoisson::define() 6 0.009318 0.009318 0.009318 2.39% MultiFab::Dot() 484 0.009125 0.009125 0.009125 2.34% MLCellLinOp::correctionResidual() 216 0.00869 0.00869 0.00869 2.23% MLMG:computeResOfCorrection() 180 0.007526 0.007526 0.007526 1.93% Gravity::get_new_grav_vector() 5 0.007411 0.007411 0.007411 1.90% Castro::initialize_advance() 5 0.007343 0.007343 0.007343 1.88% Castro::construct_old_gravity() 5 0.007214 0.007214 0.007214 1.85% Gravity::get_old_grav_vector() 5 0.007209 0.007209 0.007209 1.85% MLMG::mgVcycle_down::1 36 0.007173 0.007173 0.007173 1.84% FabArray::FillBoundary() 1766 0.007021 0.007021 0.007021 1.80% MLMG::mgVcycle_down::2 36 0.006924 0.006924 0.006924 1.78% Castro::do_new_sources() 5 0.006694 0.006694 0.006694 1.72% FillBoundary_nowait() 1766 0.006647 0.006647 0.006647 1.70% FabArray::ParallelCopy() 380 0.006592 0.006592 0.006592 1.69% MLMG::mgVcycle_down::3 36 0.006504 0.006504 0.006504 1.67% FabArray::ParallelCopy_nowait() 380 0.006452 0.006452 0.006452 1.65% FabArray::setVal() 537 0.006431 0.006431 0.006431 1.65% MLCellLinOp::defineAuxData() 6 0.006309 0.006309 0.006309 1.62% Castro::enforce_min_density() 30 0.006302 0.006302 0.006302 1.62% MLMG::mgVcycle_down::4 36 0.006207 0.006207 0.006207 1.59% Castro::expand_state() 5 0.005862 0.005862 0.005862 1.50% CGSolver::sxay() 690 0.00568 0.00568 0.00568 1.46% Castro::do_old_sources() 5 0.00556 0.00556 0.00556 1.43% MLCGSolver::ParallelAllReduce 659 0.005486 0.005486 0.005486 1.41% MultiFab::LinComb() 690 0.005477 0.005477 0.005477 1.40% MLMG::mgVcycle_up::2 36 0.005335 0.005335 0.005335 1.37% MLMG::mgVcycle_up::1 36 0.005232 0.005232 0.005232 1.34% MLMG::addInterpCorrection() 180 0.005224 0.005224 0.005224 1.34% amrex::average_down 180 0.005011 0.005011 0.005011 1.28% MLMG::mgVcycle_up::3 36 0.004994 0.004994 0.004994 1.28% MLMG::mgVcycle_up::4 36 0.004961 0.004961 0.004961 1.27% Gravity::fill_multipole_BCs() 6 0.00473 0.00473 0.00473 1.21% MLPoisson::Fapply() 500 0.004662 0.004662 0.004662 1.20% Castro::post_timestep() 5 0.004529 0.004529 0.004529 1.16% Castro::post_restart() 1 0.00369 0.00369 0.00369 0.95% Castro::estTimeStep() 10 0.003623 0.003623 0.003623 0.93% Gravity::multilevel_solve_for_new_phi() 1 0.003574 0.003574 0.003574 0.92% Gravity::actual_multilevel_solve() 1 0.003558 0.003558 0.003558 0.91% MLCellLinOp::solutionResidual() 42 0.003186 0.003186 0.003186 0.82% Castro::reset_internal_energy(MultiFab) 30 0.00279 0.00279 0.00279 0.72% MLCellLinOp::defineBC() 6 0.002762 0.002762 0.002762 0.71% MultiFab::Xpay() 258 0.002716 0.002716 0.002716 0.70% BndryData::define() 6 0.002657 0.002657 0.002657 0.68% MLMG::computeResidual() 36 0.002638 0.002638 0.002638 0.68% MLMG::prepareForSolve() 6 0.002489 0.002489 0.002489 0.64% Castro::computeNewDt() 5 0.001991 0.001991 0.001991 0.51% Castro::construct_new_source() 25 0.00164 0.00164 0.00164 0.42% Castro::construct_new_gravity_source() 5 0.001594 0.001594 0.001594 0.41% Castro::construct_old_source() 25 0.001339 0.001339 0.001339 0.34% Castro::construct_old_gravity_source() 5 0.001305 0.001305 0.001305 0.33% Castro::apply_source_to_state() 10 0.000936 0.000936 0.000936 0.24% MultiFab::Saxpy() 10 0.0009297 0.0009297 0.0009297 0.24% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.000906 0.000906 0.000906 0.23% MLMG::ResNormInf() 42 0.0008523 0.0008523 0.0008523 0.22% Castro::enforce_speed_limit() 30 0.0008332 0.0008332 0.0008332 0.21% FabArrayBase::getCPC() 632 0.0008115 0.0008115 0.0008115 0.21% Castro::reset_internal_energy(Fab) 240 0.0007933 0.0007933 0.0007933 0.20% MLMG::getGradSolution() 6 0.0007581 0.0007581 0.0007581 0.19% MLCellLinOp::compGrad() 6 0.000755 0.000755 0.000755 0.19% MLCellLinOp::setLevelBC() 6 0.0007234 0.0007234 0.0007234 0.19% FabArray::setDomainBndry() 20 0.0006656 0.0006656 0.0006656 0.17% FabArray::mult() 22 0.0006561 0.0006561 0.0006561 0.17% Castro::check_for_nan() 10 0.0006009 0.0006009 0.0006009 0.15% MLPoisson::prepareForSolve() 6 0.0006007 0.0006007 0.0006007 0.15% MLCellLinOp::prepareForSolve() 6 0.0005963 0.0005963 0.0005963 0.15% MultiFab::contains_nan() 10 0.0005942 0.0005942 0.0005942 0.15% MLMG::computeMLResidual() 6 0.0005679 0.0005679 0.0005679 0.15% Gravity::update_max_rhs() 6 0.0004452 0.0004452 0.0004452 0.11% FabArrayBase::CPC::define() 244 0.0004119 0.0004119 0.0004119 0.11% Amr::InitAmr() 1 0.0003907 0.0003907 0.0003907 0.10% FabArrayBase::getFB() 1766 0.0003393 0.0003393 0.0003393 0.09% MLLinOp::define() 6 0.0002336 0.0002336 0.0002336 0.06% Gravity::swapTimeLevels() 5 0.000233 0.000233 0.000233 0.06% MLLinOp::defineGrids() 6 0.0002133 0.0002133 0.0002133 0.05% Castro::buildMetrics() 1 0.0001623 0.0001623 0.0001623 0.04% MultiFab::Copy() 6 0.0001424 0.0001424 0.0001424 0.04% MultiFab::max() 6 0.000137 0.000137 0.000137 0.04% MLMG::MLResNormInf() 6 0.0001345 0.0001345 0.0001345 0.03% MLMG::MLRhsNormInf() 6 0.0001065 0.0001065 0.0001065 0.03% FabArrayBase::FB::FB() 26 5.826e-05 5.826e-05 5.826e-05 0.01% Castro::create_source_corrector() 5 4.244e-05 4.244e-05 4.244e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.899e-05 2.899e-05 2.899e-05 0.01% Castro::finalize_advance() 5 2.763e-05 2.763e-05 2.763e-05 0.01% Castro::swap_state_time_levels() 5 2.647e-05 2.647e-05 2.647e-05 0.01% Castro::initMFs() 1 2.551e-05 2.551e-05 2.551e-05 0.01% Amr::writeSmallPlotFile() 1 2.472e-05 2.472e-05 2.472e-05 0.01% makeSFC 30 2.118e-05 2.118e-05 2.118e-05 0.01% Castro::finalize_do_advance() 5 1.9e-05 1.9e-05 1.9e-05 0.00% DistributionMapping::Distribute() 31 9.137e-06 9.137e-06 9.137e-06 0.00% Amr::initSubcycle() 1 9.053e-06 9.053e-06 9.053e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.563e-06 4.563e-06 4.563e-06 0.00% MLMG::buildFineMask() 6 3.175e-06 3.175e-06 3.175e-06 0.00% Castro::FluxRegCrseInit 5 2.196e-06 2.196e-06 2.196e-06 0.00% Gravity::set_mass_offset() 6 2.034e-06 2.034e-06 2.034e-06 0.00% Castro::retry_advance_ctu() 5 1.912e-06 1.912e-06 1.912e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.281e-06 1.281e-06 1.281e-06 0.00% Castro::FluxRegFineAdd() 5 1.068e-06 1.068e-06 1.068e-06 0.00% AmrLevel::AmrLevel() 1 8.87e-07 8.87e-07 8.87e-07 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Total GPU global memory (MB): 12066 Free GPU global memory (MB): 2465 [The Arena] space allocated (MB): 9049 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (22.04-38-g3d344ec19655) finalized