Initializing CUDA... CUDA initialized with 1 device. AMReX (23.10-4-ge470d3350ed3) initialized Starting run at 07:56:07 UTC on 2023-10-03. Successfully read inputs file ... Castro git describe: 23.10-4-g104a57d85 AMReX git describe: 23.10-4-ge470d3350 Microphysics git describe: 23.10-1-g4803fc8b reading extern runtime parameters ... 3 Species: C12 O16 Mg24 Successfully read inputs file ... INITIAL GRIDS Level 0 8 grids 262144 cells 100 % of domain smallest grid: 32 x 32 x 32 biggest grid: 32 x 32 x 32 CHECKPOINT: file = dustcollapse-restart_chk00000 checkPoint() time = 0.05111204 secs. PLOTFILE: file = dustcollapse-restart_plt00000 Write plotfile time = 0.029073953 seconds [Level 0 step 1] ADVANCE at time 0 with dt = 4.541742215e-05 [Level 0 step 1] Advanced 262144 cells [STEP 1] Coarse TimeStep time: 0.050574282 [STEP 1] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 1 TIME = 4.541742215e-05 DT = 4.541742215e-05 [Level 0 step 2] ADVANCE at time 4.541742215e-05 with dt = 4.768829326e-05 [Level 0 step 2] Advanced 262144 cells [STEP 2] Coarse TimeStep time: 0.050854503 [STEP 2] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 2 TIME = 9.31057154e-05 DT = 4.768829326e-05 [Level 0 step 3] ADVANCE at time 9.31057154e-05 with dt = 5.007270792e-05 [Level 0 step 3] Advanced 262144 cells [STEP 3] Coarse TimeStep time: 0.074520754 [STEP 3] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 3 TIME = 0.0001431784233 DT = 5.007270792e-05 [Level 0 step 4] ADVANCE at time 0.0001431784233 with dt = 5.257634331e-05 [Level 0 step 4] Advanced 262144 cells [STEP 4] Coarse TimeStep time: 0.076402558 [STEP 4] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 4 TIME = 0.0001957547666 DT = 5.257634331e-05 [Level 0 step 5] ADVANCE at time 0.0001957547666 with dt = 5.520516048e-05 [Level 0 step 5] Advanced 262144 cells [STEP 5] Coarse TimeStep time: 0.070445248 [STEP 5] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 5 TIME = 0.0002509599271 DT = 5.520516048e-05 CHECKPOINT: file = dustcollapse-restart_chk00005 checkPoint() time = 0.049546989 secs. [Level 0 step 6] ADVANCE at time 0.0002509599271 with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.069497595 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE at time 0.0003089253456 with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.075453685 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE at time 0.000369789035 with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.073778572 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE at time 0.000433695909 with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.049933661 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE at time 0.0005007981265 with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.051456337 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 CHECKPOINT: file = dustcollapse-restart_chk00010 checkPoint() time = 0.048935076 secs. PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.028319567 seconds Ending run at 07:56:08 UTC on 2023-10-03. Run time = 0.905182219 Run time without initialization = 0.770421162 Average number of zones advanced per microsecond: 3.403 Average number of zones advanced per microsecond per rank: 3.403 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.9052 ... 0.9052 ... 0.9052 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 10 0.2652 0.2652 0.2652 29.30% VisMF::Write(FabArray) 11 0.1992 0.1992 0.1992 22.00% MLCellLinOp::applyBC() 4433 0.07805 0.07805 0.07805 8.62% MLPoisson::Fsmooth() 3280 0.03315 0.03315 0.03315 3.66% FillBoundary_nowait() 4023 0.0308 0.0308 0.0308 3.40% StateData::FillBoundary(geom) 328 0.02625 0.02625 0.02625 2.90% Castro::normalize_species() 62 0.02194 0.02194 0.02194 2.42% amrex::Dot() 1114 0.02105 0.02105 0.02105 2.32% Castro::computeTemp() 63 0.0173 0.0173 0.0173 1.91% amrex::Copy() 1029 0.01506 0.01506 0.01506 1.66% FabArray::norminf() 743 0.01471 0.01471 0.01471 1.63% FabArray::setVal() 1144 0.01349 0.01349 0.01349 1.49% FabArray::ParallelCopy_nowait() 861 0.01346 0.01346 0.01346 1.49% Castro::enforce_min_density() 62 0.01201 0.01201 0.01201 1.33% StateDataPhysBCFunct::() 41 0.01077 0.01077 0.01077 1.19% MLPoisson::Fapply() 1142 0.0106 0.0106 0.0106 1.17% MLCellLinOp::defineAuxData() 11 0.01015 0.01015 0.01015 1.12% Gravity::fill_multipole_BCs() 11 0.009103 0.009103 0.009103 1.01% FabArray::Saxpy() 813 0.008402 0.008402 0.008402 0.93% FabArray::Xpay() 821 0.008246 0.008246 0.008246 0.91% MLMG::addInterpCorrection() 410 0.006845 0.006845 0.006845 0.76% Castro::estTimeStep() 21 0.006238 0.006238 0.006238 0.69% amrex::average_down 410 0.006212 0.006212 0.006212 0.69% Castro::reset_internal_energy(MultiFab) 63 0.00523 0.00523 0.00523 0.58% Amr::checkPoint() 3 0.005134 0.005134 0.005134 0.57% FabArray::LinComb() 557 0.00464 0.00464 0.00464 0.51% amrex::Add() 164 0.004447 0.004447 0.004447 0.49% BndryData::define() 11 0.003873 0.003873 0.003873 0.43% Castro::construct_new_gravity_source() 10 0.003296 0.003296 0.003296 0.36% Castro::enforce_speed_limit() 62 0.003176 0.003176 0.003176 0.35% Castro::construct_old_gravity_source() 10 0.002868 0.002868 0.002868 0.32% MLCGSolver::bicgstab 82 0.002258 0.002258 0.002258 0.25% Amr::writePlotFile() 2 0.002195 0.002195 0.002195 0.24% check_for_negative_density() 10 0.001833 0.001833 0.001833 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001741 0.001741 0.001741 0.19% Castro::reset_internal_energy(Fab) 504 0.001705 0.001705 0.001705 0.19% Gravity::actual_solve_with_mlmg() 11 0.001562 0.001562 0.001562 0.17% Castro::initData() 1 0.001487 0.001487 0.001487 0.16% MLCellLinOp::setLevelBC() 11 0.001474 0.001474 0.001474 0.16% FabArray::setDomainBndry() 41 0.00136 0.00136 0.00136 0.15% FabArray::mult() 43 0.001356 0.001356 0.001356 0.15% MLCellLinOp::prepareForSolve() 11 0.0013 0.0013 0.0013 0.14% MultiFab::contains_nan() 20 0.001281 0.001281 0.001281 0.14% MLCellLinOp::smooth() 1640 0.001089 0.001089 0.001089 0.12% MLCellLinOp::compGrad() 11 0.001051 0.001051 0.001051 0.12% MLMG::prepareForSolve() 11 0.0009417 0.0009417 0.0009417 0.10% FabArray::FillBoundary() 4023 0.0008179 0.0008179 0.0008179 0.09% FabArrayBase::getCPC() 1323 0.0007975 0.0007975 0.0007975 0.09% FabArrayBase::CPC::define() 454 0.0006977 0.0006977 0.0006977 0.08% FabArrayBase::getFB() 4023 0.0006239 0.0006239 0.0006239 0.07% Gravity::get_new_grav_vector() 11 0.0006036 0.0006036 0.0006036 0.07% Amr::InitAmr() 1 0.0004886 0.0004886 0.0004886 0.05% Gravity::get_old_grav_vector() 10 0.0004754 0.0004754 0.0004754 0.05% MLCellLinOp::apply() 1142 0.0004713 0.0004713 0.0004713 0.05% Amr::coarseTimeStep() 10 0.0004148 0.0004148 0.0004148 0.05% AmrLevel::FillPatch() 41 0.0003338 0.0003338 0.0003338 0.04% MultiFab::max() 11 0.0003274 0.0003274 0.0003274 0.04% MLCGSolver::ParallelAllReduce 1514 0.0002923 0.0002923 0.0002923 0.03% Castro::subcycle_advance_ctu() 10 0.0002639 0.0002639 0.0002639 0.03% main() 1 0.0002594 0.0002594 0.0002594 0.03% FabArray::ParallelCopy() 861 0.0002416 0.0002416 0.0002416 0.03% MLCellLinOp::correctionResidual() 492 0.0002253 0.0002253 0.0002253 0.02% MLCellLinOp::defineBC() 11 0.0002241 0.0002241 0.0002241 0.02% FillPatchIterator::Initialize 41 0.000216 0.000216 0.000216 0.02% MLMG::mgVcycle() 82 0.0002062 0.0002062 0.0002062 0.02% Castro::create_source_corrector() 10 0.000187 0.000187 0.000187 0.02% MLLinOp::defineGrids() 11 0.0001568 0.0001568 0.0001568 0.02% Amr::timeStep() 10 0.0001487 0.0001487 0.0001487 0.02% Gravity::update_max_rhs() 11 0.0001431 0.0001431 0.0001431 0.02% Gravity::solve_for_phi() 10 0.0001365 0.0001365 0.0001365 0.02% MLMG:computeResOfCorrection() 410 0.0001217 0.0001217 0.0001217 0.01% Castro::advance() 10 0.0001146 0.0001146 0.0001146 0.01% StateData::checkPoint() 12 0.0001145 0.0001145 0.0001145 0.01% Castro::do_new_sources() 10 0.0001065 0.0001065 0.0001065 0.01% Castro::Castro() 1 9.676e-05 9.676e-05 9.676e-05 0.01% FabArrayBase::FB::FB() 56 9.511e-05 9.511e-05 9.511e-05 0.01% MLMG::actualBottomSolve() 82 8.896e-05 8.896e-05 8.896e-05 0.01% MLMG::mgVcycle_down::0 82 8.459e-05 8.459e-05 8.459e-05 0.01% Castro::enforce_consistent_e() 1 7.691e-05 7.691e-05 7.691e-05 0.01% MLMG::mgVcycle_down::1 82 7.655e-05 7.655e-05 7.655e-05 0.01% MLMG::mgVcycle_down::2 82 7.556e-05 7.556e-05 7.556e-05 0.01% Castro::initialize_advance() 10 7.524e-05 7.524e-05 7.524e-05 0.01% MLMG::solve() 11 7.447e-05 7.447e-05 7.447e-05 0.01% MLMG::mgVcycle_down::3 82 7.407e-05 7.407e-05 7.407e-05 0.01% MLMG::mgVcycle_down::4 82 7.291e-05 7.291e-05 7.291e-05 0.01% Castro::clean_state() 62 7.039e-05 7.039e-05 7.039e-05 0.01% Castro::construct_new_source() 50 6.692e-05 6.692e-05 6.692e-05 0.01% AmrLevel::checkPoint() 3 6.274e-05 6.274e-05 6.274e-05 0.01% Castro::finalize_advance() 10 6.179e-05 6.179e-05 6.179e-05 0.01% MLMG::mgVcycle_up::4 82 6.03e-05 6.03e-05 6.03e-05 0.01% Castro::initialize_do_advance() 10 5.842e-05 5.842e-05 5.842e-05 0.01% MLMG::oneIter() 82 5.459e-05 5.459e-05 5.459e-05 0.01% MLMG::mgVcycle_up::0 82 5.188e-05 5.188e-05 5.188e-05 0.01% MLMG::mgVcycle_up::3 82 4.984e-05 4.984e-05 4.984e-05 0.01% MLCellLinOp::solutionResidual() 93 4.886e-05 4.886e-05 4.886e-05 0.01% Castro::do_advance_ctu() 10 4.858e-05 4.858e-05 4.858e-05 0.01% MLMG::mgVcycle_up::1 82 4.736e-05 4.736e-05 4.736e-05 0.01% FillPatchIterator::FillFromLevel0() 41 4.668e-05 4.668e-05 4.668e-05 0.01% MLMG::mgVcycle_up::2 82 4.663e-05 4.663e-05 4.663e-05 0.01% Castro::finalize_do_advance() 10 4.203e-05 4.203e-05 4.203e-05 0.00% Amr::writeSmallPlotFile() 1 3.665e-05 3.665e-05 3.665e-05 0.00% MLMG::mgVcycle_bottom 82 3.58e-05 3.58e-05 3.58e-05 0.00% StateData::define() 4 3.563e-05 3.563e-05 3.563e-05 0.00% Castro::swap_state_time_levels() 10 3.544e-05 3.544e-05 3.544e-05 0.00% FillPatchSingleLevel 41 3.466e-05 3.466e-05 3.466e-05 0.00% MLMG::computeResidual() 82 3.442e-05 3.442e-05 3.442e-05 0.00% Gravity::solve_phi_with_mlmg() 11 3.341e-05 3.341e-05 3.341e-05 0.00% Amr::defBaseLevel() 1 3.057e-05 3.057e-05 3.057e-05 0.00% MLMG::ResNormInf() 93 3.034e-05 3.034e-05 3.034e-05 0.00% Castro::initMFs() 1 2.688e-05 2.688e-05 2.688e-05 0.00% Castro::buildMetrics() 1 2.632e-05 2.632e-05 2.632e-05 0.00% makeSFC 55 2.475e-05 2.475e-05 2.475e-05 0.00% Castro::construct_new_gravity() 10 2.343e-05 2.343e-05 2.343e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 2.163e-05 2.163e-05 2.163e-05 0.00% Castro::do_old_sources() 10 2.163e-05 2.163e-05 2.163e-05 0.00% MLPoisson::define() 11 2.066e-05 2.066e-05 2.066e-05 0.00% Amr::FinalizeInit() 1 1.862e-05 1.862e-05 1.862e-05 0.00% Castro::construct_old_source() 50 1.791e-05 1.791e-05 1.791e-05 0.00% DistributionMapping::Distribute() 56 1.658e-05 1.658e-05 1.658e-05 0.00% AmrLevel::AmrLevel(dm) 1 1.47e-05 1.47e-05 1.47e-05 0.00% MLLinOp::define() 11 1.457e-05 1.457e-05 1.457e-05 0.00% MLLinOp::makeAgglomeratedDMap 11 1.313e-05 1.313e-05 1.313e-05 0.00% Castro::check_for_nan() 20 1.268e-05 1.268e-05 1.268e-05 0.00% Castro::apply_source_to_state() 20 1.248e-05 1.248e-05 1.248e-05 0.00% Castro::construct_old_gravity() 10 1.185e-05 1.185e-05 1.185e-05 0.00% Castro::post_init() 1 9.726e-06 9.726e-06 9.726e-06 0.00% Amr::initSubcycle() 1 9.228e-06 9.228e-06 9.228e-06 0.00% Gravity::swapTimeLevels() 10 8.971e-06 8.971e-06 8.971e-06 0.00% MLMG::computeMLResidual() 11 8.758e-06 8.758e-06 8.758e-06 0.00% MLMG::MLRhsNormInf() 11 8.232e-06 8.232e-06 8.232e-06 0.00% Gravity::actual_multilevel_solve() 1 8.037e-06 8.037e-06 8.037e-06 0.00% MLPoisson::prepareForSolve() 11 7.797e-06 7.797e-06 7.797e-06 0.00% Castro::post_timestep() 10 7.634e-06 7.634e-06 7.634e-06 0.00% Castro::computeNewDt() 9 7.141e-06 7.141e-06 7.141e-06 0.00% MLMG::getGradSolution() 11 5.583e-06 5.583e-06 5.583e-06 0.00% Castro::expand_state() 10 5.273e-06 5.273e-06 5.273e-06 0.00% AmrLevel::checkPointPost() 3 4.903e-06 4.903e-06 4.903e-06 0.00% Castro::retry_advance_ctu() 10 4.671e-06 4.671e-06 4.671e-06 0.00% Amr::InitializeInit() 1 4.411e-06 4.411e-06 4.411e-06 0.00% MLMG::MLResNormInf() 11 3.856e-06 3.856e-06 3.856e-06 0.00% Gravity::set_mass_offset() 11 3.728e-06 3.728e-06 3.728e-06 0.00% Castro::computeInitialDt() 2 3.427e-06 3.427e-06 3.427e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 2.946e-06 2.946e-06 2.946e-06 0.00% Castro::FluxRegCrseInit 10 2.845e-06 2.845e-06 2.845e-06 0.00% Amr::init() 1 2.389e-06 2.389e-06 2.389e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.129e-06 2.129e-06 2.129e-06 0.00% Castro::FluxRegFineAdd() 10 2.127e-06 2.127e-06 2.127e-06 0.00% AmrLevel::checkPointPre() 3 2.098e-06 2.098e-06 2.098e-06 0.00% Castro::post_regrid() 1 1.247e-06 1.247e-06 1.247e-06 0.00% Amr::initialInit() 1 1.017e-06 1.017e-06 1.017e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.9052 0.9052 0.9052 100.00% Amr::coarseTimeStep() 10 0.7419 0.7419 0.7419 81.96% Amr::timeStep() 10 0.6393 0.6393 0.6393 70.62% Castro::advance() 10 0.6267 0.6267 0.6267 69.24% Castro::subcycle_advance_ctu() 10 0.6129 0.6129 0.6129 67.71% Castro::do_advance_ctu() 10 0.6127 0.6127 0.6127 67.68% Gravity::solve_phi_with_mlmg() 11 0.2913 0.2913 0.2913 32.18% Gravity::actual_solve_with_mlmg() 11 0.2817 0.2817 0.2817 31.12% Castro::construct_ctu_hydro_source() 10 0.2763 0.2763 0.2763 30.52% Castro::construct_new_gravity() 10 0.2627 0.2627 0.2627 29.02% MLMG::solve() 11 0.2602 0.2602 0.2602 28.75% Gravity::solve_for_phi() 10 0.2468 0.2468 0.2468 27.27% MLMG::oneIter() 82 0.2452 0.2452 0.2452 27.08% MLMG::mgVcycle() 82 0.2415 0.2415 0.2415 26.68% VisMF::Write(FabArray) 11 0.1992 0.1992 0.1992 22.00% Amr::checkPoint() 3 0.1497 0.1497 0.1497 16.54% AmrLevel::checkPoint() 3 0.1446 0.1446 0.1446 15.97% StateData::checkPoint() 12 0.1445 0.1445 0.1445 15.96% Amr::init() 1 0.1342 0.1342 0.1342 14.82% MLCellLinOp::smooth() 1640 0.1183 0.1183 0.1183 13.07% MLCellLinOp::applyBC() 4433 0.1104 0.1104 0.1104 12.19% MLMG::mgVcycle_bottom 82 0.07542 0.07542 0.07542 8.33% MLMG::actualBottomSolve() 82 0.07538 0.07538 0.07538 8.33% MLCGSolver::bicgstab 82 0.07466 0.07466 0.07466 8.25% Castro::clean_state() 62 0.06043 0.06043 0.06043 6.68% Amr::writePlotFile() 2 0.05751 0.05751 0.05751 6.35% Amr::initialInit() 1 0.05386 0.05386 0.05386 5.95% Amr::FinalizeInit() 1 0.04871 0.04871 0.04871 5.38% Castro::post_init() 1 0.04721 0.04721 0.04721 5.22% AmrLevel::FillPatch() 41 0.04686 0.04686 0.04686 5.18% Gravity::multilevel_solve_for_new_phi() 1 0.0449 0.0449 0.0449 4.96% Gravity::actual_multilevel_solve() 1 0.04488 0.04488 0.04488 4.96% FillPatchIterator::Initialize 41 0.04268 0.04268 0.04268 4.72% FillPatchIterator::FillFromLevel0() 41 0.04111 0.04111 0.04111 4.54% FillPatchSingleLevel 41 0.04106 0.04106 0.04106 4.54% StateDataPhysBCFunct::() 41 0.03702 0.03702 0.03702 4.09% MLCellLinOp::apply() 1142 0.03693 0.03693 0.03693 4.08% MLMG::mgVcycle_down::0 82 0.03387 0.03387 0.03387 3.74% MLPoisson::Fsmooth() 3280 0.03315 0.03315 0.03315 3.66% FabArray::FillBoundary() 4023 0.03234 0.03234 0.03234 3.57% FillBoundary_nowait() 4023 0.03152 0.03152 0.03152 3.48% StateData::FillBoundary(geom) 328 0.02625 0.02625 0.02625 2.90% MLMG::mgVcycle_up::0 82 0.02552 0.02552 0.02552 2.82% Castro::computeTemp() 63 0.02424 0.02424 0.02424 2.68% MLCellLinOp::correctionResidual() 492 0.02241 0.02241 0.02241 2.48% Castro::normalize_species() 62 0.02194 0.02194 0.02194 2.42% Castro::initialize_do_advance() 10 0.02137 0.02137 0.02137 2.36% amrex::Dot() 1114 0.02105 0.02105 0.02105 2.32% MLMG:computeResOfCorrection() 410 0.01972 0.01972 0.01972 2.18% Castro::do_old_sources() 10 0.01933 0.01933 0.01933 2.14% Gravity::get_new_grav_vector() 11 0.01771 0.01771 0.01771 1.96% MLPoisson::define() 11 0.01689 0.01689 0.01689 1.87% MLMG::mgVcycle_down::1 82 0.01622 0.01622 0.01622 1.79% Castro::construct_old_gravity() 10 0.01515 0.01515 0.01515 1.67% Gravity::get_old_grav_vector() 10 0.01513 0.01513 0.01513 1.67% amrex::Copy() 1029 0.01506 0.01506 0.01506 1.66% MLMG::mgVcycle_down::2 82 0.01501 0.01501 0.01501 1.66% FabArray::norminf() 743 0.01471 0.01471 0.01471 1.63% MLMG::mgVcycle_down::3 82 0.01466 0.01466 0.01466 1.62% FabArray::ParallelCopy() 861 0.01454 0.01454 0.01454 1.61% MLMG::mgVcycle_down::4 82 0.01447 0.01447 0.01447 1.60% Castro::do_new_sources() 10 0.01447 0.01447 0.01447 1.60% FabArray::ParallelCopy_nowait() 861 0.0143 0.0143 0.0143 1.58% FabArray::setVal() 1144 0.01349 0.01349 0.01349 1.49% Castro::initialize_advance() 10 0.01309 0.01309 0.01309 1.45% MLCGSolver::ParallelAllReduce 1514 0.01262 0.01262 0.01262 1.39% Castro::post_timestep() 10 0.01241 0.01241 0.01241 1.37% MLMG::addInterpCorrection() 410 0.01209 0.01209 0.01209 1.34% Castro::enforce_min_density() 62 0.01201 0.01201 0.01201 1.33% MLMG::mgVcycle_up::1 82 0.01171 0.01171 0.01171 1.29% MLMG::mgVcycle_up::4 82 0.01169 0.01169 0.01169 1.29% MLCellLinOp::defineAuxData() 11 0.01155 0.01155 0.01155 1.28% Castro::expand_state() 10 0.01154 0.01154 0.01154 1.27% amrex::average_down 410 0.01151 0.01151 0.01151 1.27% MLMG::mgVcycle_up::2 82 0.01149 0.01149 0.01149 1.27% MLMG::mgVcycle_up::3 82 0.01125 0.01125 0.01125 1.24% MLPoisson::Fapply() 1142 0.0106 0.0106 0.0106 1.17% Gravity::fill_multipole_BCs() 11 0.009331 0.009331 0.009331 1.03% FabArray::Saxpy() 813 0.008402 0.008402 0.008402 0.93% FabArray::Xpay() 821 0.008246 0.008246 0.008246 0.91% MLCellLinOp::solutionResidual() 93 0.007522 0.007522 0.007522 0.83% Castro::reset_internal_energy(MultiFab) 63 0.006935 0.006935 0.006935 0.77% MLMG::computeResidual() 82 0.006257 0.006257 0.006257 0.69% Castro::estTimeStep() 21 0.006238 0.006238 0.006238 0.69% Amr::InitializeInit() 1 0.005143 0.005143 0.005143 0.57% Amr::defBaseLevel() 1 0.005139 0.005139 0.005139 0.57% MLCellLinOp::defineBC() 11 0.005101 0.005101 0.005101 0.56% MLMG::prepareForSolve() 11 0.00501 0.00501 0.00501 0.55% BndryData::define() 11 0.004876 0.004876 0.004876 0.54% FabArray::LinComb() 557 0.00464 0.00464 0.00464 0.51% Castro::initData() 1 0.004478 0.004478 0.004478 0.49% amrex::Add() 164 0.004447 0.004447 0.004447 0.49% Castro::construct_new_source() 50 0.003363 0.003363 0.003363 0.37% Castro::construct_new_gravity_source() 10 0.003296 0.003296 0.003296 0.36% Castro::enforce_speed_limit() 62 0.003176 0.003176 0.003176 0.35% Castro::construct_old_source() 50 0.002886 0.002886 0.002886 0.32% Castro::construct_old_gravity_source() 10 0.002868 0.002868 0.002868 0.32% Castro::computeNewDt() 9 0.002825 0.002825 0.002825 0.31% Castro::finalize_do_advance() 10 0.002418 0.002418 0.002418 0.27% MLMG::ResNormInf() 93 0.002198 0.002198 0.002198 0.24% Castro::apply_source_to_state() 20 0.001861 0.001861 0.001861 0.21% check_for_negative_density() 10 0.001833 0.001833 0.001833 0.20% FabArray::setVal(val, thecmd, scomp, ncomp) 462 0.001741 0.001741 0.001741 0.19% Castro::reset_internal_energy(Fab) 504 0.001705 0.001705 0.001705 0.19% MLMG::getGradSolution() 11 0.001539 0.001539 0.001539 0.17% MLCellLinOp::compGrad() 11 0.001533 0.001533 0.001533 0.17% FabArrayBase::getCPC() 1323 0.001495 0.001495 0.001495 0.17% MLCellLinOp::setLevelBC() 11 0.001474 0.001474 0.001474 0.16% FabArray::setDomainBndry() 41 0.00136 0.00136 0.00136 0.15% FabArray::mult() 43 0.001356 0.001356 0.001356 0.15% MLMG::computeMLResidual() 11 0.001308 0.001308 0.001308 0.14% MLPoisson::prepareForSolve() 11 0.001308 0.001308 0.001308 0.14% MLCellLinOp::prepareForSolve() 11 0.0013 0.0013 0.0013 0.14% Castro::check_for_nan() 20 0.001294 0.001294 0.001294 0.14% MultiFab::contains_nan() 20 0.001281 0.001281 0.001281 0.14% Castro::post_regrid() 1 0.001212 0.001212 0.001212 0.13% Castro::computeInitialDt() 2 0.001047 0.001047 0.001047 0.12% Gravity::update_max_rhs() 11 0.0009926 0.0009926 0.0009926 0.11% FabArrayBase::getFB() 4023 0.000719 0.000719 0.000719 0.08% FabArrayBase::CPC::define() 454 0.0006977 0.0006977 0.0006977 0.08% Castro::finalize_advance() 10 0.0005924 0.0005924 0.0005924 0.07% Castro::Castro() 1 0.0005759 0.0005759 0.0005759 0.06% Amr::InitAmr() 1 0.0004978 0.0004978 0.0004978 0.05% Gravity::swapTimeLevels() 10 0.0004417 0.0004417 0.0004417 0.05% MLMG::MLResNormInf() 11 0.0003462 0.0003462 0.0003462 0.04% MultiFab::max() 11 0.0003274 0.0003274 0.0003274 0.04% Castro::buildMetrics() 1 0.0002835 0.0002835 0.0002835 0.03% MLMG::MLRhsNormInf() 11 0.0002276 0.0002276 0.0002276 0.03% MLLinOp::define() 11 0.0002269 0.0002269 0.0002269 0.03% MLLinOp::defineGrids() 11 0.0002124 0.0002124 0.0002124 0.02% Castro::create_source_corrector() 10 0.000187 0.000187 0.000187 0.02% FabArrayBase::FB::FB() 56 9.511e-05 9.511e-05 9.511e-05 0.01% Castro::enforce_consistent_e() 1 7.691e-05 7.691e-05 7.691e-05 0.01% MLLinOp::makeAgglomeratedDMap 11 5.339e-05 5.339e-05 5.339e-05 0.01% AmrLevel::AmrLevel(dm) 1 5.034e-05 5.034e-05 5.034e-05 0.01% makeSFC 55 4.026e-05 4.026e-05 4.026e-05 0.00% Amr::writeSmallPlotFile() 1 3.665e-05 3.665e-05 3.665e-05 0.00% StateData::define() 4 3.563e-05 3.563e-05 3.563e-05 0.00% Castro::swap_state_time_levels() 10 3.544e-05 3.544e-05 3.544e-05 0.00% Castro::initMFs() 1 2.688e-05 2.688e-05 2.688e-05 0.00% DistributionMapping::Distribute() 56 1.658e-05 1.658e-05 1.658e-05 0.00% Amr::initSubcycle() 1 9.228e-06 9.228e-06 9.228e-06 0.00% AmrLevel::checkPointPost() 3 4.903e-06 4.903e-06 4.903e-06 0.00% Castro::retry_advance_ctu() 10 4.671e-06 4.671e-06 4.671e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.02e-06 4.02e-06 4.02e-06 0.00% Gravity::set_mass_offset() 11 3.728e-06 3.728e-06 3.728e-06 0.00% Castro::FluxRegCrseInit 10 2.845e-06 2.845e-06 2.845e-06 0.00% MLLinOp::makeSubCommunicator() 11 2.129e-06 2.129e-06 2.129e-06 0.00% Castro::FluxRegFineAdd() 10 2.127e-06 2.127e-06 2.127e-06 0.00% AmrLevel::checkPointPre() 3 2.098e-06 2.098e-06 2.098e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Device Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Arena::Initialize() 1 1 93 MiB 9042 MiB Castro::construct_ctu_hydro_source() 2880 2880 137 MiB 692 MiB Castro::initMFs() 48 48 67 MiB 68 MiB Castro::swap_state_time_levels() 32 32 47 MiB 55 MiB StateData::define() 32 32 55 MiB 55 MiB FillPatchIterator::Initialize 328 328 1002 KiB 39 MiB Castro::initialize_do_advance() 80 80 26 MiB 39 MiB ResizeRandomSeed 1 1 30 MiB 30 MiB Amr::writePlotFile() 16 16 1781 KiB 28 MiB Castro::initialize_advance() 80 80 16 MiB 23 MiB Castro::buildMetrics() 32 32 15 MiB 15 MiB Castro::Castro() 48 48 7620 KiB 14 MiB MLMG::prepareForSolve() 660 660 3536 KiB 12 MiB Gravity::get_new_grav_vector() 91 91 203 KiB 10 MiB Gravity::get_old_grav_vector() 80 80 171 KiB 10 MiB Gravity::multilevel_solve_for_new_phi() 24 24 7516 KiB 7586 KiB Gravity::fill_multipole_BCs() 154 154 17 KiB 2053 KiB Gravity::update_max_rhs() 88 88 2098 B 2048 KiB Gravity::solve_for_phi() 80 80 557 KiB 2048 KiB Gravity::actual_multilevel_solve() 8 8 101 KiB 2048 KiB BndryData::define() 1056 1056 323 KiB 1095 KiB MLCellLinOp::defineAuxData() 1716 1716 205 KiB 671 KiB Castro::estTimeStep() 21 21 3362 B 480 KiB VisMF::Write(FabArray) 656 656 3379 B 320 KiB Castro::normalize_species() 62 62 7897 B 320 KiB amrex::average_down 1067 1067 1266 B 257 KiB MLMG::addInterpCorrection() 1066 1066 1135 B 257 KiB amrex::Dot() 1360 1360 3422 B 160 KiB FabArray::norminf() 907 907 2403 B 160 KiB check_for_negative_density() 10 10 308 B 160 KiB Castro::initData() 1 1 53 B 160 KiB MultiFab::max() 11 11 56 B 160 KiB FabArray::setVal() 106 106 21 KiB 26 KiB MultiFab::contains_nan() 20 20 27 B 20 KiB MLPoisson::Fsmooth() 132 132 3452 B 12 KiB FabArray::ParallelCopy_nowait() 861 861 43 B 10 KiB FillBoundary_nowait() 760 760 297 B 9648 B MLCellLinOp::applyBC() 8866 8866 221 B 9344 B MLCellLinOp::prepareForSolve() 66 66 4 B 7792 B amrex::Copy() 100 100 3910 B 6144 B StateData::FillBoundary(geom) 1992 1992 41 B 3024 B FabArray::setVal(val, thecmd, scomp, ncomp) 132 132 1 B 1616 B MLCGSolver::bicgstab 738 738 120 B 1472 B MLCellLinOp::defineBC() 66 66 364 B 1248 B ------------------------------------------------------------------------------ Managed Memory Usage: ----------------------------------------------------------------- Name Nalloc Nfree AvgMem MaxMem ----------------------------------------------------------------- The_Managed_Arena::Initialize() 1 1 532 B 8192 KiB ----------------------------------------------------------------- Pinned Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Pinned_Arena::Initialize() 1 1 40 KiB 8192 KiB VisMF::Write(FabArray) 744 744 502 KiB 3584 KiB FabArray::setVal() 106 106 21 KiB 26 KiB MLPoisson::Fsmooth() 132 132 3452 B 12 KiB FabArray::ParallelCopy_nowait() 861 861 44 B 10 KiB FillBoundary_nowait() 760 760 297 B 9648 B MLCellLinOp::applyBC() 4433 4433 220 B 9328 B MLCellLinOp::prepareForSolve() 66 66 4 B 7792 B amrex::Copy() 100 100 3910 B 6144 B Gravity::get_new_grav_vector() 3 3 2891 B 3072 B StateData::FillBoundary(geom) 1992 1992 41 B 3024 B Gravity::fill_multipole_BCs() 33 33 3 B 2832 B FabArray::setVal(val, thecmd, scomp, ncomp) 132 132 1 B 1616 B amrex::average_down 83 83 271 B 1296 B MLMG::addInterpCorrection() 82 82 2 B 1024 B MLMG::prepareForSolve() 11 11 294 B 1024 B MLCellLinOp::setLevelBC() 66 66 0 B 768 B amrex::Dot() 1360 1360 25 B 400 B FabArray::norminf() 907 907 17 B 272 B Castro::estTimeStep() 21 21 0 B 32 B check_for_negative_density() 10 10 0 B 16 B MultiFab::max() 11 11 0 B 16 B MultiFab::contains_nan() 20 20 0 B 16 B Castro::normalize_species() 62 62 0 B 16 B Castro::initData() 1 1 0 B 16 B ------------------------------------------------------------------------------ Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2541 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.10-4-ge470d3350ed3) finalized Initializing CUDA... CUDA initialized with 1 device. AMReX (23.10-4-ge470d3350ed3) initialized Starting run at 07:56:08 UTC on 2023-10-03. Successfully read inputs file ... Castro git describe: 23.10-4-g104a57d85 AMReX git describe: 23.10-4-ge470d3350 Microphysics git describe: 23.10-1-g4803fc8b reading extern runtime parameters ... 3 Species: C12 O16 Mg24 restarting calculation from file: dustcollapse-restart_chk00005 Successfully read inputs file ... read CPU time: 0.507170816 Restart time = 0.042588301 seconds. [Level 0 step 6] ADVANCE at time 0.0002509599271 with dt = 5.79654185e-05 [Level 0 step 6] Advanced 262144 cells [STEP 6] Coarse TimeStep time: 0.053304654 [STEP 6] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 6 TIME = 0.0003089253456 DT = 5.79654185e-05 [Level 0 step 7] ADVANCE at time 0.0003089253456 with dt = 6.086368943e-05 [Level 0 step 7] Advanced 262144 cells [STEP 7] Coarse TimeStep time: 0.048638714 [STEP 7] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 7 TIME = 0.000369789035 DT = 6.086368943e-05 [Level 0 step 8] ADVANCE at time 0.000369789035 with dt = 6.39068739e-05 [Level 0 step 8] Advanced 262144 cells [STEP 8] Coarse TimeStep time: 0.071846291 [STEP 8] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 8 TIME = 0.000433695909 DT = 6.39068739e-05 [Level 0 step 9] ADVANCE at time 0.000433695909 with dt = 6.71022176e-05 [Level 0 step 9] Advanced 262144 cells [STEP 9] Coarse TimeStep time: 0.072772042 [STEP 9] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 9 TIME = 0.0005007981265 DT = 6.71022176e-05 [Level 0 step 10] ADVANCE at time 0.0005007981265 with dt = 7.045732848e-05 [Level 0 step 10] Advanced 262144 cells [STEP 10] Coarse TimeStep time: 0.073024448 [STEP 10] FAB kilobyte spread across MPI nodes: [319638 ... 319638] STEP = 10 TIME = 0.000571255455 DT = 7.045732848e-05 PLOTFILE: file = dustcollapse-restart_plt00010 Write plotfile time = 0.073291725 seconds Ending run at 07:56:09 UTC on 2023-10-03. Run time = 0.43647981 Run time without initialization = 0.393326448 Average number of zones advanced per microsecond: 3.332 Average number of zones advanced per microsecond per rank: 3.332 CPU(0): Heap Space (bytes) used by Coalescing FAB Arena: 9481961472 TinyProfiler total time across processes [min...avg...max]: 0.4365 ... 0.4365 ... 0.4365 -------------------------------------------------------------------------------------------- Name NCalls Excl. Min Excl. Avg Excl. Max Max % -------------------------------------------------------------------------------------------- Castro::construct_ctu_hydro_source() 5 0.1299 0.1299 0.1299 29.77% Amr::writePlotFile() 1 0.0444 0.0444 0.0444 10.17% MLCellLinOp::applyBC() 1946 0.03383 0.03383 0.03383 7.75% VisMF::Read() 3 0.03122 0.03122 0.03122 7.15% VisMF::Write(FabArray) 1 0.02868 0.02868 0.02868 6.57% MLPoisson::Fsmooth() 1440 0.01419 0.01419 0.01419 3.25% FillBoundary_nowait() 1766 0.01277 0.01277 0.01277 2.92% StateData::FillBoundary(geom) 160 0.01271 0.01271 0.01271 2.91% Castro::normalize_species() 30 0.009916 0.009916 0.009916 2.27% amrex::Dot() 484 0.009027 0.009027 0.009027 2.07% amrex::Copy() 463 0.007254 0.007254 0.007254 1.66% Castro::computeTemp() 30 0.007122 0.007122 0.007122 1.63% FabArray::setVal() 537 0.00653 0.00653 0.00653 1.50% FabArray::norminf() 326 0.006403 0.006403 0.006403 1.47% Castro::enforce_min_density() 30 0.006268 0.006268 0.006268 1.44% FabArray::ParallelCopy_nowait() 380 0.006075 0.006075 0.006075 1.39% Gravity::fill_multipole_BCs() 6 0.005488 0.005488 0.005488 1.26% MLCellLinOp::defineAuxData() 6 0.005462 0.005462 0.005462 1.25% StateDataPhysBCFunct::() 20 0.004894 0.004894 0.004894 1.12% Amr::restart() 1 0.004701 0.004701 0.004701 1.08% MLPoisson::Fapply() 500 0.004598 0.004598 0.004598 1.05% FabArray::Saxpy() 355 0.003759 0.003759 0.003759 0.86% FabArray::Xpay() 361 0.003611 0.003611 0.003611 0.83% Castro::estTimeStep() 10 0.003195 0.003195 0.003195 0.73% MLMG::addInterpCorrection() 180 0.003083 0.003083 0.003083 0.71% amrex::average_down 180 0.0027 0.0027 0.0027 0.62% FabArray::LinComb() 242 0.002264 0.002264 0.002264 0.52% Castro::reset_internal_energy(MultiFab) 30 0.002113 0.002113 0.002113 0.48% BndryData::define() 6 0.002081 0.002081 0.002081 0.48% amrex::Add() 72 0.00189 0.00189 0.00189 0.43% Castro::construct_new_gravity_source() 5 0.001787 0.001787 0.001787 0.41% Castro::construct_old_gravity_source() 5 0.001507 0.001507 0.001507 0.35% Castro::enforce_speed_limit() 30 0.001129 0.001129 0.001129 0.26% MLCGSolver::bicgstab 36 0.000987 0.000987 0.000987 0.23% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009535 0.0009535 0.0009535 0.22% check_for_negative_density() 5 0.0009465 0.0009465 0.0009465 0.22% Castro::reset_internal_energy(Fab) 240 0.000859 0.000859 0.000859 0.20% Gravity::actual_solve_with_mlmg() 6 0.0008336 0.0008336 0.0008336 0.19% MLCellLinOp::setLevelBC() 6 0.0008029 0.0008029 0.0008029 0.18% MLCellLinOp::prepareForSolve() 6 0.000716 0.000716 0.000716 0.16% FabArray::mult() 22 0.0006878 0.0006878 0.0006878 0.16% FabArray::setDomainBndry() 20 0.0006671 0.0006671 0.0006671 0.15% MultiFab::contains_nan() 10 0.0006491 0.0006491 0.0006491 0.15% MLCellLinOp::compGrad() 6 0.0005849 0.0005849 0.0005849 0.13% MLMG::prepareForSolve() 6 0.0005328 0.0005328 0.0005328 0.12% MLCellLinOp::smooth() 720 0.0004921 0.0004921 0.0004921 0.11% FabArrayBase::CPC::define() 244 0.0004208 0.0004208 0.0004208 0.10% Amr::InitAmr() 1 0.0003896 0.0003896 0.0003896 0.09% FabArrayBase::getCPC() 632 0.0003823 0.0003823 0.0003823 0.09% FabArray::FillBoundary() 1766 0.0003585 0.0003585 0.0003585 0.08% Gravity::get_old_grav_vector() 5 0.000323 0.000323 0.000323 0.07% Gravity::get_new_grav_vector() 5 0.0002629 0.0002629 0.0002629 0.06% FabArrayBase::getFB() 1766 0.0002628 0.0002628 0.0002628 0.06% main() 1 0.0002583 0.0002583 0.0002583 0.06% MLCellLinOp::apply() 500 0.0002084 0.0002084 0.0002084 0.05% MultiFab::max() 6 0.0002033 0.0002033 0.0002033 0.05% Amr::coarseTimeStep() 5 0.0001783 0.0001783 0.0001783 0.04% AmrLevel::FillPatch() 20 0.0001664 0.0001664 0.0001664 0.04% Castro::subcycle_advance_ctu() 5 0.000154 0.000154 0.000154 0.04% MLCGSolver::ParallelAllReduce 659 0.0001225 0.0001225 0.0001225 0.03% MLCellLinOp::defineBC() 6 0.0001162 0.0001162 0.0001162 0.03% MLLinOp::defineGrids() 6 0.0001136 0.0001136 0.0001136 0.03% FabArray::ParallelCopy() 380 0.0001074 0.0001074 0.0001074 0.02% Castro::create_source_corrector() 5 0.0001039 0.0001039 0.0001039 0.02% FillPatchIterator::Initialize 20 0.0001029 0.0001029 0.0001029 0.02% MLCellLinOp::correctionResidual() 216 9.74e-05 9.74e-05 9.74e-05 0.02% MLMG::mgVcycle() 36 8.831e-05 8.831e-05 8.831e-05 0.02% Amr::timeStep() 5 8.116e-05 8.116e-05 8.116e-05 0.02% Castro::advance() 5 7.826e-05 7.826e-05 7.826e-05 0.02% Gravity::update_max_rhs() 6 7.005e-05 7.005e-05 7.005e-05 0.02% FabArrayBase::FB::FB() 26 6.771e-05 6.771e-05 6.771e-05 0.02% StateData::restartDoit() 4 6.683e-05 6.683e-05 6.683e-05 0.02% Gravity::solve_for_phi() 5 6.636e-05 6.636e-05 6.636e-05 0.02% AmrLevel::restart() 1 6.487e-05 6.487e-05 6.487e-05 0.01% Castro::initialize_do_advance() 5 6.3e-05 6.3e-05 6.3e-05 0.01% Castro::do_new_sources() 5 6.227e-05 6.227e-05 6.227e-05 0.01% Castro::initialize_advance() 5 5.843e-05 5.843e-05 5.843e-05 0.01% MLMG:computeResOfCorrection() 180 5.625e-05 5.625e-05 5.625e-05 0.01% Castro::do_advance_ctu() 5 4.539e-05 4.539e-05 4.539e-05 0.01% Castro::construct_new_source() 25 4.345e-05 4.345e-05 4.345e-05 0.01% Castro::finalize_do_advance() 5 4.305e-05 4.305e-05 4.305e-05 0.01% MLMG::actualBottomSolve() 36 4.196e-05 4.196e-05 4.196e-05 0.01% MLMG::mgVcycle_down::0 36 3.852e-05 3.852e-05 3.852e-05 0.01% MLMG::solve() 6 3.586e-05 3.586e-05 3.586e-05 0.01% MLMG::mgVcycle_down::1 36 3.452e-05 3.452e-05 3.452e-05 0.01% MLMG::mgVcycle_down::2 36 3.307e-05 3.307e-05 3.307e-05 0.01% Castro::clean_state() 30 3.307e-05 3.307e-05 3.307e-05 0.01% MLMG::mgVcycle_down::4 36 3.178e-05 3.178e-05 3.178e-05 0.01% MLMG::mgVcycle_down::3 36 3.048e-05 3.048e-05 3.048e-05 0.01% Castro::buildMetrics() 1 2.882e-05 2.882e-05 2.882e-05 0.01% MLMG::mgVcycle_up::4 36 2.865e-05 2.865e-05 2.865e-05 0.01% Castro::finalize_advance() 5 2.863e-05 2.863e-05 2.863e-05 0.01% Amr::writeSmallPlotFile() 1 2.802e-05 2.802e-05 2.802e-05 0.01% Castro::post_restart() 1 2.741e-05 2.741e-05 2.741e-05 0.01% Castro::construct_old_source() 25 2.706e-05 2.706e-05 2.706e-05 0.01% Castro::initMFs() 1 2.554e-05 2.554e-05 2.554e-05 0.01% MLMG::oneIter() 36 2.483e-05 2.483e-05 2.483e-05 0.01% Castro::swap_state_time_levels() 5 2.382e-05 2.382e-05 2.382e-05 0.01% MLMG::mgVcycle_up::0 36 2.355e-05 2.355e-05 2.355e-05 0.01% MLMG::mgVcycle_up::3 36 2.307e-05 2.307e-05 2.307e-05 0.01% MLMG::mgVcycle_up::2 36 2.203e-05 2.203e-05 2.203e-05 0.01% MLCellLinOp::solutionResidual() 42 2.148e-05 2.148e-05 2.148e-05 0.00% FillPatchIterator::FillFromLevel0() 20 2.123e-05 2.123e-05 2.123e-05 0.00% MLMG::mgVcycle_up::1 36 2.085e-05 2.085e-05 2.085e-05 0.00% MLMG::ResNormInf() 42 1.684e-05 1.684e-05 1.684e-05 0.00% Gravity::multilevel_solve_for_new_phi() 1 1.613e-05 1.613e-05 1.613e-05 0.00% MLMG::mgVcycle_bottom 36 1.598e-05 1.598e-05 1.598e-05 0.00% MLMG::computeResidual() 36 1.541e-05 1.541e-05 1.541e-05 0.00% FillPatchSingleLevel 20 1.501e-05 1.501e-05 1.501e-05 0.00% MLPoisson::define() 6 1.353e-05 1.353e-05 1.353e-05 0.00% Gravity::solve_phi_with_mlmg() 6 1.271e-05 1.271e-05 1.271e-05 0.00% makeSFC 30 1.259e-05 1.259e-05 1.259e-05 0.00% Castro::construct_new_gravity() 5 1.199e-05 1.199e-05 1.199e-05 0.00% Castro::do_old_sources() 5 1.123e-05 1.123e-05 1.123e-05 0.00% MLLinOp::define() 6 9.87e-06 9.87e-06 9.87e-06 0.00% DistributionMapping::Distribute() 31 9.713e-06 9.713e-06 9.713e-06 0.00% Amr::initSubcycle() 1 8.703e-06 8.703e-06 8.703e-06 0.00% Gravity::actual_multilevel_solve() 1 8.093e-06 8.093e-06 8.093e-06 0.00% MLLinOp::makeAgglomeratedDMap 6 7.187e-06 7.187e-06 7.187e-06 0.00% Castro::check_for_nan() 10 6.392e-06 6.392e-06 6.392e-06 0.00% Castro::construct_old_gravity() 5 6.187e-06 6.187e-06 6.187e-06 0.00% Castro::apply_source_to_state() 10 6.005e-06 6.005e-06 6.005e-06 0.00% Castro::post_timestep() 5 4.755e-06 4.755e-06 4.755e-06 0.00% MLMG::computeMLResidual() 6 4.198e-06 4.198e-06 4.198e-06 0.00% Castro::computeNewDt() 5 4.049e-06 4.049e-06 4.049e-06 0.00% Gravity::swapTimeLevels() 5 3.987e-06 3.987e-06 3.987e-06 0.00% MLPoisson::prepareForSolve() 6 3.943e-06 3.943e-06 3.943e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 3.212e-06 3.212e-06 3.212e-06 0.00% MLMG::getGradSolution() 6 3.006e-06 3.006e-06 3.006e-06 0.00% Castro::expand_state() 5 2.602e-06 2.602e-06 2.602e-06 0.00% MLMG::MLRhsNormInf() 6 2.32e-06 2.32e-06 2.32e-06 0.00% MLMG::MLResNormInf() 6 2.163e-06 2.163e-06 2.163e-06 0.00% Gravity::set_mass_offset() 6 2.088e-06 2.088e-06 2.088e-06 0.00% Castro::retry_advance_ctu() 5 1.974e-06 1.974e-06 1.974e-06 0.00% Castro::FluxRegCrseInit 5 1.56e-06 1.56e-06 1.56e-06 0.00% Castro::FluxRegFineAdd() 5 1.337e-06 1.337e-06 1.337e-06 0.00% Amr::init() 1 1.308e-06 1.308e-06 1.308e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.079e-06 1.079e-06 1.079e-06 0.00% -------------------------------------------------------------------------------------------- -------------------------------------------------------------------------------------------- Name NCalls Incl. Min Incl. Avg Incl. Max Max % -------------------------------------------------------------------------------------------- main() 1 0.4365 0.4365 0.4365 100.00% Amr::coarseTimeStep() 5 0.3198 0.3198 0.3198 73.25% Amr::timeStep() 5 0.3176 0.3176 0.3176 72.75% Castro::advance() 5 0.3125 0.3125 0.3125 71.58% Castro::subcycle_advance_ctu() 5 0.3049 0.3049 0.3049 69.86% Castro::do_advance_ctu() 5 0.3048 0.3048 0.3048 69.82% Castro::construct_ctu_hydro_source() 5 0.1353 0.1353 0.1353 30.99% Castro::construct_new_gravity() 5 0.1326 0.1326 0.1326 30.37% Gravity::solve_phi_with_mlmg() 6 0.1303 0.1303 0.1303 29.85% Gravity::solve_for_phi() 5 0.1247 0.1247 0.1247 28.56% Gravity::actual_solve_with_mlmg() 6 0.1245 0.1245 0.1245 28.53% MLMG::solve() 6 0.1129 0.1129 0.1129 25.86% MLMG::oneIter() 36 0.1055 0.1055 0.1055 24.17% MLMG::mgVcycle() 36 0.104 0.104 0.104 23.82% Amr::writePlotFile() 1 0.07339 0.07339 0.07339 16.81% MLCellLinOp::smooth() 720 0.05041 0.05041 0.05041 11.55% MLCellLinOp::applyBC() 1946 0.04729 0.04729 0.04729 10.83% Amr::init() 1 0.04266 0.04266 0.04266 9.77% Amr::restart() 1 0.04265 0.04265 0.04265 9.77% MLMG::mgVcycle_bottom 36 0.03271 0.03271 0.03271 7.49% MLMG::actualBottomSolve() 36 0.03269 0.03269 0.03269 7.49% MLCGSolver::bicgstab 36 0.03238 0.03238 0.03238 7.42% AmrLevel::restart() 1 0.03157 0.03157 0.03157 7.23% StateData::restartDoit() 4 0.0315 0.0315 0.0315 7.22% VisMF::Read() 3 0.03122 0.03122 0.03122 7.15% VisMF::Write(FabArray) 1 0.02868 0.02868 0.02868 6.57% Castro::clean_state() 30 0.02744 0.02744 0.02744 6.29% AmrLevel::FillPatch() 20 0.02248 0.02248 0.02248 5.15% FillPatchIterator::Initialize 20 0.02041 0.02041 0.02041 4.68% FillPatchIterator::FillFromLevel0() 20 0.01964 0.01964 0.01964 4.50% FillPatchSingleLevel 20 0.01962 0.01962 0.01962 4.50% StateDataPhysBCFunct::() 20 0.01761 0.01761 0.01761 4.03% MLCellLinOp::apply() 500 0.01609 0.01609 0.01609 3.69% MLMG::mgVcycle_down::0 36 0.01442 0.01442 0.01442 3.30% MLPoisson::Fsmooth() 1440 0.01419 0.01419 0.01419 3.25% FabArray::FillBoundary() 1766 0.01346 0.01346 0.01346 3.08% FillBoundary_nowait() 1766 0.0131 0.0131 0.0131 3.00% StateData::FillBoundary(geom) 160 0.01271 0.01271 0.01271 2.91% Castro::initialize_do_advance() 5 0.0111 0.0111 0.0111 2.54% MLMG::mgVcycle_up::0 36 0.01084 0.01084 0.01084 2.48% Castro::computeTemp() 30 0.01009 0.01009 0.01009 2.31% Castro::normalize_species() 30 0.009916 0.009916 0.009916 2.27% MLCellLinOp::correctionResidual() 216 0.009673 0.009673 0.009673 2.22% Castro::do_old_sources() 5 0.00933 0.00933 0.00933 2.14% MLPoisson::define() 6 0.009168 0.009168 0.009168 2.10% amrex::Dot() 484 0.009027 0.009027 0.009027 2.07% MLMG:computeResOfCorrection() 180 0.008511 0.008511 0.008511 1.95% Gravity::get_new_grav_vector() 5 0.007778 0.007778 0.007778 1.78% Castro::construct_old_gravity() 5 0.007644 0.007644 0.007644 1.75% Gravity::get_old_grav_vector() 5 0.007638 0.007638 0.007638 1.75% amrex::Copy() 463 0.007254 0.007254 0.007254 1.66% Castro::initialize_advance() 5 0.007157 0.007157 0.007157 1.64% Castro::do_new_sources() 5 0.007081 0.007081 0.007081 1.62% MLMG::mgVcycle_down::1 36 0.007051 0.007051 0.007051 1.62% FabArray::ParallelCopy() 380 0.006596 0.006596 0.006596 1.51% FabArray::setVal() 537 0.00653 0.00653 0.00653 1.50% FabArray::ParallelCopy_nowait() 380 0.006489 0.006489 0.006489 1.49% MLMG::mgVcycle_down::2 36 0.006457 0.006457 0.006457 1.48% FabArray::norminf() 326 0.006403 0.006403 0.006403 1.47% MLMG::mgVcycle_down::3 36 0.006269 0.006269 0.006269 1.44% Castro::enforce_min_density() 30 0.006268 0.006268 0.006268 1.44% MLCellLinOp::defineAuxData() 6 0.006233 0.006233 0.006233 1.43% MLMG::mgVcycle_down::4 36 0.00622 0.00622 0.00622 1.42% Castro::post_restart() 1 0.006209 0.006209 0.006209 1.42% Gravity::multilevel_solve_for_new_phi() 1 0.005838 0.005838 0.005838 1.34% Gravity::actual_multilevel_solve() 1 0.005822 0.005822 0.005822 1.33% Castro::expand_state() 5 0.005775 0.005775 0.005775 1.32% Gravity::fill_multipole_BCs() 6 0.005621 0.005621 0.005621 1.29% MLCGSolver::ParallelAllReduce 659 0.005453 0.005453 0.005453 1.25% MLMG::addInterpCorrection() 180 0.005386 0.005386 0.005386 1.23% MLMG::mgVcycle_up::4 36 0.0051 0.0051 0.0051 1.17% MLMG::mgVcycle_up::1 36 0.005046 0.005046 0.005046 1.16% Castro::post_timestep() 5 0.005025 0.005025 0.005025 1.15% amrex::average_down 180 0.004992 0.004992 0.004992 1.14% MLMG::mgVcycle_up::2 36 0.004938 0.004938 0.004938 1.13% MLMG::mgVcycle_up::3 36 0.004837 0.004837 0.004837 1.11% MLPoisson::Fapply() 500 0.004598 0.004598 0.004598 1.05% FabArray::Saxpy() 355 0.003759 0.003759 0.003759 0.86% FabArray::Xpay() 361 0.003611 0.003611 0.003611 0.83% MLCellLinOp::solutionResidual() 42 0.003458 0.003458 0.003458 0.79% Castro::estTimeStep() 10 0.003195 0.003195 0.003195 0.73% Castro::reset_internal_energy(MultiFab) 30 0.002972 0.002972 0.002972 0.68% MLCellLinOp::defineBC() 6 0.002769 0.002769 0.002769 0.63% MLMG::prepareForSolve() 6 0.00274 0.00274 0.00274 0.63% MLMG::computeResidual() 36 0.002698 0.002698 0.002698 0.62% BndryData::define() 6 0.002653 0.002653 0.002653 0.61% FabArray::LinComb() 242 0.002264 0.002264 0.002264 0.52% Castro::computeNewDt() 5 0.002 0.002 0.002 0.46% amrex::Add() 72 0.00189 0.00189 0.00189 0.43% Castro::construct_new_source() 25 0.001831 0.001831 0.001831 0.42% Castro::construct_new_gravity_source() 5 0.001787 0.001787 0.001787 0.41% Castro::construct_old_source() 25 0.001534 0.001534 0.001534 0.35% Castro::construct_old_gravity_source() 5 0.001507 0.001507 0.001507 0.35% Castro::finalize_do_advance() 5 0.001242 0.001242 0.001242 0.28% Castro::enforce_speed_limit() 30 0.001129 0.001129 0.001129 0.26% MLMG::ResNormInf() 42 0.0009818 0.0009818 0.0009818 0.22% Castro::apply_source_to_state() 10 0.0009617 0.0009617 0.0009617 0.22% FabArray::setVal(val, thecmd, scomp, ncomp) 252 0.0009535 0.0009535 0.0009535 0.22% check_for_negative_density() 5 0.0009465 0.0009465 0.0009465 0.22% MLMG::getGradSolution() 6 0.0008641 0.0008641 0.0008641 0.20% MLCellLinOp::compGrad() 6 0.0008611 0.0008611 0.0008611 0.20% Castro::reset_internal_energy(Fab) 240 0.000859 0.000859 0.000859 0.20% FabArrayBase::getCPC() 632 0.000803 0.000803 0.000803 0.18% MLCellLinOp::setLevelBC() 6 0.0008029 0.0008029 0.0008029 0.18% MLMG::computeMLResidual() 6 0.0007796 0.0007796 0.0007796 0.18% MLPoisson::prepareForSolve() 6 0.0007199 0.0007199 0.0007199 0.16% Gravity::update_max_rhs() 6 0.0007178 0.0007178 0.0007178 0.16% MLCellLinOp::prepareForSolve() 6 0.000716 0.000716 0.000716 0.16% FabArray::mult() 22 0.0006878 0.0006878 0.0006878 0.16% FabArray::setDomainBndry() 20 0.0006671 0.0006671 0.0006671 0.15% Castro::check_for_nan() 10 0.0006555 0.0006555 0.0006555 0.15% MultiFab::contains_nan() 10 0.0006491 0.0006491 0.0006491 0.15% FabArrayBase::CPC::define() 244 0.0004208 0.0004208 0.0004208 0.10% Amr::InitAmr() 1 0.0003983 0.0003983 0.0003983 0.09% FabArrayBase::getFB() 1766 0.0003306 0.0003306 0.0003306 0.08% Castro::finalize_advance() 5 0.0002954 0.0002954 0.0002954 0.07% Gravity::swapTimeLevels() 5 0.0002359 0.0002359 0.0002359 0.05% MultiFab::max() 6 0.0002033 0.0002033 0.0002033 0.05% MLMG::MLResNormInf() 6 0.0001861 0.0001861 0.0001861 0.04% MLLinOp::define() 6 0.0001524 0.0001524 0.0001524 0.03% Castro::buildMetrics() 1 0.000152 0.000152 0.000152 0.03% MLLinOp::defineGrids() 6 0.0001425 0.0001425 0.0001425 0.03% MLMG::MLRhsNormInf() 6 0.0001184 0.0001184 0.0001184 0.03% Castro::create_source_corrector() 5 0.0001039 0.0001039 0.0001039 0.02% FabArrayBase::FB::FB() 26 6.771e-05 6.771e-05 6.771e-05 0.02% Amr::writeSmallPlotFile() 1 2.802e-05 2.802e-05 2.802e-05 0.01% MLLinOp::makeAgglomeratedDMap 6 2.788e-05 2.788e-05 2.788e-05 0.01% Castro::initMFs() 1 2.554e-05 2.554e-05 2.554e-05 0.01% Castro::swap_state_time_levels() 5 2.382e-05 2.382e-05 2.382e-05 0.01% makeSFC 30 2.07e-05 2.07e-05 2.07e-05 0.00% DistributionMapping::Distribute() 31 9.713e-06 9.713e-06 9.713e-06 0.00% Amr::initSubcycle() 1 8.703e-06 8.703e-06 8.703e-06 0.00% DistributionMapping::SFCProcessorMapDoIt() 1 4.821e-06 4.821e-06 4.821e-06 0.00% Gravity::set_mass_offset() 6 2.088e-06 2.088e-06 2.088e-06 0.00% Castro::retry_advance_ctu() 5 1.974e-06 1.974e-06 1.974e-06 0.00% Castro::FluxRegCrseInit 5 1.56e-06 1.56e-06 1.56e-06 0.00% Castro::FluxRegFineAdd() 5 1.337e-06 1.337e-06 1.337e-06 0.00% MLLinOp::makeSubCommunicator() 6 1.079e-06 1.079e-06 1.079e-06 0.00% -------------------------------------------------------------------------------------------- Unused ParmParse Variables: [TOP]::amr.ref_ratio(nvals = 4) :: [2, 2, 2, 2] [TOP]::amr.regrid_int(nvals = 4) :: [2, 2, 2, 2] Device Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Arena::Initialize() 1 1 203 MiB 9042 MiB Castro::construct_ctu_hydro_source() 1440 1440 139 MiB 692 MiB Castro::initMFs() 48 48 62 MiB 68 MiB Castro::swap_state_time_levels() 32 32 50 MiB 55 MiB StateData::restartDoit() 32 32 53 MiB 55 MiB FillPatchIterator::Initialize 160 160 1017 KiB 39 MiB Castro::initialize_do_advance() 40 40 27 MiB 39 MiB ResizeRandomSeed 1 1 30 MiB 30 MiB Amr::writePlotFile() 8 8 1941 KiB 28 MiB Castro::initialize_advance() 40 40 16 MiB 23 MiB Castro::buildMetrics() 32 32 14 MiB 15 MiB Castro::post_restart() 48 48 6937 KiB 14 MiB MLMG::prepareForSolve() 361 361 3174 KiB 12 MiB Gravity::get_old_grav_vector() 43 43 181 KiB 10 MiB Gravity::get_new_grav_vector() 40 40 182 KiB 10 MiB Gravity::multilevel_solve_for_new_phi() 24 24 6924 KiB 7586 KiB Gravity::fill_multipole_BCs() 84 84 21 KiB 2053 KiB Gravity::update_max_rhs() 48 48 3241 B 2048 KiB Gravity::solve_for_phi() 40 40 583 KiB 2048 KiB Gravity::actual_multilevel_solve() 8 8 27 KiB 2048 KiB BndryData::define() 576 576 292 KiB 1095 KiB MLCellLinOp::defineAuxData() 936 936 187 KiB 671 KiB Castro::estTimeStep() 10 10 3510 B 480 KiB VisMF::Write(FabArray) 112 112 1316 B 320 KiB Castro::normalize_species() 30 30 7396 B 320 KiB amrex::average_down 469 469 1150 B 257 KiB MLMG::addInterpCorrection() 468 468 1046 B 257 KiB amrex::Dot() 592 592 3046 B 160 KiB FabArray::norminf() 398 398 2166 B 160 KiB check_for_negative_density() 5 5 347 B 160 KiB MultiFab::max() 6 6 72 B 160 KiB FabArray::setVal() 66 66 18 KiB 23 KiB MultiFab::contains_nan() 10 10 29 B 20 KiB MLPoisson::Fsmooth() 60 60 3078 B 12 KiB FabArray::ParallelCopy_nowait() 380 380 44 B 10 KiB FillBoundary_nowait() 336 336 254 B 9648 B MLCellLinOp::applyBC() 3892 3892 199 B 9344 B amrex::Copy() 56 56 5983 B 8464 B MLCellLinOp::prepareForSolve() 36 36 4 B 7792 B StateData::FillBoundary(geom) 960 960 40 B 2688 B FabArray::setVal(val, thecmd, scomp, ncomp) 72 72 1 B 1616 B MLCGSolver::bicgstab 324 324 107 B 1472 B MLCellLinOp::defineBC() 36 36 328 B 1248 B ------------------------------------------------------------------------------ Managed Memory Usage: ------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------ The_Managed_Arena::Initialize() 1 1 1133 B 8192 KiB ------------------------------------------------------------------ Pinned Memory Usage: ------------------------------------------------------------------------------ Name Nalloc Nfree AvgMem MaxMem ------------------------------------------------------------------------------ The_Pinned_Arena::Initialize() 1 1 83 KiB 8192 KiB VisMF::Write(FabArray) 120 120 189 KiB 3584 KiB VisMF::Read() 24 24 126 KiB 3000 KiB FabArray::setVal() 66 66 18 KiB 23 KiB MLPoisson::Fsmooth() 60 60 3078 B 12 KiB FabArray::ParallelCopy_nowait() 380 380 44 B 10 KiB FillBoundary_nowait() 336 336 254 B 9648 B MLCellLinOp::applyBC() 1946 1946 198 B 9328 B amrex::Copy() 56 56 5983 B 8464 B MLCellLinOp::prepareForSolve() 36 36 4 B 7792 B Gravity::get_old_grav_vector() 3 3 2727 B 3072 B Gravity::fill_multipole_BCs() 18 18 4 B 2832 B StateData::FillBoundary(geom) 960 960 40 B 2688 B FabArray::setVal(val, thecmd, scomp, ncomp) 72 72 1 B 1616 B MLMG::prepareForSolve() 7 7 510 B 1296 B amrex::average_down 37 37 230 B 1296 B MLMG::addInterpCorrection() 36 36 2 B 1024 B MLCellLinOp::setLevelBC() 36 36 0 B 768 B amrex::Dot() 592 592 22 B 400 B FabArray::norminf() 398 398 15 B 272 B Castro::estTimeStep() 10 10 0 B 32 B check_for_negative_density() 5 5 0 B 16 B MultiFab::max() 6 6 0 B 16 B MultiFab::contains_nan() 10 10 0 B 16 B Castro::normalize_species() 30 30 0 B 16 B ------------------------------------------------------------------------------ Total GPU global memory (MB): 12056 Free GPU global memory (MB): 2541 [The Arena] space allocated (MB): 9042 [The Arena] space used (MB): 0 [The Managed Arena] space allocated (MB): 8 [The Managed Arena] space used (MB): 0 [The Pinned Arena] space allocated (MB): 8 [The Pinned Arena] space used (MB): 0 AMReX (23.10-4-ge470d3350ed3) finalized