Abort criterion oplus by rainbowsend · Pull Request #52 · NCAR/tiegcm

rainbowsend · 2025-09-22T14:32:23Z

The current implementation uses a fixed number of iterations to update O+ number concentration. This number is controlled by the nstep_sub input option. Instead, I propose to use nstep_sub as the maximum number of iterations and break the loop once a certain threshold/epsilon has been reached. This makes the model more versatile, as it automatically adapts the number of iterations to the current situation.

The collision frequency between N2+ and O is misspelled in lamdas. Correct this based on Schunk and Nagy (2009). Thanks to Yizhe Zhang.

…of iterations

Equivalent rewrite of the loop to reduce code complexity

hzfywhn · 2025-09-22T21:50:12Z

This change looks good to me. It might help saving time during spin-up and other situations when O+ does not change drastically. What do you think? @phamkh @jvilaperez

jvilaperez · 2025-09-22T22:32:41Z

It looks good to me too. It can save some iterations in some cases.

As there isn't a thorough test for difference cases, strengthen the criteria for now until we are more confident with the change.

rainbowsend · 2025-11-11T16:02:46Z

calc_terms is called only at the last iteration

tiegcm/src/oplus.F

Lines 105 to 106 in a0d5e33

    
              if (istep == nstep_sub) 
        
           |    call calc_terms(xnmbar,diffj,Fe,Fn,

Thus, when the loop is exited before the last iteration, it is never called.

It may be sufficient to move the call outside the loop. Since calc_terms only updates Fe and Fn, and neither variables occur as an argument in any call within the loop, this should be safe.

Sorry for overlooking this.

The postprocessing terms are calculated at each sub-cycling. Thanks Armin for noticing this.

hzfywhn · 2025-11-11T17:33:24Z

calc_terms is called only at the last iteration

tiegcm/src/oplus.F

Lines 105 to 106 in a0d5e33

if (istep == nstep_sub)

| call calc_terms(xnmbar,diffj,Fe,Fn,

Thus, when the loop is exited before the last iteration, it is never called.
It may be sufficient to move the call outside the loop. Since calc_terms only updates Fe and Fn, and neither variables occur as an argument in any call within the loop, this should be safe.

Sorry for overlooking this.

That is right. We all missed this as well. Just post a quick fix to correct this.

… 25 iterations to reduce communication costs

rainbowsend · 2025-11-12T17:26:36Z

I conducted some tests for the runtime, I will also investigate the differences in O+ later.
For the test, I forwarded a 1 hour IMF run using 60 s integration time and 5° resolution.

The larger the maximal number of iterations, the more runtime is saved when using the abort criterion.

NSTEP_SUB	runtime (fixed, `006bb4e`)	runtime (abort, `674d446`)
200	49 s	46 s
400	79 s	75 s
600	118 s	102 s
800	148 s	117 s
1000	178 s	120 s

Using a less strict abort criterion could further reduce runtime. However, we need to ensure the differences in O+ are reasonable.

hzfywhn · 2025-11-13T00:04:43Z

I conducted some tests for the runtime, I will also investigate the differences in O+ later. For the test, I forwarded a 1 hour IMF run using 60 s integration time and 5° resolution.

The larger the maximal number of iterations, the more runtime is saved when using the abort criterion.

NSTEP_SUB runtime (fixed, 006bb4e) runtime (abort, 674d446)
200 49 s 46 s
400 79 s 75 s
600 118 s 102 s
800 148 s 117 s
1000 178 s 120 s
Using a less strict abort criterion could further reduce runtime. However, we need to ensure the differences in O+ are reasonable.

Thanks a lot! This benchmark is very helpful.

rainbowsend · 2025-11-17T15:27:05Z

I checked how the number of iterations influences O+. For this, I ran the same experiment, varying only the number of iterations (10, 50, 100, 200, 600, 1000, 2000) and not using the abort criterion. The following figure shows the difference in O+ relative to the run with 2000 iterations.
.

However, when using the aboart criterion, O+ differences increased with the number of iterations.
So I conducted an experiment in which the loop always exits just one iteration before the maximum number of iterations. With this setup, the differences to the run without an abort criterion are also unrealistically high (above 1E+6 cm-3 in the case of 1000 iterations. For context: Even when using only 10 iterations without abort, the differences in the figure above do not exceed 1000.)

Therefore, the abort criterion should not be used unless the cause of this behaviour is identified. I will check the code and search for dependencies of nsub.

rainbowsend · 2025-11-18T08:08:36Z

It turned out that nstep_sub is not only used to control the number of iterations but is also used in the calculation of O+:

in the subroutine prep_oplus, which is called before the iteration

tiegcm/src/oplus.F

Lines 568 to 570 in 006bb4e

    
                  q_coeff(k,i,lat) = 
        
           |        diffq(k,i,lat)+driftq(k,i,lat)+windq(k,i,lat)- 
        
           |        dtx2inv*nstep_sub-op_loss(k,i,lat)

in the subroutine smooth_oplus, which is called at the beginning of each iteration

tiegcm/src/oplus.F

Lines 674 to 678 in 006bb4e

    
                  optm1_smooth1(k,i,lat) = optm1(k,i,lat)- 
        
           |        shapiro/nstep_sub* 
        
           |        (optm1(k,i,lat+2)+optm1(k,i,lat-2)- 
        
           |        4.*(optm1(k,i,lat+1)+optm1(k,i,lat-1))+ 
        
           |        6.*optm1(k,i,lat))

tiegcm/src/oplus.F

Lines 688 to 692 in 006bb4e

    
                  optm1_smooth(k,i,lat) = optm1_smooth1(k,i,lat)- 
        
           |        shapiro/nstep_sub* 
        
           |        (optm1_smooth1(k,i+2,lat)+optm1_smooth1(k,i-2,lat)- 
        
           |        4.*(optm1_smooth1(k,i+1,lat)+optm1_smooth1(k,i-1,lat))+ 
        
           |        6.*optm1_smooth1(k,i,lat))

in the subroutine iterate_oplus, which is called in each iteration

tiegcm/src/oplus.F

Lines 902 to 905 in 006bb4e

    
                  explicit(k,i,lat) = 
        
           |        vdotn_h(k,i,lat)+bdotdh_bvel(k,i,lat)- 
        
           |        diffexp(k,i,lat)-op_prod(k,i,lat)- 
        
           |        optm1_smooth(k,i,lat)*dtx2inv*nstep_sub

in the subroutine calc_terms, which is called in the last iteration

tiegcm/src/oplus.F

Lines 1162 to 1163 in 006bb4e

dopdt(k,i,lat) = dtx2inv*nstep_sub*

| (opout(k,i,lat)-optm1_smooth(k,i,lat))

This explains why the results are off when the loop is exited before nstep_sub is reached. To exit the loop, the calculations must be performed without depending on nstep_sub.

AnonNick and others added 2 commits August 15, 2025 09:34

Correct N2+~O collision frequency (NCAR#50)

006bb4e

The collision frequency between N2+ and O is misspelled in lamdas. Correct this based on Schunk and Nagy (2009). Thanks to Yizhe Zhang.

added abort criterion to iteration in oplus, instead of fixed number …

7e95529

…of iterations

AnonNick requested a review from hzfywhn September 22, 2025 15:37

Reduce code complexity

298b718

Equivalent rewrite of the loop to reduce code complexity

hzfywhn requested review from jvilaperez and phamkh September 22, 2025 21:50

Change converging criteria

a0d5e33

As there isn't a thorough test for difference cases, strengthen the criteria for now until we are more confident with the change.

correct exit criteria

8bc5b9e

The postprocessing terms are calculated at each sub-cycling. Thanks Armin for noticing this.

rainbowsend force-pushed the abort_criterion_oplus branch from 59e0c5d to 8bc5b9e Compare November 12, 2025 09:53

compute calc_terms only at last iteration, caclulate error only every…

674d446

… 25 iterations to reduce communication costs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Abort criterion oplus#52

Abort criterion oplus#52
rainbowsend wants to merge 6 commits intoNCAR:developmentfrom
rainbowsend:abort_criterion_oplus

rainbowsend commented Sep 22, 2025

Uh oh!

hzfywhn commented Sep 22, 2025

Uh oh!

jvilaperez commented Sep 22, 2025

Uh oh!

rainbowsend commented Nov 11, 2025 •

edited

Loading

Uh oh!

hzfywhn commented Nov 11, 2025

Uh oh!

rainbowsend commented Nov 12, 2025 •

edited

Loading

Uh oh!

hzfywhn commented Nov 13, 2025

Uh oh!

rainbowsend commented Nov 17, 2025 •

edited

Loading

Uh oh!

rainbowsend commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

rainbowsend commented Sep 22, 2025

Uh oh!

hzfywhn commented Sep 22, 2025

Uh oh!

jvilaperez commented Sep 22, 2025

Uh oh!

rainbowsend commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hzfywhn commented Nov 11, 2025

Uh oh!

rainbowsend commented Nov 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hzfywhn commented Nov 13, 2025

Uh oh!

rainbowsend commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rainbowsend commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rainbowsend commented Nov 11, 2025 •

edited

Loading

rainbowsend commented Nov 12, 2025 •

edited

Loading

rainbowsend commented Nov 17, 2025 •

edited

Loading