[WIP]: normalize_shocks and normalize_levels allow user to impose true propositions on sims by llorracc · Pull Request #1094 · econ-ark/HARK

llorracc · 2021-12-23T18:57:54Z

This implements a simple version of an old idea from CDC Mathematica code: Simulation efficiency can be substantially improved by imposing on the stochastic draws facts that we know are true in the population, like that the average values of permanent and transitory shocks are 1 in each period.

codecov-commenter · 2022-01-04T18:24:32Z

Codecov Report

Merging #1094 (8f0057e) into master (4c002f1) will decrease coverage by 0.00%.
The diff coverage is 72.22%.

@@            Coverage Diff             @@
##           master    #1094      +/-   ##
==========================================
- Coverage   73.65%   73.64%   -0.01%     
==========================================
  Files          69       69              
  Lines       10579    10595      +16     
==========================================
+ Hits         7792     7803      +11     
- Misses       2787     2792       +5

Impacted Files	Coverage Δ
HARK/ConsumptionSaving/ConsIndShockModel.py	`85.55% <72.22%> (-0.32%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4c002f1...8f0057e. Read the comment docs.

llorracc · 2022-01-05T20:45:55Z

@Mv77, could you review this? And, if you have the right permissions, merge it?

It passes all tests and should not affect any existing results, since if the normalize booleans are not set then all it does is divide some things by 1.0.

PS. I'm interested to see how much improvement there is from this versus from the Harmenberg thing. And, I can't see any reason they couldn't be combined, to achieve even more improvement. What I really want to do, though, is to get the "reshuffling" technology implemented.

@wdu9, you might be interested too. Thanks for your earlier input -- you put me on the right track

Mv77 · 2022-01-08T22:29:21Z

@llorracc I am starting to take a look now. Will get to the code soon, but I have a couple of conceptual questions.

On dividing shocks by their means: I believe that doing things like TranShkNow[these] = IncShks[1, :] / TranShkMeanNow introduces the possibility of (normalized) transitory shocks taking on values not included in the discrete approximation TranShkDstn.X. How do you feel about that? The agent is drawing shocks that were not allowed in the expectations he formed. I believe this points back to the debate on whether these discretized solutions should be seen as approximations of the continuous problem. I wanted to hear your thoughts on that, since I believe this fact complicates things you have been thinking about (like shuffling).
On normalizing levels:
a) There should be some guardrails ensuring that pLvlNow = pLvlNow / pLvlNowMean happens only for models in which it is in fact true that the average pLvl should be 1 every period. This excludes e.g. life-cycle models or models with aggregate growth, right?
b) pLvlNow = pLvlNow / pLvlNowMean alters the joint distribution of (m,P). Should we worry about that? Consider an agent who ends period 0 with p=1, a=1. At the start of period 0 he draws PermShk=0.5 and now he has, approximately p_1 = 0.5, m_1 = 2. But then we apply the pLvl normalization and this can shift his p_1 up or down while leaving m_1 fixed. This is picky, the normalization should have a very small effect for large populations. Nevertheless, my point is that p affects m through PermShk before it is normalized, and I wonder if normalizing p after setting m without taking this link into account can have perverse consequences.

llorracc · 2022-01-09T01:46:03Z

.

On dividing shocks by their means: I believe that doing things like TranShkNow[these] = IncShks[1, :] / TranShkMeanNow introduces the possibility of (normalized) transitory shocks taking on values not included in the discrete approximation TranShkDstn.X. How do you feel about that? The agent is drawing shocks that were not allowed in the expectations he formed. I believe this points back to the debate on whether these discretized solutions should be seen as approximations of the continuous problem. I wanted to hear your thoughts on that, since I believe this fact complicates things you have been thinking about (like shuffling).

This is exactly right. It goes back to whether we are thinking of our discretizations as defining an exact model that is being solved on its own terms, or whether we are thinking of them as approximations of a "true" model in which the shock is, say, really continuously lognormally distributed. My preference is strongly for the latter, because it is the more general formulation. If two solutions to a model differ because one used a 5 point and the other used a 7 point approximation, then whichever of them is closer to what you get when you have an infinity-point approximation is the one that is defined as being "closer" to the truth. I'd rather do shuffling than dividing, because shuffling has the virtue that the simulated outcomes match numerically identically the computations that went into the calculation of the expectations, and it is deeply attractive to have identical calculations going into the solution and the simulation phases. But implementing shuffling would require considerably more work, and it is possible that dividing by the mean gets 95 percent of the benefits -- that is something I want to figure out.

On normalizing levels:
a) There should be some guardrails ensuring that pLvlNow = pLvlNow / pLvlNowMean happens only for models in which it is in fact true that the average pLvl should be 1 every period. This excludes e.g. life-cycle models or models with aggregate growth, right?

No, PermGroFac is a separate object from PermShk. Throughout, we have always insisted that E[PermShk]=1, and have handled either life cycle patterns or aggregate growth using PermGroFac. So, it's not a problem to impose E[PermShk]=1.

b) pLvlNow = pLvlNow / pLvlNowMean alters the joint distribution of (m,P). Should we worry about that? Consider an agent who ends period 0 with p=1, a=1. At the start of period 0 he draws PermShk=0.5 and now he has, approximately p_1 = 0.5, m_1 = 2. But then we apply the pLvl normalization and this can shift his p_1 up or down while leaving m_1 fixed. This is picky, the normalization should have a very small effect for large populations. Nevertheless, my point is that p affects m through PermShk before it is normalized, and I wonder if normalizing p after setting m without taking this link into account can have perverse consequences.

This may be a very good catch: I have not examined the simulation code carefully enough to determine whether, as I had assumed, the draw of the permanent shock occurs before the calculation of b or m. If it does, then we're fine. That was the case in my original Mathematica code, so I had assumed it is the case in our HARK simulations, but if not then this step may need to be moved to some earlier point. (Though, if that is the case, maybe some renaming will be in order, since I think of "transitions" as defining how you get from t-1 to t, and if by the time you get to "transitions" some of that has already been done then our nomenclature may not be ideal).

Mv77 · 2022-01-09T01:52:18Z

No, PermGroFac is a separate object from PermShk. Throughout, we have always insisted that E[PermShk]=1, and have handled either life cycle patterns or aggregate growth using PermGroFac. So, it's not a problem to impose E[PermShk]=1.

Fully agree, but you are imposing E[pLvl] = 1 in some parts

HARK/HARK/ConsumptionSaving/ConsIndShockModel.py

Line 1809 in 8f0057e

pLvlNow = pLvlNow / pLvlNowMean # Divide by 1.0 if normalize_levels=False

Mv77 · 2022-01-09T01:57:27Z

This may be a very good catch: I have not examined the simulation code carefully enough to determine whether, as I had assumed, the draw of the permanent shock occurs before the calculation of b or m. If it does, then we're fine. That was the case in my original Mathematica code, so I had assumed it is the case in our HARK simulations, but if not then this step may need to be moved to some earlier point. (Though, if that is the case, maybe some renaming will be in order, since I think of "transitions" as defining how you get from t-1 to t, and if by the time you get to "transitions" some of that has already been done then our nomenclature may not be ideal).

Here is the relevant code

HARK/HARK/ConsumptionSaving/ConsIndShockModel.py

Lines 1795 to 1816 in 8f0057e

    
           # Calculate new states: normalized market resources and permanent income level 
        
           pLvlNow = pLvlPrev*self.shocks['PermShk']  # Updated permanent income level 
        
           # Asymptotically it can't hurt to impose true restrictions 
        
           # (at least if the GICRaw holds) 
        
           pLvlNowMean = 1.0 
        
           if not hasattr(self, "normalize_shocks"): 
        
               self.normalize_shocks = False 
        
           if not hasattr(self, "normalize_levels"): 
        
               self.normalize_levels = False 
        
           if self.normalize_levels == True: 
        
               pLvlNowMean = np.mean(pLvlNow) 
        
           pLvlNow = pLvlNow / pLvlNowMean # Divide by 1.0 if normalize_levels=False 
        
           # Updated aggregate permanent productivity level 
        
           PlvlAggNow = self.state_prev['PlvlAgg']*self.PermShkAggNow 
        
           # "Effective" interest factor on normalized assets 
        
           ReffNow = RfreeNow/self.shocks['PermShk'] 
        
           bNrmNow = ReffNow*aNrmPrev         # Bank balances before labor income 
        
           mNrmNow = bNrmNow + self.shocks['TranShk']  # Market resources after income

Notice pLvlNow is set in line 1796, using PermShk. Then pLvlNow is normalized in line 1809. But then ReffNow, which is used to compute bNrmNow is set using PermShk in 1814.

llorracc · 2022-01-09T01:57:33Z

Oh, yes, you're right about that. When I did that my thought was "the right way to handle this is to have an aggregate PLvl variable that tracks the aggregate movements and an idiosyncratic pLvl whose mean should always be 1 but I have a sneaking suspicion we have not done it that way ... even though it's being done that way in the particular case I'm working with right now (Harmenberg-Aggregation)."

Copilot

Pull request overview

This work-in-progress PR implements shock and level normalization during simulations to improve efficiency. The approach imposes known population properties (e.g., that mean permanent and transitory shocks equal 1) on stochastic draws.

Changes:

Added two new boolean parameters normalize_shocks and normalize_levels to control normalization behavior
Modified transition() method in PerfForesightConsumerType to normalize permanent income levels when enabled
Modified get_shocks() method in IndShockConsumerType to normalize permanent and transitory shocks when enabled

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-28T19:20:19Z