andersonfrailey
andersonfrailey
Thanks for pointing this out, @evtedeschi3. The deviations are not intentional. My intuition says this is probably caused by the scripts misidentifying the spouse when creating the record and assigning...
@martinholmer the next biggest step in taxdata development from my perspective is replacing the SAS code to make the CPS file with Python. I put that on hold the last...
We won't be able to impute the variables @codykallen mentioned using just the CPS data. We impute a few other variables in the PUF off of the CPS so we...
@reubenfb, I can't think of anything in the methodology that would result in the gap you've pointed out. I'll do some research to see if I can find a possible...
@MattHJensen said: > We may need to do the opposite: impute to the CPS off of the PUF rather than off of the CPS to the PUF. You're right. My...
Looking through the code, `e00900` in the CPS is the sum of the `semp_val` (Own business self-employment earnings, total value) variable in the CPS files. Unfortunately the CPS documentation doesn't...
Good point @codykallen. So would it be better if we split `e00900` into `e00900` and `e02000` and then used the new `e02000` variable to get at `e26270`?
Thanks for the breakdown, @codykallen. I'm working on a TaxData PR now.
As seen in PR #127, splitting up `e00900` isn't an effective method for getting `e02000` and `e26270`. I'm instead going to try and impute them. Here's a rough outline of...
Sure. The issue with using the same methods used when we impute the various deductions is that method uses the log of all deductions. That doesn't work with `e02000` and...