Junteng (Jason) Li
Junteng (Jason) Li
Previously, the mfu timing measurement was taken before the CPU-GPU sync point at every iter. The resulting `running_mfu`: - would converge correctly when `log_interval = 1`. - could converge to...
## Environment - Platform (select one): - [ ] Anthropic API - [ ] AWS Bedrock - [x] Google Vertex AI - [ ] Other: - Claude CLI version: 1.0.10...
### Describe the bug `ElasticNetCV` internally computes `alpha_max` that is just big enough to push all coef to 0. The existing code (in `_alpha_grid()`) does not take `positive` as an...
#### Reference Issues/PRs Fix #32767. #### What does this implement/fix? Explain your changes. 1. Make alpha_max and the alpha search range correct for `ElasticNetCV(positive=True)` and `LassoCV(positive=True)`. 2. Add test case...