Kevin Hu

Results 14 issues of Kevin Hu

## Checklist - [x] The PR conforms to DataHub's [Contributing Guideline](https://github.com/linkedin/datahub/blob/master/docs/CONTRIBUTING.md) (particularly [Commit Message Format](https://github.com/linkedin/datahub/blob/master/docs/CONTRIBUTING.md#commit-message-format)) - [ ] Links to related issues (if applicable) - [x] Tests for the changes...

ingestion

The `concat` method for joining multiple DataFrames appears to be missing several arguments, such as `join`, `keys`, `levels`, and more. https://github.com/predictive-analytics-lab/data-science-types/blob/faebf595b16772d3aa70d56ea179a2eaffdbd565/pandas-stubs/__init__.pyi#L37-L42 Compare to the Pandas docs: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.concat.html

Script used: ```python import pandas as pd df: pd.DataFrame = pd.DataFrame([[1, 2], [1, 4]], columns=["a", "b"], index=["c", "d"]) df.drop_duplicates(subset=["a"], inplace=True) print(df) ```

Script used: ```python import pandas as pd x: pd.DataFrame = pd.read_hdf("your_hdf_here.hdf") ```

Script used: ```python import numpy as np arr: np.ndarray[np.float32] = np.array([0, 1, np.inf], dtype=np.float32) print(np.isfinite(arr)) ```

Script used: ```python import pandas as pd df: pd.DataFrame = pd.DataFrame([[1, 2], [3, 4]], columns=["a", "b"], index=["c", "d"]) grouped = df.groupby("a")["b"] grouped_list = grouped.apply(list) print(df) print(grouped) print(grouped_list) print(grouped.groups) print(grouped.get_group(1)) ```

I'm building an app that has some high-resolution textures, and I was exploring storage methods for the textures. Some of my textures have partial opacities, so I can't use JPEGs...

Hi! This is a super useful tool. I'm using this package for a scraper, and I ran into a segfault when running minify_html 0.11.1 (default settings) with this particular website:...

enhancement

Hello! When I installed sophia via `pip install Sophia-Optimizer` and run `from Sophia import SophiaG`, I get `ModuleNotFoundError: No module named 'Sophia'`. When I try running `import sophia`, I get...

Is there a recommended way to run data parallel inference (i.e. a copy of the model on each GPU)? It's possible by hacking CUDA_VISIBLE_DEVICES, but I was wondering if there's...

feature request