timesfm icon indicating copy to clipboard operation
timesfm copied to clipboard

Is this normal to use 12G GPU memory to load the 200m model with default parameter?

Open keefeleen opened this issue 9 months ago • 2 comments

Our code (following the sample)

model = timesfm.TimesFm(
    context_len=128,
    horizon_len=5,
    input_patch_len=32,
    output_patch_len=128,
    num_layers=20,
    model_dims=1280,
    backend=backend,
)
model.load_from_checkpoint(repo_id="google/timesfm-1.0-200m")

then our process gives the GPU memory usage around 12237MiB

image

keefeleen avatar May 15 '24 07:05 keefeleen

that is a lot

blackcat1402 avatar May 16 '24 08:05 blackcat1402

I have 64gb of ram and 64 of swap and it oom XD sooooo

R3xpook avatar May 20 '24 10:05 R3xpook

Can you try setting the environment variable: XLA_PYTHON_CLIENT_PREALLOCATE=false ?

rajatsen91 avatar May 20 '24 15:05 rajatsen91