arrow
arrow copied to clipboard
[Python] `pyarrow.fs.HadoopFileSystem` throws OSError: Unable to load libhdfs
Describe the bug, including details regarding any error messages, version, and platform.
I'm trying to create an HDFS Connection via pyarrow.fs.HadoopFileSystem, but unfortunately I get an error:
from pyarrow.fs import HadoopFileSystem
hdfs = HadoopFileSystem(
host="localhost",
port=8001,
)
OSError: Unable to load libhdfs: Das angegebene Modul wurde nicht gefunden.
From https://arrow.apache.org/docs/python/filesystems.html#hadoop-distributed-file-system-hdfs I understand that libhdfs.so should be located in %HADOOP_HOME%lib/native/, which is the case. I also set the CLASSPATH environment variable to %HADOOP_HOME%/bin/hadoop.
What am I missing?
I use pyarrow==15.0.0.
Component(s)
Python
still watching this, :")
@oendnsk675 Could you share what you did and what's happened with it?