UrlDownload.jl icon indicating copy to clipboard operation
UrlDownload.jl copied to clipboard

Excel support?

Open PyDataBlog opened this issue 5 years ago • 3 comments
trafficstars

Excel is one of the most popular data formats so it'd be great to have it supported like:

using UrlDownload
using DataFrames 

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/00352/Online%20Retail.xlsx"

df = urldownload(url) |> DataFrame

PyDataBlog avatar May 04 '20 23:05 PyDataBlog

Most popular package for excel files is XLSX.jl. Unfortunately, it doesn't support raw data. There is an issue https://github.com/felipenoris/XLSX.jl/issues/26, but it is not resolved currently.

Arkoniak avatar May 05 '20 10:05 Arkoniak

Most popular package for excel files is XLSX.jl. Unfortunately, it doesn't support raw data. There is an issue felipenoris/XLSX.jl#26, but it is not resolved currently.

We can only wait for the issue to be fixed now.

PyDataBlog avatar May 05 '20 10:05 PyDataBlog

Current workaround (with UrlDownload v0.3.0)

using UrlDownload
using DataFrames 
using XLSX

url = "https://archive.ics.uci.edu/ml/machine-learning-databases/00352/Online%20Retail.xlsx"
outfile = "/tmp/online_retail.xlsx"

urldownload(url, parser = identity, save_raw = outfile)
df = DataFrame(XLSX.readtable(outfile, "mysheet")...)

Arkoniak avatar Jun 11 '20 09:06 Arkoniak