dataverse icon indicating copy to clipboard operation
dataverse copied to clipboard

XLSX ingestion: integers are converted to decimal numbers

Open stevenferey opened this issue 3 months ago • 5 comments

Bug description

This follows a discussion in the Google Group Dataverse:
https://groups.google.com/g/dataverse-community/c/75OO7gDNpsw/

When ingesting an XLSX file, any non-zero numeric value is always interpreted as a decimal number.
This results in integers being automatically converted into decimals:

  • 22.0
  • -2-2.0
  • 2.52.5

This behavior can be problematic in some use cases (e.g., phone numbers, street numbers).

Notes:

  • This issue does not occur when ingesting a CSV file, where integers remain integers.
  • Reproducible example dataset: test dataset.

Steps to reproduce

  1. Create an XLSX file containing a column with integer values.
  2. Ingest into Dataverse.
  3. Observe that all integers are converted into decimals.

Expected behavior

  • Keep integers as integers during ingestion unless a decimal value is explicitly present.
    • 22
    • -2-2
    • 2.52.5

Impact

  • Incorrect data representation in business contexts.
  • Inconsistency between XLSX ingestion and CSV ingestion.

Affected versions

  • All versions of Dataverse

Help is always welcome, is this bug something you or your organization plan to fix?

  • This is not currently planned by the team.

stevenferey avatar Oct 13 '25 09:10 stevenferey

Hi @stevenferey, I’m interested in working on this issue. Could you please assign it to me? Cheers.

pizofreude avatar Oct 13 '25 16:10 pizofreude

Hello @pizofreude , I don't have the rights to assign you the issue but you can definitely work on it ! Thanks

stevenferey avatar Oct 14 '25 09:10 stevenferey

@pizofreude assigned! Please go for it! If you have any questions, you can ask here or in #dev at https://dataverse.zulipchat.com

pdurbin avatar Oct 14 '25 12:10 pdurbin

@pizofreude hi! Just checking in. Do you need any help getting started?

pdurbin avatar Nov 18 '25 13:11 pdurbin

@pdurbin hi, I'm finishing DE course right now which took more time that I had initially planned so I had to compromise my open source contribution atm. Will get back to it once I'm done with the course. Cheers and sorry for the wait!

pizofreude avatar Nov 21 '25 12:11 pizofreude