ideas icon indicating copy to clipboard operation
ideas copied to clipboard

DataStore column name mapping

Open davidread opened this issue 6 years ago • 2 comments

When importing a CSV into DataStore, the column names are set (by DataPusher/Express Loader) to be the same in the database as they are in the CSV.

This is problematic:

How about we get DataPusher/Express Loader to make some sensible changes to column names before it goes into the database, and then the mapping is saved for:

  • users can see it in the web interface
  • users can get it via an API
  • users can edit it in the web interface
  • subsequent DataPusher/Express Loader runs reuse the same mapping if possible

davidread avatar Mar 02 '18 14:03 davidread

+1

We could also have datapusher and xloader populate the data dictionaries 'labels' with the original column names.

As for taking a bunch of column names and transforming them into cleaner versions how about adding an action that will "preview" the transformation. That lets sites potentially override it if they have different naming preferences or even revert to the old behaviour if they depend on it.

wardi avatar Mar 02 '18 15:03 wardi

+1 on overriding unidecode(). It is removing French accents from the column headers of our French datasets, which is not what we want. Any plans to allow this? (I left a comment here https://github.com/ckan/ckanext-xloader/issues/145).

KatiRG avatar May 04 '23 18:05 KatiRG