Forbid duplicate data with labels from being ingested
This throws a form error is duplicate data with old labels in ingested through the update data pipeline. Does not affect project creation data upload.
Error shows which rows in the uploaded data contains this problem
I think the part where it shows which rows are problem-causing is not user-friendly.
Currently, it shows it as a csv, but pandas has a to_html function for dataframes which could look better. It does not work well with django form errors though because it shows it has the string rather than html.
Current error message:

Also what do you guys think about limiting the number of rows that is displayed? Currently, it shows all rows.
I'd say list the first 5 or 10, then just say the number of remaining differences. The error message is a bit confusing. Are you saying the data is already there and labeled, or it's there at all?
Should mean that the data is there, either unlabelled or labelled.
@AstridKery I changed it to "The following uploaded text + metadata combinations are already in the database and the uploaded labels will not be reflected in the database:". Would take make more sense?