data-diff icon indicating copy to clipboard operation
data-diff copied to clipboard

Support UUID x Text comparison

Open MattDelac opened this issue 2 years ago • 10 comments

Describe the bug Matt Delac 6:38 PM Hi @Erez Shinan, follow up of my PR. It sounds like we also need to support UUID x Text?

TypeError: Incompatible types for column 'owner_id': String_UUID() <-> Text()

On my use case, it happens between MySQL UUID <> Redshift Text. I am not sure where UUID fits the class Dialect object though

Describe the environment

Describe which OS you're using, which sqeleton version, and any other information that might be relevant to this bug.

MattDelac avatar Apr 12 '23 11:04 MattDelac

Any update on this issue? I'm seeing this happen on Postgres where the column is being converted to "String_UUID" in one table, but is classified as "Text" in the other table due to detection of non-UUID values.

timothyhinh79 avatar Jun 05 '23 18:06 timothyhinh79

Any update on this issue? I'm seeing this happen on Postgres where the column is being converted to "String_UUID" in one table, but is classified as "Text" in the other table due to detection of non-UUID values.

Hi @timothyhinh79, we're not currently working on this but may in the future, in the meantime we'd happily accept a community contribution

dlawin avatar Jun 06 '23 16:06 dlawin

@dlawin - I had this exact issue when comparing snowflake and postgres and I was able to come up with a simple approach that will overcome this issue in my fork. May be that can help. I will push it out and hopefully it can overcome your issue too.

ramdharam avatar Jun 27 '23 15:06 ramdharam

This issue has been marked as stale because it has been open for 60 days with no activity. If you would like the issue to remain open, please comment on the issue and it will be added to the triage queue. Otherwise, it will be closed in 7 days.

github-actions[bot] avatar Aug 27 '23 06:08 github-actions[bot]

Although we are closing this issue as stale, it's not gone forever. Issues can be reopened if there is renewed community interest. Just add a comment and it will be reopened for triage.

github-actions[bot] avatar Sep 04 '23 06:09 github-actions[bot]

@dlawin - I had this exact issue when comparing snowflake and postgres and I was able to come up with a simple approach that will overcome this issue in my fork. May be that can help. I will push it out and hopefully it can overcome your issue too.

Hey did you get that working?

MattDelac avatar Nov 02 '23 14:11 MattDelac

I still encounter this issue on the latest version (0.9.9) ...

MattDelac avatar Nov 02 '23 14:11 MattDelac

I think once we get https://github.com/datafold/data-diff/issues/762 worked out, it will resolve this issue too

mrn3 avatar Nov 02 '23 21:11 mrn3

I still encounter this issue on the latest version (0.9.9) ...

@MattDelac do you get this warning?

Mixed UUID/Non-UUID values detected in column {'.'.join(table_path)}.{col_name}, disabling UUID support

dlawin avatar Nov 02 '23 22:11 dlawin

I still encounter this issue on the latest version (0.9.9) ...

@MattDelac do you get this warning?

Mixed UUID/Non-UUID values detected in column {'.'.join(table_path)}.{col_name}, disabling UUID support

No now I get the following when I compare UUID (MySQL) to BINARY (Snowflake) image

MattDelac avatar Nov 02 '23 23:11 MattDelac

Hi @MattDelac,

I'm sorry for the delay in following up on this. Thank you for taking the time to open this PR!

We made a hard decision to sunset the data-diff package and won't provide further development or support.

W.r.t UUID x Text diffing. If that helps at all, we currently support it in Datafold Cloud with the caveat that upper/lower casing and "-" characters should match.

-Gleb

glebmezh avatar May 17 '24 13:05 glebmezh