nomic icon indicating copy to clipboard operation
nomic copied to clipboard

Now able to find `detect_duplicate` in the documentation or as code

Open ayulockin opened this issue 9 months ago • 2 comments

I am trying to dedup my dataset. The Atlas Duplicate Clustering section in the documentation has a line - "Make sure to enable duplicate clustering by setting detect_duplicate = True when building a map". I am not able to find this argument in the Atlas API references or in this GitHub repo.

The bottom line question is - can I dedup my dataset using Atlas?

ayulockin avatar Oct 03 '23 03:10 ayulockin

Hi, there is an error in that documentation. The field is 'duplicate_detection' and should be set to True. https://docs.nomic.ai/atlas_api.html#nomic.project.AtlasProject.add_text

AndriyMulyar avatar Oct 16 '23 04:10 AndriyMulyar

Updated link: https://docs.nomic.ai/reference/python-api/atlas-dataset#creating-an-atlas-map

AndriyMulyar avatar Jan 19 '24 05:01 AndriyMulyar