pandas-ai icon indicating copy to clipboard operation
pandas-ai copied to clipboard

MongoDB Connector for PandasAI

Open bsab opened this issue 1 year ago • 14 comments

🚀 The feature

Integrate a MongoDB connector into PandasAI to allow users to easily access and manipulate data stored in MongoDB databases. This connector will leverage PyMongoArrow to efficiently convert MongoDB query results into Apache Arrow, Numpy, and Pandas formats, improving performance during data manipulation and analysis.

Motivation, pitch

T simplify the process of accessing and manipulating data stored in MongoDB for PandasAI users. Additionally, by utilizing PyMongoArrow, users will benefit from efficient data conversion between MongoDB and the common data formats used in data analysis, such as Apache Arrow, Numpy, and Pandas.

Alternatives

Additional context

bsab avatar Nov 03 '23 16:11 bsab

That's a great idea! Do you want to work on this @bsab? How do you think we could handle "nested" schemas?

gventuri avatar Nov 10 '23 01:11 gventuri

@gventuri yes, I can work on this feature. About the nested schema, I think we can solve it via pymongoarrow: https://mongo-arrow.readthedocs.io/en/latest/schemas.html

bsab avatar Nov 10 '23 14:11 bsab

@bsab sounds great, also pymongoarrow seems a great solution for that. Just assigned to you :)

gventuri avatar Nov 11 '23 00:11 gventuri

Hey I can work on this too ?? Plz let me. @bsab

buddhiraz avatar Nov 11 '23 11:11 buddhiraz

Hi @buddhiraz, yes we can team up!

bsab avatar Nov 13 '23 14:11 bsab

Hello @buddhiraz, I've just create the branch "https://github.com/bsab/pandas-ai/tree/feat/connector/mongodb" and added you as contributor. Feel free to work on it ;)

bsab avatar Nov 14 '23 15:11 bsab

Sure, I will work on it then.

On Tue, Nov 14, 2023 at 9:11 PM Sab Severino @.***> wrote:

Hello @buddhiraz https://github.com/buddhiraz, I've just create the branch "https://github.com/bsab/pandas-ai/tree/feat/connector/mongodb" and added you as contributor. Feel free to work on it ;)

— Reply to this email directly, view it on GitHub https://github.com/gventuri/pandas-ai/issues/728#issuecomment-1810481821, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALKSCKYCDVCTGPESGXXCNSDYEOGKHAVCNFSM6AAAAAA64VKJ26VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMJQGQ4DCOBSGE . You are receiving this because you were mentioned.Message ID: @.***>

buddhiraz avatar Nov 15 '23 03:11 buddhiraz

how are you progressing @bsab @buddhiraz. Happy to help :)

PizBernina avatar Nov 19 '23 20:11 PizBernina

hey @PizBernina, yes I've hust invited you to collaborate on the relative branch.

bsab avatar Nov 20 '23 16:11 bsab

HI @bsab, I am also working on a similar issue of integrating RediSearch to this. Please let me know how can I start to look into this as I am new to databases connectors. Thanks

Bhavay-2001 avatar Nov 27 '23 08:11 Bhavay-2001

Hello @bsab I would like to help you on this issue.

kartheekyakkala avatar Dec 27 '23 19:12 kartheekyakkala

Hi @bsab , I think I can help you on this enhancement by adding testcases.

While reading the code, I found that some methods and their functionalities for connectors are same. So I think I can straight away write unit test cases for those methods.

Kindly correct me if my thinking is wrong. Thanks... :)

Vimal-Kumar-V avatar Feb 04 '24 16:02 Vimal-Kumar-V

unfinished yet?

mikumifa avatar May 06 '24 14:05 mikumifa

Hey @mikumifa, I'm really sorry but I'm not able to work on this issue due to time constraints. If you'd like, I can assign it to you. Let me know what you think.

bsab avatar May 07 '24 09:05 bsab