graphrag
graphrag copied to clipboard
SQL Server Storage Support for Indexing Pipeline
Description
This PR adds SQL Server as a storage option for parquet outputs within the indexing pipeline.
Related Issues
This PR is similar to the CosmosDB storage support PR(s) that was completed previously.
Proposed Changes
- Implement the SQLServerPipelineStorage class to interface parquet file outputs with SQL Server
- Update the factory class to include the SQL server option
- Update the output configs and config unit tests as needed
Checklist
- [x] I have tested these changes locally.
- [x] I have reviewed the code changes.
- [ ] I have updated the documentation (if necessary).
- [x] I have added appropriate unit tests (if applicable).
Additional Notes
NOTE: This PR is scoped to only handle parquet file outputs in the indexing pipeline. This PR does not support using SQL server storage for cache or vector embedding outputs