graphrag icon indicating copy to clipboard operation
graphrag copied to clipboard

SQL Server Storage Support for Indexing Pipeline

Open KennyZhang1 opened this issue 8 months ago • 0 comments

Description

This PR adds SQL Server as a storage option for parquet outputs within the indexing pipeline.

Related Issues

This PR is similar to the CosmosDB storage support PR(s) that was completed previously.

Proposed Changes

  • Implement the SQLServerPipelineStorage class to interface parquet file outputs with SQL Server
  • Update the factory class to include the SQL server option
  • Update the output configs and config unit tests as needed

Checklist

  • [x] I have tested these changes locally.
  • [x] I have reviewed the code changes.
  • [ ] I have updated the documentation (if necessary).
  • [x] I have added appropriate unit tests (if applicable).

Additional Notes

NOTE: This PR is scoped to only handle parquet file outputs in the indexing pipeline. This PR does not support using SQL server storage for cache or vector embedding outputs

KennyZhang1 avatar Apr 17 '25 18:04 KennyZhang1