[Issue]: <title> how to configure this with azure-openai for GraphRAG?
Is there an existing issue for this?
- [X] I have searched the existing issues
- [ ] I have checked #657 to validate if my issue is covered by community support
Describe the issue
encoding_model: cl100k_base skip_workflows: [] llm: api_key: ${GRAPHRAG_API_KEY} type: azure_openai_chat model: gpt-4-turbo-preview model_supports_json: true # recommended if this is available for your model. api_base: https://openai-australiaeast-20240202.openai.azure.com/ api_version: 2024-02-15-preview deployment_name: <azure_model_deployment_name>
parallelization: stagger: 0.3
async_mode: threaded # or asyncio
embeddings:
async_mode: threaded # or asyncio llm: api_key: ${GRAPHRAG_API_KEY} type: azure_openai_embedding model: text-embedding-3-small api_base: https://openai-australiaeast-20240202.openai.azure.com/ api_version: 2024-02-15-preview deployment_name: <azure_model_deployment_name>
chunks: size: 300 overlap: 100 group_by_columns: [id] # by default, we don't allow chunks to cross documents
input: type: file # or blob file_type: text # or csv base_dir: "input" file_encoding: utf-8 file_pattern: ".*\.txt$"
cache: type: file # or blob base_dir: "cache"
storage: type: file # or blob base_dir: "output/${timestamp}/artifacts"
reporting: type: file # or console, blob base_dir: "output/${timestamp}/reports"
entity_extraction: prompt: "prompts/entity_extraction.txt" entity_types: [organization,person,geo,event] max_gleanings: 1
summarize_descriptions: prompt: "prompts/summarize_descriptions.txt" max_length: 500
claim_extraction: prompt: "prompts/claim_extraction.txt" description: "Any claims or facts that could be relevant to information discovery." max_gleanings: 1
community_reports: prompt: "prompts/community_report.txt" max_length: 2000 max_input_length: 8000
cluster_graph: max_cluster_size: 10
embed_graph: enabled: false # if true, will generate node2vec embeddings for nodes
umap: enabled: false # if true, will generate UMAP embeddings for nodes
snapshots: graphml: false raw_entities: false top_level_nodes: false
local_search:
global_search:
I tried this, but I get this error, but if I change OpenAI API key, no problem, the problem is only with Azure OpenAI API key.
❌ create_base_entity_graph None ⠴ GraphRAG Indexer ├── Loading Input (InputFileType.text) - 1 files loaded (0 filtered) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 0:00:00 0:00:00 ├── create_base_text_units ├── create_base_extracted_entities ├── create_summarized_entities └── create_base_entity_graph ❌ Errors occurred during the pipeline run, see logs for more details.
Steps to reproduce
No response
GraphRAG Config Used
# Paste your config here
Logs and screenshots
No response
Additional Information
- GraphRAG Version:
- Operating System:
- Python Version:
- Related Issues: