graphrag icon indicating copy to clipboard operation
graphrag copied to clipboard

[Issue]: <title> how to configure this with azure-openai for GraphRAG?

Open krishkarnan opened this issue 1 year ago • 5 comments

Is there an existing issue for this?

  • [X] I have searched the existing issues
  • [ ] I have checked #657 to validate if my issue is covered by community support

Describe the issue

encoding_model: cl100k_base skip_workflows: [] llm: api_key: ${GRAPHRAG_API_KEY} type: azure_openai_chat model: gpt-4-turbo-preview model_supports_json: true # recommended if this is available for your model. api_base: https://openai-australiaeast-20240202.openai.azure.com/ api_version: 2024-02-15-preview deployment_name: <azure_model_deployment_name>

parallelization: stagger: 0.3

async_mode: threaded # or asyncio

embeddings:

async_mode: threaded # or asyncio llm: api_key: ${GRAPHRAG_API_KEY} type: azure_openai_embedding model: text-embedding-3-small api_base: https://openai-australiaeast-20240202.openai.azure.com/ api_version: 2024-02-15-preview deployment_name: <azure_model_deployment_name>

chunks: size: 300 overlap: 100 group_by_columns: [id] # by default, we don't allow chunks to cross documents

input: type: file # or blob file_type: text # or csv base_dir: "input" file_encoding: utf-8 file_pattern: ".*\.txt$"

cache: type: file # or blob base_dir: "cache"

storage: type: file # or blob base_dir: "output/${timestamp}/artifacts"

reporting: type: file # or console, blob base_dir: "output/${timestamp}/reports"

entity_extraction: prompt: "prompts/entity_extraction.txt" entity_types: [organization,person,geo,event] max_gleanings: 1

summarize_descriptions: prompt: "prompts/summarize_descriptions.txt" max_length: 500

claim_extraction: prompt: "prompts/claim_extraction.txt" description: "Any claims or facts that could be relevant to information discovery." max_gleanings: 1

community_reports: prompt: "prompts/community_report.txt" max_length: 2000 max_input_length: 8000

cluster_graph: max_cluster_size: 10

embed_graph: enabled: false # if true, will generate node2vec embeddings for nodes

umap: enabled: false # if true, will generate UMAP embeddings for nodes

snapshots: graphml: false raw_entities: false top_level_nodes: false

local_search:

global_search:

I tried this, but I get this error, but if I change OpenAI API key, no problem, the problem is only with Azure OpenAI API key.

❌ create_base_entity_graph None ⠴ GraphRAG Indexer ├── Loading Input (InputFileType.text) - 1 files loaded (0 filtered) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 100% 0:00:00 0:00:00 ├── create_base_text_units ├── create_base_extracted_entities ├── create_summarized_entities └── create_base_entity_graph ❌ Errors occurred during the pipeline run, see logs for more details.

Steps to reproduce

No response

GraphRAG Config Used

# Paste your config here

Logs and screenshots

No response

Additional Information

  • GraphRAG Version:
  • Operating System:
  • Python Version:
  • Related Issues:

krishkarnan avatar Aug 06 '24 08:08 krishkarnan