connectors icon indicating copy to clipboard operation
connectors copied to clipboard

[Github] Add support for `mdx` files

Open spong opened this issue 1 year ago • 2 comments

Problem Description

Currently the Github connector only supports syncing documents with .markdown, .md, and .rst file extensions as per the docs. I've been working to add support for exposing Search Connector indices as Knowledge Base content to the Security Assistant within Kibana and was hoping to make the Kibana Documentation available for reference, however since we've moved to using the mdx format for our docs it is not possible to ingest and embed these documents using the Github connector.

Proposed Solution

Add support for syncing and parsing mdx files. In conversations, it seems there might be compatibility issues at the Apache Tika layer, so might be more involved than just adding the file extension on within the Github connector.

Alternatives

Right now the workaround is to make a new branch on the repo and do a bulk rename mdx->md, which is not ideal for widespread adoption.

spong avatar Jun 12 '24 22:06 spong

@danajuratoni is there anything blocked from our end or we are waiting for this issue https://issues.apache.org/jira/browse/TIKA-4269 to get done?

moxarth-rathod avatar Jul 08 '24 12:07 moxarth-rathod

@DianaJourdan Could you please check this & update?

khushbu-elastic avatar Jul 12 '24 06:07 khushbu-elastic