office-hours-queue icon indicating copy to clipboard operation
office-hours-queue copied to clipboard

Completed Newbie Project

Open ethanwang-04 opened this issue 2 years ago • 3 comments

Implemented document file upload, parsing, text extraction, and vector upload to Pinecone with metadata. Also implemented vector retrieval from Pinecone. @trangiabach

ethanwang-04 avatar Oct 27 '23 22:10 ethanwang-04

⚠️ GitGuardian has uncovered 4 secrets following the scan of your pull request.

Please consider investigating the findings and remediating the incidents. Failure to do so may lead to compromising the associated services or software components.

🔎 Detected hardcoded secrets in your pull request
GitGuardian id GitGuardian status Secret Commit Filename
8604658 Triggered Generic High Entropy Secret 8f873f3934fb4bcda1f9545ca4dbf41ac9d0983e backend/ohq/vector_db.py View secret
8604658 Triggered Generic High Entropy Secret 8f873f3934fb4bcda1f9545ca4dbf41ac9d0983e backend/ohq/vector_db.py View secret
8604658 Triggered Generic High Entropy Secret 82b8c3e15c46118c3c0f732ad4341d7d4037022b backend/ohq/vector_db.py View secret
8604658 Triggered Generic High Entropy Secret 82b8c3e15c46118c3c0f732ad4341d7d4037022b backend/ohq/vector_db.py View secret
🛠 Guidelines to remediate hardcoded secrets
  1. Understand the implications of revoking this secret by investigating where it is used in your code.
  2. Replace and store your secrets safely. Learn here the best practices.
  3. Revoke and rotate these secrets.
  4. If possible, rewrite git history. Rewriting git history is not a trivial act. You might completely break other contributing developers' workflow and you risk accidentally deleting legitimate data.

To avoid such incidents in the future consider


🦉 GitGuardian detects secrets in your source code to help developers and security teams secure the modern development process. You are seeing this because you or someone else with access to this repository has authorized GitGuardian to scan your pull request.

Our GitHub checks need improvements? Share your feedbacks!

gitguardian[bot] avatar Oct 27 '23 22:10 gitguardian[bot]

Implemented document file upload, parsing, text extraction, and vector upload to Pinecone with metadata. Also implemented vector retrieval from Pinecone. @trangiabach

@ethanwang-04 Overall good job on this PR (and cool stuff) and I left a few comments here and there. I would leave over some more comments as I see more possible fixes!

trangiabach avatar Oct 28 '23 18:10 trangiabach

Implemented document file upload, parsing, text extraction, and vector upload to Pinecone with metadata. Also implemented vector retrieval from Pinecone. @trangiabach

Implemented document file upload, parsing, text extraction, and vector upload to Pinecone with metadata. Also implemented vector retrieval from Pinecone. @trangiabach

Implemented document file upload, parsing, text extraction, and vector upload to Pinecone with metadata. Also implemented vector retrieval from Pinecone. @trangiabach

Implemented document file upload, parsing, text extraction, and vector upload to Pinecone with metadata. Also implemented vector retrieval from Pinecone. @trangiabach

Encouraged writing some tests to make sure vector databases models works correctly, especially with parsing different kinds of documents into chunks

trangiabach avatar Nov 10 '23 17:11 trangiabach