community icon indicating copy to clipboard operation
community copied to clipboard

Achieve and add OpenSSF Best Practice Badge for each Kubeflow Project

Open franciscojavierarceo opened this issue 9 months ago • 18 comments

We need to achieve and maintain a Core Infrastructure Initiative Best Practices Badge.

This includes:

  • Kubeflow Trainer
  • Kubeflow Pipelines
  • Kubeflow Spark Operator
  • Kubeflow Katib
  • Kubeflow KServe
  • Kubeflow Notebooks
  • Kubeflow Dashboard
  • Kubeflow Model Registry
  • Kubeflow Platform / manifests

The checklist is available here:

Below is the Graduation Stage Criteria - A list of these requirements are provided below, which were copied from this site: (https://github.com/cncf/toc/blob/main/.github/ISSUE_TEMPLATE/template-graduation-application.md) To graduate from sandbox or incubating status, or for a new project to join as a graduated project, a project must meet the incubating stage criteria plus:

  • [ ] Have achieved and maintained a Core Infrastructure Initiative Best Practices Badge. Github Repo - https://github.com/coreinfrastructure/best-practices-badge

PASSING

Basic project website content

  • [x] The project website MUST succinctly describe what the software does (what problem does it solve?). [description_good]
  • [x] The project website MUST provide information on how to: obtain, provide feedback (as bug reports or enhancements), and contribute to the software. [interact]
  • [x] The information on how to contribute MUST explain the contribution process (e.g., are pull requests used?) {Met URL} [contribution]
  • [x] The information on how to contribute SHOULD include the requirements for acceptable contributions (e.g., a reference to any required coding standard). {Met URL} [contribution_requirements]

FLOSS License

Documentation

  • [x] The project MUST provide basic documentation for the software produced by the project. {N/A justification} [documentation_basics]- [] The project MUST provide reference documentation that describes the external interface (both input and output) of the software produced by the project. {N/A justification} [documentation_interfacetion]

Other

  • [x] The project sites (website, repository, and download URLs) MUST support HTTPS using TLS. [sites_https]
  • [ ] The project MUST have one or more mechanisms for discussion (including proposed changes and issues) that are searchable, allow messages and topics to be addressed by URL, enable new people to participate in some of the discussions, and do not require client-side installation of proprietary software. [discussion]
  • [ ] The project SHOULD provide documentation in English and be able to accept bug reports and comments about code in English. [english]
  • [ ] The project MUST be maintained. [maintained]

CHANGE CONTROL

Public version-controlled source repository

  • [ ] The project MUST have a version-controlled source repository that is publicly readable and has a URL. [repo_public]
  • [ ] The project's source repository MUST track what changes were made, who made the changes, and when the changes were made. [repo_track]
  • [ ] To enable collaborative review, the project's source repository MUST include interim versions for review between releases; it MUST NOT include only final releases. [repo_interim]
  • [ ] It is SUGGESTED that common distributed version control software be used (e.g., git) for the project's source repository. [repo_distributed] Unique Versioning Numbering
  • [ ] The project results MUST have a unique version identifier for each release intended to be used by users. [version_unique]
  • [ ] It is SUGGESTED that the Semantic Versioning (SemVer) or Calendar Versioning (CalVer) version numbering format be used for releases. It is SUGGESTED that those who use CalVer include a micro level value. [version_semver]
  • [ ] It is SUGGESTED that projects identify each release within their version control system. For example, it is SUGGESTED that those using git identify each release using git tags. [version_tags]

Release notes

  • [ ] The project MUST provide, in each release, release notes that are a human-readable summary of major changes in that release to help users determine if they should upgrade and what the upgrade impact will be. The release notes MUST NOT be the raw output of a version control log (e.g., the "git log" command results are not release notes). Projects whose results are not intended for reuse in multiple locations (such as the software for a single website or service) AND employ continuous delivery MAY select "N/A". {N/A justification} {Met URL} [release_notes]
  • [ ] The release notes MUST identify every publicly known run-time vulnerability fixed in this release that already had a CVE assignment or similar when the release was created. This criterion may be marked as not applicable (N/A) if users typically cannot practically update the software themselves (e.g., as is often true for kernel updates). This criterion applies only to the project results, not to its dependencies. If there are no release notes or there have been no publicly known vulnerabilities, choose N/A. {N/A justification} [release_notes_vulns]

REPORTING

Release notes

  • [ ] The project MUST provide a process for users to submit bug reports (e.g., using an issue tracker or a mailing list). {Met URL} [report_process]
  • [ ] The project SHOULD use an issue tracker for tracking individual issues. [report_tracker]
  • [ ] The project MUST acknowledge a majority of bug reports submitted in the last 2-12 months (inclusive); the response need not include a fix. [report_responses]
  • [ ] The project SHOULD respond to a majority (>50%) of enhancement requests in the last 2-12 months (inclusive). [enhancement_responses]
  • [ ] The project MUST have a publicly available archive for reports and responses for later searching. {Met URL} [report_archive] Vulnerability report process
  • [ ] The project MUST publish the process for reporting vulnerabilities on the project site. {Met URL} [vulnerability_report_process]
  • [ ] If private vulnerability reports are supported, the project MUST include how to send the information in a way that is kept private. {N/A allowed} {Met URL} [vulnerability_report_private]
  • [ ] The project's initial response time for any vulnerability report received in the last 6 months MUST be less than or equal to 14 days. {N/A allowed} [vulnerability_report_response]

QUALITY

Working Build System

  • [ ] If the software produced by the project requires building for use, the project MUST provide a working build system that can automatically rebuild the software from source code. {N/A allowed} [build]
  • [ ] It is SUGGESTED that common tools be used for building the software. {N/A allowed} [build_common_tools]
  • [ ] The project SHOULD be buildable using only FLOSS tools. {N/A allowed} [build_floss_tools]

Automated Test Suite

  • [ ] The project MUST use at least one automated test suite that is publicly released as FLOSS (this test suite may be maintained as a separate FLOSS project). The project MUST clearly show or document how to run the test suite(s) (e.g., via a continuous integration (CI) script or via documentation in files such as BUILD.md, README.md, or CONTRIBUTING.md). [test]
  • [ ] A test suite SHOULD be invocable in a standard way for that language. [test_invocation]
  • [ ] It is SUGGESTED that the test suite cover most (or ideally all) the code branches, input fields, and functionality. [test_most]
  • [ ] It is SUGGESTED that the project implement continuous integration (where new or changed code is frequently integrated into a central code repository and automated tests are run on the result). [test_continuous_integration]

New Functionality Testing

  • [ ] The project MUST have a general policy (formal or not) that as major new functionality is added to the software produced by the project, tests of that functionality should be added to an automated test suite. [test_policy]
  • [ ] The project MUST have evidence that the test_policy for adding tests has been adhered to in the most recent major changes to the software produced by the project. [tests_are_added]
  • [ ] It is SUGGESTED that this policy on adding tests (see test_policy) be documented in the instructions for change proposals. [tests_documented_added]

Warning Flags

  • [ ] The project MUST enable one or more compiler warning flags, a "safe" language mode, or use a separate "linter" tool to look for code quality errors or common simple mistakes, if there is at least one FLOSS tool that can implement this criterion in the selected language. {N/A allowed} [warnings]
  • [ ] The project MUST address warnings. {N/A allowed} [warnings_fixed]
  • [ ] It is SUGGESTED that projects be maximally strict with warnings in the software produced by the project, where practical. {N/A allowed} [warnings_strict]

SECURITY

Secure Development Knowledge

  • [ ] The project MUST have at least one primary developer who knows how to design secure software. (See ‘details’ for the exact requirements.) [know_secure_design]
  • [ ] At least one of the project's primary developers MUST know of common kinds of errors that lead to vulnerabilities in this kind of software, as well as at least one method to counter or mitigate each of them. [know_common_errors]

Use basic good cryptographic practices

  • [ ] The software produced by the project MUST use, by default, only cryptographic protocols and algorithms that are publicly published and reviewed by experts (if cryptographic protocols and algorithms are used). {N/A allowed} [crypto_published]
  • [ ] If the software produced by the project is an application or library, and its primary purpose is not to implement cryptography, then it SHOULD only call on software specifically designed to implement cryptographic functions; it SHOULD NOT re-implement its own. {N/A allowed} [crypto_call]
  • [ ] All functionality in the software produced by the project that depends on cryptography MUST be implementable using FLOSS. {N/A allowed} [crypto_floss]
  • [ ] The security mechanisms within the software produced by the project MUST use default keylengths that at least meet the NIST minimum requirements through the year 2030 (as stated in 2012). It MUST be possible to configure the software so that smaller keylengths are completely disabled. {N/A allowed} [crypto_keylength]
  • [ ] The default security mechanisms within the software produced by the project MUST NOT depend on broken cryptographic algorithms (e.g., MD4, MD5, single DES, RC4, Dual_EC_DRBG), or use cipher modes that are inappropriate to the context, unless they are necessary to implement an interoperable protocol (where the protocol implemented is the most recent version of that standard broadly supported by the network ecosystem, that ecosystem requires the use of such an algorithm or mode, and that ecosystem does not offer any more secure alternative). The documentation MUST describe any relevant security risks and any known mitigations if these broken algorithms or modes are necessary for an interoperable protocol. {N/A allowed} [crypto_working]
  • [ ] The default security mechanisms within the software produced by the project SHOULD NOT depend on cryptographic algorithms or modes with known serious weaknesses (e.g., the SHA-1 cryptographic hash algorithm or the CBC mode in SSH). {N/A allowed} [crypto_weaknesses]
  • [ ] The security mechanisms within the software produced by the project SHOULD implement perfect forward secrecy for key agreement protocols so a session key derived from a set of long-term keys cannot be compromised if one of the long-term keys is compromised in the future. {N/A allowed} [crypto_pfs]
  • [ ] If the software produced by the project causes the storing of passwords for authentication of external users, the passwords MUST be stored as iterated hashes with a per-user salt by using a key stretching (iterated) algorithm (e.g., Argon2id, Bcrypt, Scrypt, or PBKDF2). See also OWASP Password Storage Cheat Sheet). {N/A allowed} [crypto_password_storage]
  • [ ] The security mechanisms within the software produced by the project MUST generate all cryptographic keys and nonces using a cryptographically secure random number generator, and MUST NOT do so using generators that are cryptographically insecure. {N/A allowed} [crypto_random]

Secured delivery against man-in-the-middle (MITM) attacks

  • [ ] The project MUST use a delivery mechanism that counters MITM attacks. Using https or ssh+scp is acceptable. [delivery_mitm]
  • [ ] A cryptographic hash (e.g., a sha1sum) MUST NOT be retrieved over http and used without checking for a cryptographic signature. [delivery_unsigned]

Publicly known vulnerabilities fixed

Other security issues

  • [ ] The public repositories MUST NOT leak a valid private credential (e.g., a working password or private key) that is intended to limit public access. [no_leaked_credentials]

ANALYSIS

Static code analysis

  • [ ] At least one static code analysis tool (beyond compiler warnings and "safe" language modes) MUST be applied to any proposed major production release of the software before its release, if there is at least one FLOSS tool that implements this criterion in the selected language. {N/A justification} {Met justification} [static_analysis]
  • [ ] It is SUGGESTED that at least one of the static analysis tools used for the static_analysis criterion include rules or approaches to look for common vulnerabilities in the analyzed language or environment. {N/A allowed} [static_analysis_common_vulnerabilities]
  • [ ] All medium and higher severity exploitable vulnerabilities discovered with static code analysis MUST be fixed in a timely way after they are confirmed. {N/A allowed} [static_analysis_fixed]
  • [ ] It is SUGGESTED that static source code analysis occur on every commit or at least daily. {N/A allowed} [static_analysis_often]

Dynamic code analysis

  • [ ] It is SUGGESTED that at least one dynamic analysis tool be applied to any proposed major production release of the software before its release. [dynamic_analysis]
  • [ ] It is SUGGESTED that if the software produced by the project includes software written using a memory-unsafe language (e.g., C or C++), then at least one dynamic tool (e.g., a fuzzer or web application scanner) be routinely used in combination with a mechanism to detect memory safety problems such as buffer overwrites. If the project does not produce software written in a memory-unsafe language, choose "not applicable" (N/A). {N/A allowed} [dynamic_analysis_unsafe]
  • [ ] It is SUGGESTED that the project use a configuration for at least some dynamic analysis (such as testing or fuzzing) which enables many assertions. In many cases these assertions should not be enabled in production builds. [dynamic_analysis_enable_assertions]
  • [ ] All medium and higher severity exploitable vulnerabilities discovered with dynamic code analysis MUST be fixed in a timely way after they are confirmed. {N/A allowed} [dynamic_analysis_fixed]

Silver Level (see criteria document) Gold Level (see criteria document)

franciscojavierarceo avatar Mar 14 '25 14:03 franciscojavierarceo

@kubeflow/kubeflow-steering-committee

franciscojavierarceo avatar Mar 14 '25 14:03 franciscojavierarceo

Part of: https://github.com/kubeflow/community/issues/655

andreyvelich avatar Mar 14 '25 14:03 andreyvelich

  1. Does the "FLOSS license" section mean that we need SBOMs ?
  2. @franciscojavierarceo I added Kubeflow Platform / manifests and Kubeflow Dashboard
  3. "The release notes MUST identify every publicly known run-time vulnerability fixed in this release that already had a CVE assignment or similar when the release was created.", "The project MUST publish the process for reporting vulnerabilities on the project site.", "If private vulnerability reports are supported, the project MUST include how to send the information in a way that is kept private.", "The project's initial response time for any vulnerability report received in the last 6 months MUST be less than or equal to 14 days." Is something I need to work on. Right now it is me handling this at the platform level, so indirectly benefiting all working groups https://github.com/kubeflow/manifests?tab=readme-ov-file#cve-scanning and we need to add an Email.
  4. "The project MUST use at least one automated test suite that is publicly released as FLOSS (this test suite may be maintained as a separate FLOSS project). The project MUST clearly show or document how to run the test suite(s) (e.g., via a continuous integration (CI) script or via documentation in files such as BUILD.md, README.md, or CONTRIBUTING.md).", "The project MUST have a general policy (formal or not) that as major new functionality is added to the software produced by the project, tests of that functionality should be added to an automated test suite.", "The project MUST have evidence that the [test_policy](https://www.bestpractices.dev/en/criteria#test_policy) for adding tests has been adhered to in the most recent major changes to the software produced by the project." I think that is covered by the platform integration tests at kubeflow/manifests and continuously improved by me and others (GSOC)
  5. "The project MUST have at least one primary developer who knows how to design secure software. (See ‘details’ for the exact requirements.) [[know_secure_design](https://www.bestpractices.dev/en/criteria#0.know_secure_design)]", "At least one of the project's primary developers MUST know of common kinds of errors that lead to vulnerabilities in this kind of software, as well as at least one method to counter or mitigate each of them." I think I am pushing quite well for that :-D I push for container security and hard multi-tenancy and have written Information Security Concepts, Security Blogposts and done penetration tests for Kubeflow.
  6. "The project MUST use a delivery mechanism that counters MITM attacks" is why we have the platform with Cert-manager, Istio and mTLS as dependency plus networkpolicies
  7. "There MUST be no unpatched vulnerabilities of medium or higher severity that have been publicly known for more than 60 days. Projects SHOULD fix all critical vulnerabilities rapidly after they are reported." Well that is also what I am working on, in multiple WGs (Platform, Pipelines, Katib, trainer) and I have added the CVE scanning at platform level so for all working groups last GSOC. https://github.com/kubeflow/manifests/actions/workflows/trivy.yaml
  8. "At least one static code analysis tool (beyond compiler warnings and "safe" language modes) MUST be applied to any proposed major production release of the software before its release, if there is at least one FLOSS tool that implements this criterion in the selected language.", "All medium and higher severity exploitable vulnerabilities discovered with static code analysis MUST be fixed in a timely way after they are confirmed." Is something we need to improve but at least formatting / linting is there at the platform level. https://github.com/kubeflow/manifests/actions/workflows/linting_bash_python_yaml_files.yaml
  9. I think a lot of that is also supporting the updated charter https://github.com/kubeflow/community/pull/837
  10. Most of the remaining items look checkable.

juliusvonkohout avatar Mar 18 '25 12:03 juliusvonkohout

@franciscojavierarceo Can you change the components list to add checkboxes? This makes easy to track which project needs to add the OpenSSF badge.

By the way, this is the PR for Kubeflow Pipelines to add OpenSSF badge. https://github.com/kubeflow/pipelines/pull/11543

rimolive avatar Apr 16 '25 14:04 rimolive

yeah of course @rimolive!

franciscojavierarceo avatar Apr 22 '25 13:04 franciscojavierarceo

@franciscojavierarceo I compared this version with the current website https://www.bestpractices.dev/en/criteria/0 .

I found a few diff. I'm not able to update this issue. Can you make this updates? Thanks. https://github.com/cncf/toc/blob/main/process/graduation_criteria.md#graduation-stage) -> https://github.com/cncf/toc/blob/main/.github/ISSUE_TEMPLATE/template-graduation-application.md BASIC LEVEL -> PASSING documentation -> [documentation_interface] Release Notes -> Bug-reporting process

CC @andreyvelich

varodrig avatar Apr 25 '25 02:04 varodrig

/area cncf-graduation

andreyvelich avatar Apr 29 '25 12:04 andreyvelich

this is the PR for Kubeflow Model Registry that added OpenSSF badge:

  • https://github.com/kubeflow/model-registry/pull/1011

tarilabs avatar May 04 '25 15:05 tarilabs

fyi, thought I'd report back on some progress cc @milosjava

  • as of 12 May 2025, Model Registry at 34%
  • as of 15 May 2025, Model Registry at 55%

tarilabs avatar May 17 '25 07:05 tarilabs

@tarekabouzeid do you want add the badge for Kubeflow platform ?

juliusvonkohout avatar May 19 '25 11:05 juliusvonkohout

@tarekabouzeid do you want add the badge for Kubeflow platform ?

Yeah sure, I will work on that.

tarekabouzeid avatar May 19 '25 11:05 tarekabouzeid

How are we going to add OpenSSF badge for Kubeflow Platform ? Is it different from adding badge for Kubeflow Dashboard ? cc @thesuperzapper

andreyvelich avatar May 28 '25 21:05 andreyvelich

I already have it 75 % done for kubeflow platform and linked here. https://github.com/kubeflow/manifests See also https://www.bestpractices.dev/en/projects/9940

juliusvonkohout avatar May 29 '25 16:05 juliusvonkohout

thanks @juliusvonkohout @tarekabouzeid please keep me posted on the "Vulnerability report process" part, as we would like to align from Model Registry; my understanding the Kubeflow community wanted a shared CVE reporting mechanism, and happy to comply.

tarilabs avatar May 30 '25 08:05 tarilabs

How are we going to add OpenSSF badge for Kubeflow Platform ? Is it different from adding badge for Kubeflow Dashboard ? cc @thesuperzapper

I created a new badge for central dashboard , https://www.bestpractices.dev/en/projects/10673 .

Also will create a similar PR to add the badge for the repo. I added @juliusvonkohout to edit the badge, @thesuperzapper can you please share your id so you can edit the entries as well ?

Update:

Adding OpenSSF badge PR : https://github.com/kubeflow/dashboard/pull/100

tarekabouzeid avatar May 30 '25 14:05 tarekabouzeid

thanks @juliusvonkohout @tarekabouzeid please keep me posted on the "Vulnerability report process" part, as we would like to align from Model Registry; my understanding the Kubeflow community wanted a shared CVE reporting mechanism, and happy to comply.

In the manifest repo we have Trivy workflow, which extract all components images and scan them and can download scan results, example here.

tarekabouzeid avatar May 30 '25 14:05 tarekabouzeid

@franciscojavierarceo @andreyvelich fup on https://github.com/kubeflow/community/issues/834#issuecomment-2888201228 as of today MR at 69% for the "passing badge" cc @milosjava

tarilabs avatar Jun 03 '25 16:06 tarilabs

With a bit flexibility of interpretation we have achieved https://www.bestpractices.dev/en/projects/9940.

juliusvonkohout avatar Jun 03 '25 17:06 juliusvonkohout

✅ Kubeflow Model Registry OpenSSF badge completed (link). cc @milosjava

tarilabs avatar Jul 07 '25 06:07 tarilabs

i think this is done right?

franciscojavierarceo avatar Sep 10 '25 04:09 franciscojavierarceo

Yeah, we can close this one. /close

andreyvelich avatar Sep 10 '25 13:09 andreyvelich

@andreyvelich: Closing this issue.

In response to this:

Yeah, we can close this one. /close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

google-oss-prow[bot] avatar Sep 10 '25 13:09 google-oss-prow[bot]