Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feat: Application Deployment Readiness Scoring #6382

Open
siddhant-khisty opened this issue Feb 10, 2025 · 0 comments
Open

Feat: Application Deployment Readiness Scoring #6382

siddhant-khisty opened this issue Feb 10, 2025 · 0 comments
Labels
enhancement New feature or request priority/low Less important tasks that can be delayed

Comments

@siddhant-khisty
Copy link
Collaborator

siddhant-khisty commented Feb 10, 2025

Description

Often when deploying applications to production environments, DevOps teams need to assess how resilient a workload is for a production environment. This can include things such as its availability, resilience, disaster response, etc. Devtron is looking to implement a mechanism to generate a scorecard that gives a number from 0-100 to denote how ready a workload is for production environments. This scorecard can help a user understand what are the best practices being followed, and what further steps can be taken to improve the production readiness for the workload. Some examples of the criteria include:

  • Pod - Topology Spread to ensure spread across multiple AZs
  • Autoscaling enabled
    • HPA
    • KEDA
  • PDB enabled and disreputable is at least 1
  • Quality of Service (QOS)

Deliverable

Analyse a list of set criteria that will analyze the application and generate a scorecard to determine the production readiness for a deployment.

In the scope
Implement a basic scoring mechanism that assesses readiness based on simple criteria such as

  • Is a PDB or Topology Spread configured?
  • Are the resource quotas well-defined?
  • Does the app have liveness & readiness probes?
  • Is autoscaling configured? Yes/No
  • Separation of workloads across environments such as Dev/UAT/Prod
  • High Availability of Nodes - Master & Worker

Future Scope
In the future, we would like to add additional criteria for creating the score card such as

  • Security Scanning criteria
  • Secret Type - External, CSI
  • Autoscaling based on HPA/VPA/KEDA/Karpenter
  • State of backup automation
  • Multi-region/cluster availability
  • Cluster Health
  • Monitoring setup

Key Competencies

  • Basic Programming
  • Curious enough to learn and implement

Recommended Skills to have:

  • Go or Python. You can choose the language of your choice
  • Kubernetes

Mentors

Skill Level: Easy, Medium
Time: ~175 hrs

@satyampsoni satyampsoni added enhancement New feature or request GSoC Google Summer of Code labels Feb 10, 2025
@satyampsoni satyampsoni added priority/low Less important tasks that can be delayed and removed GSoC Google Summer of Code labels Mar 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request priority/low Less important tasks that can be delayed
Projects
None yet
Development

No branches or pull requests

2 participants