Skip to content
This repository has been archived by the owner on May 20, 2023. It is now read-only.

234304 - Project in Artificial Intelligence U

Notifications You must be signed in to change notification settings

Ofir-Shechtman/PrometheusAI

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PrometheusAI

Table Of Contents:

  1. Register to OperateFirst
  2. Connect to Slack
  3. JupiterHub
  4. Intro to Prometheus Client
  5. Generate Personal access token
  6. How to run our project
  7. Other useful links

Predicting the Future Prometheus Metrics of a Kubernetes Cluster

Kubernetes is a leading open source framework for container orchestration - the process of automating deployment and managing applications in a containerized and clustered environment. At its basic level, Kubernetes runs and coordinates container-based applications across a cluster of machines. It is designed to completely manage the life cycle of containerized applications and services using methods that provide predictability, scalability, and high availability. A Kubernetes user provides the applications to be deployed in the form of pods - basic units of execution containing one or more tightly-coupled containers. These pods are then assigned to nodes, that represent actual computing resources such as physical or virtual servers. A collection of nodes forms a Kubernetes cluster, managed by the control plane. One of the most important tasks of a Kubernetes cluster admin is to constantly monitor the status of the cluster and react accordingly when situations of interest occur or anomalies in the workflow are detected (such as an application failure or a malfunctioning node). The de facto standard monitoring software framework utilized to that end is called Prometheus. As illustrated in the figure below, Prometheus collects real-time metrics from application services and nodes (up to millions of metrics per second), stores them in a time-series database, and provides a convenient visualization tool for the user to examine, query and analyze the data. image

While Prometheus is a very powerful monitoring system capable of high-quality real-time data gathering, it entirely lacks one important capability: predicting the expected values of future metrics. By detecting and observing repeatedly occurring patterns and trends in the monitored data and learning from them, it might be possible to not only detect abnormal conditions and situations, but also to forecast their occurrence in advance, thus providing an admin with sufficient time to prevent the possibly harmful or otherwise undesired event. The goal of this project is to fill this gap by implementing a predictive component for Prometheus metrics. This component will incorporate a variety of well-known time series forecasting (TSF) algorithms, based on statistical methods, deep neural networks, or their combination. It will receive as input a stream of Prometheus updates (i.e., files containing the last recorded values for the monitored metrics) and generate a stream of predicted future values for all involved metrics.

About

234304 - Project in Artificial Intelligence U

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages