Objective
- To learn how to deploy monitoring services like Prometheus and Grafana for live performance monitoring.
- To set up Jupyter notebook environments for interactive computing and data analysis on multi-node and GPU systems.
- To integrate more and more services like logging and alerting systems for better management of the systems.
- Make sure that the services which are deployed are highly available and scalable.
- To assess the deployment services performance with the help of monitoring and analytics tools.