Implementation of Production systems infrastructure to cater to growing needs of the company
Automate and maintain our continuous delivery pipeline for consistent software releases.
Scale up of infrastructure on a need basis.
Perform root cause analysis of production issues and provide a report which includes recommendations for identifying future issues more quickly as well as preventing future failures entirely, whether through process or technology improvements.
Manage backups and disaster recovery, including backup monitoring and verification, and leading restoration tests and disaster recovery drills
What makes you a great fit:
Knowledge of Production Operations/Best practices.
Experience in more than one end-to-end devops cycles in previous projects
Ability to effectively prioritize work with fast changing requirements.
Strong background in managing Linux/Unix systems.
Knowledge/Experience with scripting languages - Python, bash shell.
Excellent knowledge of Amazon Web Services Products (EC2, ECS, elasticache, Route53, VPC/Private cloud configurations and others)
Experience with Infrastructure As Code - CloudFormation, Terraform [Must].
Experience with MySQL, Nginx.
Experience with CI/CD platforms such as Jenkins.
Experience with Version Control systems - Git.
Experience with Configuration Management Systems - Ansible, Chef[Must] etc.
Experience with Monitoring Platforms - Nagios, Grafana, EFK, New Relic etc.
Experience with maintaining and running large scale web apps.
Experience with Micro-Services - container technologies, docker [Must].
Experience with secrets Management Tool - Vault
Experience with service discovery and configuration Tool - Consul
Handling/Analyzing large amount of logs and anomaly detection.
You have experience in analyzing and resolving complex infrastructure resources and application deployment issues.