We are looking for 1-2 years of experienced candidates for this role
Rotational Shift timing:
6 am to 2 pm
2 pm to 11 pm
11 pm to 6 am
Responsibilities include:
Work closely with the application support team.
Monitor critical applications and services to minimize downtime and ensure their availability.
Collaborate with DevOps teams to maintain and monitor CI/CD pipelines.
Deploy new versions to production environments.
Work with project teams to ensure the reliability and maintainability of new and modified releases.
Provide input to risk management practices that will anticipate reliability-related incidents that could adversely impact operations.
Document processes and monitor application performance metrics.
Continuously improve proactive monitoring alert configuration and incident response processes to increase reliability and reduce Mean Time to Recovery (MTTR ).
Optimize performance and cost efficiency through continuous monitoring, trend analysis, and fine-tuning.
Monitor any abnormal usage that can impact the cost or performance and take corrective actions.
Proactively implement preventive measures to improve system reliability.
Maintain runbooks, Standard Operating Procedures (SOPs), diagrams, and documentation for swift incident response.
Conduct post-incident reviews to improve reliability and contribute to the development of resilience strategies.
Achieve Service Level Indicators (SLIs) that are set to meet reliability objectives.
Certifications :
Bachelor’s degree or an equivalent professional qualification.
Primary Skills :
Experience in SRE/DevOps with a focus on Ops.
1-2 years of experience in AWS Cloud Infrastructure.
Familiarity with CI/CD pipelines and version control systems.
Experience in Project Management and issue tracking tools such as JIRA/SysAid.
Technical Knowledge
Good Understanding of AWS key services (EBS, S3, AWS Compute, Storage, RDS etc).
Good Understanding Kubernetes or any Container Orchestration System.
Knowledge of Infrastructure as a Code.
Linux system administration knowledge.
Knowledge of RDBMS and Document databases.
Knowledge of Monitoring tools including AWS CloudWatch and NewRelic.
Additional certification in Microsoft, Linux, Cisco, AWS or similar technologies is a plus.
Soft Skills :
Communication
Customer Centricity
Business & Market Acumen
Psychological Safety
Empathy
Growth Mindset & Learning Agility
Ethical and Vigilant
Digital Mindset
Operational Excellence
Teamwork
Analytical thinking
Job Details
Role:
Site Reliability Engineer (SRE)
Location :
Trivandrum, Chennai
Close Date :
06-12-2024
Interested candidates may forward their detailed resumes to Careers@reflectionsinfos.com along with their notice period, current and expected CTC details. This is to notify jobseekers that some fraudsters are promising jobs with Reflections Info Systems for a fee. Please note that no payment is ever sought for jobs in Reflections. We contact our candidates only through our official website or LinkedIn and all employment related mails are sent through the official HR email id. Please contact careers@reflectionsinfos.com for any clarification/ alerts on this subject.