Minimum 7 years of work experience as an SRE (not Traditional Production Support) covering integration platforms on cloud-based deployments
Coding / automation scripting experience in any programming language, particularly for integration tier and middleware
Working as a DevOps Engineer or SRE in mission critical applications and infrastructure
Working experience with GCP (Google Cloud), particularly with GKE is important
Working with AppDynamics and Splunk for monitoring and setting up observability is key
CI CD tool chains, setting up and running deployment pipelines and propagating changes on different environments
Core Capabilities
Maintaining middleware such as Kafka (open source) and MQ as well as application servers (Tomcat)
Maintain Hazelcast Data storage platform clusters and Control M job schedulers
GCP and private-cloud operational support / administration activities such as provision, capacity management, reliability management, monitoring, restoration, etc
Kubernetes cluster management, monitoring and remediation. Knowledge of Docker is important
Automating deployments and scripting self-healing workflows based on telemetry
Define SLIs and configure SLOs, respond to threshold alerts and optimize monitoring capability
Work with code as well as configuration artifacts to debug and fix issues that may arise
Knowledge of applying SRE practices to daily operations is key
Must be inclined to work on proof-of-concept solutions to optimize reliability such as those incorporating AI models for event correlation and assisted triaging
Ability to work in shifts in office is mandatory; this is a 24 / 7 on-desk operation
Qualification
Computer Science and or Engineering degrees are preferred
SRE Foundation certification by DevOps Institute or any other equivalent certification on SRE by a recognized body is mandatory
CKA certification
GCP Cloud Digital Leader certification at a minimum is mandatory; Cloud Engineer level is a bonus
Hazelcast Platform Operations certification badge
Role & Responsibilities
Work as part of a 24 / 7 on-desk team in shifts that will manage middleware and associated applications that are being consumed globally incident, change, event, problem management
Debugging integrations and consumers at the code level
Work with CI CD pipelines and automate new change rollouts. Change deployment and sanity testing is part of the scope
Set up and configure an observability product, preferably AppDynamics or Splunk for end-to-end traceability and log analytics
Be the guardian to ensure high reliability of the applications, middleware, storage platforms, scheduler (and its jobs) and underlying cloud infrastructure
Define and set up SLIs as well as SLOs while continuously refining thresholds
Set up anomaly detection and auto-remediation workflows
Ensure all alerts and incidents within scope are actioned upon before breaching SLOs
Minimum 7 years of work experience as an SRE (not Traditional Production Support) covering integration platforms on cloud-based deployments
Coding / automation scripting experience in any programming language, particularly for integration tier and middleware
Working as a DevOps Engineer or SRE in mission critical applications and infrastructure
Working experience with GCP (Google Cloud), particularly with GKE is important
Working with AppDynamics and Splunk for monitoring and setting up observability is key
CI CD tool chains, setting up and running deployment pipelines and propagating changes on different environments
Core Capabilities
Maintaining middleware such as Kafka (open source) and MQ as well as application servers (Tomcat)
Maintain Hazelcast Data storage platform clusters and Control M job schedulers
GCP and private-cloud operational support / administration activities such as provision, capacity management, reliability management, monitoring, restoration, etc
Kubernetes cluster management, monitoring and remediation. Knowledge of Docker is important
Automating deployments and scripting self-healing workflows based on telemetry
Define SLIs and configure SLOs, respond to threshold alerts and optimize monitoring capability
Work with code as well as configuration artifacts to debug and fix issues that may arise
Knowledge of applying SRE practices to daily operations is key
Must be inclined to work on proof-of-concept solutions to optimize reliability such as those incorporating AI models for event correlation and assisted triaging
Ability to work in shifts in office is mandatory; this is a 24 / 7 on-desk operation
Qualification
Computer Science and or Engineering degrees are preferred
SRE Foundation certification by DevOps Institute or any other equivalent certification on SRE by a recognized body is mandatory
CKA certification
GCP Cloud Digital Leader certification at a minimum is mandatory; Cloud Engineer level is a bonus
Hazelcast Platform Operations certification badge
Role & Responsibilities
Work as part of a 24 / 7 on-desk team in shifts that will manage middleware and associated applications that are being consumed globally incident, change, event, problem management
Debugging integrations and consumers at the code level
Work with CI CD pipelines and automate new change rollouts. Change deployment and sanity testing is part of the scope
Set up and configure an observability product, preferably AppDynamics or Splunk for end-to-end traceability and log analytics
Be the guardian to ensure high reliability of the applications, middleware, storage platforms, scheduler (and its jobs) and underlying cloud infrastructure
Define and set up SLIs as well as SLOs while continuously refining thresholds
Set up anomaly detection and auto-remediation workflows
Ensure all alerts and incidents within scope are actioned upon before breaching SLOs
Teamwork, quality of life, professional and personal development: values that Virtusa is proud to embody. When you join us, you join a team of 36,000 people globally that cares about your growth — one that seeks to provide you with exciting projects, opportunities and work with state of the art technologies throughout your career with us.
Great minds, great potential: it all comes together at Virtusa. We value collaboration and the team environment of our company, and seek to provide great minds with a dynamic place to nurture new ideas and foster excellence.
Virtusa is an Equal Opportunity Employer. All applicants will receive fair and impartial treatment without regard to race, color, religion, sex, national origin, ancestry, age, legally protected physical or mental disability, protected veteran status, status in the U.S. uniformed services, sexual orientation, gender identity or expression, marital status, genetic information or on any other basis which is protected under applicable federal, state or local law.
Applicants may be required to attend interviews in person or by video conference. In addition, candidates may be required to present their current state or government-issued ID during each interview. All candidates must be authorized to work in the USA.
Learn more
Have any questions?
To join our bright team of professionals, you can apply directly to our website under the Careers tab and search all open jobs. https://www.virtusa.com/careers
Yes, you can. Virtusa gives you the flexibility to apply for multiple open positions that excite you about your future and align to your experience and career goals.
Yes, you can. Virtusa is a global Company, and we serve our clients through our global delivery model.
Our dedicated recruitment team will review your online application and match it to all our open jobs. We update our open jobs on a daily basis and encourage you to check back often.
Our team of recruiters will review your application, relevant job experience, and skills to appropriately align it to our open jobs. From there, the recruitment team will contact the qualified candidate to start the interview process.
Want to explore the ways you can engineer your career in technology? Our thought leaders share key career insights for candidates from entry-level job seekers to senior technologists.
Check your downloads folder for files and implementation instructions.
Assets are now available in your profile for future editing and use.