At Amazon Web Services (AWS), were building out our team to help our Global Financial Services (GFS) customers plan, build and mature their ability to ensure their Production environments and applications running on AWS meet their intended business outcomes and are resilient.
A Operations Reliability Lead assists Financial Services and other enterprise level customers in creating a resilient ongoing run function for their cloud environment, by leveraging automation, production KPIs, predictive analytics, AI/ML. They help to define operating models as a means for delivering live production workloads, deliver solutions through technologies such as Service Catalog, CloudWatch, CloudFormation and other Management Tools, and their integration with third-party products. This person will be the thought leader within GFS, setting and implementing strategy, deploying best practices, architecture & design approaches.
The consultant works with the customers development, SRE, operations, change, and service management teams to identify, define, and build the expected operational resiliency. They will focus on and set best practices, direction, and drive operations as it relates to resiliency, using automation. They will be responsible for defining & driving proactive support best practices for our customers, using predictive analysis and AI/ML.
ESTABLISHED TECHNICAL LEADERSHIP
As a Operations Consultant you will lead complex projects with autonomy and discretion, often involving multiple Amazon and customer teams. You will work with customers and partners.
We are looking for:
Passionate operations advocate of Cloud technologies looking to work on strategic, enterprise level customer AWS projects
Seasoned leaders who have been hands-on, production experience and can provide thought leadership, architectural definition and direction. A passion for data and automation
Senior individuals that have run operations teams, worked on company transformations and managed critical systems, who are looking to use their own experience to benefit some of the worlds biggest financial institutions
Previous deep experience creating automation within: SRE, support, change management organizations, devops, development pipelines
Deep understanding and embracing of operational KPIs, defining and using KPIs to drive continuous improvement, automation, proactive support models
Cloud transformation leaders wanting to go deep on cloud operating models, operations capabilities as they relate to system resiliency
RESPONSIBILITIES AND ABILITIES
Collaborate with other specialist consultants, account and sales teams, Engagement Managers, and training and support teams to help customers and partners build production ready environments on AWS
Solutions - Define and deliver on-site Professional Services engagements with partners and customers
Delivery - Engagements include on-site, semi-remote or remote projects to plan, build and mature AWS operations capabilities
Insights - Work with AWS engineering and support teams to convey partner and customer needs and feedback as input to technology roadmaps
Partnering Work with new vendors to help them become MSPs and enable and upskill existing partners
Experienced with the use of automation in the context of IT operations
Hands-on experience with key operations technologies such as: Monitoring (NewRelic, DataDog, AWS X-Ray, AppDynamics, NetCool, Zabbix etc.), Alerting (PagerDuty, NetCool, OpsGenie etc.), ITSM (ServiceNow, Jira Service Desk etc.), Scripting (Powershell, Bash, Batch files etc.), Dashboarding (Graphana, Kibana, Prometheus, Zabbix etc.),Logging (Elastic, Splunk etc.)
Understanding of enterprise IT operational capabilities examples include Change, Release, Incident Management, infrastructure management or applications management
Track record of hands-on delivery of processes, procedures or technical solutions e.g., Runbooks, ITSM Processes, governance or monitoring\alerting scripts, automation
Understanding of modern application delivery (such as , DevOps, CI\CD Pipelines etc.) methods and how to transition operations from traditional approaches to supporting product lead teams
An ideal candidate can lead discussions on areas such as:
How cloud operations and infrastructure teams function, their core responsibilities and interfaces
The change in responsibility and accountability that comes with making the most of cloud transformation program
How to transition ITIL processes from on premise to Public Cloud
How to translate theoretical models into customer needs without overlooking nuance such as institutional memory and culture
Demonstrated ability to think strategically about the business, product, and technical challenges of operating enterprise production environments
Familiarity with Application Delivery frameworks and approaches (COTS, DevOps, CI/CD, Waterfall)
Understanding of a public cloud platform from an operations perspective, experience of running transactional systems at scale ($1bn+) and managing multi-service complex environments
Infrastructure delivery knowledge, skills and experience
Experience of leading teams with a mix of technical and
One or more certification with an emphasis on public cloud (AWS Cloud Practitioner, AWS Solutions Architect Associate, AWS SysOps Administration Associate, Microsoft Certified: Azure Administrator Associate, GCP Associate Cloud Engineer etc.)
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.