Sr Operational Excellence Engineer
Blue Yonder
Date: 1 week ago
City: Hyderabad
Contract type: Full time
Overview:
If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success – and the success of our customers. Does your heart beat like ours? Find out here: Core Values Diversity, Inclusion, Value & Equity (DIVE) is our strategy for fostering an inclusive environment we can be proud of. Check out Blue Yonder's inaugural Diversity Report which outlines our commitment to change, and our video celebrating the differences in all of us in the words of some of our associates from around the world All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
- Seeking an experienced and highly skilled Lead Operational Excellence Engineer to join our team as we transform the world through our industry-leading cloud-based supply chain solutions.
- Operational Excellence is about improving our resilience and reliability of our cloud platform, ensuring our customers have the best possible experience and that the service is always up to meet their needs.
- This role focuses on leading the enhancement of our operational processes and tools, driving a DevOps culture, and ensuring continuous improvements in reliability and operational efficiency across various functional teams
- The team is completely new based in India and we’re hiring for roles at multiple levels. The incumbent will need to have leadership qualities because their role involves working cross-functionally with other teams to help drive process and automation improvements.
- Cloud Architecture: Microsoft Azure
- Observability: Elastic, Azure, Grafana, ELK
- Application Architecture: Scalable, Resilient, event driven, observable and secure multi-tenant Microservices architecture
- Infrastructure Architecture: Blue Green Deployments, High Availability, Disaster Recovery
- Frameworks/Others: Kubernetes, Docker, Kafka, Elasticsearch, GitHub CI/CD, ArgoCD, Argo Rollout, Crossplane, Prometheus
- Lead the definition and execution of high availability and disaster recovery test plans.
- Oversee the implementation and management of infrastructure change control processes.
- Drive enhancements in on-call and engineer engagement processes.
- Lead the development and maintenance of common operational dashboards and widgets.
- Establish and enforce guidelines for monitoring and alarming, mentoring teams in their implementation.
- Develop and manage processes for reporting operational impacts.
- Lead improvements in incident response processes and root cause analysis documentation.
- Automate the collection of service maturity data to support continuous delivery promotions.
- Collaborate with cross-functional teams to drive a DevOps culture and best practices.
- Identify gaps and implement tools, processes, and best practices for continuous improvement in reliability and operational excellence.
- Mentor junior engineers and provide technical leadership.
- Bachelor's degree in Computer Science, Engineering, or a related field; Master's degree preferred.
- 4.6 to 7.6 years of experience only in Site Reliability Engineering, DevOps, or a related field.
- Proven leadership experience with a track record of leading projects and mentoring teams.
- Strong understanding of cloud platforms, particularly Microsoft Azure.
- Extensive experience with high availability and disaster recovery planning and execution.
- Proficiency in infrastructure change control processes and tools.
- Expertise in on-call management and paging tools like Opsgenie or Pager Duty.
- Experience in defining and implementing monitoring, alarming, and operational dashboards.
- Strong problem-solving skills and a proactive approach to identifying and addressing operational issues.
- Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams.
- Passion for continuous improvement and operational excellence.
- Experience with supply chain solutions or similar industries.
- Certification in Microsoft Azure or related cloud platforms.
- Familiarity with incident management tools and processes.
- Knowledge of automation tools and scripting languages
If you want to know the heart of a company, take a look at their values. Ours unite us. They are what drive our success – and the success of our customers. Does your heart beat like ours? Find out here: Core Values Diversity, Inclusion, Value & Equity (DIVE) is our strategy for fostering an inclusive environment we can be proud of. Check out Blue Yonder's inaugural Diversity Report which outlines our commitment to change, and our video celebrating the differences in all of us in the words of some of our associates from around the world All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
How to apply
To apply for this job you need to authorize on our website. If you don't have an account yet, please register.
Post a resumeSimilar jobs
Network System Administrator - ITG (Switching and Routing)
Blue Yonder,
Hyderabad
1 week ago
Blue Yonder Title:
Network System Administrator (ITG)
Other Comparable titles:
Systems Administrator, Systems Engineer, IT Support Engineer.
Overview:
Leading AI-driven Global Supply Chain Solutions Software Product Company and one of Glassdoor’s “Best Places to Work”
Seeking an astute individual that has a strong technical foundation with the additional ability to be hands-on with the broader engineering team as part of...
Developer III - Software Engineering
UST Global,
Hyderabad
1 week ago
3 - 5 Years
1 Opening
Hyderabad
Role description
Role Proficiency:
Independently develops error free code with high quality validation of applications guides other developers and assists Lead 1 – Software Engineering
Outcomes:
Understand and provide input to the application/feature/component designs; developing the same in accordance with user stories/requirements.
Code debug test document and communicate product/component/features at development stages.
Select...
Software Development Engineer III
f5,
Hyderabad
2 weeks ago
At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation.
Everything we do centers around people. That means we...