Vacancy expired!
Job Description
- Design, develop, and maintain secure and scalable cloud infrastructure platforms using the latest DevSecOps and Performance Engineering methodologies
- Create and implement best practices and processes for code quality, security, performance, and scalability using Sonarqube, Checkmarx & FOSSA
- Strong experience using Google Cloud Platform specific services like Compute Engine, CloudRun, GKE, Cloud operations suite, Anthos, Pub/Sub, Dataflow, Cloud Scheduler, Bigtable, AlloyDB and managed services.
- Google Cloud infrastructure provisioning including VPC, Subnet, Gateway, Security groups, managed services, Kubernetes Cluster etc.
- Expertise with automating Infrastructure as Code using Terraform, Packer, Ansible, Shell Scripting
- Lead cross-functional teams to drive the adoption of DevSecOps and Performance Engineering best practices across the organization
- Experience in implementing Auto scaling, Disaster Recovery, High Availability, Multi-region Active/Active & Active/Passive configurations & best practices is added advantage.
- Evaluate and select appropriate technologies and tools to support the development and deployment of products on the eCommerce foundation layer
- Collaborate with stakeholders to understand business needs and requirements, and translate them into technical non-functional specifications
- Work with domain teams to understand workload models for each Product and gather performance Requirements.
- Strategize & work with senior leaders across Ford's Enterprise Architecture, IT Operations to make significant, measurable impact on the eCommerce Platform
- Expertise with patch management, APM tools like Dynatrace/AppDynamics, Prometheus, Grafana, ELK/Splunk for monitoring and alerting.
- Team members in US
- Initial team size will include 12+ positions
- Responsible for overall Infrastructure Architecture and evolution of next gen platforms. Ideal candidates will research the existing products and recommend solutions to run workloads in futuristic Infrastructure Architecture landscape
- Conduct Infrastructure as Code reviews, automate and deploy Cloud Infrastructure
- Identify code vulnerabilities and performance bottlenecks, and recommend solutions to improve the overall quality and performance of the systems
- Create and maintain technical documentation, including architecture diagrams, design documents, and operational procedures for High Availability, Disaster Recovery scenarios
- Analyze thread dumps, heap dumps, kernel logs, network stats, APM metrics, application logs to troubleshoot CPU/Memory/Resource hot spots, API latency and application/platform health
- Analyze and identify root-cause and fix complex performance problems involving multiple teams, networks, and software in Google Cloud Platform that relate to scaling and performance
- Build Automation for repeatable DevSecOps tasks and help with improving Software Engineers' productivity
- Mentoring Team members to scale and perform at their next level
- Thought Leadership around Shift Left (Quality, Security, OSS use, Performance Engineering) & Shift Right (Chaos Engineering) and increasing the adoption in the eCommerce Platform