Vacancy expired!
- Work closely with development teams to ensure that platforms are designed with Observability in mind
- Gain deep knowledge of both our complex internally developed applications and enterprise-class services
- Familiar with standard concepts, practices, and procedures within Software Engineering and AWS
- Work to support teams with adding OTEL attributes and configurations utilizing OpenSource OTEL documentation
- Produce high-quality dashboards and streams to showcase primary service flows with distributed tracing
- Support teams onboarding into our distributed tracing platform, Lightstep.
- Ability to query on traces to find application issues
- Comfortable reviewing code, championing issues, and shaping project roadmaps and backlogs
- Continuously look for opportunities to automate changes and implement them
- Implement and improve monitoring and alerting
- Build, automate, and improve observability dashboards to provide better visibility in the operational aspects of the systems
- Function well in a fast-paced, rapidly-changing environment.
- B.S. or higher in Computer Science or other technical discipline, or related practical experience
- Monitoring/Logging tools techniques and configuration
- Open Telemetry experience with code level instrumentation
- Development Languages: Java, Python, and/or Node JS
- Excellent written and verbal communication skills
- 3+ years of experience in AWS
- 5+ Years of hands on experience with multiples of these example technologies:
- EC2, ElastiCache, Cloud Front, Auto Scaling, Containers, API gateways
- Networking: Load balancers (ALB/ELB), SSL/TLS, DNS, Firewall
- AWS Certification or AWS Experience w/ Lambda, ECS, and Fargate.
- Observability Tool Experience(APM)
- Experience with AWS X-Ray or other similar tools.
- Monitoring: Splunk, Grafana, Dynatrace, Data Dog, LogicMonitor
- Scripting languages: Python, PowerShell, Bash; specifically for systems automation
- Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Operators, Product Managers, etc.
- Problem solver with excellent written and interpersonal skills; ability to make sound, complex recommendations in a fast-paced, technical environment
- Humble, collaborative, team player, willing to step up and support your colleagues
- Effective communication, problem solving and interpersonal skills
- Commit to grow deeper in the knowledge and understanding of how to improve our existing applications
- Enthusiasm for cutting edge technologies, complex problems, and building things
- Good command of English (Speaking, Writing and Reading)