In this role, we look forward to your managing very large-scale, highly-available Cloud and Big Data Platforms supporting exabytes of data for Analytics.
- Be a key part of the design, architecture, instrumentation and delivery of a massive data platform empowering Apple’s products.
- Work with cross-functional teams to solve challenging operational problems across a broad range of Apple manufacturing services.
- Lead innovation by exploring, investigating, instrumenting, recommending, benchmarking and implementing data centric technology solutions for the platform.
Provide hardware architectural guidance, planning, estimating cluster capacity, and creating roadmaps
Responsibilities.
Infrastructure Management: Design, implement, and maintain cloud infrastructure on AWS and GCP, leveraging best practices to ensure high availability, scalability, and resilience.
CI/CD Pipeline Development: Develop and optimize CI/CD pipelines for seamless deployments and efficient collaboration between development and operations.
Monitoring and Alerting: Set up, maintain, and continuously improve monitoring, alerting, and logging solutions to ensure application health, using tools like Prometheus, Grafana, CloudWatch, and Splunk.
Automation and Scripting: Build and manage infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation for automated provisioning and configuration management.
Database Management: Oversee the maintenance, backup, and performance tuning of Postgres databases to ensure data reliability and accessibility.
Data Processing Pipeline Support: Collaborate with the data engineering team to deploy and maintain data processing workflows in Spark and Trino, optimizing for performance and scalability.
Security and Compliance: Implement and maintain security best practices, including access control, network security, and data encryption, ensuring compliance with industry standards.
Troubleshooting and Optimization: Provide support to resolve infrastructure and application performance issues, conducting root cause analysis and implementing long-term solutions.