Reports to: Senior Director, Production Engineering
Job Location: Los Angeles/San Diego/Palo Alto, California
Job Status: Exempt, FT
About SHEIN
SHEIN is a global fashion and lifestyle e-retailer committed to making the beauty of fashion accessible to all. We use on-demand manufacturing technology to connect suppliers to our agile supply chain, reducing inventory waste and enabling us to deliver a variety of affordable products to customers around the world. From our global offices, we reach customers in more than 150 countries.
Founded in 2012, SHEIN has nearly 10,000 employees operating from offices around the world, with U.S. Headquarters located in Los Angeles and Global Headquarters located in Singapore. In SHEIN, we work with outstanding, creative, and capable peers. We share an energetic and open culture for capable people to discern, work and ignite as a team.
Position Summary
We are looking for a Director, Production Engineering to join our Production Engineering team. As a leader of this team, you will have an opportunity to shape Shein's production environment and culture as we grow. Production Engineering team at SHEIN are hybrid software/systems engineers whose overarching goal is to ensure that Production Services are "Always On." They strive to build the most reliable and performant systems on the planet. They are tasked with driving forward the operability of the platform to drive down the number of incidents while reducing MTTR. To accomplish this, the team combines software development, networking and systems engineering expertise, and a strong desire to be challenged by problems of scale and complexity to make our service better for our customers. If you are someone who takes ownership and can achieve high-level goals without clearly defined solutions, then we have a place for you here.
Job Responsibilities
Leadership: Provide strategic direction and leadership to the production engineering team, fostering a culture of innovation, collaboration, and accountability.
Infrastructure Management: Oversee the design, implementation, and optimization of our infrastructure, including servers, networks, databases, and cloud services, to support high-traffic e-commerce operations.
Performance Optimization: Continuously monitor and optimize system performance, implementing proactive measures to enhance scalability and reliability.
Automation: Drive automation initiatives to streamline operations, improve efficiency, and reduce manual intervention in system maintenance and deployment processes.
Incident Management: Develop and implement robust incident management processes to ensure timely resolution of production issues and minimize downtime.
Cross-functional Collaboration: Collaborate with product management, software engineering, and operations teams to align infrastructure initiatives with business objectives and project timelines.
Security and Compliance: Ensure that our infrastructure meets industry standards for security and compliance, implementing best practices and protocols to safeguard customer data and company assets.
Team Development: Mentor and develop team members, providing guidance on technical skills development, career growth, and performance management.
Drive efficiencies through software improvement and root cause analysis resulting in service delivery, maturity, and scalability.
Job Requirements
degree in Computer Science, Engineering, or a related field; advanced degree preferred.
experience (10+ years) demonstrating hands-on technical leadership and business impact in combining software engineering skills with systems engineering skills to solve complex automation and reliability challenges, preferably in the e-commerce industry.
technical expertise in cloud platforms (e.g., AWS, Azure, GCP), infrastructure as code (e.g., Terraform, Ansible)
problem-solving abilities and a proactive approach to identifying and addressing technical challenges.
communication skills with the ability to articulate complex technical concepts to non-technical stakeholders.
communication and collaboration skills, with the ability to work cross-functionally and influence stakeholders at all levels of the organization.
with incident management and post-mortem processes, including root cause analysis and remediation.
of industry best practices in SLA/SLO/SLI management, and observability tools (e.g., Prometheus, Grafana).
of containerization technologies (e.g., Docker, Kubernetes) and microservices architecture.
Nice to have
Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator) are a plus.
Fluent in Mandarin Chinese is a plus.
Pay: $168,200.00 min - $236,500.00 max annually, Bonus & RSU offered.