About The Team
About Sea Labs
Sea Labs is at the core of the Sea platform development, supporting diverse business lines from e-commerce, supply chain, games, payment, and finance, among many others. The strong growth and unique positioning of Sea's e-commerce business, Shopee, spurred the launch of Sea Labs Indonesia. Since its inception, passionate engineers have charted the course to drive the best experience for our users in Indonesia, many of which solutions are even adapted to other regional markets.
Sea's hyper-growing business scale has transformed most innocent problems into huge technical challenges, and there is no better place to experience world-class projects first-hand if you love technologies as much as we do. Together with our passionate and driven teams, you'll get to develop your skills, build on industry knowledge, and collaborate with global teams in a dynamic space. Browse our Sea Labs Indonesia team openings to see how you can make an impact with us.
About Team
Games Site Reliability Engineer (SRE) team mainly be responsible for the SRE support work for Shopee Games. Our work scope includes but is not limited to maintaining and improving the stability of our system, optimizing resources, improving efficiency and so on. Shopee Games are a set of games and gamification features that drive user engagement. Games have already become a key engagement feature for Shopee. We have a mature container management platform and various common components that are deeply used. We have a board room for growth and challenges. Welcome to join us.
Job Description
- Be responsible for ensuring the reliability of the business, including but not limited to monitoring and alert, incident management, business continuity management, resource and capacity management, campaign support, etc.
- Take part in the planning and development of operational tools to automate processes, improve efficiency, and reduce costs.
- Enhance the existing stability assurance system, drive the implementation of best practices and processes for SRE operations, ensuring scalability, reliability, and performance.
- Collaborate with the Dev team, provide pertinent technical solutions based on their requirements.Proactively engage in effective communication to secure their support and ensure the successful delivery of relevant projects.
- Responsible for 24/7 monitoring and response of Games business, response promptly to live incidents, quick location and recovery, to ensure business stability.
Requirements
- Bachelor's degree or above in Computer Science or related fields
- Having 3+ years of experience as SRE/DevOps/System Engineer
- Expert in Shell language, better familiar with Python or Go language, React, JavaScript also highly preferred
- In-depth understanding of Network, Linux, Traffic Scheduling
- Familiar with Jenkins, Gitlab, experienced in CI/CD process development and integration
- Familiar with commonly used middleware and databases, such as Codis,Redis, MQ,MySQL
- Familiarity in Docker/k8s including related underlying technology and principles is preferred
- Able to respond promptly to handle all fault incidents
- An effective team player with a customer service orientation
- Meticulous and attentive to detail with strong critical thinking, data analytics and problem solving capabilities
- Candidates with experience in independently leading technical projects will be given priority
- Able to communicate effectively English to work with stakeholders in other regions
- Have a passion for reliable and performance systems, and care deeply about the end-user experience