Join a high-impact engineering team responsible for building, automating, and optimizing globally distributed load balancing platforms that support critical online services used by millions of users worldwide. This is an exciting opportunity for an experienced Site Reliability Engineer (SRE) to work on large-scale infrastructure, solve complex technical challenges, and contribute to the reliability, performance, and scalability of mission-critical systems in a collaborative international environment.
Key Responsibilities
- Design, implement, and automate scalable load balancing infrastructure supporting high-volume production environments.
- Collaborate with cross-functional engineering teams to enhance platform resilience, reliability, and operational efficiency.
- Diagnose and resolve complex issues across Linux systems, networking infrastructure, and application layers.
- Develop, maintain, and optimize APIs that improve platform functionality and streamline operations.
- Coordinate infrastructure deployments, software releases, and system updates while engaging with stakeholders to ensure smooth delivery.
- Contribute to the continuous improvement of globally distributed traffic management and network solutions.
Required Skills and Qualifications
Experience:
- 3+ years of experience in Site Reliability Engineering, Infrastructure Engineering, DevOps, or related fields.
- Strong expertise in Linux administration, system troubleshooting, and performance optimization.
- Solid understanding of networking concepts and web protocols including TCP, HTTP, TLS, DNS, and BGP.
- Experience designing, developing, or maintaining APIs.
- Knowledge of data center infrastructure, network architecture, and large-scale distributed systems.
- Experience supporting production environments with a focus on availability, scalability, and operational excellence.
Soft Skills:
- Strong analytical and problem-solving skills with a structured approach to troubleshooting.
- Ability to work effectively within collaborative engineering teams.
- Excellent communication skills and the ability to engage with technical and non-technical stakeholders.
- Self-motivated mindset with a passion for continuous improvement and automation.
Language Requirements:
- English: Fluent
- Japanese: Basic Level
Preferred Skills & Qualifications
- Experience working with globally distributed infrastructure environments.
- Knowledge of modern automation, monitoring, and observability practices.
- Exposure to cloud platforms and large-scale traffic management solutions.
- Experience supporting mission-critical online services.
About the Company
Our client is a leading global technology company operating large-scale online platforms and cloud infrastructure across multiple regions worldwide. Their engineering teams develop high-performance systems that power critical digital services, focusing on innovation, automation, reliability, and scalability. The company promotes a collaborative culture where talented engineers can make meaningful contributions while working with cutting-edge technologies.
Why You'll Love Working Here
- Competitive hourly compensation and the opportunity to work on high-impact global infrastructure projects.
- Minimal overtime, supporting a healthy work-life balance.
- Gain exposure to large-scale distributed systems, networking technologies, and advanced reliability engineering practices.
- Collaborate with highly skilled international engineering teams in a supportive and innovative environment.
- Enjoy additional benefits including casual dress code, side business flexibility, and a modern engineering culture focused on continuous learning and improvement.
Don't Miss Out - Apply Now!
