【職務内容】
As a Site Reliability Engineer, you will be responsible for developing solutions, implementing requirements, assisting in creating key processes and procedures, that facilitate product planning, execution and delivery. We aim to solve society's issues with AI, so our mission is to solve the Engineering Department's issues!
【必須要件】
Familiarity with at least one cloud platform (i.e. GCP, AWS, Azure, etc...)
Experience in designing and implementing scalable cloud-based solution architectures
Strong expertise in infrastructure-as-code solutions such as Terraform
Strong operational expertise in containerization technologies, especially Kubernetes
Knowledge of source control, CI/CD, infrastructure automation, orchestration, deployment automation and configuration management
Bi-lingual (business English & Japanese daily conversation or English daily conversation & Japanese native)
While our team is mostly english-speaking, you should be comfortable enough talking in Japanese with other internal stakeholders
【歓迎要件】
AWS Solutions Architect certifications or knowledge on par with those
Kubernetes development experience, such as creation of in-house Helm charts
Familiar with scripting languages (Shell, Python, Golang)
Familiar with extended infrastructure-related tooling such as Ansible or Chef
Experience in working with large software systems developed on Unix/Linux
Experience of working with monitoring and metrics systems (e.g Collectd, Grafana, Nagios, etc.)
Experience in working closely together with development, product and business teams
Knowledge of web application security and best practices
【求める人物像】
You are comfortable at explaining complex recommendations to engineering and infrastructure teams, while discussing technical trade-offs in product development with other work colleagues. You are highly resourceful, analytical, and have a combination of focus, flexibility, self-motivation, and integrity.