Senior Platform Engineer
Posted on Saturday, November 18, 2023
OneFootball is the world’s #1 digital football platform, with more than 100 million active users across the world. Founded in 2008, we have come a long way to provide our users with the best personalised digital football experience. We are a team of hundreds of football fanatics working from hubs in Berlin, London and Lisbon. Our purpose is to disrupt the status quo and make football accessible and enjoyable.
Cloud Runtime Squad - About The Team & Your Place In It:
You will be joining the Platform tribe within the OneFootball Product and Engineering domain. Our team is the critical force ensuring that all our services are not just running but thriving across our cloud platforms with unmatched efficiency, reliability, and security. As a media platform, our vision is to connect Football Fans with the best Football related content and media.
This is a senior-level engineering opportunity aimed at those with a robust technical background and a self-driven approach to excellence. Experience counts, but what truly matters is your capability to deliver and lead both yourself and your peers toward outstanding results. For this senior role, we require a minimum of 5+ years of experience in the same or equivalent position.
- Lead the design and implementation of the core infrastructure that powers OneFootball's applications and services.
- Innovate with cutting-edge automation tools to streamline processes and enhance system efficiencies.
- Craft and refine our cloud infrastructure, focusing on scalability, cost-effectiveness, and bulletproof security.
- Forge strong alliances with software engineers to architect services that are not just scalable and resilient but also performance-optimised.
- Establish and maintain proactive monitoring and alerting systems, ensuring rapid response to any incidents.
- Commit to a culture of continuous learning, staying abreast of the latest technological advancements and industry best practices in platform engineering.
- Embrace full ownership of the systems you build and deploy, responsible for their ongoing maintenance, enhancements and evolution.
- As part of our team, your role will not only involve building and deploying systems but also taking complete ownership of them. This means being responsible for the maintenance, improvement, and evolution of our platforms to ensure they meet our high standards of efficiency and security. You’ll champion the reliability and uptime of our systems, considering the full implications of your work on the user experience and the operational excellence of the Product and Engineering team.
Who You Are - Expanded Qualifications with a Focus on Ownership:
- A seasoned professional with at least 5 years of experience in platform engineering, SRE, or DevOps, ready to tackle and take ownership of complex infrastructure challenges.
- Highly skilled in scripting (e.g., Python, Bash) and infrastructure as code (e.g., Terraform, CloudFormation), demonstrating a profound sense of responsibility for the systems you develop and sustain.
- Deeply proficient in at least one cloud service (e.g., AWS, Azure, GCP), committed to exploiting these platforms for peak performance and dependability.
- Expert in containerization and orchestration technologies (e.g., Docker, Kubernetes) with a passion for managing the full lifecycle of services, from conception to production.
- Well-versed in the principles of continuous delivery, equipped with the know-how to implement and optimise relevant tools and processes.
- Profound understanding of networking, security, and distributed systems, committed to ensuring the long-term integrity and security of the platform.
- Experienced with monitoring and alerting tools (e.g., Prometheus, Grafana, Honeycomb, Lightstep), acknowledging that true ownership encompasses vigilant system monitoring and swift issue resolution.
- Proficiency in Golang
- Experience with Backstage, showcasing a commitment to enhancing developer workflows and taking ownership of the developer experience.
- Experience with Incident management processes (rotation, incident escalation etc..) and tooling around it.