Site Reliability Engineer, Internet - Remote

Remote, Toronto, ON

Tucows (NASDAQ:TCX, TSX:TC) is on a mission to make the Internet better by providing people everywhere with online access to be empowered to make individual contributions. As a company, we embrace a people-first philosophy that is rooted in respect, trust, and understanding to encourage freedom, inspire innovation, and promote inclusivity; creating an environment for everyone to thrive!

Tucows has been working on the Internet since the days when people unironically called it the Information Superhighway. We are a 25-year-old global start-up embracing agility and creativity in order to continually seize opportunity for growth. We have evolved from a start-up domain service provider to becoming the second-largest domain wholesaler in the world while expanding our business with Ting, an Internet services company partnering with towns and cities to change what customers expect from their Internet Service Provider. We are building fiber networks across the US and have already launched Gigabit speed service in Maryland, Virginia, North Carolina, Colorado, Idaho and California, laying the groundwork for rapid scale.

Our growth has been incredible, smart, and measured, built on a solid technical and financial foundation. We have doubled our workforce in the last 4 years and continue in rapid expansion mode, providing services to millions of customers around the world.

Today, we’re the second-largest domain wholesaler in the world with tens of millions of domains under management (OpenSRS / Enom). We’re doing all kinds of interesting things, including running an MVNO cell phone service (Ting Mobile) and building true fiber to the premises networks in towns and cities across the US (Ting Internet). We offer individual and small business domains and integration with various popular platforms (Hover/Ascio).

We’re a team of over 600 people serving tens of millions of customers around the world. Our growth has been incredible, smart and measured (NASDAQ: TCX, TSX: TC). Our success is built on a solid technical and financial foundation.

 

About the role:

In this role, you can expect to:

    • Be responsible for application availability, latency, performance, efficiency, and monitoring

    • Work closely with the software engineering and devops team members

  • Own and ensure that internal SLA’s go above and beyond expectations

    • Build tools for automating deployment, monitoring, and operations of the overall stack

    • Work to improve efficiency of the delivery pipeline

  • Participate in on-call rotation to provide application support, incident management, and solve problems

  • Collaborate in a remote-first environment

  • Contribute back to upstream OSS when appropriate

 

You may be a good fit for our team if you have:

  • Familiarity with infrastructure management and operations lifecycle concepts

  • Configuration management (SaltStack, Ansible, Chef, Puppet etc.) experience

  • Understanding of Gitops practices

  • Experience operating and maintaining production systems in a Linux computing environment

  • Solid understanding of Docker containers, service discovery, and load balancing

  • Proficient Intermediate Python development experience

  • Kubernetes experience is a plus but not required

 

 

We believe diversity drives innovation. We are committed to inclusion across race, religion, colour, national origin, gender, sexual orientation, age, marital status, veteran status or disability status. We celebrate multiple approaches and diverse points of view.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Share on: