Senior Software Engineer

Burlington, MA, US, 01803


Sophos Overview – Cybersecurity Evolved

Sophos is a worldwide leader in next-generation cybersecurity, protecting more than 500,000 organizations and millions of consumers in more than 150 countries from today’s most advanced cyberthreats. Powered by threat intelligence, AI and machine learning from SophosLabs and SophosAI, Sophos delivers a broad portfolio of advanced products and services to secure users, networks and endpoints against ransomware, malware, exploits, phishing and the wide range of other cyberattacks. Sophos provides a single integrated cloud-based management console, Sophos Central – the centerpiece of an adaptive cybersecurity ecosystem that features a centralized data lake that leverages a rich set of open APIs available to customers, partners, developers, and other cybersecurity vendors. Sophos sells its products and services through reseller partners and managed service providers (MSPs) worldwide. Sophos is headquartered in Oxford, U.K. More information is available at www.sophos.com.


Job Purpose:

The Principal Infrastructure Engineer role will work within the Sophos Core Platform Group System Engineering team. This team is responsible for driving system resiliency, scalability, and supportability through software & infrastructure development, system design consultation, and development of operational best practices. This team is also responsible for the development and operations of all MongoDB and Redis shared infrastructure on the platform. You will work closely with our production operations, engineering services, and service development teams as you pursue your mission.  As a principal member of the system engineering team, you will directly influence both system design and best practice across Sophos’ global cloud development organization.


Main Duties

  • Develop and operate shared infrastructure on the Central platform. This includes large scale Mongo DB, Redis, and Memcached infrastructure.
  • Actively re-factor existing platform infrastructure and java code to increase resiliency, scalability, and cost efficiency of Central applications and platform services.
  • Develop and maintain sustainable automation for deployment and management of java applications and infrastructure in production.
  • Develop and promulgate system design and operational best practices to be adopted by development, engineering services, and operations teams.
  • Implement monitoring, telemetry, and debugging tools within the platform to improve operational response and troubleshooting capabilities.
  • Triage and troubleshoot system errors in prod and pre-prod Central platform environments. Analyze logs and communicate potential code issues to development teams.
  • Consult with application development teams on use of shared infrastructure and general system engineering aspects of service design.
  • Actively develop system improvements and refactoring in the context of incident postmortem to improve system resiliency.
  • Influence service decomposition priorities as we extract services from the monolithic components on the platform.
  • Actively monitor system performance of “SOA” processes in production.


Skills & Experience:

  • BS in Computer Science or equivalent experience
  • 7+ years experience in system engineering, infrastructure development, or SRE roles working with large scale cloud systems.
  • Strong competency in the following essential skills:
    • Linux operating systems
    • Scripting / software development with languages such as bash, Python, Go and Java
    • Public cloud platforms. Amazon Web Services (AWS), Azure, or GCP
    • Declarative infrastructure as code frameworks such as Terraform or CloudFormation
  • Conceptual understanding of design, implementation, and operational patterns for:
    • Distributed cloud systems
    • Large scale distributed data stores. Expertise in Mongo and Redis are particularly valuable in this role.
    • System monitoring & telemetry
  • Familiarity with relational, noSQL, and in memory data stores, particularly MongoDB and Redis. Ability to understand and modify both mongo and SQL statements and scripts to fulfill database configuration and setup needs for applications. Ability to perform database installation and configuration steps, including backup and restoration of test data in multiple environments.
  • Excellent written and verbal communication skills to coordinate with worldwide development and operations teams


Equal Opportunities

Sophos is committed to equality opportunity in all areas of its work. All qualified applicants will be treated in a fair and equal manner and in accordance with the law regardless of gender, marital status, race, religion, color, age, disability or sexual orientation.

If you choose to explore this opportunity, and subsequently share your CV or other personal details with Sophos, these details will be held by Sophos for 12 months in accordance with our Privacy Policy and used by our recruitment team to contact you regarding this or other relevant opportunities at Sophos.  If you would like Sophos to delete or update your details at any time, please follow the steps set out in the Privacy Policy describing your individual rights.  If you have any questions about Sophos’ data protection practices, please contact dataprotection@sophos.com.

At Sophos, we want every organization to be protected by innovative, next-generation IT security, even those who don't have a huge IT staff. We protect organizations of all sizes, all around the world by making enterprise-grade security that is simple to deploy, manage, and use. It is our passion, and something we are truly proud of.

Nearest Major Market: Boston

Job Segment: Developer, Computer Science, Application Developer, Cloud, Java, Technology