This job listing has expired and the position may no longer be open for hire.

Senior Site Reliability Engineer - API at Mimecast North America, Inc. in Lexington, Massachusetts

Posted in Other 30+ days ago.





Job Description:

Job Locations

US-MA-Lexington














Title
Senior Site Reliability Engineer - API
ID
2021-5740
Position Type
Permanent - Full Time

Overview

Mimecast is looking for a Senior Site Reliability Engineer to join the API and Integrations Team. To be successful in the role you will be passionate about solving complex technical problems, have exceptional analytical and troubleshooting skills and a desire to improve systems and processes within engineering. You will tackle a wide range of complex software and system issues, including monitoring distributed systems in multiple geographic locations, responding to and safeguarding the availability, reliability and performance of the API services platform.

As a Site Reliability Engineer, you will be passionate about ensuring that our services are running optimally at all times and about detecting/addressing any anomalous behavior with a minimum of delay. When necessary, you will also be required to handle incidents and, most importantly, protect customers by minimising impacts on them by maintaining effective and swift communication and providing quick solutions in high pressure situations.

Responsibilities

    Monitoring, alerting, SLO tracking (Grafana, Graphite).
  • Detection / early warning / self-heal.

  • Facilitate emergency / incident response.

  • Change management and delivery pipeline into production.

  • Provisioning / automating deployment infrastructure.

  • Ensure safety, predictability, repeatability and auditability of all build and deploy processes.

  • Create, maintain and test for resilience & recovery.

  • Demand forecasting and capacity management

  • Efficiency and cost management.

  • Performance and scalability of the services.

  • Provide level of audit and control to security personnel.

  • Collaborate with the team to come up with technical designs and solutions with adequate documentation.

  • Work closely with the team to discuss solution and implementation strategy.

  • Help maintain the platform by fixing bugs and issues coming from the operational pipeline.

  • Champion best practices and enable and support members of the team.

Qualifications

The ideal candidate would have knowledge/experience in the following:


Essential:


  • Distributed Systems Architecture and best practices.

  • Strong background of traditional ops areas of expertise: Linux, Disk I/O, Networking.

  • Hands on experience in configuration management of server farms (using tools such as Puppet, Chef, Ansible etc).

  • Knowledge about encryption protocols, public-private key infrastructure, symmetric vs asymmetric keys, current cryptographic hashing algorithms and their use in HTTPS / SSH and other protocols.

  • Continuous delivery - principles and pragmatics of dealing with build pipelines, artifact repositories, zero-downtime deployment and modern software development engineering best practices.

  • Familiarity with docker and container ecosystem.

  • Using, developing and debugging on Linux.

  • Developing and debugging in a distributed environment.

  • Familiar with distributed version control systems such as Git.

  • Experience with Software Engineering collaboration tools such as Jira, Confluence etc.

  • Experience working in an agile environment using approaches such as scrum and Kanban.

  • Ability to read / debug Java code, experience with Java tuning.

  • Ability to analyze network behaviour, performance and application issues using standard tools.

  • Understanding of common Internet protocols and applications such as SMTP, DNS, HTTP, SSH, SNMP etc.

Desirable:(Experience in one or more of the following)


  • Microservice architecture.

  • Working knowledge of Public Key Infrastructure, Envelope encryption, Key Management, Identity Access Management.

  • Knowledge of network programming and protocol including REST, HTTP.

  • Hands on experience with open source monitoring and graphing solutions such as Nagios, Zabbix, Sensu, Graphite, Grafana, etc.

  • Extensive experience with at least one scripting language apart from BASH (Ruby, Perl, Python, Go).

  • PostgreSQL database configuration and management of replication.

  • Proving resilience via failure injection (chaos monkey), DR testing, scalability via load and stress testing.

  • Experience with NGINX, API Managers such as Kong and WSO2, and Identity and Access management servers.

  • Windows PowerShell scripting and management.

  • Knowledge of Exchange and Office 365, Azure and LDAP Directory Services, GSuite services.

Benefits and Rewards:


We offer a highly competitive rewards and benefits package, please speak with the Talent Acquisition Team at Mimecast to learn more about our rewards and benefits.

Check us out here too...


  • StackOverflow: https://stackoverflow.com/jobs/companies/mimecast

  • Mimecast Engineering blog: https://medium.com/mimecast-engineering

  • Our YouTube Channel: https://www.youtube.com/user/mimecast

  • LinkedIn Life page: https://www.linkedin.com/company/mimecast/life?trk=nav_type_life

Mimecast is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or protected veteran status and will not be discriminated against based on disability.Mimecast is also committed to providing reasonable accommodations to individuals with disabilities throughout the interview and employment process, and to use our online system to apply for a position. Call us at (617) 393-7000 or send an email togcraig@mimecast.comto request reasonable accommodations.

Mimecastwill not discharge or in any other manner discriminate against employees or applicants because they have inquired about, discussed, or disclosed their own pay or the pay of another employee or applicant. However, employees who have access to the compensation information of other employees or applicants as a part of their essential job functions cannot disclose the pay of other employees or applicants to individuals who do not otherwise have access to compensation information, unless the disclosure is (a) in response to a formal complaint or charge, (b) in furtherance of an investigation, proceeding, hearing, or action, including an investigation conducted by the employer, or (c) consistent with the contractor's legal duty to furnish information.


#LI-GC1


More jobs in Lexington, Massachusetts

Other
about 9 hours ago

The Judge Group Inc.
Other
1 day ago

Massachusetts General Hospital(MGH)
Other
2 days ago

HireMinds
More jobs in Other

Other
less than a minute ago

American Cybersystems, Inc.
Other
less than a minute ago

American Cybersystems, Inc.
Other
less than a minute ago

American Cybersystems, Inc.