English (United States)
Anew Logo

Software Product Engineering - Sr. Engineer - Site Reliability

Req #1150
Virtual
This job posting is no longer available

Job Description

Posted Tuesday, January 17, 2023 at 5:00 AM

Our goals are to provide excellent service, utilize advanced technology, and proficiently deliver results. To accomplish these goals, we constantly seek individuals who look for ways to do things better. We are a company whose culture cultivates teamwork, rewards excellence, focuses on quality for every aspect of our business, and promotes community involvement.
 

Tabula Rasa HealthCare (TRHC) is a leader in providing patient-specific, data-driven technology and solutions that enable healthcare organizations to optimize performance to improve patient outcomes, reduce hospitalizations, lower healthcare costs, and manage risk. Medication risk management is TRHC’s lead offering, and its cloud-based software applications, including EireneRx® and MedWise™, provide solutions for a range of payers, providers and other healthcare organizations.

 

TRHC empowers our employees to provide excellent service, utilize advanced technology, and proficiently deliver results. Our 32Fundamentals are what we are and who we are.  Our culture cultivates teamwork, rewards excellence, focuses on quality for every aspect of our business, and promotes community involvement. As a part of our team, you will help us bring innovative service models to healthcare, improving patient outcomes.

The Senior SRE - Cloud Technology, you will be responsible for ensuring the stability and reliability of TRHC public cloud hosted software. You will work with development teams to make sure that observability, resilience and performance are incorporated into their software solutions from the earliest design stages. You will act in a consulting role to promote a proactive approach to logging, monitoring and alerting, and establish the measures necessary to track continuous improvement in these areas. You will work with the Application Platform team to provide the necessary instrumentation and services to support the development teams in pursuing these objectives. You will work with Product Owners to define and commit to Service Level Objectives and Error Budgets for their products and provide them with tools to allow them to track their performance.   
 

The is a remote role.

ESSENTIAL JOB FUNCTIONS:

Primary Functions:

  • Define and maintain standards for observability across all TRHC products.
  • Collaborate with development teams to incorporate these standards into their software and provide the necessary tooling and code to enable compliance with the standards.
  • Build, monitor and troubleshoot shared infrastructure for monitoring, logging and alerting using both custom and third-party tools.
  • Integrate tooling with CI/CD pipelines to simplify compliance and onboarding of code bases.
  • Educate development team members on SRE principles, tooling, metrics and processes.
  • Assist teams to define their incident management process and to create playbooks to standardize incident response.

QUALIFICATION REQUIREMENTS:

  • 5 years professional experience with a minimum of 3 years of experience building infrastructure in multiple public cloud environments (AWS and/or GCP preferred)
  • Experience working in an environment that uses SRE practices like Service Level Indicators, Service Level Objectives, and Error Budgets.
  • Mastery of at least 1 compiled programming language (C/C++, Java, C#) and at least 1 scripting language (Python, JavaScript, Go, etc)
  • Experience with Infrastructure as Code using Terraform in a commercial environment is preferred.
  • Thorough understanding of CI/CD tools, processes and practices in a commercial environment.
  • Experience with tools like NewRelic, DataDog, and PagerDuty in a commercial environment.
  • Experience with a wide variety of AWS resources and familiar with the best practices for using them in a production environment.
  • Excellent communication skills.
  • Ability to work under general supervision by senior engineers.
  • Ability to read and interpret technical data.

EDUCATION:

Bachelor’s degree in a science or technical discipline. Will accept 4 years professional experience in lieu of a degree.


EXPERIENCE:

Five years of professional experience in a related discipline in addition to the above educational requirement.


OTHER SKILLS and ABILITIES:

Strong communication skills and the ability to interact with leaders across the organization.

#LI-Remote

#Dice 

The Company is proud to be an equal opportunity employer. All qualified applicants will receive consideration without regard to ancestry or national origin, race or color, religion or creed, age, disability, AIDS/HIV, gender, marital or family status, pregnancy, childbirth or related medical conditions, genetic information, military service, protected caregiver obligations, sexual orientation, protected financial status or other classification protected by applicable law.

Cookie Preferences

When you visit any website, it may store or retrieve information on your browser, mostly in the form of cookies. This information might be about you, your preferences, or your device and is mostly used to make the site work as you expect it to. The information does not usually directly identify you, but it can give you a more personalized web experience. Because Dayforce respects your right to privacy, you can choose not to allow some types of cookies by clicking on "Manage Preferences" below. However, blocking some types of cookies may impact your experience of the site and the services we are able to offer.For more information, refer to Dayforce Cookie Statement.