Senior Infrastructure Engineer
TITLE: Senior Infrastructure Engineer
LOCATION: Alexandria, Virginia; Hybrid
FLSA: Exempt
Division: Technology
TRAVEL REQUIREMENT:
Less than 10%, primarily to NCMEC branch offices and data center.
HOURS and SCHEDULE:
Monday – Friday, 9:00am to 5:00pm (37.5 hours within five (5) days per week); unless otherwise required or approved by management. Provide on-call, evening and weekend coverage as required. Scheduled Telework available.
REPORTS TO:
This position reports to the Executive Director, Information Technology Division.
SUPERVISION EXERCISED:
Oversees Infrastructure Team members
RESPONSIBILITY FOR PUBLIC CONTACT:
Vendor management; daily contact requiring courtesy, discretion, and sound judgment.
LICENSING AND CERTIFICATION:
N/A
GENERAL DESCRIPTION:
The National Center for Missing and Exploited Children is seeking an experienced and driven infrastructure engineer to manage it’s hybrid cloud infrastructure and deliver consistent quality of service to software engineering and operation staff alike. The infrastructure team is responsible for the support, delivery and maintenance of bare metal, VMWare and cloud servers and related infrastructure with the goal of providing exceptional availability for our software engineering teams operating in a DevOps culture as well as COTS products for external business units.
ESSENTIAL DUTIES AND RESPONSIBILITIES:
- Maintain physical and virtual server infrastructure in primary and disaster recovery data centers
- Script / automate core processes to improve efficiency as well as platform availability.
- Build and maintain monitoring solutions for mission critical systems and services.
- Manage operating systems (primarily Linux) patching, updates, installation, and configuration
- Conduct root cause analysis and remediation in response to system failures, errors and performance issues.
- Build, extend and maintain private, public, and hybrid cloud infrastructure.
- Design, build and maintain accessible infrastructure for use by operational and engineering staff
- General administration of server hosted COTS applications
- Ensure staff has appropriate operating system and application permissions to perform their job functions
EDUCATION AND EXPERIENCE:
- Bachelor’s degree in Computer Science, Information Technology or related field preferred.
- A minimum of 8 years of experience administering enterprise servers and storage, site reliability engineering or equivalent.
KNOWLEDGE, SKILLS AND ABILITIES:
- Ability to write automation scripts using Bash, Python, Groovy, or other appropriate languages.
- Knowledge of configuration management systems such as Ansible, Chef or Salt.
- Proven experience administering Linux servers and infrastructure in both physical and virtual environments.
- Experience administering database technologies including but not limited to MySQL and MSSQL.
- Knowledge of private, public, and hybrid cloud concepts.
- Patch management for large numbers of hosts
- Linux operating system hardening and security remediation
- Experience administering VMWare and Storage Area Network systems.
- Ability to utilize system monitoring tools such as Nagios or Sensu Go
- Experience administering network attached storage environments
- Ability to rapidly learn and implement new open source and commercial off the shelf tools.
- Strong problem solving and analytical skills.
- Adaptability and flexibility in an agile environment.
All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status.
The National Center for Missing and Exploited Children is an EO employer - M/F/Veteran/Disability/Sexual Orientation/Gender Identity.
Other details
- Pay Type Salary
- Headquarters, 333 John Carlyle, Alexandria, Virginia, United States of America
- Virtual