Systems Administrator, Senior (VMware / Kubernetes) at LCG Inc
Baltimore, MD 21224, USA -
Full Time


Start Date

Immediate

Expiry Date

09 Nov, 25

Salary

150000.0

Posted On

10 Aug, 25

Experience

15 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Computer Science, Puppet, Kubernetes, Storage Systems, Communication Skills, It Security, Automation Tools, Docker, Nih, Orchestration, Information Technology, Linux System Administration

Industry

Information Technology/IT

Description

Location: Baltimore, MD (Onsite, 5 days per week)
Required Clearance: Ability to obtain Public Trust
LCG, Inc. provides insight into the impact of public programs that advance our society. For more than 20 years, LCG, Inc. has been a leading provider of technology-based consulting services, biomedical research support, grants management, decision analytics, software engineering and IT operations that enhance the transparency, efficiency, and empowerment of programs with health and science missions

JOB OVERVIEW:

We are seeking a highly experienced System Administrator, Senior (VMware / Kubernetes) to join our team supporting NIH. This role is critical to ensuring the operational continuity and technical resilience of client IT systems, especially in emergency situations. The candidate will manage and maintain large-scale Linux systems and OpenStack-based private cloud environments, support biomedical research computing needs, and ensure secure system operations in collaboration with NIH stakeholders

QUALIFICATIONS

  • Bachelor’s degree in Information Technology, Computer Science, or related field (or equivalent experience).
  • 15+ years of hands-on experience in Linux system administration and cloud computing environments.
  • Proven expertise in OpenStack private cloud installation, configuration, and management.
  • Experience supporting IT systems in federal biomedical research or healthcare settings.
  • Knowledge of NIH IT security and compliance frameworks.
  • Familiarity with containerization technologies such as Docker and orchestration with Kubernetes.
  • Experience with modern Linux management and automation tools such as Puppet, Chef, or Ansible.
  • Familiarity with scientific computing environments and high-performance storage systems.
  • Strong analytical, documentation, and communication skills.
  • Ability to work effectively as part of a crisis/emergency response team.

How To Apply:

Incase you would like to apply to this job directly from the source, please click here

Responsibilities

Core System Administration Duties

  • Manage and orchestrate containerized applications using Kubernetes, supporting scalable and resilient research computing environments.
  • Install, configure, and administer VMware ESXi/vSphere environments supporting virtualized workloads and cloud integration.
  • Deploy, manage, and monitor Linux-based physical and virtual servers, including high-performance storage and HPE GPU-accelerated systems.
  • Perform rack-and-stack tasks for general-purpose equipment (GPE) and GPU servers, including physical installation, cabling, labeling, and power/network configuration in data center environments
  • Install and configure OpenStack private cloud infrastructure on NIH-provided hardware.
  • Manage Linux-based physical and virtual servers as well as high-performance storage systems.
  • Support the deployment and configuration of scientific applications using virtual machines or containers.
  • Use shell scripting and system programming for automation and task optimization.
  • Collaborate with NIH IT Security to ensure the secure operation of cloud and Linux systems.
  • Conduct routine system audits, software/hardware inventory, and performance monitoring.
  • Manage system upgrades, patches, and configuration changes.
  • Maintain documentation of all system configurations, procedures, and work products.
  • Provide technical training to client IT staff on OpenStack and system administration best practices.

Infrastructure, Security, and Cloud Management

  • Perform disaster recovery operations and data backups when required to ensure business continuity.
  • Support the migration of IT platforms, systems, and applications to cloud-based solutions as required by the government.
  • Build, configure, and manage UNIX/LINUX authentication systems and provide high-level UNIX/LINUX/Macintosh administration support as needed.
  • Support a heterogeneous hybrid cloud and on-premises environment, including DRaaS, IaaS, SaaS, and PaaS platforms.
  • Perform vulnerability assessments and recommend mitigation strategies to reduce risk to IRP information system resources.
  • Build, configure, and monitor security systems including firewalls, intrusion detection systems, antivirus, and patch deployment solutions.
  • Monitor server, SAN, and network operations to ensure optimal performance and adherence to operational standards.
  • Manage network VLANs and switch configurations in support of deployed infrastructure.
  • Conduct hardware maintenance and replacements for server, SAN, and network devices as needed.
  • Proficient in administering VMware, Windows Active Directory, DNS, DHCP, GPOs, SQL Server, and core networking technologies.

Emergency Management Responsibilities

  • Serve as an emergency staff member and participate in client response teams to ensure operational continuity.
  • Be available during emergencies (e.g., natural disasters, outages) to maintain continuity of critical functions such as:
  • Clinical IT systems supporting patient care and ASTRA studies.
  • Scientific equipment and data storage monitoring.
  • Secure communications with researchers, grantees, and NIH partners.
  • Maintain updated personal contact information for emergency communication purposes.
Loading...