Lance Powers

Senior Systems Engineer

image

Dynamic Leader with 18+ Years of Expertise in Systems Administration, DevOps, & Project Management

Seasoned senior systems engineer with over 18 years of expertise in systems administration, DevOps, and project management. Skilled in designing and optimizing systems architecture, cloud infrastructure, and leveraging data analysis for strategic decision-making. Proven ability to lead cross-functional teams to deliver innovative solutions that align with organizational goals. Known for driving continuous process improvement and ensuring system reliability. Adept at fostering collaborative relationships to enhance team productivity and achieve superior outcomes.


Work Experience

HPC System Engineer

University of California, San Francisco | 2024 - Present

Achieved an annual savings of $100K by migrating all virtual machines from VMware vSphere to a Proxmox VE cluster.

  • Led the migration from CentOS 7 to Rocky Linux 8, including automated in-place upgrades of machines running CentOS 7 before its end-of-life.
  • Designed, configured, and deployed Proxmox VE cluster utilizing ZFS over iSCSI to replace VMware vSphere infrastructure.
  • Designed, configured, and deployed Zabbix monitoring infrastructure to replace the existing Nagios monitoring system.
  • Benchmarked and performance-tuned all servers in the HPC cluster.
  • Developed Ansible playbooks to provision new compute and storage nodes in the HPC cluster.
  • Deployed an Unbound DNS server to eliminate the use of hosts files across the cluster.
  • Redesigned PXE boot environment and built kickstart configurations for provisioning new compute and storage nodes in the HPC cluster, adding support for UEFI.
  • Managed day-to-day operations, including installing software, provisioning systems, applying patches, managing filesystems, monitoring performance, updating documentation, troubleshooting alerts, and resolving user issues.

Senior Systems Engineer

CSC Global | 2023 - Present

Achieved annual savings of $1.2 million in licensing and infrastructure costs by analyzing and right-resizing eight OpenShift clusters.

  • Automated deployment of Nutanix virtual machines using Ansible, improving deployment speed by 300%.
  • Migrated virtual machines from legacy VMware infrastructure to Nutanix infrastructure utilizing virtual to virtual (V2V), achieving improved performance and reduced maintenance costs.
  • Led migration of legacy Red Hat Enterprise Linux virtual machines to latest releases, enhancing performance and security posture.
  • Troubleshooting issues related to the Nutanix AHV and ESXi hosts, virtual machines, and all other Nutanix and VMware infrastructure.
  • Automated a disaster recovery operation by migrating virtual machines from our east coast Nutanix data center to our west coast Nutanix disaster recovery site.
  • Provided Tier 3 support for critical incidents, resolving issues related to Nutanix AHV, VMware ESXi hosts, and other infrastructure components with thorough root cause analysis.

Senior Systems Engineer

OneMain Financial | 2019 - 2023

Led software architecture, deployment, and support for advanced financial data analytics platforms, enabling critical decision-making for 1K active users.

  • Managed system administration and documentation, ensuring seamless operation and compliance with industry standards.
  • Supervised technical output of system engineering team, achieving 100% improvement in system reliability and performance.
  • Handled escalations and queries from end users via email and Teams platform, maintaining high levels of user satisfaction and issue resolution.
  • Resolved file system issues promptly, minimizing downtime and ensuring continuity of services for mission-critical operations.
  • Acted as key liaison for a large-scale migration project, collaborating effectively with third-party contractors to achieve project milestones on time and within budget.
  • Rapidly identified and resolved critical issues with the Kerberos platform, implementing patches that restored service for thousands of users.
  • Proactively stayed updated with industry best practices in disaster recovery, security training, and data management, contributing to a culture of continuous improvement and compliance.
  • Recognized with two performance-based bonuses and regular commendations from senior leadership for exceptional project management, cross-functional collaboration, and problem-solving skills.

Senior Systems Architect

Endurance International Group (Newfold Digital) | 2016 - 2019

Led a 15-person engineering team in executing large-scale enterprise projects for multiple brands within the EIG family.

  • Directly assisted in developing operations teams through code changes for infrastructure improvements.
  • Managed integration and transition during the acquisition of the company from HostGator, ensuring seamless continuity of operations.
  • Developed and debugged code interfacing with OnApp and BaseKit SiteBuilder’s RESTful and SOAP APIs, facilitating enhanced functionality and integration.
  • Implemented Zabbix clustering to monitor a network of systems and developed custom Zabbix templates, enhancing system monitoring and reliability.
  • Optimized performance by implementing caching on over 3 million websites at EIG and HostGator, significantly improving load times and user experience.
  • Played a critical role in cleaning up approximately 90,000 compromised WordPress sites using Perl scripts and custom regular expressions, ensuring security and stability.
  • Designed a new OnApp cloud infrastructure for the Houston data center, facilitating the migration of customers from legacy platforms and improving scalability.
  • Revamped and upgraded the Dallas OnApp cloud management platform for 7,000 customers, achieving improved performance, stability, and minimal downtime.
  • Integrated Amazon structures into the chat infrastructure to scale live chat services during peak traffic, enhancing user experience across internal brands.
  • Implemented Galera cluster with MySQL in a multi-master setup for billing, improving database reliability and performance.
  • Automated server auditing checks for PCI and SOX compliance using custom PHP and Perl scripts, ensuring regulatory adherence and security.

Systems Architect

Endurance International Group | 2013 - 2016

Managed site reliability for 3 million websites, ensuring smooth functioning and prompt resolution of critical issues on a 24/7/365 on-call basis.

  • Deployed Puppet across new and existing systems, increasing manageability, productivity, and automating repetitive tasks.
  • Documented new and existing infrastructures, created flowcharts, and updated documentation in Confluence for improved clarity and accessibility.
  • Oversaw audit and AIDE reports from PCI and SOX core systems, identifying discrepancies and managing resolutions through JIRA tasks and Confluence updates.
  • Conducted peer reviews of code changes before staging and deployment to production environments, ensuring adherence to coding standards and minimizing errors.
  • Developed a centralized audit system using Perl and PHP to generate core server reports, enhancing security by tracking unauthorized access, filesystem changes, and compliance issues.
  • Optimized U.S. and India BaseKit site builder clusters, improving redundancy and substantially reducing costs by optimizing bare metal server requirements.
  • Created Perl software to scan billing databases for customers listed on OFAC and SDN lists, enabling proactive security measures and compliance with legal requirements.
  • Achieved 100% compliance with PCI and SOX standards, passing audits and maintaining rigorous standards across all systems and processes.

Senior Systems Administrator

HostGator.com | 2008 - 2013

Provided comprehensive technical assistance and support to all departments, ensuring smooth operation and addressing diverse technical needs across the organization.

  • Designed and built proactive monitoring tools to identify and preemptively resolve issues with problematic servers, enhancing system reliability and minimizing downtime.
  • Automated the generation of support tickets based on monitoring results, streamlining the resolution process and improving response times for critical issues.
  • Collaborated cross-functionally to implement solutions that aligned with departmental requirements and organizational objectives, fostering a culture of continuous improvement and efficiency.
  • Acted as a key technical resource, offering guidance and expertise to teams across the organization, contributing to enhanced productivity and operational effectiveness.

Systems & Network Administrator

SITCO, LLC | 2005 - 2008

Provided technical support for over 1,000 residential and business customers, addressing diverse systems and network support requirements.

Projects

OpenShift Resizing

Software

Achieved annual savings of $1.2M in licensing and infrastructure costs by analyzing and resizing eight OpenShift clusters.

VMware Migration to Proxmox

Software

Achieved an annual savings of $100K by migrating all virtual machines from VMware vSphere to a Proxmox VE cluster.

CentOS Migration to Rocky Linux

Open Source

Led the migration from CentOS 7 to Rocky Linux 8, including automated in-place upgrades of machines running CentOS 7 before its end-of-life.