WR
WILLARD ROBERTS
ad8ya4@r.postjobfree.com 832-***-**** Houston, TX 77396
Certified System Engineer with extensive experience in systems optimization, data transformation, automation, and strategic IT solutions. Recognized for excellence in system management and software development in Windows and Linux systems. Skilled problem solver with a strong data pipeline background focused on achieving high availability, resiliency, and performance.
Linux
Python and Shell scripting
Database Management (Oracle, MySQL, PostgresSQL)
Networking
Monitoring tools (Geneos, Splunk, New Relic)
Version Control Systems
AWS Cloud Computing
Incident Management
Problem-Solving
Agile methodology
Security Compliance
Docker
Kubernetes
Ansible
LTI - GE Power Systems Houston, TX
Unix System Administrator
08/2003 - 04/2025
Performed system administration tasks including user account management, software installation, and hardware maintenance. Installed, configured and maintained various Linux distributions such as RedHat Enterprise Linux, CentOS, Ubuntu.
Established procedures for backup of critical data on Unix servers and ensured periodic backups were taken regularly according to standards set by the organization.
Provided technical support for GE Aero Energy Products team with Unix related problems and inquiries.
Mindlance - Nike, Inc Houston, TX
Site Reliability Engineer
09/2022 - 08/2024
Provided production and QA support of security systems hosted on Windows and Linux OS.
Monitored system availability by developing New Relic alerts. Led incident response for production incidents, coordinating cross- functional teams to resolve critical issues with 98% SLA compliance. Responsible for analyzing and resolving performance bottlenecks. Responsible for building and configuring new applications on AWS EC2 instances in DEV and Production environments.
Developed and maintained over 50 data integration workflows using Apache Airflow, improving data processing efficiency by 20%. Leveraged Terraform to automate and standardize the build of AWS EC2 compute instances.
Conducted root cause analyses on production incidents, implementing solutions that reduced repeat incidents by 80%.
Managed and automated monthly patch cycles for Windows and Linux systems, achieving 100% compliance and zero downtime. Summary
Skills
Experience
Reduced manual work by developing Python scripts to automate support tasks.
Managed and optimized the Snowflake data lake, overseeing data storage, data ingestion, and data transformation processes to ensure efficient handling of large datasets and seamless data accessibility. Developed Python scripts that leverage pandas to clean, transform, and aggregate raw data. This included handling missing values, normalizing data formats, and performing complex calculations to prepare data for analysis and reporting.
Leveraged AWS S3 service for archival data store of data integration jobs. Ensured high availability and scalability of applications across multiple environments.
Documented best practices and procedures for incident response activities. Mindlance - TD Ameritrade Houston, TX
Site Reliability Engineer
08/2020 - 04/2022
Provided production support of Java based order management system. Responsible for maintaining 99.99% SLA for uptime and availability Developed Splunk queries for real-time monitoring and analysis of system performance, identifying and resolving issues within the Order Management System.
Implemented custom dashboards and alerts to proactively manage system health and performance.
Developed a custom support portal using Django, enabling the support team to run SQL queries via a user-friendly interface, significantly reducing the need for manual query execution and data retrieval. Deployed, configured, and managed Kubernetes clusters to orchestrate containerized applications. Ensured high availability and scalability of services, optimized resource usage, and implemented best practices for cluster management.
Provided ongoing support and troubleshooting for the Think or Swim Order Management System, diagnosing and resolving complex issues to ensure uninterrupted trading operations and system stability. Developed and maintained Continuous Integration/Continuous Deployment (CI/CD) pipelines to automate the deployment of applications and updates, ensuring efficient and reliable releases across development and production environments.
Enhanced alert visibility and response times by integrating web hooks into Slack, automating the delivery of critical alerts directly into predefined Slack channels alongside existing email notifications. Automated the provisioning of AWS EC2 instances using Rundeck integrated with Ansible
Krescendo Staffing Agency Houston, TX
President, Founder
06/2018 - 08/2020
Proactive client engagement
Develop new sales prospects
Conduct intake meetings with clients to agree on qualification criteria for candidates
Publish job ads on careers pages, job boards, and social media Sourced and contacted passive candidates online, including through platforms like LinkedIn.
Network with candidates offline during job fairs
Screen resumes and job applications
Monitored market trends and adjusted strategies accordingly. JPMorgan Chase Houston, TX Command, control, and operational support of internally developed and Site Reliability Engineer, VP
04/2005 - 03/2018
third-party applications on a distributed Linux environment. Partnered with the development team to scope out infrastructure requirements.
Automated system configuration with Ansible.
Responsible for testing and sign off on various code rollouts in pre-prod environments.
Leveraged Jira to plan, track, and manage all project work. Published runbooks and technical documents.
Developed post-release and start-of-day automated functional health checks.
Integrated Geneos monitoring solutions with ITPAM to automate alert handling and incident response. Configured Geneos to trigger automated remediation actions based on predefined conditions, enhancing operational efficiency.
Ensured all critical incoming and outgoing feeds are delivered within the agreed SLA.
Leveraged Control-M and AutoSys for nightly batch orchestration. Reduced toil by automating manual tasks with Python scripts. First to respond and engage in resolving live incidents. Leveraged advanced Unix and database skills in administration, maintenance, and troubleshooting technical incidents. Hosted weekly stability meetings with various development team leads to discuss impact and strategic solutions to prevent repeat recurrence. Experience in large system platform migrations.
Assist in DR and BCP planning and testing.
Responsible for global user communication related to system availability and performance.
Exxon Mobil Houston, TX
Unix System Administrator
09/2000 - 08/2003
Developed, configured and maintained Unix systems in a large-scale enterprise environment.
Provided technical support for users with Unix related problems and inquiries.
Installed, tested and patched operating systems, applications and security patches as needed.
Implemented security policies to protect against unauthorized access or malicious activity on the system.
Maintained documentation related to installed software packages and configurations for future reference purposes.
Petroleum Geo Services Houston, TX
Unix System Administrator
04/2000 - 09/2000
Installed, tested and patched operating systems, applications and security patches as needed.
Created user accounts, assigned privileges and monitored usage of resources on the system.
Managed file systems including creating new filesystems or expanding existing ones when required.
Performed regular system Legato backups to ensure data integrity and availability in case of disaster recovery scenarios. Interliant Houston, TX
Unix System Administrator
07/1998 - 04/2000
Carried out routine maintenance tasks such as disk space management and cleaning up old log files from the servers.
Managed email routing and internal message protocols to support reliable delivery of communications.
Configured DNS rules
Configured and managed Legato backup schedules
Texas Southern University Houston, TX
Bachelor of Science (B.S) in Transdisciplinary Energy and Environmental Systems 05/2024
Certified Ethical Hacker
EC-Council
Issued Jan 2017
Credential ID ECC80612486305
Education and Training
Certifications