Post Job Free
Sign in

Technical Project Lead - Site Reliability Engineer II

Location:
Union City, CA
Posted:
July 02, 2024

Contact this candidate

Resume:

Neel Rajgor

ad6xwr@r.postjobfree.com +1-630-***-**** San Jose, CA, USA LinkedIn

SUMMARY

Experienced Site Reliability Engineer with over 3 years specializing in cloud-based infrastructure and automation. Proficient in managing and optimizing AWS and GCP environments, establishing CI/CD pipelines, and improving system performance and availability. Adept at creating and leveraging observability tools, collaborating with development teams, and conducting thorough root cause analyses. Dedicated to utilizing technical skills to build robust infrastructure, mitigate risks, and enhance system reliability

SKILLS

Technologies: AWS, GCP, Azure, Docker, Python, SRE, DevOps, Linux, IOT, Kubernetes, Configuration management, Networking, Infrastructure as code IaC, Monitoring & Observability, OS Services, Microservices, MongoDB, MySQL, Helm, HashiCorp Terraform.

Tools: Splunk, Puppet,Terraform, Ansible, Jenkins, Nagios, Github, GitLab, VictorOps, Prometheus, Grafana, Gitlab CI/CD, Helm, PagerDuty, Jira, Confluence, Kanban, Salesforce, Slack, Zoom.

Languages: Python, Bash Scripting, SQL, HTML/CSS, C, C++, Java.

Certifications: AWS SAA 2024, IOT, Swift Programming, Block Chain. EXPERIENCE

Site Reliability Engineer II - Technical Project Lead July 2021 - May 2024 Crest Data Systems – San Jose, CA

● Managed and optimized cloud-based infrastructure on AWS, GCP, ensuring high availability and scalability.

● Automated routine tasks and procedures using scripting languages such as Python and Bash, Developed and deployed CI/CD pipelines using Jenkins and GitLab CI, reducing deployment time by 30%.

● Experienced with Linux and Splunk for log collecting and aggregation, real-time system monitoring, and data input and configuration automation.

● Conducted performance testing and optimization of critical systems, resulting in a 15% improvement in application response times and transformed Splunk for real-time data analytics, developing graphs, reports, alerts, and dashboards.

● Facilitate rapid growth of Splunk Cloud by adapting to changes in automation, tooling, processes, scaling the SRE team, training and monitoring cloud environments using Splunk.

● Collaborated with development teams to optimize application performance and reliability, resulting in a 15% decrease in system downtime.

● Proficient in managing Customer Escalation Calls, orchestrating swift issue resolution through effective cross-functional collaboration, and skillfully addressing escalated customer concerns to maintain high satisfaction and retention metrics.

● Expert in conducting Root Cause Analyses (RCAs) as part of incident management processes, utilizing systematic approaches to identify and address complex system issues swiftly.

● Participated in on-call rotations, providing timely support to Splunk users and troubleshooting for production systems. ACADEMIC PROJECTS

Vehicle Fuel Monitoring & Services Nov 2020 - April 2021

● Designed and implemented an IoT-based vehicle tracking and monitoring system, utilizing sensors and GPS-GSM modules to store real-time data in a MySQL database. Developed an Android application for user access, employing Python, Raspberry Pi, MIT App Inventor, and Arduino. Distributed System Jun 2020 - Aug 2020

● Researched distributed systems and developed a serverless application with AWS Lambda. Implemented multithreading for faster dataset processing, reducing time by 40%. Utilized AWS services like EC2 and CloudFront to streamline development and improve website performance by 30%.

EDUCATION

Bachelor of Engineering - Computer Engineering August 2018 - May 2021 Trinity Academy of Engineering

References:

Available upon request.



Contact this candidate