You are on page 1of 4

Sai Thikkavarapu 🕾 +1 774-519-0973

Mckinney, TX
tsaik917@gmail.com

Career Objective

Seeking assignment on latest technology with a challenging environment and a growth-oriented organization.

Summary of Expertise

● 10+ years of Experience in SRE/DevOps/System Analyst/ Release Management/DB, Application


Deployment and development lifecycle.
● An effective communicator with excellent relationship building & interpersonal skills.
● Strong analytical, problem solving & organizational abilities. Possess a flexible & detail-oriented attitude.

Academic Background

● B.E in Computer Science from ANNA UNIVERSITY, India


● Masters in Embedded Software Engineering (Program of Computer Software) from Gannon University.

PROJECTS:

Bose Corporation
Duration: Sep 16 to Current

Technologies: AWS, APIGEE, Azure, GCP, Kubernetes, SonarQube, Qualys, Jenkins, Python, Prometheus,
Grafana, Datadog, JMeter, Linux (RHEL), Docker, Terraform, Ansible and EFK stack, Jaeger/Dynatrace
(Tracing), BASH, KAFKA, Various Databases (Oracle, Aurora - MYSQL/POSTGRES)

Roles: Application Support Engineer/ DevOps

Responsibilities:
✔ Worked as incident lead for Bose cloud applications which includes writing run-books and stakeholder
(product managers and developers) communication using status.io and working with 3rd party vendors like
PagerDuty, AWS, Apigee etc.
✔ worked to create many troubleshooting guides to prevent the production incidents.

✔ I am responsible for maintaining deployments of staging and integration env in automated fashion and creation
of automated tests which run on daily basis before go-live.
✔ Gathers and documents requirements from a diverse user community and Translates requirements to
functional design documents for complex systems/processes.
✔ Work with the development teams to ensure alignment of development needs and timelines with
infrastructure capacity and capabilities.

Page PAGE1 of NUMPAGES1


✔ Worked in Kubernetes operations managing various activities like deployment, logs viewing using stern,
creating docker images for apps, Scaling of applications, archival, Secrets management and role-binding
features.
✔ Release coordination’s for embedded Firmware builds to device (OTA release management) and software
release builds to cloud on daily/monthly releases.
✔ Supported Canary Releases in production and rollbacks whenever issues arise in production cloud and devices
exhibit anomalies.
✔ Worked on AWS features including security extensively as admin i.e. EC2, ELB, SES, VPC, RDS, S3, SNS, Billing
dashboards, CloudWatch, IAM, DNS, Route 53, WAF, Secrets Manager, Certificate Manager and worked with
AWS support for various maintenance activities.
✔ Used Terraform for automated provisioning of resources on native aws stack and ansible to manage various
nodes and software installations.
✔ Worked on security components of infrastructure like vulnerability remediation using Qualys and deployments
of newly created AMI to PROD.
✔ Created various Datadog dashboards for Production applications.

✔ Supported Legacy infrastructure (Java, Tomcat, Nginx) as tier1 ops, installation and troubleshooting of issues
like restarting ad updating tomcat configurations as required.
✔ Worked on Terraform and Ansible to provision deployments to QA, NonPROD and PROD envs.

✔ Worked on Qualys to monitor and patch vulnerabilities and AMI creations for higher envs.

✔ Installed and configured Qualys agent on various worker nodes for automated node scans and reported them
to various teams using EXCEL and Power BI Dashboards
✔ Configure and Test High Availability (HA) and Disaster Recovery (DR) scenarios for Databases like aurora
MSQL, DB etc. for customer facing applications.
✔ Created various custom dashboards for service monitoring using metrics from APIGEE Stats and monitoring
API
✔ Various APIGEE Admin tasks like deploying proxies, API keys, adding policies like JWT validations, Spike
Arrests etc.
✔ Custom Report Jobs creation to analyze various metrics like errors/tps/latencies on APIGEE proxies in
production.
✔ Develop and run Containerized Micro services through Orchestration tool Kubernetes.

✔ Worked Heavily on JMETER scripts for Performance.

✔ Configured many JMETER scripts on various environments and responsible for managing performance aspect
of services in production
✔ Did Performance testing for many applications (Java, Python) and scaled the resources for holiday traffic.

✔ Managed entire soak test environment including deployments and execution of soak test and providing
recommendations/reported various bugs.
✔ Installed and Configured blazemeter plugin for JMeter tests to have better reporting of test results for
stakeholders.

Page PAGE1 of NUMPAGES1


✔ Develop and improve existing Cloud Platform by writing Continuous integration and continuous delivery
scripts in Python and shell scripting.
✔ Used Blue/Green deployment method to upgrade various RDS instances of aurora and Postgres databases with
almost zero downtime.
✔ Configured various CloudWatch alerts for S3, WAF and RDS resources in AWS.

✔ Support the Production platform and involve in 24*7 On-Call Support on Rotation basis to handle any issues
that arise with production applications and doing RCA.
✔ Used BOTO3 to write various automations for features like S3 encryption, resource management like reserved
instances, Default VPC and their components deletion etc.
✔ Create and maintain Runbooks and Troubleshooting guides for internal applications for debugging issues in
various Environments.
✔ Created various Grafana dashboards for metrics like CPU/MEM, TPS, Latency and error codes for various
applications and guided developers on best practices.
✔ Configured various data sources like Prometheus, aurora MySQL etc. to pull various metrics and create
dashboards in Grafana for visibility of key KPI’s
✔ Written complex Prometheus queries and constructed Grafana dashboards for anomaly detections across
various applications.
✔ Wrote various alert rules to manage the anomalies in CPU/MEM, TPS, Latency and error codes.

✔ Develop Python Scripts for Performance/Stress Testing applications newly being Onboarded to benchmark
and provide resource recommendations.
✔ Support Jenkins to create build/deploy Jobs and create schedules to run them in automated way and manage
access for various development teams.
✔ Support SonarQube Tool to run Security Analysis and develop code-coverage scripts for applications.

DC HBX (District of Columbia – Health benefit exchange)


Duration: March 2015 to Sep 2016

Technologies: LINUX, Oracle RAC, PL/SQL, IBM-CURAM, Kubernetes, Web logic,Dynatrace, SVN,
SonarQube, Jenkins (BUILD-AUTOMATION),Groovy, Grafana, BASH

Roles: SRE/DevOps

Responsibilities:
✔ Gathers and documents requirements from a diverse user community and Translates requirements to
functional design documents for complex systems/processes.
✔ Planned and Implemented CURAM DATABASE upgrade from version 6.0.5.4 to 6.2 supporting DC-HBX
and DCAS related Applications.
✔ Did POC on Dockerizing applications and building custom docker images for developers.

✔ Creating, configuring, and maintaining Oracle Automated Database Maintenance using Shell scripting
techniques.

Page PAGE1 of NUMPAGES1


✔ Supported Applications running in Kubernetes on AWS cloud which involved entire Cluster Setup,
deployments, Role based Access-controls, Auto-Scaling and Namespace Management using python scripts.
✔ Controls migrations of programs, database changes, reference data changes and menu changes through the
development life cycle.
✔ Used Dynatrace to debug high latency calls coming from app to database.

✔ Performance tuning and Optimization (PTO) of Oracle Databases by Generating AWR (Automatic
Workload Repository) and ASH (Active Session History) reports using Oracle SQL DEVELOPER (Tool)
and analyzing them.
✔ Creation of AWS Development (CENTOS,RHEL) and Production stack and deploy the developed Code on to
AWS stack to ensure operational readiness of Code.
✔ AWS RDS Provisioning and upgrades using terraform.

✔ Managing Backup/Recovery and Cloning Operations of the Databases using RMAN and export/import
Utilities of Oracle and integrated them with Jenkins Jobs to run and notify their success on Grafana
dashboards.

AIRTEL
Duration: dec 2012 to Nov 2013
Technologies: LINUX, RHEL, UBUNTU, Shell scripting(bash)
Roles: Software Analyst

Responsibilities:

✔ Involed in patching of linux systems

✔ Establish monitoring, backup and logging procedures for all environments

✔ Administer and maintain multiple Centos/RHEL and other Linux environments, establish and enforce
configuration management controls and perform security patches.
✔ Database Optimization and Tuning using Execution Plans, SQL Trace and AWR Reports.

✔ Shell scripting to automate various monitoring of file system and other system resources

✔ Software installations and configurations.

Page PAGE1 of NUMPAGES1

You might also like