Professional Documents
Culture Documents
As a member of Numerix Global SysOps team you will be responsible for running and maintaining a 24x7
production environments hosted in multiple AWS regions.
Responsibilities
Monitor the health and utilization of AWS resources through CloudWatch and CloudWatch logs
Create alarms in CloudWatch to proactively respond to any resource issues
Use CloudWatch Log Insight to query log data for monitoring and troubleshooting incidents
Install CloudWatch agents onto EC2 instances for custom metrics data gathering
Create and restore snapshots of EBS volumes, databases (RDS, EC2)
Patching, upgrading, and security updates of installed OS EC2 instances
Create and manage AMIs, deploy new instances, and change resource instance types
Help define and design Disaster Recovery procedures and validate via planned scenario testing
Regular review of cost optimization reports, make recommendations, and implement
optimization tasks following change management processes
Work closely with the security team to help implement user access, troubleshoot access issues,
maintain security groups
Work closely with Value Stream teams during product development phases to help implement,
troubleshoot AWS resources
Understand a complex manual process and provide a simple and friendly user experience
through an automated process.
Deploy new or scale existing systems and software using automated build and deployment tools.
Write and review accurate and complete support procedures, system documentation, and issue
tracking entries.
Provide 24x7 on-call support during assigned periods.