Site Relaibility Engineer 2023

Site Reliability Engineer
About Media.net
Media.net is one of the world’s leading companies in the Contextual Advertising space that provides a
wide range of advertising and traffic monetization solutions. Since its founding, Media.net has constantly
broken new ground in building innovative contextual advertising solutions. We serve our contextual real-
time ads customized for each visitor and each page view across billions of visitors, across 10’s of
millions sites/domains on a server infrastructure that runs 1000+ CPUs, terabytes of RAM, and 100’s of
pentabytes of data.
With more than 1000+ employees, Media.net has one of the largest teams worldwide building a global
contextual advertising business. By market cap, Media.net is one of the Top 5 largest ad tech companies
worldwide. By revenue, Media.net is the #2 largest contextual advertising businesses worldwide.
Media.net is acquired by Chinese Consortium for $900 Million USD in 3rd Largest Ad Tech deal ever
and this acquisition will fuel Media.net’s global expansion strategy and provide access to China’s world-
class talent and capital markets.
Location(s) – Mumbai / Bangalore
About Team:
SRE team in Media.Net is responsible for managing scaling, performance, monitoring, security, availability
of the production environment. The main focus is to architect, develop, automate and deploy products and
infrastructure based on Linux and Linux application stacks.
Our environment consists of our own bare metal and private cloud across co-located data center facilities and
the AWS public cloud.
Our engineering teams follow DevOps practices and we rely heavily on open source tools like Jenkins,
Selenium, Git, Puppet, Docker, Kubernetes, Openstack, Nagios/Icinga, Kafka, Graphite, Hadoop, Graphite,
ELK, Vault etc. We use Python and Go majorly in SRE teams.
Who should apply for this post?
Key Responsibilities:
1. Engage with product and engineering team to design, build and maintain the system / software for
high availability and drive operational best practices
2. Identify and drive opportunities in making resilient systems that help maintain business continuity
3. Eagerly perform troubleshooting, RCA and implement permanent resolution of issues across the
stacks – hardware, software, database, network and so on
4. Implementation of proactive monitoring, alerting, trend analysis and self-healing systems
5. Develop continuous delivery for multiple platforms in production and staging environments
6. Find areas of existing manual intervention, and replace with automation where possible
7. Infrastructure and platform security
8. Effectively use and maintain Infrastructure and config management tools like puppet, chef, ansible,
terraform to deploy and manage infrastructure
9. Demonstrate technical mentoring and coaching to team members
10. Adaptable to work in a fast-paced environment and alter priorities as per business needs
Required Skills –
11. B. Tech / M. Tech or Equivalent in Computer Science, Information Technology or a related field
12. 2-5 years of experience in handling services in a large-scale distributed systems environment
13. Experience with Unix/Linux operating systems internals and administration (e.g. filesystems, inodes,
system calls, etc)
14. Deep understanding of network stack (e.g. TCP/IP, routing, network topologies and hardware, SDN,
etc)
15. Awareness of, and ability to reason about, modern software & systems architectures, including load-
balancing, queueing, caching, distributed systems failure modes generally, microservices, and so on
16. Excellent programming (Python, Go, Ruby or preferred scripting languages) and automation skills
17. You have expertise in some of the below tools/skills -
o Container orchestration technologies like Kubernetes and Mesos
o Virtualization platforms, either on-prem or cloud-based (We use Openstack and AWS)
o Understands Infrastructure as a code (we use Puppet, Ansible and Terraform) and
containerization tool sets (we use Docker).
o Data intensive applications and platforms like Kafka, Hadoop, Spark, Zookeeper, Cassandra,
PostgreSQL OLAP, Druid
o Relational databases like MySQL, Oracle, PostgreSQL etc
o NoSQL databases like Redis, MongoDB, Cassandra, CouchDB etc
o One or more CI tools like Jenkins, Teamcity
o Centralized logging systems, metrics, and tooling frameworks such as ELK, Prometheus, and
Grafana.
o Web and Application servers like Apache, Nginx, Tomcat
o Versioning tools such as git.
18. Ability to work independently and own problem statements end-to-end.
19. Great communication, interpersonal and teamwork skills.
Benefits and Culture –
At Media.Net people love their jobs, and not just because we offer the most competitive salaries in
the industry. Our excellent benefits include everything from great medical and life insurance to catered
meals. Our workspaces are comfortable and fun, complete with bean bag chairs, ping pong tables, and
all the snacks you can eat. We have no dress code (tee-shirts are a-ok!). We have flexible work hours
and flexible holidays, which means that teams pick their own work hours. Media.Net has its own
concierge desk that doubles up as a travel agency.
We are passionate about building the next generation of web products, and we believe that happy
employees are the key to achieving this goal. If you like the idea of working in an exciting
workspace on cutting-edge internet products that make a truly global impact (and wearing flip-flops
to work), then we want to get to know you!
Site Reliability Engineer – Recruitment Process
Round 1 –
- Online MCQ Test

- 45 minutes
Round 2 –
- Coding Round
- 3 hours
Round 3 –
- 3 Tech Rounds
- 60 minutes
Round 4 –
- Final Round
- 60 minutes
Each round will be an elimination round.

Verdict will be announced immediately after Round 4.
Compensation Details –
Site Reliability
Engineer
Component Amount Comments
Compensation (A) 13,00,008 per annum
Joining Bonus (B) 1,50,000 one time
Relocation Benefit (C) 1,05,000 one time reimbursement
Benefits (D) 22,682 non monetary
Total Package (A+B+C+D) 15,77,690

Site Relaibility Engineer 2023

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Site Relaibility Engineer 2023

Uploaded by

Copyright:

Available Formats

Site Reliability Engineer

Location(s) – Mumbai / Bangalore

Who should apply for this post?

Benefits and Culture –

- Online MCQ Test

Each round will be an elimination round.

You might also like