You are on page 1of 3

How do I start my career as a Data Engineer?

I get asked this question quite often here on LinkedIn.


I get multiple DMs every time I post.

I even ran a poll that showed majority of the people faced challenges in starting up their career in
Data.

I am sure many people have already answered this question.


Benjamin Rogojan has an excellent post and a video on the same topic.

Still it seemed like most of the roadmaps were either too detailed to confuse beginners or too
superficial that there is no clear path.

One of my mentee said, "I spoke with many engineers, and they said just learn Python and SQL" .
Though that's a good advice, still people need a full roadmap.

Hence, I started creating a plan which even I will follow to learn and refresh the topics again. The
planner also has a timeline attached to it, so that we can stay focused.

I also broke the topics down in 3 segments, a. Fundamentals b. Advanced c. Good to have

Fundamentals:

1. Database Concepts:
2. Programming
3. SQL
4. Data Warehouse and Data Modelling
5. Cloud Fundamentals
6. Hadoop Eco-System & Spark

Advanced Topics:
1. ETL using Python / Scala in Spark
2. Data Processing Libraries / Constructs
3. NOSQL Db
4. Workflow Management and Schedulers
5. Data Streaming
6. DE in cloud (AWS / GCP / Azure)

Good To Have
1. Dashboarding Tools 
2. Docker and Containerization
3. Devops / Data Ops
4. Modern Data Stack 
5. Data Governance and Observability

The fundamentals will help you build the base and you can even start applying for job roles.

The advanced topics will help you with actual project building. Once you complete the advanced
topics, you can clear most of the DE interviews, even as a senior professional.

The good to have topics will help you understand the broader perspective and actual importance of
DE role. This will also help you creating production grade projects.

The most important aspect of the roadmap is to learn continuously every day and build on top of
what you already learnt.

Find the link to the blog post and planner in comments section.

Go On, Crush it....

#dataengineering #dataengineer #datascience #etldeveloper #dataanalytics

You might also like