Professional Documents
Culture Documents
Data engineer
Hide description
You’ll be managing data pipelines for companies that deal with large volumes of data.
The questions you’ll be dealing with sound like, “How do I build a pipeline that can handle 10,000
requests per minute?” and, “How can I clean this dataset without loading it all in RAM?”.
The technologies you’ll be working with include Apache Spark, Hadoop and/or Hive, as well as
Kafka. You’ll most likely need to have a solid foundation in SQL.
Data analyst
Hide description
Your job will be to translate data into actionable business insights.
The questions you’ll be dealing with sound like, “What’s driving our user growth numbers?” and,
“How can we explain to management that the recent increase in user fees is turning people
away?”
The technologies you’ll be working with include Python, Tableau and Excel. SQL might also be
necessary.
Data scientist
Hide description
Your job will be to clean and explore datasets, and make predictions that deliver business value.
The questions you’ll be dealing with sound like, “How many different user types do we really
have?”, and “Can we build a model to predict which products will sell to which users?”
The technologies you’ll be working with include Python, scikit-learn, Pandas, SQL, and possibly
Flask, Spark and/or TensorFlow/PyTorch.