Data Infrastructure Engineer
As a Data Infrastructure Engineer, you will help design and build our data storage and processing infrastructure. You will be joining a small team that owns the soup to nuts on how we store and move data at Pinterest. You will be responsible for managing our EMR cluster and it's dependencies. Additionally, you will help build and manage data logging and pipeline tools. Your experience and interest with data metrics to ensure that data is processed accurately will make a huge impact across our various teams such as Search, Recommendation, Spam and Operations.
Extensive experience with Hadoop, HDFS and MapReduce.
Preferred knowledge of running Hive on EMR.
Strong knowledge in scripting and systems languages. Java and Python strongly preferred.
Experience with distributed logging systems like Kafka.
Experience building batch job scheduling systems.
Minimum BA/BS degree in Math or Computer Science
Boundless appetite for cutting edge open-source technologies and how they can be leveraged for our infrastructure
Enthusiasm for working collaboratively with talented people
Pinterest - 12 months ago
Pinterest is a virtual pinboard where users can organize and share the things they love.