Twitter is seeking engineers to help build and grow our next generation storage infrastructure. As a Hadoop Reliability Engineer, you will be joining a team that is making a huge impact on how we do storage infrastructure at Twitter, working with technologies such as Hadoop and Cassandra in both batch-oriented and real-time contexts.
Responsibilities:
Work with open source technologies and have the freedom to release your work upstream to the open source community.
Diagnose, and troubleshoot complex storage problems.
Build advanced tooling for monitoring, administration, and operations of multiple Hadoop clusters.
Work cross-functionally with various teams such as: engineering, analytics, vendors and the capacity planning team.
Be responsible for qualification and ownership of the physical storage technology.
Code primarily in Java, Python, and C/C++
.
Requirements:
Three or more years of hands-on experience in distributed systems.
BS or MS degree in Computer Science or Engineering, or equivalent experience.
Experience administering distributed systems in a production environment.
Solid knowledge of networking and UNIX systems management.
Familiarity with systems-automation tools such as puppet, CFEngine, etc.
Plus: Experience with operating system internals, filesystems, disk/storage technologies and storage protocols (SATA/ATA/SAS).
Twitter - 19 months ago
- save job
-
block