Amazon EC2 (AWS Elastic Compute Cloud) is the top cloud computing offering on the internet, changing the way computing is done. The Amazon EC2 Network Health team owns ensuring the network is performing the way that our customers expect. We own three things to make this happen; firstly the network monitoring tools that tell us when our customers are seeing problems; secondly we own continuous improvement of network performance; and thirdly we own the DNS, NTP and HTTP Proxy services that both our customers and internal systems rely upon as an "invisible" tier 0 dependency.
We are looking for System Engineers to join our Engineering team to operate and improve our tools in this space. We want someone passionate about four things:
Networking and DNS : These are our major operational focus, and we need someone who knows and can operate them front to back. We own both the servers and tens of thousands of clients here, so you need to be as comfortable configuring bind as a server to modifying its source to be resilient to server failure.
Perl/ Ruby Scripting : EC2 is in over 7 regions and 18 availability zones today, and will be in many more going forward. You can't operate across that many zones doing things by hand. We need people with zero tolerance for wasted and repeated work, who can build the scripts such that they never have to pay that cost again. We want System Engineers who are capable of thinking in big O terms, as when you have 10,000 things to monitor, N squared is a lot of work.
Operational Excellence at a massive scale : we have tens of thousands of hosts across 7 different EC2 regions, all dependent on our services at a Tier 0 layer (i.e. an outage stops them getting their work done). We need the type of person who can confidently come in and take actions to change things in such an environment, yet do so in such a way that doesn't endanger our customers. A combination of confident with a bias for action to make things happen, but detail-driven and careful to ensure the action is executed as carefully as needed.
Deep Problem Solving: When customers have difficult performance problems they come to us at a third layer of support when the upper two layers have been unable to piece it together. We need someone who is happy to take on these types of problem, and drive them to a solution, whether that be a fix to software we own, or an improvement to our monitoring tools to find the problem faster in future. We want the type of person for whom reading kernel source code or Cisco manuals is not going too deep. And who is confident and can communicate well enough to enlist other team's help.
If this sounds like you, we’d love to have you on board. Cloud computing is changing the world, and this is a great opportunity to be a participant instead of observer.
BS in Computer Science or related field, minimum of three years software development experience
Alternatively (no degree) minimum of five years of professional software development experience
Proficiency in programming with Ruby (essential) and ideally Perl, Java and C
Proficiency in TCP/IP networking, architecture and core technologies (DNS, HTTP, Routing, etc.)
Ability to operate in high pressure environment and successfully handle multiple priorities
Knowledge of network performance methodologies along with hands on empirical monitoring
Experience deploying or managing large scale distributed Linux environments
MySQL experience - Excellent communication and analytical skills
For more information on Amazon Web Services please visit: http://aws.amazon.com/ec2
- Please email AWS Recruiter, Sara Fleischman (email@example.com ) if you have questions.*
Amazon strives to be Earth's most customer-centric company where people can find and discover virtually anything they want to buy online. By...