We are looking for a Lead Operations Engineer with experience managing cloud-hosted infrastructure. You will work closely with our CTO, QA, and engineering teams to maintain and deploy cloud server infrastructure (using chef and AWS) and ensure performance, uptime, and scalability. This position has an expectation of on-call time for unexpected off-hours outages. You can be based in either of our locations (San Francisco, CA or Portland, OR).
Continually refining processes for code deploys, server maintenance, etc.
Growing the ops team in-house.
Putting in place NOC support.
Managing team including on call rotation
Automate failover and scaling infrastructure
Maintain and (where possible) improve supportability of service applications
Track and ensure known application issues reach resolution with QA and engineering
Develop tools or implement initiatives to improve uptime, maintainability, and user-facing experience
Maintain server and deployment documentation
Linux System Administration (Centos / RHEL or Debian / Ubuntu systems)
nginx, apache or lighttpd
Chef, Puppet, or other configuration management
Python, Perl, Ruby, or other high-level scripting language
Python WSGI frameworks (gunicorn, tornado, twisted, etc)
Memcached, redis, or other temporary key-value stores
Packaging (deb/dpkg, rpm)
SQL Schema design, query optimization
Component performance analysis and profiling
Other cloud infrastructures (Rackspace, Heroku,Google App Engine)
Content Delivery Networks (Akamai, Limelight, etc.)
What we offer
Free lunches every day and plenty of snacks
Competitive salaries and awesome medical, dental and vision benefits plus a 401(k)
Unlimited vacation (and we require a minimum of 2 weeks!)
Epic ping pong and FIFA battles
A transparent work environment
A developer community that inspires and works with you to change mobile gaming
PlayHaven - 5 months ago