The ideal candidate will have maintained large numbers of commodity infrastructure components supporting proprietary SaaS/PaaS applications in a high-availability production environment and have the ability to rapidly self-educate on new concepts and methods.
Responsibilities
Provide 24/7 (on-call rotation) operational support for production servers and systems
Rapidly triage and escalate incidents to internal groups and/or managed-service providers.
Monitoring of systems and capacity (Cacti, Nagios, Ganglia, SNMP, etc.)
Manage DNS, SSL, SSH architectures at remote datacenters
Work with engineers and architects to define system standards, requirements and infrastructure architecture
Work with engineering and server operations staff to ensure a successful release of all components from QA to Production.
Provide architectural input and capacity planning strategies to ensure systems scalability and functionality
Qualifications
3-5 years production 24/7 internet SaaS/PaaS systems administration
Experience with remote Linux administration (Centos 5+, Debian 5+) using SSH/CLI
Expertise supporting web applications using LAMP or RoR infrastructure
Experience with server clustering and load balancing technologies (HAProxy, nginx, etc.)
Expertise in hardening, testing and monitoring systems against security threats
Must have experience with scripting and automation of common tasks, and setting up Cacti, Nagios, Ganglia or other monitoring tools
Ability to work efficiently in a fast paced environment
Ability to work under pressure and in high stress situations
GREE - 6 months ago
- save job
-
block