Job Category: Operations
Location: Redmond, WA, US
Job ID: 813333-94890
CDM (Cloud Datacenter Management) is a team that delivers and operates the next generation of cloud and management services hosted on Azure. Do you have a passion with online services and excited in making an impact? Our team is looking for a strong Service Engineer that is self-driven, with the engineering excellence to build the next highly scalable System Center cloud services.
The Service Engineering team is the fourth pillar in the System Center Cloud and Data Center Management Dev/Test/PM. As part of this quad Service Engineering integrates in the planning, design, build, deploy, operate and optimizations of service delivery for public/private cloud. The next wave of these cloud services are to be taken from prototype, CTP, beta to launch as V1+.
The position is a prime opportunity for a Senior Service Engineer to help define the strategy for Microsoft’s Public/private cloud solutions for the enterprise and datacenter. Our Service Operations team is committed and a results driven team. We are responsible for incident root cause analysis, problem prevention and working closely with the product development teams on automation, manageability, security, optimization and deployment. As a member of this team - you will own all functional systems from an engineering perspective and be a key part in providing the service delivery components and solutions to support System Center Cloud Services.
As a Senior Engineer you will provide technical engineering guidance for the implementation, integration, and evolution of complex systems architectures; design and maintain services hosted on the Azure cloud platform. Sustained engineering expertise required in Quick Fix Engineering (QFE) tactics; provide application performance monitoring, tuning and optimization improvements and provide technical excellence for resolving critical production systems issues. Regular cadence of Service deployments and reacting to all aspects of incident and problem management, assisting operations engineers where needed.
The Service Engineer has a broad scope of participation in the service lifecycle - long term statistical trending and analysis, service capacity and threshold testing, systems documentation, service maintenance, the analysis and evaluation of new systems designs and technical strategies. You will work with other team members to troubleshoot complex live service issues, identify root causes and develop mitigation options.
Key Responsibilities of a Service Engineer:
o Working with Program Management, Developers and Testers to implement short and long term plans for our platform and services.
- Planning (concept phase):
o Service Level Objectives - reliability measures for our services like SLA and KPIs (Service Level Agreements and Key Performance Indicators)
o Drive Security and Compliance analysis and requirements
o Contribute to initial Architecture designs
o Identify key service management requirements for Production
o Perform service feature design reviews.
- Engineering (Design phase):
o Develop service monitoring strategies.
o Build systems management automation and tooling capabilities.
o Contribute to scaling modeling, capacity planning and Disaster recovery solutions.
o Release planning and readiness
- Release (Test/Implementation phase):
o Build out of environments
o Detailed Service rollout/release for operations team
o Co-ordination of implementation of Monitoring solutions
o Contribute to managing a large Azure hosted service - including SQL Azure, ACS and Storage.
- Operate (Sustained Engineering phase):
o Provide technical expertise to perform root cause analysis of service interrupting incidents and develop strategies to prevent reoccurrence.
o Statistical analysis of systems trending data and review logs to identify system Performance and Availability bottlenecks and potential incidents.
o Provide guidance to product development teams on operability, manageability, security and data-flow.
o Problem Engineering
o Manage and prioritize multiple tasks in accordance with high level objectives/projects.
- Additional role requirements
o SE escalation point for service level questions and issues.
o Close engagement with Developer’s and Test team.
The Ideal candidate would have the following qualifications and Experience:
Minimum of 5-7 years’ experience with a combination of the following:
- BA/BS in Business or Computer Science
- Experience managing a Hosted Service or Enterprise data center experience
- Azure experience a bonus
- Microsoft Operations Framework experience
- Strong Working knowledge of Socket, XML, .Net Framework, TCP/IP, security hardening procedures.
- Advance knowledge of networking topologies, protocols and networking architectures
- Expertise with monitoring large scale environments, capacity modeling, security modeling a must.
- Knowledge of networking topologies, protocols and infrastructure architectures
- Excellent interpersonal skills, written and oral communication communications skills and ability to be highly self-directed when required and work closely within a team.
Microsoft Corporation develops, manufactures, licenses and supports a range of software products for computing devices. The Company's...