The High Performance
Computing Technician is skilled in all HPC operational areas including
computational systems, networking, cybersecurity, storage and environmental
systems in a UNIX environment. Under minimal supervision, monitors daily
operations on one or two areas of the control center. Analyzes system failures
and initiates appropriate recovery procedures on all large systems, servers,
storage systems and network components within the NERSC and Esnet environments.
Contact appropriate vendors and initiate service calls in a timely manner.
Maintains appropriate documentation using a trouble ticketing system and the
ESnet Maintenance Calendar system. Maintains tools and procedures used to
monitor and support HPC operations. Provide technical support for monitoring
tools utilized by the group, including the ability to make basic modifications
to existing utilities as required. Inform and update NERSC staff and the user
community quickly and accurately when system failures or maintenances occur.
Perform basic user support functions such as generation of passwords and
disabling of user accounts when requested. Be able to provide procedural input
to the CONS manager or Lead Technician. Modify and maintain CONS documentation
including system and network documentation as appropriate, and Standard
Operating Procedures (SOP’s) as directed.
KEY SUCCESS FACTORS:
- Minimum of 2-years relevant technical
experience or a combination of experience and education. The HPC Tech 2 will
have experience in one of our key area: systems administration, storage
administration or wide area networking.
- Experienced in a UNIX or Linux
environment. Windows only experience will not be considered unless there
is also at least one or 2 semesters UNIX/Linux experience in the classroom (at
Work well under pressure in a fast-paced,
team-oriented environment. Be reliable, self-motivated and customer oriented.
Be able to read and comprehend technical
information such as operation manuals and technical diagrams.
Have excellent communications skills and be an
active listener. Be able to ask the right questions to obtain information and
get to the root of the problem.
The ability to monitor operations and detect
problems by understanding variations and changes to system performance. Understand
which tools to use to diagnose issues, respond to error messages, or debug
Assure that all NERSC and ESnet functions
operate to the highest degree of effectiveness.
Have system administration knowledge in order to
support the improvement and upgrade of all NERSC systems as needed.
Understand and perform system administration
functions involving operating systems in use at NERSC.
Understand basic computer security functions and
act in a manner that will not jeopardize NERSC security.
Adapt to rapidly changing hardware and software
environments and stay abreast of current computing and networking technologies
and industry standards.
Ability to lift 25 pounds.
Additional desired qualifications:
Work experience in computer
operations, communications protocols, data/video communications and
environmental systems is a plus. Must be able to analyze situations, show
initiative, and complete tasks proficiently and in a timely manner. Knowledge
of system administration functions involving multiple operating systems,
computer security and LAN/WAN network environments or a combination of
education and some experience is a must. Capable of giving clear and concise
verbal and written technical descriptions. Must be customer oriented. Able
to work in varying shifts as needed. K
nowledge of cybersecurity principles through experience, classes or certification.
This is a part-time, indefinite (career) appointment.
This position requires completion of a background check.
Incumbents in this position are represented by a union for collective
Equal Employment Opportunity: Berkeley Lab is an affirmative action/equal
opportunity employer committed to the development of a diverse workforce.
LBNL is a U.S. Department of Energy national laboratory located in Berkeley, California. It conducts unclassified scientific research and is...