NOC Technician Job Listing at Robert Half Technology in Boston, MA (Job ID 02100-130841)
Please send an updated resume to email@example.com
Senior NOC Engineer
The Sr. NOC Engineer is responsible for the monitoring, governance, process development, staff oversight, and day-to-day operations for the Network Operations Center. This senior member of the Technical Operations team operates as the Subject Matter Expert (SME) and Systems Engineer working with Software Development, Performance Testing, and Systems Engineering. The Sr. NOC Engineer will develop, maintain, and evolve monitoring requirements and technology selection. In addition, the individual will lead the activities required to monitor health, performance, and availability of the Software as a Service (SaaS) operation. Technology ownership and stewardship of the platform will include developing monitoring strategy as well as reporting and the ability to harvest performance and availability metrics for internal use and customer facing Service Level (SLA) reporting.
Developing operational procedures and providing guidance and leadership to NOC operators in adopting standard operating procedures necessary for the day-to-day maintenance and incident response methodologies (identification, triage, troubleshoot, escalate, close)
Work with Software Architecture, Software and Systems Engineering, and Operations teams to develop innovative solutions to attain high availability scalability and reliability
Provide technical leadership and do technical hands on scripting, tooling, automation for continuous operations
Detect incidents based on monitoring tools, notifications, and log files.
Develop new monitors, and modify existing monitors, as new monitoring needs are identified.
Triage incidents and perform documented steps to resolve when a known error is identified.
Logging incidents within the Incident Tracking system, clearly documenting symptoms needed for others to investigate the incident.
Act as incident owner, escalating to other support groups and following the status of the incident until it has been confirmed to be resolved.
Work closely with technical support, engineers, customers, and other groups as needed to narrow investigative efforts and resolve incidents.
Schedule new batch jobs, modify job schedules, and delete jobs from schedules based on operational and project needs. Monitor running jobs for operational impact. Identify scheduled job failures.
Maintain critical documentation assets, such as customer contact lists, escalation procedures, scheduled job inventories, and operational cookbooks.
Provide support via phone or pager on a scheduled basis as part of an on-call rotation
3+ years NOC Operations /Monitoring Experience
3+ years of monitoring development/deployment experience
3+ years in an advanced Information Technology and/or software development role
Broad experience deploying complex monitoring systems for mission critical, 24/7 environments
Strong Monitoring Foundational expertise including SNMP, WMI, Synthetic Transaction Engines and experience with various commercial, open source and home grown monitoring packages and methods (e.g. Splunk, Nagios, Zabbix, OneSite, Gomez, CA, HP Openview, etc)
Strong experience in script/automation development, adaptation and troubleshooting
Solid understanding of networking, including network devices, subnets, and routing protocols ; Ability to take and interpret packet captures (Ethereal, etc)
Solid understanding of systems, including server hardware, Windows and LINUX operating systems, iSCSI/FC SAN/NAS/DAS storage, Hypervisor/Virtualization (VMware, Hyper-V),
Independently implement and build tools and test major features and capabilities, as well as work jointly with othe
Apply on Company Website
Get alerts for jobs like this:
Get jobs like this tweeted to you:Software Dev. - General/IT jobs in Boston, MA
View similar jobs:
Lab Operations Manager
sanofi-aventis - Cambridge, MA
Accenture - Boston, MA
Accenture - Boston, MA
Locate this job: