Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Join a World Lead trading firm as they grow their capability to provide first class 24/7 operational support for our global network infrastructure. The successful candidate will join the Network Operations team within the global infrastructure function, responsible for managing and maintaining the Network Infrastructure and Network Automation services. The team will be operating in various complex environments, including low latency trading, high performance computing, and global WAN technologies.
The Role:
- Oversight and development of monitoring dashboards, responding proactively to alerts with appropriate level of priority. Taking responsibility for continuous improvement of alerting and the overall observability stack
- Running post incident reviews finding opportunities to improve the availability and reliability of our infrastructure offerings
- Delivery of BAU changes through automation wherever possible, working with trading desk, research and other infrastructure engineering teams in a highly collaborative manner
- Perform trend analysis using various data sources aiming to seek out potential issues, improve correlation and finding capacity concerns
- Defining SLOs to ensure the high availability of network services and infrastructure
- Provide considerable support of network automation and associated tooling such as CI/CD pipelines, orchestration, Ansible, Python, GitOps practices, amongst others
- Responsible for releasing into production environments by providing high quality reviews of merges and scheduled pushes of changes/features/capabilities
Your present skillset
- Experience monitoring and resolving incidents across:
- Low Latency LAN: Multicast, BGP, PTP, FPGA, Layer 1 Switching
- Datacentre HPC: VXLAN, EVPN, RoCEv2, Leaf/Spine Topology
- WAN and Internet /cloud connectivity: MPLS, BGP
- Expert troubleshooting skills of network infrastructure and collaborating with vendor support teams when required to perform deep investigation
- Good knowledge of infrastructure metric collection and visualisation tooling such as Splunk, Prometheus and Grafana
- Clear networking knowledge at Cisco CCNP level either through demonstrable experience, or verifiable qualifications
On-Call and out of hours work will be a future requirement of this role.
While a resume is preferable we also welcome tentative inquiries from well-qualified persons. Utmost confidentiality and discretion are assured. If interested, please email [email protected]
Key Skills
Ranked by relevanceReady to apply?
Join GQR and take your career to the next level!
Application takes less than 5 minutes