Tribus
Network Operations
TribusAustralia10 hours ago
Full-timeEngineering

About Us

We are a global quantitative and systematic investment manager, operating across all liquid asset classes worldwide. Our technology- and data-driven approach applies scientific methods to investing. By combining expertise in data, research, technology, and trading, we foster a collaborative culture that helps us tackle complex challenges and continuously innovate to deliver strong outcomes for our investors.

Your Future Role

We are expanding our capability to deliver first-class 24/7 operational support for a global network infrastructure. The successful candidate will join the Network Operations team within the global infrastructure function, responsible for managing and maintaining core Network Infrastructure and Network Automation services. This team operates across diverse and complex environments, including low-latency trading, high-performance computing, and global WAN technologies.

Key Responsibilities

  • Oversee and enhance monitoring dashboards, proactively responding to alerts and improving the observability stack.
  • Conduct post-incident reviews to identify opportunities to improve availability and reliability.
  • Deliver BAU changes through automation, collaborating closely with trading desks, research teams, and other infrastructure engineering functions.
  • Perform trend analysis to detect potential issues, improve correlation, and anticipate capacity concerns.
  • Define SLOs to ensure high availability of network services and infrastructure.
  • Provide significant support for network automation, including CI/CD pipelines, orchestration, Ansible, Python, and GitOps practices.
  • Manage production releases, ensuring high-quality reviews of merges and scheduled deployment of changes, features, and capabilities.

Your Skills and Experience

  • Proven experience monitoring and resolving incidents across:
  • Low Latency LAN: Multicast, BGP, PTP, FPGA, Layer 1 Switching
  • Datacentre HPC: VXLAN, EVPN, RoCEv2, Leaf/Spine Topology
  • WAN and Internet/Cloud Connectivity: MPLS, BGP
  • Strong troubleshooting skills with the ability to work with vendor support teams for deep investigations.
  • Solid knowledge of infrastructure metric collection and visualization tools such as Splunk, Prometheus, and Grafana.
  • Clear networking expertise at Cisco CCNP level, demonstrated either through qualifications or relevant experience.
  • Willingness to participate in on-call and out-of-hours support as part of the role.

Key Skills

Ranked by relevance