Track This Job
Add this job to your tracking list to:
- Monitor application status and updates
- Change status (Applied, Interview, Offer, etc.)
- Add personal notes and comments
- Set reminders for follow-ups
- Track your entire application journey
Save This Job
Add this job to your saved collection to:
- Access easily from your saved jobs dashboard
- Review job details later without searching again
- Compare with other saved opportunities
- Keep a collection of interesting positions
- Receive notifications about saved jobs before they expire
AI-Powered Job Summary
Get a concise overview of key job requirements, responsibilities, and qualifications in seconds.
Pro Tip: Use this feature to quickly decide if a job matches your skills before reading the full description.
Strong experience in Production administration of Red Hat OpenShift 4.x on bare-metal Must
Strong experience on Linux (RHEL/RHCOS) + Node-Level Troubleshooting Must
Strong experience OpenShift networking (OVN-K, Ingress, LoadBalancers---Must
Strong experience on ODF / Ceph storage in production –Must
Strong experience on OpenShift security, RBAC, and governance implementation is MUST
Main Role(Accountability)
We are seeking an experienced Red Hat OpenShift Container Platform (OCP) System Administrator to manage, operate, secure, and optimize enterprise-grade OpenShift 4.x clusters deployed on bare-metal infrastructure.
The role involves full ownership of Day-0, Day-1, and Day-2 operations, including installation, upgrades, patching, monitoring, troubleshooting, automation, and platform reliability. The administrator will support mission-critical production environments and provide L2/L3 infrastructure support to multiple projects.
This position requires deep expertise in OpenShift, Linux, Kubernetes, networking, storage, security, and observability, along with the ability to work under pressure in regulated, high-availability environments.
Key Responsibilities
- Platform Administration (Day-to-Day Operations)
- Administer and operate Red Hat OpenShift 4.x on bare-metal
- Monitor cluster health: nodes, pods, ETCD, operators, resources
- Perform capacity planning, tuning, scaling, cordon/drain operations
- Manage Master, Worker, Infra nodes, storage and networking components
- Onboard and support applications on OpenShift
- Provide L2/L3 technical support to projects and operations teams
- Upgrades, Patching & Lifecycle Management
- Execute minor and major OpenShift upgrades
- Perform RHCOS OS patching
- Upgrade cluster operators, ingress, registry, and storage components
- Manage API and Ingress certificates
- Validate upgrades in test environments before production rollout
- Networking (OpenShift & Bare-Metal)
- Manage OpenShift networking: OVN-Kubernetes, Multus, SR-IOV
- Troubleshoot Routes, Ingress, LoadBalancers, DNS, VIP issues
- Work with VLANs, MTU, BGP, HAProxy, MetalLB
- Coordinate with network and firewall teams for internal/external access
- Storage Management
- Manage and troubleshoot ODF / Ceph storage
- Handle PVC/PV provisioning, StorageClasses, quotas
- Resolve persistent storage issues for stateful workloads
- Security, Compliance & Governance
- Implement Pod Security Standards, NetworkPolicies, Zero-Trust
- Enforce CIS benchmarks and Red Hat security baselines
- Integrate image scanning tools (ACS, Trivy, Clair)
- Manage RBAC, secrets, vaults, SSO/IAM
- Maintain audit logs, compliance reports, and policy enforcement
- Monitoring, Logging & Observability
- Configure and maintain Prometheus, Alertmanager, Grafana
- Manage logging stacks (EFK / ELK / Loki)
- Create alerts, dashboards, and analyze performance trends
- Conduct periodic capacity and reliability assessments
- Backup, Restore & Disaster Recovery
- Manage ETCD backups, snapshots, and restore testing
- Implement application backup using OADP / Velero
- Participate in DR planning, drills, and execution
- CI/CD & DevOps Integration
- Support Tekton pipelines
- Implement GitOps using ArgoCD / Flux
- Manage image registries (Quay / internal registry)
- Ensure pipeline security, tagging, and automation
- Documentation & Governance
- Create and maintain SOPs, runbooks, architecture diagrams
- Maintain change records as per ITIL processes
- Perform Root Cause Analysis (RCA) for major incidents
- Participate in governance meetings and weekly status reporting
Key Skills
Ranked by relevanceReady to apply?
Join TAT IT Technolgies and take your career to the next level!
Application takes less than 5 minutes

