Site Reliability Engineer (SRE) Job at Openkyber, Georgia

b0ZER0twZWtEYzB2UVFLc2VpZ3hOQ3RVR3c9PQ==
  • Openkyber
  • Georgia

Job Description

Job Summary We are seeking an experienced Site Reliability Engineer (SRE) to join the Applied AI and Data Science program. This role focuses on deploying, monitoring, and optimizing cloud-based applications and infrastructure to ensure high availability and performance. The ideal candidate will have strong expertise in AWS, containerized microservices, infrastructure automation, and monitoring tools.

Key Responsibilities:
  • Release Management: Build and deploy application, service, and infrastructure releases; validate system integrity post-deployment; document release notes.
  • Production Support: Maintain 99.999% availability of critical systems; monitor infrastructure and applications; perform root cause analysis for outages; respond to incidents.
  • Monitoring & Alerting: Implement monitoring policies; build dashboards; track system efficiency and resource consumption; alert stakeholders for SLA deviations.
  • Optimization: Manage resource scaling; optimize system performance and resource utilization.
  • Team Collaboration: Assist with user support; coordinate with onshore/offshore teams; develop bug fixes; become an expert in system architecture and deployment pipelines.
Required Qualifications:
  • 6+ years of DevOps or SRE experience in large, complex environments.
  • Strong background in software development (OOP) and ability to read/debug code.
  • Expertise in AWS services (EKS, S3, DocumentDB) and Terraform for Infrastructure as Code.
  • Experience with Kubernetes, containerized microservices, and cloud deployments.
  • Proficiency with GitLab or similar CI/CD tools for pipeline management.
  • Hands-on experience with monitoring tools such as Datadog or Splunk.
  • Bachelors degree in a related field or equivalent experience.
Preferred Qualifications:
  • Familiarity with Python, Node.js, React, TypeScript, and GraphQL.
  • Exposure to relational (SQL) and NoSQL databases.
  • Experience with Docker, Redis, and ORM frameworks.
  • Knowledge of experimentation, statistical testing, and data analysis.
  • Masters degree in a related field is a plus.

Education: Bachelors Degree

Job Tags

Similar Jobs

Providence Health & Services

Security Officer Job at Providence Health & Services

 ...Management Department is a centralized location for managing a wide variety of programs that provide for the safety and security of St. Patrick Hospital & Health Sciences Center's (SPHHC) patients, visitors, employees and property. Members of the Safety Management... 

Danville Services

Direct Support Staff - St George Day Program Job at Danville Services

 ...those interested in the fields of nursing, medical supports, social work, behavior supports, and therapy, but anyone with a desire to...  ...assist with activities of daily living (ADLs) and provide care Experience in group home settings or long-term care is a plus, but not... 

Walmart Inc.

(USA) Packing Operator, Manufacturing (6:00am-6:30pm, Friday-Sunday) - $22.80/hr. Job at Walmart Inc.

 ...What you'll do at Position Summary... Walmart is opening its third owned and operated milk processing facility in Robinson, Texas...  ...vision and dental coverage. Financial benefits include 401(k), stock purchase and company-paid life insurance. Paid time off benefits... 

DSV - Global Transport and Logistics

Freight Forwarder, Gateway Exports Job at DSV - Global Transport and Logistics

 ...is local and close to our customers. Read more at Location: Grapevine, TX Division:Air & Sea Job Posting Title: Freight Forwarder, Gateway Exports Time Type: Full Time Summary An Air Export Gateway Freight Forwarder will be responsible for managing... 

FEMA

Emergency Management Specialist (Response) Job at FEMA

 ...Experience refers to paid and unpaid experience, including volunteer work done through National Service programs (e.g., Peace Corps, AmeriCorps) and other organizations (e.g., professional, philanthropic, religious, spiritual, community, student, social). Volunteer work...