Site Reliability Engineer (SRE) Job at Openkyber, Georgia

cGxUSElKZW5DOEl2UUFlcmVDb3hPU0pSR1E9PQ==
  • Openkyber
  • Georgia

Job Description

TEKsystems is hiring for a fully remote, Level 5 SRE for one of our clients. The role can sit in any US state and any timezone.

This is a short-term contract role with funding till end of January 2026 but may extend beyond.

Our client, a digital asset exchange platform where users can buy, sell, and store cryptocurrencies, is seeking a high-level, Senior SRE to join their AI Infrastructure team.

The following experience is REQUIRED :

  • Site Reliability Engineering (SRE) background
  • AI infrastructure familiarity (nice-to-have, not mandatory)
  • Strong Go and Python scripting skills
  • Terraform for infrastructure as code
  • GCP or AWS Cloud Infra (logging, observability, pub/sub, cloud syncs)
  • Vector.dev and Datadog for observability pipeline
  • Security risk assessment and remediation
  • Ability to own projects end-to-end with minimal supervision

Description

We are looking for a Site Reliability Engineer (SRE) to join the IT AI Infrastructure team to deploy, manage, and optimize AI-powered productivity tools and in-house AI solutions that enhance employee efficiency at scale. A successful candidate will have demonstrated success in similar roles within high-growth, security-conscious environments, bringing deep expertise in public cloud infrastructure (AWS/GCP), backend development (Python, Go, or Java), and automation tooling. The right person is passionate about building scalable and reliable AI infrastructure, driving automation, and collaborating across disciplines to integrate AI systems while maintaining strong security and compliance standards.

  • Deployment and Management of AI Tools: Deploy, configure, and manage AI-powered employee productivity tools and in-house AI built solutions
  • Reliability and Performance: Ensure high availability, reliability, and optimal performance of AI platforms and services. Implement monitoring, alerting, and incident response procedures.
  • Scalability and Infrastructure: Design and implement scalable infrastructure to support the growing demands of AI tools and user base. Optimize resource utilization and manage capacity planning.
  • Automation and Tooling: Develop and maintain automation scripts and tools to streamline deployment, monitoring, and maintenance tasks. Contribute to the experimental sandbox environments for testing new AI solutions.
  • Collaboration and Support: Collaborate with cross-functional teams (Machine-Learning, HR, Security, Data Science, Developer Experience) to support the development and integration of AI solutions. Provide technical support and troubleshooting for AI-related issues.
  • Security and Compliance: Adhere to security and privacy policies while deploying and managing AI tools. Ensure compliance with regulatory requirements.
  • Monitoring and Metrics: Implement comprehensive monitoring and metrics to track the performance and health of AI systems. Analyze data to identify areas for improvement and optimization.
  • Incident Response: Participate in incident response and troubleshooting for AI-related outages or performance issues. Develop and maintain incident response plans.
  • Backend Development: Contribute to backend development tasks to support the integration and functionality of AI tools.
  • Public Cloud Management: Deploy and manage AI solutions on public cloud platforms (AWS/GCP), leveraging cloud-native services and best practices.
  • Written and Verbal Communication: Excellent communication skills and experience presenting technical information to non-technical audiences, including senior leadership.

Skills

Proven experience as a Site Reliability Engineer (SRE) or similar role. Strong understanding of AI technologies and platforms. Experience with deploying and managing applications in a cloud environment (AWS/GCP). Solid backend development experience with programming languages such as Python, Java, or Go. Strong proficiency in managing and configuring public cloud services (AWS/GCP) for scalability and reliability.

Experience with automation tools and scripting (e.g., Ansible, Terraform, Bash, Python). Excellent troubleshooting and problem-solving skills. Strong communication and collaboration skills. Strong security and compliance understanding. Experience working in a highly regulated environment Experience in a fast-paced, high-growth company

Education

Proven experience as a Site Reliability Engineer (SRE) or similar role. Strong understanding of AI technologies and platforms. Experience with deploying and managing applications in a cloud environment (AWS/GCP). Solid backend development experience with programming languages such as Python, Java, or Go. Strong proficiency in managing and configuring public cloud services (AWS/GCP) for scalability and reliability.

Experience with automation tools and scripting (e.g., Ansible, Terraform, Bash, Python). Excellent troubleshooting and problem-solving skills. Strong communication and collaboration skills. Strong security and compliance understanding. Experience working in a highly regulated environment. Experience in a fast-paced, high-growth company

Additional Skills & Qualifications

Role: AI Site Reliability Engineer (Contractor, IC5 level)

Team: IT EMPA (Employee Productivity & Automation)

Duration: Open until end of January (possible extension)

Location: Remote

Responsibilities:

  • Manage and enhance AI-driven employee productivity tools (e.g., Glean, Google Workspace, Slack AI)
  • Implement observability solutions (logging, metrics, dashboards)
  • Automate infrastructure tasks using Terraform
  • Assess and mitigate security risks in AI systems
  • Build scaffolding APIs for unsupported Glean features
  • Collaborate with engineering teams to deliver production-ready solutions quickly

Job Type & Location

This is a Contract position based out of Oakland, CA.

Pay and Benefits

The pay range for this position is $90.00 - $100.00/hr.

Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:

  • Medical, dental & vision
  • Critical Illness, Accident, and Hospital
  • 401(k) Retirement Plan Pre-tax and Roth post-tax contributions available
  • Life Insurance (Voluntary Life & AD&D for the employee and dependents)
  • Short and long-term disability
  • Health Spending Account (HSA)
  • Transportation benefits
  • Employee Assistance Program
  • Time Off/Leave (PTO, Vacation or Sick Leave)

Workplace Type

This is a fully remote position.

Application Deadline

This position is anticipated to close on Dec 5, 2025.

About TEKsystems:

We're partners in transformation. We help clients activate ideas and solutions to take advantage of a new world of opportunity. We are a team of 80,000 strong, working with over 6,000 clients, including 80% of the Fortune 500, across North America, Europe and Asia. As an industry leader in Full-Stack Technology Services, Talent Services, and real-world application, we work with progressive leaders to drive change. That's the power of true partnership. TEKsystems is an Allegis Group company.

The company is an equal opportunity employer and will consider all applications without regards to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

About TEKsystems and TEKsystems Global Services

We're a leading provider of business and technology services. We accelerate business transformation for our customers. Our expertise in strategy, design, execution and operations unlocks business value through a range of solutions. We're a team of 80,000 strong, working with over 6,000 customers, including 80% of the Fortune 500 across North America, Europe and Asia, who partner with us for our scale, full-stack capabilities and speed. We're strategic thinkers, hands-on collaborators, helping customers capitalize on change and master the momentum of technology. We're building tomorrow by delivering business outcomes and making positive impacts in our global communities. TEKsystems and TEKsystems Global Services are Allegis Group companies. Learn more at TEKsystems.com.

The company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.

Job Tags

Contract work, Temporary work, For contractors, Remote work,

Similar Jobs

Entergy

Cybersecurity Analyst I-II Job at Entergy

 ...qualifications and experience.*******Brief Position Description** :The Security and Regulatory Compliance group applies frameworks to ensure...  ...during audits/assessments+ Understanding of multiple cyber security domains, such as:+ Asset, Change, and Configuration Management... 

Wingate by Wyndham

Overnight Hotel Front Desk Supervisor Job at Wingate by Wyndham

Overnight Hotel Front Desk Supervisor Location Kittanning, PA : Job Summary Wingate Kittanning...  ...focused individual to join our team as a Night Auditor. The Night Auditor is responsible...  ...is a plus. Ability to work overnight shifts, including weekends and holidays. Must... 

Stepan Company

Director, Corporate Communications Job at Stepan Company

 ...Job Description The Director of Corporate Communications is responsible for all internal and key external communications, including developing short-term and long-term communications strategies that support the company to achieve its strategic business objectives,... 

Zelis

DRG Clinical Dispute Reviewer Job at Zelis

 ...the knowledge you've gained, and the personal interests that shape who you are. Position Overview At Zelis, the DRG Clinical Dispute Reviewer role is responsible for the resolution of facility and provider disputes as they relate to DRG validation. They will be responsible... 

Master Center for Addiction Medicine

Psychiatric Nurse Practitioner or Physician Assistant Job at Master Center for Addiction Medicine

Channel your passion for helping others into a medical career that is personally and professionally rewarding. Join us on the front lines as direct care staff working alongside top notch professionals and learning from the best in the addiction treatment industry. Become...