Reports To : Director of Cloud Infrastructure

About the Role

We are seeking a Senior Technical Architect to drive architecture, design, and technology strategy for our enterprise-level revenue optimization and performance management platform. The ideal candidate will bring deep experience in cloud-native architecture, distributed systems, and modern application frameworks, with proven expertise in security, scalability, integrations, and enterprise data processing pipelines.

As a senior leader, you will collaborate with engineering, DevOps, InfoSec, product, and business stakeholders to ensure our platform is resilient, secure, compliant, and scalable while supporting our roadmap for growth and innovation.

Key Responsibilities

Architecture & Design

  • Cloud-Native Architecture: Expertise in designing AWS-based cloud architectures for scalability, high availability, and cost optimization (EKS, EMR, RDS, Redshift, S3, Lambda, VPC, IAM).
  • Microservices & API Design: Strong experience in microservices architecture, service decomposition, API gateway patterns, REST, GraphQL, gRPC, and event-driven messaging (Kafka, SQS, SNS).
  • Data Architecture: Ability to design data pipelines, ETL/ELT frameworks, data lakes, data warehouses, and distributed processing systems, ensuring data quality, schema evolution, and reconciliation.
  • Security & Compliance by Design: Embed security, access control, encryption, and compliance (SOC2, PCI, GDPR, ISO 27001) into all layers of architecture.
  • Scalability & Resilience: Design systems that handle geo-distributed deployments, multi-tenancy, auto-scaling, failover, and disaster recovery.
  • Observability & Monitoring: Define logging, monitoring, alerting, and performance tuning standards for applications and infrastructure.
  • CI/CD & Deployment Architecture: Design pipelines that enforce code quality, automated testing, versioning, and secure deployments across environments.
  • Technical Documentation & Decision Records: Clearly document architecture diagrams, design decisions, trade-offs, and rationale for stakeholders and auditors.
  • Future-State Roadmapping: Ability to plan evolutionary architecture and modernization strategies, including monolith-to-microservices migration, cloud adoption, and AI/ML integration.
  • Performance & Cost Optimization: Design for efficient compute, storage, and network usage while maintaining required SLA, latency, and throughput.

Qualifications & Skills

  • 10+ years of experience in enterprise software engineering and architecture.
  • Strong expertise in AWS Cloud services including EKS, EMR, RDS, Redshift, S3, Glue, Lambda, VPC, and IAM.
  • Proven experience in microservices architecture, API design (REST, GraphQL, gRPC), and event-driven systems (Kafka, SQS, SNS).
  • Deep expertise in data pipelines, ETL/ELT, data lakes, data warehousing, and distributed processing.
  • Experience with containerization and orchestration (Docker, Kubernetes, Helm, service mesh).
  • Strong understanding of security architecture, IAM, OAuth2.0, OIDC, SAML, Auth0, and compliance frameworks (SOC2, PCI-DSS, GDPR, ISO 27001).
  • Proficiency in DevOps, CI/CD, GitOps, Jenkins, ArgoCD, GitHub Actions, and Infrastructure-as-Code (Terraform, Ansible, Pulumi).
  • Expertise in observability, monitoring, logging, tracing, and performance tuning using New Relic, OpenTelemetry, Prometheus, Grafana, ELK/EFK.
  • Extensive experience in database design and management: relational (MySQL, Postgres), NoSQL (DocumentDB, DynamoDB), and data warehouse (Redshift, Snowflake).
  • Experience designing geo-distributed, multi-tenant, high-availability, and resilient SaaS architectures.
  • Familiarity with frontend frameworks (Angular, React) ,Backend Framework (Java Spring Boot) and mobile application architecture.
  • Strong skills in architectural governance, technical debt management, and future-state roadmap planning.
  • Understanding of AI/ML workflows for anomaly detection, KPI forecasting, and optimization.
  • Expertise in cost optimization, scalability, disaster recovery, and high-performance infrastructure design.
  • Hands-on experience with software lifecycle best practices, agile methodologies, and code quality governance.
  • Ability to evaluate tools, frameworks, and platforms for strategic enterprise adoption.
  • Strong analytical, problem-solving, and decision-making skills with the ability to balance trade-offs.

Reports To : Director of Cloud Infrastructure

We are looking for a DevOps Engineer to be responsible for our infrastructure and deployments in our Multi-cloud environments. As a member of our engineering team, you will be in involved all things DevOps/SysOps/MLOps. You’ll be responsible for planning and building tools for system configuration and provisioning. This role also will be responsible for maintaining any required infrastructure SLAs both internal and external to the business. Our team is extremely collaborative. Interested candidates must be self-motivated, willing to learn, and willing to share new ideas to improve our team and process.

Responsibilities:

  • Performs technical maintenance of the configuration management tools and release engineering practices to ensure technical changes are documented, comply with standard configurations, and are sustainable.
  • Designs develops, automates, and maintains tools using an automate-first mindset to improve the quality and repeatability of software and infrastructure configuration development and deployment.
  • Will train software developers and system administrators in the use of pipeline tools and the implementation of quality standards.
  • Oversee integration work & provide automated solutions in support of multiple products.
  • Provide technical leadership, lead code reviews and mentor other developers.
  • Build systems that dynamically scale.
  • Plan deployment.

Requirements:

  • Experience with AWS and GCP.
  • Hands of experience in Kubernetes (at least 2years of K8s experience.)
  • Minimum 3+ years experience with Unix based systems.
  • Working knowledge of Ansible, or other Configuration Management.
  • Experience in leading scripting tools (Python/Ruby/Bash etc).
  • Experience with Jenkins or Cloud Native CI/CD.
  • Strong scripting and automation skills.
  • Solid understanding of web applications.
  • Experience in Windows and LInux Automations using ANsible or similar.
  • Excellent hands on skill in Terraform and CloudFormation.

Great to have:

  • Experience with Terraform
  • Experience with Azure
  • AWS Solution Arch (Pro) or DevOps Engineer (Pro)
  • Experience with continuous deployments (CD)
  • Experience with cloud-based autoscaling and elastic sizing
  • Experience with relational database administration and SQL
  • Experience with Redis, MongoDB, Memcached, Cassandra, or other non-relational storage

Reports To: Director of Cloud Infrastructure

About the Role.

We’re seeking a skilled Prompt Engineer specializing in Kubernetes and platform engineering tools to design and optimize prompts that enable Large Language Models (LLMs) to automate complex container orchestration and infrastructure management tasks. You will create precise, context-rich prompts that guide AI models to generate Kubernetes manifests, manage deployments, and interact with platform engineering workflows, boosting developer productivity and operational reliability.

Your work will bridge AI and cloud-native infrastructure, enabling seamless AI-driven automation for Kubernetes clusters, Helm charts, CI/CD pipelines, and platform tooling.

What You’ll Do

  • Develop and refine prompts that instruct LLMs to generate, validate, and optimize Kubernetes YAML manifests, Helm charts, and platform automation scripts.
  • Apply advanced prompt engineering techniques such as zero-shot, few-shot, and chain-of-thought prompting tailored for infrastructure-as-code and container orchestration contexts.
  • Collaborate with DevOps, SRE, and platform engineering teams to understand deployment patterns, best practices, and pain points to craft domain-specific prompt templates.
  • Integrate prompts with AI orchestration frameworks (e.g., LangChain, AutoGen) and Kubernetes management tools to enable autonomous or semi-autonomous platform operations.
  • Continuously evaluate prompt outputs for accuracy, security, and compliance with Kubernetes best practices (e.g., pod scheduling, resource quotas, readiness/liveness probes).
  • Document prompt designs, usage guidelines, and best practices to empower platform teams and AI developers.
  • Stay up-to-date with Kubernetes ecosystem advancements and AI-driven infrastructure automation trends.

Required Skills & Experience

  • Proven experience with prompt engineering for LLMs (OpenAI GPT-4.x, Anthropic Claude, Google Gemini, etc.) especially applied to Kubernetes or cloud infrastructure automation.
  • Strong understanding of Kubernetes architecture, deployment best practices (Helm, taints/tolerations, autoscaling, probes), and platform engineering workflows.
  • Familiarity with infrastructure-as-code tools (Helm, Terraform, Kubernetes manifests) and container orchestration concepts.
  • Proficiency in Python or TypeScript for scripting and integrating AI prompts with platform tooling.
  • Experience with AI orchestration frameworks such as LangChain, AutoGen, or Semantic Kernel.
  • Knowledge of vector databases (Pinecone, Weaviate, Chroma) and semantic search to enhance prompt context retrieval.
  • Ability to craft clear, positive, and domain-specific prompts that reduce ambiguity and improve AI output quality.
  • Understanding of security and compliance considerations in cloud-native environments.

Preferred Tools & Technologies

CategoryTools & Frameworks
LLM APIsOpenAI GPT-4.x, Anthropic Claude 3.x, Google Gemini 2.5, Cohere Command R
Prompt EngineeringLangChain, AutoGen, Semantic Kernel, PromptLayer, LangSmith
Kubernetes Toolskubectl, Helm, Kustomize, Terraform
Vector DatabasesPinecone, Weaviate, Chroma
OrchestrationLangChain Agents, AutoGen, crewAI
DevOps & CloudDocker, Kubernetes, AWS, GCP, Azure, CI/CD (GitHub Actions, Jenkins)
ObservabilityPrometheus, Grafana, Kube-state-metrics

Why Join Us?

  • Work at the intersection of AI and cloud-native technologies to redefine platform automation.
  • Collaborate with experts in AI, DevOps, and platform engineering to build innovative solutions.
  • Influence the future of autonomous infrastructure management powered by prompt engineering.
  • Access to cutting-edge AI tools and continuous learning opportunities.

This role is ideal for prompt engineers passionate about Kubernetes and platform engineering who want to leverage LLMs to automate and optimize cloud infrastructure management through expert prompt design and AI orchestration.

Reports To: Director of Cloud Infrastructure

Role Overview:
Join our Core Kubernetes Operator Development team, where we’re pushing the boundaries of Kubernetes innovation. As a Kubernetes Controller Developer (Golang), you will play a crucial role in building “01”, our cloud-agnostic Platform as a Service (PaaS), driven by full-fledged Kubernetes operators and agents.

This position requires a strong background in Kubernetes internals and Golang programming, particularly in developing and managing Kubernetes controllers. If you’re a proactive problem solver with experience in building cloud-native infrastructure, this is your opportunity to contribute to a transformative platform.

We highly encourage candidates with a solid programming foundation and a hunger to explore the cloud-native world to apply. Comprehensive onboarding and professional development support will be provided.

Key Responsibilities (Not limited to):

  • Collaborate in Agile teams, taking ownership of development stories with minimal supervision.
  • Partner with internal teams and clients to accurately capture technical requirements.
  • Design, build, deploy, and maintain Kubernetes controllers and operators using Golang.
  • Identify gaps in current systems and propose or implement technical improvements.
  • Apply best practices across the full software development lifecycle.
  • Create and execute unit, regression, and E2E tests for operator reliability.
  • Work in Linux environments and troubleshoot issues in containerized applications.
  • Contribute to CI/CD workflows for seamless testing and deployment.

Essential Skillset:

  • Kubernetes Controller Development: Proven expertise in building and maintaining controllers and operators.
  • Proficiency in Golang: 2+ years writing idiomatic, well-tested Go code for Kubernetes projects.
  • Deep understanding of Kubernetes APIs and libraries including client-go, CRDs, and API extensions.
  • Hands-on experience with:
    • Kubebuilder – For scaffolding controllers and CRDs
    • Operator SDK – For building Operators with OLM support
    • controller-runtime – For abstracting Kubernetes client logic
  • Strong testing skills, including unit, load, and E2E tests for operators.
  • Familiarity with containerization (Docker) and orchestration (Kubernetes).
  • Comfortable working in Linux with debugging tools and CLI.
  • 2+ years experience working with CI/CD tools like Jenkins, GitHub Actions, Tekton, or similar.

Preferred Skills (Nice to Have):

  • CKA or CKAD certifications.
  • Hands-on experience managing production-grade Kubernetes clusters.
  • Knowledge of Infrastructure as Code tools (e.g., Terraform).
  • Exposure to major cloud providers: AWS, GCP, or Azure.
  • Scripting experience in Shell or Python.

What We Offer:

  • A chance to build infrastructure automation tools that power real-world workloads.
  • Opportunity to work on bleeding-edge cloud-native technologies with a global impact.
  • Collaborative and innovation-driven culture, with strong engineering mentorship.
  • Remote-friendly setup and flexible work culture.
  • Career development in one of the most in-demand areas of DevOps.

Reports To: Director of Cloud Infrastructure

About the Role.

You will be part of our Platform Engineering team that takes GitOps and 100% (Infra as Code) IaC seriously. You will be involved in development of a Cloud Agnostic  Platform as a service (PaaS) product, we called it 01, that leverages Kubernetes. Our product aims to reduce significant time to roll out the Cloud Native application across the managed or  on premise.  This role requires Kubernetes experiences with strong programming skills.

Responsibilities:

  • Work in an Agile team and be able to take ownership of stories with minimal direction
  • Work closely with our clients, understand and capture their requirements
  • Able to see gaps and areas of improvement in process as well as technologies, providing recommendations and taking the initiative to fix issues
  • Quick learner and able to adapt to new technologies and teams quickly
  • People who can interact well in both group and one-to-one set-up Experience with software development lifecycle.

Primary Skillset:

  • Experience in cloud and container solutions such as Docker and Kubernetes 
  • Familiar with AWS, GCP or Azure
  • Experience in Linux Environment and debugging tools (2+ years)
  • Working experience in Linux script writing (Shell, Python etc) (2+ years)
  • Familiar with AWS, GCP or Azure (2+ years)
  • Familiar with at least one CI/CD tool: Jenkins, Gitlab CI (2+ years)
  • Experience in GOLANG (1+ years)

Secondary Skillset (Optional – nice to have)

  • Additional certification such as CKA (Certified Kubernetes Administrator), CKAD (Certified Kubernetes Application Developer) etc will be a bonus
  • Experience in containers, registries and microservices build using Springboot framework would be an added advantage
  • Experience in creating and managing production scale Kubernetes clusters
  • Experience with Terraform

Reports To:  Director of Cloud Infrastructure

We’re looking for a SecOps Engineer to help us get to cyber resiliency in our infrastructure. You will be responsible for ensuring security standards are set and adhered to for operational excellence in the areas of availability, productivity, capacity, and efficiency.

Responsibilities:

  • Define & execute on company’s strategy for a secure infrastructure, applications and third party vendors
  • Understanding the tooling behind the security
  • Analyse, identify and remediate security weaknesses in cloud infrastructure
  • Build cyber resiliency in our cloud infrastructure
  • Consult on security-critical infrastructure and system features
  • Championing improvements for a company’s security controls, identifying automation opportunities and tools that could improve the ability to detect and react to events
  • Internal infrastructure network testing, mainly within Kubernetes clusters
  • Vulnerability assessment (VMs, container images)
  • Container runtime security
  • Web application security testing (ability to identify vulnerabilities within the OWASP Top Ten)
  • Static Application Security Testing (SAST) and DAST (Dynamic Application Security Testing); mainly automated but some manual work required to eliminate false positives
  • Mobile Application Security Testing (MAST)
  • Participate in security code reviews
  • Develop KPIs to show how many issues found and how many issues fixed
  • Participate in security architecture reviews
  • Assist with automation to help development and operations consume security service

Requirements:

  • Experienced with Application Security Tools (SAST, DAST, SCA)
  • Knowledge on threat modelling and security design review methodologies
  • Promoting security knowledge sharing within technical organisations
  • Assisting in the design of enhancements to the cloud security strategy by identifying and alerting on appropriate event types
  • Managing CI/CD security strategy with integration of Security as Code (SaC) and Policy as Code (PaC)

Great to have:

  • Ability to carry out manual code security review (this would mean some knowledge of the Javascript, Java, PHP, Go, C#, Python, Terraform, HTML, XML and CSS languages)
  • Experience in helping an organisation to meet the PCI DSS and SOC-2 compliance standards
  • Experience in Kubernetes
  • AWS/GCP exposure.

Qualification:

  • Bachelor’s or master’s degree in Computer Science, Information Technology, or a related field, or 1 years equivalent practical Information Security professional experience.
  • 1+ years of Information Technology experience with a focus on Security.
  • 1+ years of experience in Cloud Administration, Networking, or another operations-related field.
  • Candidates should have exceptional troubleshooting and problem-solving skills.
  • Experience engineering and operating public-facing infrastructure
  • Knowledge with integrating crucial security tasks into CI/CD pipelines

Get the latest BerryBytes updates by subscribing to our Newsletter!

  • Home
  • About
  • Products
  • Services
  • Careers
  • Contact