top of page

Career

Platform Operation Engineer

Technology

|

Permanent

Technology

Permanent

About Us

Do you want to be part of Thailand banking transformation? Data is the core of the new financial services era, and we are open for the opportunity to be part to drive this change at the core. 

SCB DATAx is a new venture of the Siam Commercial Bank (SCB) holdings, a leading financial services and digital services holdings in Thailand and ASEAN. As part of the transformation of SCB into a group of product and technology companies, under the SCBx brand, SCB DATAx is the technology company to centralize data and provide AI and data science services and products to the group. With a leading-edge cloud native data & AI platform, our vision is to support the group to providing everyone in our region with the opportunity to prosper. 

We work on forward-thinking challenges of centralizing, analyzing and sharing information. We collaborate with companies and experts in many different domains, embrace diversity and all that while having a good laugh and joy in work.

Discover job openings on our career page. To apply, email with the role's title as the subject, attach your CV, and specify your contact information. We're eager to learn more about you.

 I acknowledge that I have read and agreed to DataX's Terms and Conditions and Privacy Notice

Benefits

Other

Preferred Qualifications

  • Technical Stack:

    • Azure Services: Azure Databricks, Azure Kubernetes Service (AKS), Azure Monitor, Azure Log Analytics, Azure Storage, Azure SQL Database, Azure Active Directory, Azure VPN Gateway, Azure Traffic Manager, Azure Front Door, Azure API Management, Azure Service Bus

    • Monitoring and Logging Tools: Azure Monitor, Grafana, Prometheus, ELK Stack, Application Insights

    • CI/CD and Automation Tools: Azure DevOps, Jenkins, Ansible, Terraform, Kubernetes Operators

    Security Tools: Azure Security Center, Azure Sentinel, Palo Alto NextGen Firewall, HashiCorp Vault

Qualifications

  • Education:

    • Bachelor’s degree in Computer Science, Information Technology, or a related field. Relevant certifications (e.g., Azure Administrator, Azure DevOps Engineer, Certified Kubernetes Administrator) are a plus.

    Experience:

    • Minimum of 3 years of experience in platform operations, preferably in cloud environments with Azure Databricks and AKS.

    • Proven experience in managing service level agreements (SLAs) and ensuring platform availability and performance.

    • Strong knowledge of Azure services, including Databricks, AKS, and related infrastructure components.

    • Experience with monitoring tools, incident management processes, and customer support practices.

    Skills:

    • Strong customer support mindset with excellent communication and interpersonal skills.

    • Ability to troubleshoot and resolve technical issues efficiently and effectively.

    • Proficiency in managing cloud platforms and services, with a focus on Azure.

    • Experience with automation and orchestration tools to improve operational efficiency.

    • Strong analytical and problem-solving skills to identify and address platform issues.

    • Accountability and ownership of platform operations and service delivery.

    • Ability to work collaboratively with cross-functional teams in a dynamic environment.

Responsibilities

  • Platform Management and Support:

    • Monitor and maintain the Azure Databricks data platform and AKS clusters to ensure high availability, performance, and reliability for internal users.

    • Perform routine operational tasks, including maintenance, upgrades, backups, and disaster recovery procedures for Azure services and platforms.

    • Troubleshoot and resolve technical issues related to Azure Databricks and AKS, providing timely updates to stakeholders and users.

    Service Availability and SLA Management:

    • Establish and manage SLAs for platform availability, performance, and incident response times.

    • Monitor service performance against SLAs and generate regular reports for stakeholders.

    • Implement measures to improve service availability and reduce downtime, including proactive monitoring and preventive maintenance.

    Incident and Service Request Management:

    • Serve as the first point of contact for service requests and incidents related to the platforms.

    • Manage the incident resolution process, coordinating with cross-functional teams to quickly address and resolve issues.

    • Document incidents, their root causes, and resolutions to improve incident management processes and prevent recurrence.

    Customer Support and Communication:

    • Provide exceptional customer support to internal users, ensuring their needs are met and concerns are addressed promptly.

    • Communicate effectively with users about platform status, incidents, and planned maintenance activities.

    • Gather and analyze user feedback to identify areas for platform improvement and new features.

    Collaboration and Process Improvement:

    • Collaborate with development, security, and infrastructure teams to ensure seamless platform operations and alignment with business goals.

    • Participate in the development and refinement of platform operation processes and procedures to enhance efficiency and effectiveness.

    • Stay updated on emerging technologies and best practices in platform operations and cloud services to recommend improvements.

About Team & Role

We are seeking a highly skilled and customer-focused Platform Operation Engineer to join our IT team. The ideal candidate will be responsible for ensuring the availability, performance, and reliability of our Azure Databricks data platform and Azure Kubernetes Service (AKS) for core GenAI and LLM applications serving internal users. This role requires a strong customer support mindset, accountability, and operational excellence in managing service requests, incidents, and service level agreements (SLAs).

bottom of page