Jobs in Kuala Lumpur
Malaysia
Browse 5 job opportunities in Kuala Lumpur, Malaysia.
Related
Job Description: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day. Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and
The RolePave Bank is building the future of programmable banking â combining traditional banking with digital assets under a single, regulated platform. Weâre looking for a Site Reliability Engineer (SRE) to ensure our core systems are highly available, scalable, and performant as we grow.As an SRE at Pave Bank, youâll work closely with Engineering, Product, Security and Operations teams to build robust infrastructure, automate operations, and maintain reliability across all services. Your work will directly impact the safety, performance, and scalability of our banking platform, helping our customers trust Pave Bank with their finances.What Youâll Be DoingMonitor, maintain, and improve the reliability, availability, and performance of production systems and services.Build and maintain infrastructure as code (IaC), deployment pipelines, and automation to support continuous delivery, scalability, and disaster recovery.Respond to incidents, perform root-cause analysis, and drive postmortems to ensure lessons learned are applied.Implement and enforce operational best practices: observability, logging, metrics, alerting, capacity planning, failover strategies, and backups.Collaborate with Engineering, Product, Compliance, and Operations teams to ensure infrastructure meets reliability, compliance, and security standards.Support service scaling, database operations, cloud infrastructure (GCP preferred), networking, and microservices orchestration.Document operational runbooks, on-call procedures, and system architecture to support maintenance, knowledge sharing, and compliance.What Youâll BringTechnical Skills and ExperienceStrong programming or scripting skills (Go, Python, Bash, or similar) for automation, tooling, and operational tasks.Hands-on experience with cloud infrastructure, ideally Google Cloud Platform (GCP).Familiarity with containerization and orchestration (Docker, Kubernetes, or equivalent).Experience with infrastructure-as-code tools (Terraform, Cloud Deployment Manager, or similar).Experience with either FluxCD or ArgoCD for GitOps-based delivery.Solid understanding of distributed systems, microservices architecture, and reliability patterns.Experience setting up monitoring, logging, alerting, and observability (e.g., Prometheus, Grafana, ELK, distributed tracing).Strong troubleshooting skills and ability to respond to incidents under pressure.Knowledge of backup and disaster recovery strategies, database management, and secure operations.Other SkillsOwnership mindset: proactive, responsible, and committed to system reliability.Strong communication skills â able to coordinate across technical and non-technical stakeholders.Comfortable working in a fast-paced, early-stage startup environment.High integrity, attention to detail, and passion for fintech and programmable banking systems.Nice to HavePrior experience in fintech, banking, or other highly regulated industries.Familiarity with compliance, security, and data protection best practices.Experience with high-availability, high-throughput systems, or financial infrastructure.Exposure to blockchain or crypto systems integrated with banking.Experience optimizing cloud infrastructure for cost and performance under rapid growth.Why Pave Bank?Work alongside a founding team from Monzo and BigPay, bringing top-tier fintech expertise.Tackle real-world reliability challenges in a regulated, fast-growing fintech environment.Learn from and collaborate with experienced engineers while developing your SRE career.Competitive salary and meaningful equity with room for growth.Be part of a well-funded startup shaping the future of programmable banking.Please mention the word CONGRATULATIONS and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
Job Description: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. We do this by driving Responsible Growth and delivering for our clients, teammates, communities and shareholders every day. Being a Great Place to Work is core to how we drive Responsible Growth. This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates' physical, emotional, and
ABOUT YOU We are looking for an Operations Engineer who is technically curious, detail-oriented, a strong communicator, and proactive to join our Global Technical Operations (GTO) team. The best candidate will be someone who thrives in a fast-paced, highly collaborative, and exceptionally dynamic setting and is excited to monitor and investigate production issues across a global platform, help improve how we detect and respond to incidents, analyze trends and patterns in production data, and contribute to better communication with partners and stakeholders during incidents. Strong troubleshooting skills, observability platform experience, and scripting ability are essential, along with experience in SRE, DevOps, production operations, or NOC environments supporting high-availability platforms (payments, e-commerce, SaaS, or gaming). The ability to communicate clearly and effectively in English â both written and verbal â when writing incident updates, shift handoffs, and status page communications will be key to your success in this role. If you're passionate about keeping critical systems running and continuously improving operational processes and love being the first to spot issues and the one who drives them to resolution for game developers and players worldwide, we would love to hear from you! Operations Engineer, Kuala Lumpur ABOUT US Xsolla is a global commerce company with robust tools and services to help developers solve the inherent challenges of the video game industry. From indie to AAA, companies partner with Xsolla to help them fund, distribute, market, and monetize their games. Grounded in the belief in the future of video games, Xsolla is resolute in the mission to bring opportunities together, and continually make new resources available to creators. Headquartered and incorporated in Los Angeles, California, Xsolla operates as the merchant of record and has helped over 1,500+ game developers to reach more players and grow their businesses around the world. With more paths to profits and ways to win, developers have all the things needed to enjoy the game. For more information, visit xsolla.com.\nResponsibilities: Serve as the primary dashboard monitor during your shift â continuously watch the GTO Operational Dashboard in Datadog, detect anomalies by correlating signals across APM, logs, metrics, synthetic tests, and Real User Monitoring, and determine whether alerts warrant an incident ticket or can be resolved through immediate investigation. Triage and investigate production incidents â create incident tickets in JIRA Service Management, perform initial technical investigation using Datadog (traces, logs, infrastructure and application metrics), determine blast radius and likely root cause domain, and route to the correct team (Product SRE, Infrastructure SRE, or Engineering) using the smart routing model. Own lower-severity incidents end-to-end from detection through resolution â diagnose, execute runbook procedures, and resolve without escalation where possible. Escalate promptly when an incident is unresolved within defined thresholds or requires a code-level fix. Support the TSO Lead during major incidents as the technical right hand in the war room â surface real-time data (error rates, impact scope, deployment history, related alerts), maintain the incident ticket with live timeline entries and linked evidence, and execute mitigation actions as directed. Draft incident communications under TSO Lead direction, including internal Slack updates, stakeholder notifications, and customer-facing status page updates (status.xsolla.com). Support clear, timely communication throughout the incident lifecycle. During non-incident periods, analyze incident trends, recurring issues, and production bugs â compile data from Datadog, JIRA, and Slack, identify patterns, and contribute findings to regular reports for product and engineering teams. Publish health reports of critical apps periodically. Compile incident timelines and draft initial PIR documents for Post-Incident Review preparation. Track PIR action items post-session and flag overdue items to the TSO Lead. Build and maintain operational automation (alert enrichment scripts, incident templates, Slack workflows, dashboard widgets) and contribute to runbook development â documenting new resolution procedures so they can be repeated by any Operations Engineer on any shift. Conduct structured shift handoffs covering active incidents, at-risk services, upcoming deployments, and follow-up items. Participate in knowledge transfer sessions with SREs to continuously expand independent resolution capability. Cover for the TSO Lead during vacations, absences, or emergencies â including severity classification, escalation decisions, stakeholder communications, and basic Incident Commander functions. Qualifications: 4+ years of experience in SRE, DevOps, production operations, NOC, or technical operations in a high-availability environment. Experience with platforms that handle payments, e-commerce, SaaS, or gaming workloads is preferred. Strong troubleshooting and investigation skills â ability to take an alert or user-reported symptom and methodically trace it through the stack: application logs, APM traces, infrastructure metrics, database queries, and network paths. Hands-on experience with Datadog (or equivalent observability platform: Grafana, Splunk, New Relic, Elastic) â navigating APM, building log queries, reading infrastructure dashboards, interpreting SLO burn rates, and configuring monitors and alerts. Proficiency in at least one scripting language: Python, Go, or Bash. You will write automation scripts, build operational tooling, and work with APIs. Clear written and verbal communication skills in English â ability to write incident tickets, investigation notes, Slack updates, shift handoff reports, status page communications, and PIR drafts that are clear, concise, and useful to both technical and non-technical audiences. Working knowledge of Kubernetes and cloud infrastructure (GCP preferred, AWS/Azure acceptable) â understanding of pods, deployments, services, ingress, node health, and how to investigate Kubernetes-related production issues. Understanding of SLOs, error budgets, and burn-rate alerting â knowing what a multi-window burn-rate alert means, how error budgets deplete, and how SLO breaches translate into incident severity. Experience with incident management tooling: JIRA or JIRA Service Management, PagerDuty or OpsGenie, Slack, and Confluence. Experience with or strong interest in AI/ML-assisted operations: anomaly detection, alert correlation, predictive monitoring, or automated remediation. Comfort with 24x7 shift-based operations as part of a follow-the-sun model with handoff overlaps. Weekend on-call (rotating) is required. Nice to have: Experience in the gaming, payments, or fintech industry â particularly environments where transaction processing, checkout flows, or player-facing services must meet strict uptime requirements. Familiarity with Datadog Service Catalog, synthetic monitoring, and RUM (Real User Monitoring). Experience with distributed systems debugging: tracing failures across microservices, understanding cascading failures, and reading distributed traces end-to-end. Exposure to database operations (MySQL, PostgreSQL, Redis, Kafka) at a level sufficient to investigate connection pool exhaustion, replication lag, slow queries, or queue backlogs during incidents. Familiarity with CI/CD pipelines and deployment tooling (GitLab CI, ArgoCD, Helm) â enough to correlate recent deployments with production issues and identify rollback targets. JIRA Service Management administration experience: workflows, automation rules, SLA timers, and queues. ITIL Foundation certification is a plus but not required â practical experience matters more. \nRM144,000 - RM216,000 a year\n BENEFITS Convenient work tools Latest Mac workplaces + additional hardware to make you more effective at work Google Chat, Gmail, Google Drive, Confluence, Jira, GitLab Professional growth Free trainings and participation in specialized conferences Rich knowledge exchange within the company More perks Health insurance (Medical, dental and optical)- Employee and dependants Flexible hours: organize your day according to your needs and sprint & teamwork demands No dress code Comfortable and new office environment The duties of this position may change from time to time so the individual and organization can achieve their results. This job description is intended to describe the general level of work being performed. It is not intended to be all-inclusive. By submitting your application, you consent to Xsolla conducting background checks, where permitted by law, after the final interview stage. All checks will comply with local regulations, and your information will be handled confidentially. Xsolla KL Sdn Bhd takes your privacy very seriously, and will not sell or externally distribute any data received during the hiring process. Pursuant to the Personal Data Protection Act 2010 ("PDPA"), Xsolla KL Sdn Bhd is mindful and committed to the protection of your personal information and your privacy. Please direct any inquiries regarding your data privacy to careers@xsolla.com. For more vacancies: Careers | Xsolla Please mention the word PROVING and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.
At Lalamove, we believe in the power of community. Millions of drivers and customers use our technology every day to connect with one another and move things that matter. Delivery is what we do best and we ensure it is always fast and simple. Since 2013, we have tackled the logistics industry head on to find the most innovative solutions for the worldâs delivery needs. We are full steam ahead to make Lalamove synonymous with delivery and on a mission to impact as many local communities we can. We have massively scaled our efforts across Asia and now have our sights on taking our best in class technology to the rest of the world. And we are looking for talented professionals to join us in this journey!!As a Senior Data Engineer at Lalamove, you will be joining the global Data team as a key member of our expanding technology team in our new market. Due to the importance of user privacy and our commitment to compliance laws, we need an additional engineer to support our operations in the expanding market, while collaborating closely with our global engineering team.\nWhat you'll do:Provide production support and incident response of our data in expanding market platform.Support and troubleshoot technical issues, including the data pipelines running on top of the data platform.Collaborate with a geographically-dispersed team of engineers to support compliance for the expanding market.Support ad hoc requests related to expanding market data and operations.What you'll need:Legally permitted to work in Malaysia5+ years of relevant experience in data engineeringExperience in supporting Big Data operationsProficiency in SQLHands-on experience in linux systems and command line operationsExperience in Java and Spring Boot frameworkGood command of English, fluency in Mandarin is a plus\nTo all candidates- Lalamove respects your privacy and is committed to protecting your personal data.This Notice will inform you how we will use your personal data, explain your privacy rights and the protection you have by the law when you apply to join us. Please take time to read and understand this Notice. Candidate Privacy Notice: https://www.lalamove.com/en-hk/candidate-privacy-noticePlease mention the word DASHING and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.