Suresh Raju

Scalable infrastructure & site reliability

Twenty-five years of experience in scalable cloud infrastructure, distributed systems, product development, technology management and customer success across America, Europe and Asia.

I build and stabilise data pipelines and platform infrastructure that drive customer adoption. I've mentored and led engineering teams to deliver on problems of performance, scale, reliability and data management — from early-stage startups to global enterprise platforms.

Nov 2023 – present Cloudera

Platform Engineering

  • Built scalable performant infrastructure on top of Kubernetes supporting the cloudera enterprise AI platform with additional focus on FedRamp.
  • Mentored teams and leading the adoption of Istio service mesh across different form factors of the platform.
  • Collaborated on cross-functional platform initiatives, driving company-wide adoption of standardized tooling across engineering teams and improving developer productivity metrics.
Apr 2021 – Jun 2023 Mux, San Francisco

Platform & Infrastructure

  • Built and scaled the different parts of the core platform on self managed Kubernetes upon which the video and data platforms were built to power the largest live streaming sporting events.
  • Helped scale the video analytics platform support 40 Million concurrent viewers backed by Envoy.
  • Part of the team developing and supporting an in-house Starlark based DSL for Kubernetes.
  • Helped the company attain ISO27001 certification.
  • Worked closely with other engineering teams to deliver a stellar developer experience and increased productivity.
Jun 2020 – Apr 2021 Optimizely, San Francisco

Site Reliability Engineering

  • Owned performance, reliability, CDN, DNS, data infrastructure and automation.
Jun 2019 – Feb 2020 Reltio, Redwood Shores

Senior Manager, Platform & Infrastructure

  • Built and managed large-scale infrastructure for a cloud-native MDM SaaS platform on AWS and GCP.
  • Responsible for automation, scale, performance and observability of the cloud infrastructure and software stack — Cassandra and Elasticsearch.
  • Led a distributed team of infrastructure engineers. Implemented SRE principles for 24×7 support operations.
Dec 2015 – May 2019 Oracle, Redwood Shores

Senior Manager, Oracle Management Cloud

  • Built the DevOps/SRE function for a monitoring and analytics platform deployed globally. Owned observability for CDN, EUM, APM, logs, security and monitoring.
  • Built scalable and performant infrastructure for accelerated promotion of services to production.
  • Full ownership of deployment, process automation and observability across the entire stack including Hadoop and Kafka clusters.
Nov 2005 – Nov 2015 Oracle / Sun — Bay Area, Prague, Bangalore

Technology Management, Open Source & Engagements

  • Led and executed data centre management projects across complex enterprise environments.
  • Defined and managed crisis engagements that resulted in millions of dollars in retained revenue.
  • Official committer for Eclipse on Solaris x86 (Sun Microsystems).
  • Represented Sun at conferences and universities on data centre management.
Jun 1999 – Oct 2005 Tata Infotech, Bangalore / Chicago

Lead Systems Engineer

  • Defined, managed and executed statements of work for Ascom, Qualcomm and World Book.
  • Liaised between on-site teams and offshore technology development.

Infrastructure

EnvoyIstioKubernetesLinuxSolaris

Cloud

AWSAzureGCPOCI

Platform

ArgoCDHarnessHelmKarpenterTerraformVault

Data & Streaming

ELKKafkaFlinkRedisSolrZookeeper

Edge & Networking

AkamaiCloudflareSecurityNetworking

Languages

GoJavaPythonPerlShell

Master of Computer Science & Applications

PSG College of Technology, Coimbatore