Platform Data Analyst - AI Media-Creation

New
R
RunwareAI media-creation platform
United KingdomFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page

Job Details

Required Skills
GrafanaPrometheusDatadogData analytics

Requirements

  • Strong experience with data analytics, observability, or monitoring
  • Hands-on with metrics/logging/tracing frameworks (Prometheus, Grafana, Datadog, New Relic, etc.)
  • Good understanding of backend systems and distributed architectures
  • Ability to turn raw metrics into actionable insights
  • Experience building dashboards for internal and external stakeholders
  • Familiarity with AI model monitoring (latency, throughput, error codes, GPU utilization)
  • Experience with AI/ML infrastructure, inference pipelines, GPUs (nice-to-have)
  • Understanding of Python APIs, FastAPI, or Node environments (nice-to-have)
  • Experience working with high-throughput real-time systems (nice-to-have)
  • Startup or scale-up experience (nice-to-have)
  • Problem-solver mindset
  • Proactivity — digging into data and flagging problems
  • Ability to work with ML, backend, DevOps, and product teams
  • Comfort with autonomous ownership

Responsibilities

  • Build and maintain E2E inference time tracking (global and per-model)
  • Monitor how implementation changes impact total request latency
  • Detect regressions introduced by suboptimal code paths
  • Provide automated alerts & historical trends
  • Build dashboards for internal use (engineering, product, leadership)
  • Provide client-facing usage dashboards (requests, errors, success rate, performance)
  • Support clients who need visibility to debug their integrations
  • Track model-level usage, API endpoints usage, adoption metrics, etc.
  • Implement metrics, logs, and traces that help the entire platform scale smoothly
  • Work closely with DevOps & backend teams to improve system observability
  • Provide insights that guide infra decisions (GPU allocation, autoscaling, caching, batching, etc.)
  • Select and maintain tooling (e.g., Prometheus/Grafana, Datadog, OpenTelemetry, ELK, BigQuery, etc.)
  • Ensure data pipelines are reliable, accessible, and always up-to-date
  • Build simple, easy-to-read dashboards for both technical and non-technical teams
View Full Description & ApplyYou'll be redirected to the employer's site
View details
Apply Now