Platform Data Analyst - AI Media-Creation
New
R
RunwareAI media-creation platform
United KingdomFull-TimeMiddle
Salary not disclosed
Apply NowOpens the employer's application page
Job Details
- Required Skills
- GrafanaPrometheusDatadogData analytics
Requirements
- Strong experience with data analytics, observability, or monitoring
- Hands-on with metrics/logging/tracing frameworks (Prometheus, Grafana, Datadog, New Relic, etc.)
- Good understanding of backend systems and distributed architectures
- Ability to turn raw metrics into actionable insights
- Experience building dashboards for internal and external stakeholders
- Familiarity with AI model monitoring (latency, throughput, error codes, GPU utilization)
- Experience with AI/ML infrastructure, inference pipelines, GPUs (nice-to-have)
- Understanding of Python APIs, FastAPI, or Node environments (nice-to-have)
- Experience working with high-throughput real-time systems (nice-to-have)
- Startup or scale-up experience (nice-to-have)
- Problem-solver mindset
- Proactivity — digging into data and flagging problems
- Ability to work with ML, backend, DevOps, and product teams
- Comfort with autonomous ownership
Responsibilities
- Build and maintain E2E inference time tracking (global and per-model)
- Monitor how implementation changes impact total request latency
- Detect regressions introduced by suboptimal code paths
- Provide automated alerts & historical trends
- Build dashboards for internal use (engineering, product, leadership)
- Provide client-facing usage dashboards (requests, errors, success rate, performance)
- Support clients who need visibility to debug their integrations
- Track model-level usage, API endpoints usage, adoption metrics, etc.
- Implement metrics, logs, and traces that help the entire platform scale smoothly
- Work closely with DevOps & backend teams to improve system observability
- Provide insights that guide infra decisions (GPU allocation, autoscaling, caching, batching, etc.)
- Select and maintain tooling (e.g., Prometheus/Grafana, Datadog, OpenTelemetry, ELK, BigQuery, etc.)
- Ensure data pipelines are reliable, accessible, and always up-to-date
- Build simple, easy-to-read dashboards for both technical and non-technical teams
View Full Description & ApplyYou'll be redirected to the employer's site