HUD

HUD is an IT firm that offers software development, IT staffing, and web consulting services.

51-100 employees

Founded 2007

Staffing Agency

Private Company

Website LinkedIn Email Facebook Twitter Share Tweet

Staffing Agency IT Management Web Development Information Technology Software

Open Positions1

Fully remote-friendly. We already have several fulltime100% remote hires. But if you’re in the San Francisco Bay Area or SingaporeWe do have an office you can work together in. We do prefer applicants who can show up to meetings in Pacific Time (UTC-7:00/8:00) or China/Singapore Time (UTC +8:00).Full-TimeAI AgentsPosted

Research Engineer, Agentic AI Evals

Build out environments for HUD's CUA evaluation datasets, including evals for safety redteaming, general business tasks, long-horizon agentic tasks etc.
Deliver custom CUA datasets and evaluation pipelines requested by clients
Contribute to improving the HUD evaluation harness, depending on your interests, skills, and current organizational priorities.

DockerPythonReact+1 more

About HUD

HUD is at the forefront of AI development, specializing in creating robust evaluation frameworks for Computer Use Agents (CUAs). Their flagship product, the CUA Evals framework, is the industry's first comprehensive tool for assessing the performance of AI agents that browse the web. By providing detailed evaluations across a vast array of tasks, HUD empowers developers and AI labs to understand and improve the real-world functionality of AI agents. Backed by Y Combinator and collaborating closely with leading AI labs, HUD is instrumental in scaling agent evaluation infrastructure. Their mission is to ensure AI agents reliably perform in real-world applications by delivering precise and thorough evaluation tools. This focus on rigorous testing is critical for the advancement of AI safety and alignment. The company fosters a dynamic startup environment with a team comprised of highly accomplished individuals, including international Olympiad medalists and experienced AI startup founders. HUD's engineering culture emphasizes technical aptitude, rapid learning, and a problem-solving mindset. They are actively building out their evaluation environments and infrastructure, utilizing technologies such as Python, Docker, Linux, AWS, Kubernetes, Redis, and PostgreSQL. Opportunities span building new evaluations, optimizing infrastructure, sales, partnership development, and supporting research engineers. HUD offers a remote-friendly work culture, with an office available for those in the San Francisco Bay Area. They prioritize candidates who can align with Pacific Time or China/Singapore Time for meetings and provide visa sponsorship for strong full-time candidates. With a current team of 5-10 people and plans for significant growth, HUD presents an exciting opportunity to contribute to the cutting edge of AI development.

Tech Stack

mxcmswidgetsanalyticsmobile