A technology startup in San Francisco seeks a research engineer to build evaluation environments for AI systems. The candidate should have proficiency in Python, Docker, and Linux. Experience with LLM evaluation frameworks is a plus. The role is full-time preferred, remote-friendly, and focuses on creating high-quality datasets for assessment purposes. A strong problem-solving ability is needed, and applicants with startup experience are encouraged to apply.
#J-18808-Ljbffr