트웰브랩스(TwelveLabs)-Lead Data Engineer, ML Data Platform
트웰브랩스(TwelveLabs)-Lead Data Engineer, ML Data Platform
1/2
트웰브랩스(TwelveLabs)서울 용산구경력 5-15년

Lead Data Engineer, ML Data Platform

포지션 상세

We’re a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world’s most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building foundation AI models that can accurately and instantly search exact moments within petabytes of video archives, generate coherent text summaries of videos, perform prompt-based video generation, and many more. The Twelve Labs platform provides access to its Large Visual Language Models (VLMs) through a suite of APIs that are trained on massive video datasets and learn to understand the meaning and context behind the visuals, conversations, and sounds within videos.

Twelve Labs recently raised $17M in seed funding, recognized as one of CB Insights’ AI 100 companies within a year of its founding, and secured a massive compute resource through partnering with Oracle. We are hyper focused on delivering the Twelve Labs platform to our customers so they can build video understanding into their products and power dream features they could have only imagined.

Part of the pathway to our rapid growth has been paved by the outstanding group of people united by the company’s mission. Beyond prominent venture capital firms such as Index Ventures and Radical Ventures, the Twelve Labs mission is backed by category building luminaries like Fei-Fei Li (Stanford HAI), Silvio Savarese (Salesforce), Oren Etzioni (AI2), Alexandr Wang (Scale), Lukas Biewald (W&B), Jack Conte (Patreon) and more.

We are committed to creating a diverse and inclusive work environment where our team members can bring their full selves to work, bring out their potential, and most importantly, thrive together. We welcome kind, brilliant, and open minded people from all walks of life to our team. If joining this mission speaks to you, we encourage you to apply!

주요업무

• Acquire and deliver massive and high-quality datasets for our large training runs
• Develop and implement best practices and data pipelines (ingest, annotate, and incorporate high-quality datasets into model training and evaluation) by working with internal and external data partners
• Improve our data infrastructure (e.g., management, versioning) by collaborating with software engineers and security engineers
• Collaborate with modeling and product teams to evaluate the impact of the data on our models and continuously improve the data quality
• Hire, provide career growth guidance, coaching, and training for engineers on your team
• Work across teams to understand and manage project priorities and product deliverables, evaluate trade-offs, and drive technical initiatives from execution to landing

자격요건

[Technical Skills]
• 5+ years of experience in managing unstructured and/or human-annotated data (e.g., collecting or assessing sample quality)
• 2+ years in handling large-scale data processing and building pipeline systems (e.g., Airflow)
• Owned data initiatives such as data cleaning, data validation, data augmentation, and image or video processing
• Experience with ML frameworks such as Pytorch and Tensorflow
• Experience in operating and developing large-scale distributed processing environments (e.g., Hadoop, Spark)
• Experience in development/operations in container environments
• Proficiency in Python

[Soft Skills]
• Open to learning and enthusiastic about new technologies
• Ability to work in a team and deliver results collaboratively
• Capability to create processes in the absence of clear structures
• Interest in finding optimal solutions to problems by understanding over/under engineering"

기술 스택 • 툴

태그

마감일

상시채용

근무지역

서울시 용산구 이태원로 27길 39-11
본 채용정보는 원티드랩의 동의없이 무단전재, 재배포, 재가공할 수 없으며, 구직활동 이외의 용도로 사용할 수 없습니다.

본 채용 정보는 에서 제공한 자료를 바탕으로 원티드랩에서 표현을 수정하고 이의 배열 및 구성을 편집하여 완성한 원티드랩의 저작자산이자 영업자산입니다. 본 정보 및 데이터베이스의 일부 내지는 전부에 대하여 원티드랩의 동의 없이 무단전재 또는 재배포, 재가공 및 크롤링할 수 없으며, 게재된 채용기업의 정보는 구직자의 구직활동 이외의 용도로 사용될 수 없습니다. 원티드랩은 에서 게재한 자료에 대한 오류나 그 밖에 원티드랩이 가공하지 않은 정보의 내용상 문제에 대하여 어떠한 보장도 하지 않으며, 사용자가 이를 신뢰하여 취한 조치에 대해 책임을 지지 않습니다.
<저작권자 (주)원티드랩. 무단전재-재배포금지>