Job title: Lead Data Platform Engineer ( AI/ML , Start-up)
Company: Talent To Hire Inc.
Job description: Data Platform Software Lead Engineer – AI/ML SystemsLocation: Hybrid (4 days onsite – Downtown Toronto)
Type: Full-Time | Start-upAbout the RoleJoin our team as a Data Platform Software Lead Engineer and drive the architecture that fuels our AI and ML pipelines, focusing on large-scale, code-based text datasets. You’ll design and build reliable systems for data ingestion, transformation, and delivery—enabling teams to train, iterate, and deploy models with confidence.What You’ll DoArchitect and implement scalable data platforms for code/text dataset ingestion, processing, and delivery.Build web-scale crawling and metadata extraction tools from open-source code repositories.Develop reliable, distributed pipelines with frameworks like Spark, Kafka, and Airflow/Prefect.Enable data visualization, sampling, and analytics for research teams to improve model performance.Collaborate with researchers, infrastructure, and compliance teams to meet technical and governance requirements.Must-Have Skills8+ years in data-intensive software engineering.Proficiency in Python, Go, or Scala; Spark or Ray; Airflow or Prefect; Kafka; Redis; Postgres or ClickHouse; GitHub APIs.Understanding of how datasets power AI/ML workflows.Proven experience in scalable data infrastructure and pipeline development.Skills in web crawling, scraping, and large-scale ingestion.Cloud-native experience (e.g., AWS, containerized compute, security).Bonus Points;Curating or preparing code-based datasets for LLMs or AI tooling.Familiarity with code parsing, tokenization, or embedding.Startup or fast-paced high-ownership environment experience.Why Work With UsInfluence the technical path of an AI-first startup.Work with cutting-edge cloud and ML/AI technologies.Receive competitive pay with equity.Collaborate with a diverse, high-caliber team.Thrive in a dynamic, innovation-driven workplace culture.Screening QuestionsDescribe a scalable data pipeline you’ve built for processing large code or text datasets.What tools or frameworks did you use for distributed data processing and why?Share an example of a web crawling or scraping system you’ve developed at scale.What components of your past data pipelines have you deployed in AWS (e.g., storage, compute, security)?Have you worked with code-based datasets or applied parsing/tokenization in model workflows?Apply NowInterested? Submit your resume and a brief response to the screening questions to: sasha@talenttohire.com
Expected salary:
Location: Toronto, ON
Job date: Thu, 14 Aug 2025 22:59:26 GMT
Apply for the job now!
Job title: Embedded Software Engineer Company: High Tech Genesis Job description: Location: Ottawa, ONHybrid: On-site 2 days per week.Term: Full...
Apply For This JobJob title: Senior Software Engineer – AI/Computer Vision (Camera Systems) (Toronto or Vancouver Hybrid) Company: Motorola Solutions Job description: Company...
Apply For This JobJob title: Electrical Substation Engineer Company: AECOM Job description: Company DescriptionWork with Us. Change the World.At AECOM, we’re delivering a...
Apply For This JobJob title: Software Engineer, Front-End Company: Kora Job description: Software Engineer, Front-End Toronto, ON (Hybrid – 1 Day RTO) $80K-$110K...
Apply For This JobJob title: Software Developer II (Front-end) Company: Warner Bros. Discovery Job description: Welcome to Warner Bros. Discovery… the stuff dreams...
Apply For This JobJob title: Senior Software Engineer Company: GoDaddy Job description: Location Details:At GoDaddy the future of work looks different for each...
Apply For This Job