AIML – ML Engineer, MLPT

vacanciesin.eu

Summary

Posted: Oct 21, 2024

Role Number:200574752

Do you feel you think differently, you are eager to break status quo, are bold and ambitious, aren’t afraid to take risks and are passionate to build the best of class technology. If yes, what better place to be at and do this than Apple? At Apple, “we think different, we push the boundaries of computing and intelligence. We build products that bring smile to people’s face”. Foundation Model Infrastructure team, within Machine Learning Platform Technologies organization is the back-bone of Apple Intelligence. It builds frameworks, services and tools that power the largest Apple foundation models on servers. Our Infrastructure powers a wide gamut of services at Apple including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri and upcoming ever exciting Apple products serving millions of queries every day with incredible low latencies, drawing every ounce of compute from our hardware. As part of this group, you will get a chance to bring Intelligence to billions of users across the world. You will have an opportunity to make a difference in life of people. You will have a chance to work on optimizing billions of parameter languge and vision and speech models using state of the art technologies and make it run at scale of Apple.

Description

Work along side Foundation Model Research team to optimize inference for cutting edge model architectures. Work closely with product teams to build Production grade solutions to launch models serving millions of customers in real time. Build tools to understand bottlenecks in Inference for different hardwares and use cases. Mentor and guide engineers in the organization.

Minimum Qualifications

  • Demonstrated experience in leading and driving complex, ambiguous projects.
  • Have experience with high throughput services particularly at supercomputing scale.
  • Proficient with running applications on Cloud (AWS / Azure or equivalent) using Kubernetes, Docker etc.
  • Familiar with GPU programming concepts using CUDA an d with one of the popular ML Frameworks like Pytorch, Tensorflow

Preferred Qualifications

  • Proficient in building and maintaining systems written in modern languages (eg: Golang, python)
  • Familiar with fundamental Deep Learning architectures such as Transformers, Encoder/Decoder models.
  • Familiarity with Nvidia TensorRT-LLM, vLLLM, DeepSpeed, Nvidia Triton Server etc.
  • Experience writing custom CUDA kernels using CUDA or OpenAI Triton.

Apply now
To help us track our recruitment effort, please indicate in your email/cover letter where (vacanciesin.eu) you saw this job posting.

Published by

Recent Posts

Alternant sales – prospection et chasse in Courbevoie, France

vacanciesin.eu Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology…

30 minutes ago

Associate, France

vacanciesin.eu Who We Are Boston Consulting Group partners with leaders in business and society to…

30 minutes ago

Ingénieur DevOps CI/CD Confirmé (H/F)

vacanciesin.eu Eviden regroupe les activités Digital, Cloud, Big Data et Sécurité d’Atos et sera un…

30 minutes ago

Ingénieur Energie H/F

vacanciesin.eu Ville : Rambervillers (FR) Type de contrat : Temps plein Job-Code: 4905 Service : Autre Technique et…

30 minutes ago

Inside Sales Manager H/F, Responsable des Ventes Internes – Paris

vacanciesin.eu What to Expect Chez Tesla, notre engagement envers l'excellence client guide chacune de nos…

30 minutes ago

Ingénieur développement système et réseau (H/F)

vacanciesin.eu Eviden est une entreprise du Groupe Atos qui réalise un chiffre d'affaires annuel d'environ 5…

30 minutes ago
If you dont see Apply Button. Please use Non-Amp Version