Degrees/Certifications
• Bachelor's degree in relevant field or equivalent experience required
Must-Have Skills:
• AI/LLM Model Optimization: The ability to optimize and fine-tune AI/LLM models for improved performance, efficiency, and scalability.
• Research and Publication Experience: A track record of conducting original research and publishing papers in top-tier conferences or journals, demonstrating expertise in AI/LLM and a commitment to advancing the field.
• AI Coding in PyTorch or Similar Frameworks: Proficiency in coding AI models using popular frameworks such as PyTorch, TensorFlow, or Keras, with a focus on developing efficient, scalable, and maintainable code.
Nice-to-Have Skills:
• Model Training and Fine-Tuning: The ability to train and fine-tune AI/LLM models for specific tasks or domains, ensuring optimal performance and adaptability.
• Enablement and Implementation: Experience in enabling and implementing AI/LLM models on various platforms, including cloud and edge devices, to ensure seamless integration and deployment.
• Cloud and Edge Device Optimization: Knowledge of optimizing AI/LLM models for deployment on cloud and edge devices, ensuring efficient use of resources and high performance in real-world scenarios.
Minimum Years of Experience:
• 10 +
Client brings together a world-class team of researchers, developers, and engineers to create the future of virtual and augmented reality, which together will become as universal and essential as smartphones and personal computers are today. The compute performance and power efficiency requirements of Virtual and Augmented Reality require custom silicon. Client is seeking a Research Scientist to join our Research & Development teams. The ideal candidate will have experience working on AI Infrastructure and models related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist in Client. The primary objective will be to develop creative solutions that enable compute and power efficient training and on-device inference of vision and language models for use cases in AR, VR and edge devices. We are hiring in multiple locations.
Responsibilities:
• Apply relevant AI infrastructure, AI algorithms and hardware acceleration techniques to build & optimize our intelligent ML systems that improve Client products and experiences Develop state-of-the art model compression and scalability techniques using Numerics, pruning, distillation etc.
• Optimize models on hardware to achieve the best performance given various real time latency and power constraints Goal setting related to project impact, AI algorithms, AI system design, and infrastructure/developer efficiency Directly or influencing partners to deliver impact through deep, thorough data-driven analysis Define use cases, and develop methodology & benchmarks to evaluate different approaches Apply in-depth knowledge of how the ML infra interacts with the other systems around it
Minimum Qualifications
• Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Client.
• Currently has, or is in the process of obtaining a PhD in the field of Computer Science, Electrical Engineering or equivalent practical experience. Degree must be completed prior to joining Client.
• Specialized experience in one or more of the following machine learning/deep learning domains: Model compression, hardware aware model optimizations, hardware accelerators architecture, GPU architecture, machine learning compilers, or ML systems, AI infrastructure, high performance computing, performance optimizations, or Machine learning frameworks (e.g. PyTorch), numerics and SW/HW co-design.
• Experience developing AI-System infrastructure , AI algorithms or AI hardware acceleration in C/C++ or Python.
• Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.
Preferred Qualifications
• Experience or knowledge of training/inference of Large scale AI models.
• Experience or knowledge of distributed systems or on-device algorithm development.
• Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as publications at leading workshops, journals or conferences such as CLR, NeurIPS, CVPR, ACL, ICML, MLSys, ISCA, MICRO, DAC etc.
• Demonstrated research and software engineering experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. GitHub).
• Experience working and communicating cross functionally in a team environment.
• Experience solving complex problems and comparing alternative solutions, tradeoffs, and diverse points of view to determine a path forward.
- **Only those lawfully authorized to work in the designated country associated with the position will be considered.**
- **Please note that all Position start dates and duration are estimates and may be reduced or lengthened based upon a client’s business needs and requirements.**
Your team at Rose International is always very helpful and responsive.
Barbara, Consultant
You are customer service oriented. No matter whether it was the Recruiter or someone in Human Resources/Payroll, you were responsive. That to me is key!
Tonya, Consultant
I am very happy with the Rose International, and the professionalism of the employees.
Robin, Consultant
Thanks for the opportunity. If in the future I ever need a job, I would like to work for Rose International.
David, Consultant
It is a great pleasure being a part of the Rose International Team.
Toni, Consultant
EMPLOYEE COMMENTS