Roots Automation logo

AI Engineer

Roots Automation
Full-time
On-site
New York City, New York, United States
$160,000 - $220,000 USD yearly
Artificial Intelligence (AI)

AI Engineer

About the team: 
Our team specializes in fine-tuning state-of-the-art custom models to meet specific business needs, surpassing GPT-4 in accuracy and efficiency. In addition to harnessing the power of pre-trained models, we are committed to building and refining our own multimodal models, tailored to the unique requirements of our use-cases. We take pride in developing custom inference endpoints that seamlessly integrate these models into our customers' workflows, providing low-latency, real-time document processing.   

At Roots, we are committed to building a team of talented individuals who share our love for innovation and problem-solving. As we continue to expand, we are seeking a motivated AI Engineer to join our growing team. 

‍ 

Responsibilities

  • Develop machine learning systems to solve complex business problems and enhance the capabilities of InsurGPT. 

  • Stay updated on the latest advancements in machine learning research and explore innovative approaches to drive continuous improvement. 

  • Work closely with cross-functional teams to understand requirements, prioritize tasks, and deliver solutions aligned with business objectives. 

  • Design and maintain systems to support strong SLAs on latency and uptime, while managing tradeoffs on resource consumption (CPU, GPU, memory, network) 

  • Improve our current MLOps infrastructure to streamline the deployment, monitoring, and oversight of our machine learning models. 

  • Improve vLLM/Triton inference endpoints for real-time integration of Large Language Models into product ecosystems. 

  • Collaborate with teams and stakeholders, ensuring effective communication and presence in the office at least three days a week 

 

Qualifications 

  • Graduate degree in Computer Science, Electrical Engineering, Mathematics, Statistics, or a related field. 

  • Proven experience with LLM inference, including optimization and deployment using tools such as vLLM, TensorRT, multi-GPU setups, DeepSeek, and ONNX Runtime to ensure real-time, high-performance model serving in production environments. 

  • Proficiency in Python and familiarity with machine learning libraries and frameworks such as  PyTorch, Scikit-learn, Pandas, Numpy etc. 

  • Excellent knowledge and good practical skills in major ML algorithms as applied to large language models, traditional NLP, computer vision and information retrieval. 

  • Strong problem-solving skills, analytical thinking, and attention to detail, with the ability to translate business requirements into technical solutions effectively. 

  • Excellent written and verbal communication skills, with a strong emphasis on the written word. We highly appreciate public articles or blogs that highlight communication skills. 

  • Demonstrated ability to work independently, prioritize tasks, and manage multiple projects simultaneously in a fast-paced and dynamic environment. 

The estimated base salary for this role in New York is $160,000 - $220,000 per year. Final compensation will be determined based on factors such as experience, skills, and qualifications. 

Roots Automation is an Equal Opportunity Employer. All applicants will be considered for employment without attention to race, color, religion, sexual orientation, gender identity, national origin, veteran or disability status. Roots Automation is a progressive and open-minded workplace where we do not tolerate discrimination or harassment in any form. If you are smart, passionate and good at what you do, come as you are.

Roots Automation is also committed to providing reasonable accommodations to individuals with disabilities throughout the application process and employment. If you need assistance or an accommodation, please contact us for more information.