About the Role:
As an AI/LLM Engineer in our AI Foundation Services team, you will support the development of secure and compliant AI solutions with a focus on Large Language Models and Retrieval-Augmented Generation systems.
Your responsibilities:
- Design, develop, and implement end to end AI systems, particularly focusing on large language models (LLMs)
- Collaborate with cross-functional teams to integrate AI solutions into existing systems and processes
- Conduct research to explore new techniques and technologies in AI and machine learning
- Analyze large datasets to identify trends and insights that inform model development
- Continuously monitor and improve model performance to ensure accuracy and efficiency
- Communicate technical concepts effectively to both technical and non-technical stakeholders
Qualifications:
- Education in Computer Science, Machine Learning, or a related field
- Excellent problem-solving skills and attention to detail
- Proficiency in Python and strong programming skills
- Experience with LLMs and AI frameworks: Transformers, Pytorch / Tensorflow as well as RAG frameworks: Llama-index / LangChain
- Ability to integrate new functional AI systems with APIs, low latency backends and production deployment
- Quickly grasp new AI concepts from papers and blogs, conduct experiments, and propose system improvements
-
Excellent communication skills in English, both written and verbal
Nice-to-Have:
- Have experiences on working with distributed HPC clusters: Ray, Kubernetes, Docker
- Experience with fine-tuning LLM / Embed for downstream tasks
-
Experience with building, evaluating RAG systems
In case of equal qualification, disabled candidates will be considered preferentially.