Overview
We are looking for an experienced AI Engineer to design and deliver advanced LLM powered solutions that transform unstructured information into scalable, searchable knowledge systems. You will work on building high quality retrieval and generation pipelines, enabling intelligent applications with strong evaluation, observability, and performance optimization practices.
Project Overview:
This project focuses on building a next generation enterprise knowledge platform powered by large language models, semantic search, and graph technologies. The solution converts expert content into structured knowledge and enables advanced reasoning through agent driven workflows.
- Develop and maintain end to end LLM orchestration and retrieval pipelines
- Design and implement embedding generation and chunking strategies with effective context window management
- Optimize retrieval quality through tuning techniques and evaluation driven improvements
- Build entity extraction pipelines that transform unstructured content into graph structures using entity resolution and relationship normalization
- Implement semantic search solutions and prompt engineering patterns
- Develop agentic workflows to support complex reasoning tasks
- Integrate graph databases with LLM based platforms and services
- Create and maintain evaluation frameworks including ground truth datasets and regression testing processes
- Measure and improve system performance using metrics such as recall, precision at K, answer relevance, and faithfulness
- Improve observability, tracing, and cost efficiency across LLM pipelines
- Collaborate with team members to design scalable and reliable systems
- Experience developing applications with large language models and retrieval augmented generation
- Strong proficiency in Python and backend development practices
- Hands on experience with vector databases and semantic search implementations
- Understanding of embedding techniques, chunking strategies, and context management
- Experience building or maintaining data pipelines for unstructured content processing
- Familiarity with graph data modeling and knowledge graph concepts
- Experience implementing evaluation methodologies for AI systems including precision and recall metrics
- Experience working with cloud platforms and containerized environments such as Docker and Kubernetes
- Ability to design scalable and maintainable system architectures
- Strong collaboration and communication skills
- Experience with Google Cloud services including Spanner Graph
- Familiarity with Gemini Enterprise Agent Platform or similar tools
- Experience with LangChain, LlamaIndex, or similar frameworks
- Knowledge of entity resolution and graph based reasoning techniques
- Experience with observability tools for ML or AI systems
- Understanding of cost optimization strategies for LLM usage
✨ Our intelligent job search engine discovered this job and republished it for your convenience.
Please be aware that the job information may be incorrect or incomplete. The job announcement remains the property of its original publisher. To view the original job and its full details, please visit the job's URL on the owner’s page.
Please clearly mention that you have heard of this job opportunity on https://ijob.am.


