AI Engineer - Acon - BLR at KPMG Global Services
, , India -
Full Time


Start Date

Immediate

Expiry Date

20 Jul, 26

Salary

0.0

Posted On

21 Apr, 26

Experience

5 year(s) or above

Remote Job

Yes

Telecommute

Yes

Sponsor Visa

No

Skills

Python, Generative AI, Large Language Models, LangChain, LangGraph, FastAPI, Azure OpenAI, Azure Machine Learning, Vector Databases, Azure AI Search, RAG, Docker, Kubernetes, CI/CD, JavaScript, Streamlit

Industry

Business Consulting and Services

Description
Responsible for designing, developing, and deploying scalable Generative AI solutions using state-of-the-art large language models (LLMs) and transformer     architectures. - Responsible for building out sophisticated AI agent workflows, establishing a scalable cloud infrastructure, and ensuring the application can handle large-     scale document ingestion and concurrent user requests seamlessly. - Design and implement highly complex Retrieval-Augmented Generation (RAG) systems, specifically focusing on Agentic RAG (autonomous decision-making workflows) and Multimodal RAG (handling text, images, and other data types). Backend Development & Orchestration: Build robust, high-performance APIs and backend services using Python and FastAPI. Orchestrate complex LLM workflows and agents utilizing LangChain and LangGraph. - Search & Data Infrastructure: Develop data ingestion pipelines capable of handling exceptionally large documents. Configure, optimize, and maintain Vector Databases and Azure AI Search for low-latency, high-accuracy retrieval - Frontend & UI Development: Develop rapid, functional, and interactive user interfaces using Streamlit. Utilize JavaScript to build custom frontend components, integrate web elements, and enhance the overall user experience. - DevOps & Automation: Establish, maintain, and optimize CI/CD pipelines to ensure smooth, automated testing, building, and deployment of the GenAI application. Document Processing: Handson in parsing complex financial documents, unstructured enterprise documents (PDFs, PPTs, tables, Excel, Word, txt, charts, images etc) Required Skills - Programming Languages: Deep expertise in Python; solid working knowledge of JavaScript. - Strong experience with Azure OpenAI, Azure Machine Learning, Cognitive Search, and AI Studio. - Solid understanding of LLMs, embeddings, RAG pipelines, vector search, prompt engineering. - GenAI & LLM Frameworks: Extensive hands-on experience with LangChain and LangGraph/ Semantic Kernel / LlamaIndex. Proven ability to build beyond basic RAG into Agentic and Multimodal implementations. - Backend & APIs: Strong experience building scalable RESTful APIs with FastAPI. - Cloud & DevOps (Azure): Demonstrated experience deploying enterprise-grade applications on Azure. Deep understanding of Docker, containerization, and configuring automated CI/CD pipelines. -  LLM Evaluation & Observability: Hands-on experience with evaluation frameworks (e.g., RAGAS, TruLens, DeepEval) and monitoring/tracing tools (e.g.,   LangSmith, Langfuse, Arize Phoenix). - Databases & Search: Practical experience with Azure AI Search and leading Vector Databases (e.g., Pinecone, Milvus, Qdrant, or Azure Native). - System Architecture: Proven track record of designing systems for parallelism, large-scale data ingestion, and multi-user environments (SaaS architecture). - Frontend/Prototyping: Experience building responsive interfaces using Streamlit,HTML,CSS,Javascript. - Docker, Kubernetes (for model deployment) - Previous experience building commercial AI products or complex advisory/analytical AI agents from the ground up. - Strong understanding of data security and tenant isolation in cloud environments. - Understanding of responsible AI, data privacy, and application security, Guardrails. - Fine-tune and customize foundation models using domain-specific datasets and techniques.
Responsibilities
Design, develop, and deploy scalable Generative AI solutions and complex agentic workflows using LLMs and transformer architectures. Manage backend orchestration, data ingestion pipelines, and cloud infrastructure to ensure high-performance application delivery.
Loading...