Full-time

Multimodal AI Solutions Architect

Posted by Impetus • Industrial Area, Uttar Pradesh, India

📍 Industrial Area, Uttar Pradesh 🕒 February 26, 2026

About the Role

Responsibilities



  • Core AI/ML Fundamentals
  • Solid understanding of AI/ML concepts including:
  • Classification, regression, neural networks
  • OCR and transcription systems
  • Audio/Video processing and multimodal learning
  • OCR, Transcription & Audio/Video Intelligence
  • Implement specialized models for:
  • High‑accuracy document OCR
  • Real‑time audio transcription
  • Architect deep learning pipelines for audio/video analysis and generation.
  • Integrate multimodal models (e.G., LLaVA, Whisper) into broader GenAI systems.
  • Generative AI & LLM Expertise
  • Strong understanding of:
  • Generative AI techniques
  • Transformer architectures
  • RAG (Retrieval-Augmented Generation) pipelines
  • Modern LLM ecosystems
  • Hands-on experience with:
  • <...

Ready to Apply?

Submit your application today and take the next step in your career journey with Impetus.

Apply Now