Skip to content Skip to footer

Position: AI Solution Architect

Experience: 10+ years in AI/ML Solutioning & Architecture

Requirement: Fulltime (Work from Office)

Location: Hyderabad

About Nstarx

NStarX is an AI-native, cloud-native services & platform company that helps enterprises accelerate digital transformation through:

  • AI Engineering & Data Modernization
  • Enterprise GenAI & Agentic AI
  • Federated Learning & Confidential Computing
  • Predictive & Prescriptive AI Solutions
  • “Service-as-Software” – our core platform model
  • Domain solutions across Healthcare, Media, Manufacturing, Finance & Retail

For more information, please visit:
https://nstarxinc.com/

Job Description

We are looking for a highly skilled AI Architect with deep expertise in Generative AI, LLMs, Video Models (Digital Humans / Avatars), and end-to-end AI product architecture.

This role requires a hands-on technologist and strategic thinker who can design scalable AI systems, guide development teams, interact with global clients, and drive high-impact AI initiatives across cloud, on-prem GPU servers, and edge devices (AI PCs).

You will architect solutions that span the spectrum—from H200-class GPU compute for very large LLM workloads to lightweight, optimized models that run efficiently on edge devices.

This is a senior, high-ownership role for someone passionate about building real-world AI products at scale.

1. AI & GenAI Architecture

  • Design and architect LLM-based systems using both open-source (Llama, Mistral, etc.) and proprietary (OpenAI, Azure OpenAI, Anthropic, etc.) models.
  • Architect video-based AI systems, including Digital Human Avatars, Video Generation, Video-to-Text, and multimodal pipelines.
  • Build end-to-end GenAI pipelines including data ingestion, preprocessing, retrieval, fine-tuning (LoRA, QLoRA, DAPT), evaluation, guardrailing, and deployment.

2. Core ML & Data Engineering

  • Define and orchestrate data pipelines, ML workflows, vector search architecture, and embedding strategies.
  • Build scalable, secure ML engineering wrappers around models (inference servers, orchestration layers, API microservices).
  • Oversee experimentation frameworks, evaluation methodologies, and MLOps integration.

3. Cloud & Infrastructure

  • Architect AI solutions on AWS and Azure (preferred), including GPU clusters, model hosting, DevOps/MLOps, and autoscaling.
  • Work with Nvidia GPU server stacks (DGX, H200, H100, L40S) and edge AI systems (Intel, AMD, Qualcomm AI PCs).
  • Optimize AI workloads across heterogeneous compute environments.

4. Product & Delivery Ownership

  • Lead AI architecture across POC → MVP → GA → production-scale phases.
  • Contribute to roadmap planning, feasibility analysis, and technical risk assessment.
  • Ensure performance, scalability, cost efficiency, and robustness of AI products.

5. Governance, Security & Compliance

  • Embed data privacy, security controls, Responsible AI, and governance frameworks into product design.
  • Ensure adherence to enterprise AI policies, guardrails, and regulatory requirements.

6. Client Engagement & Communication

  • Interact with global clients (North America & Europe) to understand requirements, present architectures, and provide expert guidance.
  • Create clear architecture diagrams, documentation, and high-quality technical specifications for developers and stakeholders.
  • Serve as the technical face of the project in client discussions.

7. Leadership & Collaboration

  • Collaborate with AI Engineers, Data Scientists, Product Owners, Cloud Architects, and MLOps teams.
  • Mentor teams in AI design patterns, best practices, and solution development.
  • Conduct architecture reviews, code/design audits, and knowledge-sharing sessions.

Required Skills & Qualifications

Technical Skills
  • Deep expertise in Generative AI: LLMs, Vision/Video models, Digital Avatars, RAG systems, and multimodal architectures.
  • Strong experience in ML engineering, data pipelines, and scalable model APIs.
  • Hands-on experience with Nvidia GPU systems, CUDA stack, TensorRT, vLLM/Ollama, and model optimization.
  • Experience building AI on edge devices (Intel, AMD, Qualcomm NPUs, AI PCs).
  • Proficiency in AWS and Azure cloud ecosystems, including GPU-based deployments.
  • Strong knowledge of Python, ML frameworks (PyTorch, TensorFlow), model serving frameworks, and MLOps tools.
Professional Requirements
  • Minimum 10 years of experience in ML/AI solution architecture.
  • Proven track record of architecting POC, MVP, and production-grade AI products.
  • Strong architectural documentation and diagramming skills (Mermaid, Draw.io, Lucidchart, ArchiMate).
  • Excellent communication skills for client presentations and internal leadership discussions.
  • Ability to work in a fast-paced, multi-project environment across global teams.
Preferred Qualifications
  • Graduation from a Tier-1 institute (IIT, NIT, IIIT, or equivalent).
  • Certifications in AI/ML Architecture, Solution Architecture, or Cloud Architecture (AWS, Azure).
  • Experience with enterprise AI governance and generative AI compliance frameworks.
Why NStarX
  • You will be joining a fast-growing AI-native company backed by:
  • Strategic investment from SHI International
  • Advisory board including Silicon Valley CXOs & AI leaders
  • Compensation includes:
  • Competitive base + commission
  • Fast growth into leadership roles

To apply for this job email your details to recruiting-ind@nstarxinc.com

Privacy Overview
NStarX Logo

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Necessary

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.