AI Solutions Architect (GenAI & ML)

Full Time
Hyderabad, India (Office)
Posted 2 months ago
Applications have closed

Position: AI Solution Architect

Experience: 10+ years in AI/ML Solutioning & Architecture

Requirement: Fulltime (Work from Office)

Location: Hyderabad

About Nstarx

NStarX is an AI-native, cloud-native services & platform company that helps enterprises accelerate digital transformation through:

AI Engineering & Data Modernization
Enterprise GenAI & Agentic AI
Federated Learning & Confidential Computing
Predictive & Prescriptive AI Solutions
“Service-as-Software” – our core platform model
Domain solutions across Healthcare, Media, Manufacturing, Finance & Retail

For more information, please visit:
https://nstarxinc.com/

Job Description

We are looking for a highly skilled AI Architect with deep expertise in Generative AI, LLMs, Video Models (Digital Humans / Avatars), and end-to-end AI product architecture.

This role requires a hands-on technologist and strategic thinker who can design scalable AI systems, guide development teams, interact with global clients, and drive high-impact AI initiatives across cloud, on-prem GPU servers, and edge devices (AI PCs).

You will architect solutions that span the spectrum—from H200-class GPU compute for very large LLM workloads to lightweight, optimized models that run efficiently on edge devices.

This is a senior, high-ownership role for someone passionate about building real-world AI products at scale.

1. AI & GenAI Architecture

Design and architect LLM-based systems using both open-source (Llama, Mistral, etc.) and proprietary (OpenAI, Azure OpenAI, Anthropic, etc.) models.
Architect video-based AI systems, including Digital Human Avatars, Video Generation, Video-to-Text, and multimodal pipelines.
Build end-to-end GenAI pipelines including data ingestion, preprocessing, retrieval, fine-tuning (LoRA, QLoRA, DAPT), evaluation, guardrailing, and deployment.

2. Core ML & Data Engineering

Define and orchestrate data pipelines, ML workflows, vector search architecture, and embedding strategies.
Build scalable, secure ML engineering wrappers around models (inference servers, orchestration layers, API microservices).
Oversee experimentation frameworks, evaluation methodologies, and MLOps integration.

3. Cloud & Infrastructure

Architect AI solutions on AWS and Azure (preferred), including GPU clusters, model hosting, DevOps/MLOps, and autoscaling.
Work with Nvidia GPU server stacks (DGX, H200, H100, L40S) and edge AI systems (Intel, AMD, Qualcomm AI PCs).
Optimize AI workloads across heterogeneous compute environments.

4. Product & Delivery Ownership

Lead AI architecture across POC → MVP → GA → production-scale phases.
Contribute to roadmap planning, feasibility analysis, and technical risk assessment.
Ensure performance, scalability, cost efficiency, and robustness of AI products.

5. Governance, Security & Compliance

Embed data privacy, security controls, Responsible AI, and governance frameworks into product design.
Ensure adherence to enterprise AI policies, guardrails, and regulatory requirements.

6. Client Engagement & Communication

Interact with global clients (North America & Europe) to understand requirements, present architectures, and provide expert guidance.
Create clear architecture diagrams, documentation, and high-quality technical specifications for developers and stakeholders.
Serve as the technical face of the project in client discussions.

7. Leadership & Collaboration

Collaborate with AI Engineers, Data Scientists, Product Owners, Cloud Architects, and MLOps teams.
Mentor teams in AI design patterns, best practices, and solution development.
Conduct architecture reviews, code/design audits, and knowledge-sharing sessions.

Required Skills & Qualifications

Technical Skills

Deep expertise in Generative AI: LLMs, Vision/Video models, Digital Avatars, RAG systems, and multimodal architectures.
Strong experience in ML engineering, data pipelines, and scalable model APIs.
Hands-on experience with Nvidia GPU systems, CUDA stack, TensorRT, vLLM/Ollama, and model optimization.
Experience building AI on edge devices (Intel, AMD, Qualcomm NPUs, AI PCs).
Proficiency in AWS and Azure cloud ecosystems, including GPU-based deployments.
Strong knowledge of Python, ML frameworks (PyTorch, TensorFlow), model serving frameworks, and MLOps tools.

Professional Requirements

Minimum 10 years of experience in ML/AI solution architecture.
Proven track record of architecting POC, MVP, and production-grade AI products.
Strong architectural documentation and diagramming skills (Mermaid, Draw.io, Lucidchart, ArchiMate).
Excellent communication skills for client presentations and internal leadership discussions.
Ability to work in a fast-paced, multi-project environment across global teams.

Preferred Qualifications

Graduation from a Tier-1 institute (IIT, NIT, IIIT, or equivalent).
Certifications in AI/ML Architecture, Solution Architecture, or Cloud Architecture (AWS, Azure).
Experience with enterprise AI governance and generative AI compliance frameworks.

Why NStarX

You will be joining a fast-growing AI-native company backed by:
Strategic investment from SHI International
Advisory board including Silicon Valley CXOs & AI leaders
Compensation includes:
Competitive base + commission
Fast growth into leadership roles

About Nstarx

Job Description

Required Skills & Qualifications

Technical Skills

Professional Requirements

Preferred Qualifications

Why NStarX

Have Questions?

Services

Industries

About Us

Insights

Address

Contact

+1 314 720 4402