Position: AI Solution Architect
Experience: 10+ years in AI/ML Solutioning & Architecture
Requirement: Fulltime (Work from Office)
Location: Hyderabad
About Nstarx
NStarX is an AI-native, cloud-native services & platform company that helps enterprises accelerate digital transformation through:
- AI Engineering & Data Modernization
- Enterprise GenAI & Agentic AI
- Federated Learning & Confidential Computing
- Predictive & Prescriptive AI Solutions
- “Service-as-Software” – our core platform model
- Domain solutions across Healthcare, Media, Manufacturing, Finance & Retail
For more information, please visit:
https://nstarxinc.com/
Job Description
We are looking for a highly skilled AI Architect with deep expertise in Generative AI, LLMs, Video Models (Digital Humans / Avatars), and end-to-end AI product architecture.
This role requires a hands-on technologist and strategic thinker who can design scalable AI systems, guide development teams, interact with global clients, and drive high-impact AI initiatives across cloud, on-prem GPU servers, and edge devices (AI PCs).
You will architect solutions that span the spectrum—from H200-class GPU compute for very large LLM workloads to lightweight, optimized models that run efficiently on edge devices.
This is a senior, high-ownership role for someone passionate about building real-world AI products at scale.
1. AI & GenAI Architecture
- Design and architect LLM-based systems using both open-source (Llama, Mistral, etc.) and proprietary (OpenAI, Azure OpenAI, Anthropic, etc.) models.
- Architect video-based AI systems, including Digital Human Avatars, Video Generation, Video-to-Text, and multimodal pipelines.
- Build end-to-end GenAI pipelines including data ingestion, preprocessing, retrieval, fine-tuning (LoRA, QLoRA, DAPT), evaluation, guardrailing, and deployment.
2. Core ML & Data Engineering
- Define and orchestrate data pipelines, ML workflows, vector search architecture, and embedding strategies.
- Build scalable, secure ML engineering wrappers around models (inference servers, orchestration layers, API microservices).
- Oversee experimentation frameworks, evaluation methodologies, and MLOps integration.
3. Cloud & Infrastructure
- Architect AI solutions on AWS and Azure (preferred), including GPU clusters, model hosting, DevOps/MLOps, and autoscaling.
- Work with Nvidia GPU server stacks (DGX, H200, H100, L40S) and edge AI systems (Intel, AMD, Qualcomm AI PCs).
- Optimize AI workloads across heterogeneous compute environments.
4. Product & Delivery Ownership
- Lead AI architecture across POC → MVP → GA → production-scale phases.
- Contribute to roadmap planning, feasibility analysis, and technical risk assessment.
- Ensure performance, scalability, cost efficiency, and robustness of AI products.
5. Governance, Security & Compliance
- Embed data privacy, security controls, Responsible AI, and governance frameworks into product design.
- Ensure adherence to enterprise AI policies, guardrails, and regulatory requirements.
6. Client Engagement & Communication
- Interact with global clients (North America & Europe) to understand requirements, present architectures, and provide expert guidance.
- Create clear architecture diagrams, documentation, and high-quality technical specifications for developers and stakeholders.
- Serve as the technical face of the project in client discussions.
7. Leadership & Collaboration
- Collaborate with AI Engineers, Data Scientists, Product Owners, Cloud Architects, and MLOps teams.
- Mentor teams in AI design patterns, best practices, and solution development.
- Conduct architecture reviews, code/design audits, and knowledge-sharing sessions.
Required Skills & Qualifications
Technical Skills
- Deep expertise in Generative AI: LLMs, Vision/Video models, Digital Avatars, RAG systems, and multimodal architectures.
- Strong experience in ML engineering, data pipelines, and scalable model APIs.
- Hands-on experience with Nvidia GPU systems, CUDA stack, TensorRT, vLLM/Ollama, and model optimization.
- Experience building AI on edge devices (Intel, AMD, Qualcomm NPUs, AI PCs).
- Proficiency in AWS and Azure cloud ecosystems, including GPU-based deployments.
- Strong knowledge of Python, ML frameworks (PyTorch, TensorFlow), model serving frameworks, and MLOps tools.
Professional Requirements
- Minimum 10 years of experience in ML/AI solution architecture.
- Proven track record of architecting POC, MVP, and production-grade AI products.
- Strong architectural documentation and diagramming skills (Mermaid, Draw.io, Lucidchart, ArchiMate).
- Excellent communication skills for client presentations and internal leadership discussions.
- Ability to work in a fast-paced, multi-project environment across global teams.
Preferred Qualifications
- Graduation from a Tier-1 institute (IIT, NIT, IIIT, or equivalent).
- Certifications in AI/ML Architecture, Solution Architecture, or Cloud Architecture (AWS, Azure).
- Experience with enterprise AI governance and generative AI compliance frameworks.
Why NStarX
- You will be joining a fast-growing AI-native company backed by:
- Strategic investment from SHI International
- Advisory board including Silicon Valley CXOs & AI leaders
- Compensation includes:
- Competitive base + commission
- Fast growth into leadership roles
To apply for this job email your details to recruiting-ind@nstarxinc.com