Anand Trivedi

Accomplished AI leader with 14 years of experience, specializing in building and scaling AI/ML solutions from the ground up. As a founding member at Aavenir, I led the AI division for 5 years, driving strategy from seed stage through Series B. A recognized "Top 40 Under 40 AI Professional" and a published author on Large Language Models, I excel at transforming business needs into commercialized, high-impact AI products.

Key Achievements

A snapshot of my most impactful contributions to AI strategy and product development.

Startup Leadership

Spearheaded AI strategy as a founding team member at Aavenir, scaling AI capabilities and infrastructure from inception through Series B funding.

Advanced RAG Implementation

Achieved 80% accuracy in contract analysis by architecting an advanced RAG system, which included a custom retriever and a fine-tuned LLM.

Custom LLM Optimization

Engineered and deployed custom LLMs, employing advanced fine-tuning (LoRA, QLoRA) and quantization to significantly improve inference speed and reduce costs.

Industry Recognition

Recognized as a Top 40 under 40 AI Professional (2024) and a Top 10 AI Speaker at AIM Bangalore (2024).

Published Author

Authored the upcoming book "Building LLMs with PyTorch" (BPB Publications, Feb 2025), establishing thought leadership in modern language model architecture.

Core Competencies

My expertise spans from high-level AI strategy to hands-on implementation. Interact with the chart to see my specializations, and use the filters to explore my technical stack.

AI Specializations

Technical Stack

Professional Journey

Explore my career path, from co-founding a startup to leading enterprise AI divisions.

Thought Leadership & Projects

I believe in contributing to the community. Here are some of my key publications and open-source projects.

Book: Building LLMs with PyTorch

Authored a comprehensive guide for BPB Publications, covering foundational concepts to advanced techniques in LLM development.

Scheduled: Feb 2025

Open Source LLM from Scratch

Built and published a complete Large Language Model, demonstrating foundational knowledge of Transformer architecture.

View on GitHub →

Diffusion Models for Generation

Developed a novel diffusion model for controlled anime face generation using latent space manipulation.

View on GitHub →