nvidia_tech_guides

NVIDIA Tech Guides

About the author: I’m Saravanan Gnanaguru (gsaravanan.dev), part of the NVIDIA Developer Program. I read about NVIDIA technologies and publish articles here for others to consume.

A knowledge-sharing repository for NVIDIA technology learnings — organised by technology area, inspired by the NVIDIA Docs catalogue.

Technology	Description
NVIDIA NIM	NVIDIA Inference Microservices — production-ready AI model serving
NVIDIA TensorRT	Deep learning inference optimization SDK for NVIDIA GPUs
NVIDIA Triton Inference Server	Multi-framework, multi-model scalable inference serving
NVIDIA NeMo Framework	End-to-end LLM training, fine-tuning, and customization
NVIDIA CUDA Toolkit	Foundational parallel computing platform for GPU-accelerated applications
NVIDIA RAPIDS	GPU-accelerated data science and machine learning for Python
NVIDIA cuDNN	GPU-accelerated deep neural network primitives library
NVIDIA API Catalog	Hosted AI model APIs and getting started guide for developers
AI Trends	Key AI topics and emerging trends for practitioners

Articles

NVIDIA NIM Explained: What It Is, Where It Fits in MLOps, and How It Compares With Ollama
NVIDIA TensorRT Explained: What It Is, How It Optimizes Models, and When to Use It
NVIDIA Triton Inference Server Explained: Scalable Model Serving for Production AI
NVIDIA NeMo Framework Explained: LLM Training, Fine-Tuning, and Customization at Scale
NVIDIA CUDA Toolkit Explained: Parallel Computing Foundation for GPU-Accelerated Applications
NVIDIA RAPIDS Explained: GPU-Accelerated Data Science for Python Practitioners
NVIDIA cuDNN Explained: The GPU Primitives Library Powering Deep Learning Frameworks
NVIDIA API Catalog Explained: Hosted AI Models and APIs for Developers
Getting Started with NVIDIA APIs: Organizations, API Keys, and Your First API Call
Understanding Large Language Models: Architecture, Training, and What Makes Them Work
Retrieval-Augmented Generation (RAG) Explained: How It Works, Why It Matters, and How to Build It

GitHub Pages

This repository is published as a GitHub Pages site at https://chefgs.github.io/nvidia_tech_guides via GitHub Actions whenever new content is pushed to the main branch.

License

MIT

This site is open source. Improve this page.

nvidia_tech_guides

NVIDIA Tech Guides

Contents

Articles

GitHub Pages

License