Large Language Models - Tom Compagno

Choosing Your Runner – LM Studio vs. Ollama vs. Kobold

A granular comparison of the software tools used to actually load and "chat" with your quantized model files.

Multi-Modal RAG – Talking to Your Images and Videos

A deeper dive into "Vision-Language Models" (VLMs) that allow you to ask your local AI questions about your personal photo library or screenshots.

Learn more

The First Boot – Downloading and Running Your First GGUF

The final "how-to" step: finding a model on Hugging Face, loading it into your software, and sending your first offline prompt.

Learn more

The Hardware Check – Can Your PC Handle It?

A high-level guide to the "Big Three" requirements—VRAM, System RAM, and Storage—and how to audit your current specs for local LLM.

Learn more

The Language Masters – Large Language Models (LLMs)

Zooming in on Generative AI specifically built for human conversation and text generation.

Learn more

The Training Phase – Pre-training & Fine-Tuning

How an LLM goes from reading the entire internet to being a helpful assistant that follows instructions.

Learn more

The VRAM Bottleneck – Why the GPU is King

A deeper dive into Video RAM (VRAM), explaining why your graphics card’s memory is the single most important factor for speed and model size for local LLM.

Learn more

Why Go Local? The Case for Private AI

An introduction to the benefits of running models on your own machine, from total data privacy to avoiding monthly subscription fees.

Learn more