local-llm - Rushi's

Jun 02

2026

You’ve seen the numbers. 7B. 70B. 405B. Everyone talks about parameter counts. But what are they? Why does size matter? And what actually happens when you hit “Generate”? This post covers the mechanics: what parameters are, where they live in the model architecture, how scaling affects them, and what that means if you’re running or choosing […]

May 31

2026

You’ve heard the pitch: run AI privately, offline, on your own hardware — no API keys, no usage limits, no data leaving your machine. You open Hugging Face, find a model called Qwen3-30B-A3B-GGUF, download 20GB, try to run it, and your laptop grinds to a halt or produces nothing at all. The problem isn’t that local […]

May 29

2026

Sharing local LLM models between Ollama and llama.cpp seems like a niche concern until you’ve burned through tens of GB of disk space on duplicate copies of the same model. The two tools use completely different storage formats by default, but you can configure them to share one file. Table of contents The problem: data […]

Apr 03

2026

Google DeepMind released Gemma 4 on April 2, 2026 under Apache 2.0. It’s their fourth-generation open model family, and it runs locally with surprisingly little friction. Here are three ways to get it going, depending on what hardware you have in front of you. Table of contents Option 1: On your phone No account, no […]

Mar 11

2026

If you’ve tried running a local model through Ollama with Claude Code and been greeted by this message: There’s an issue with the selected model (qwen3-coder:30b). It may not exist or you may not have access to it. Run /model to pick a different model. …even though the model is clearly installed and runs fine […]

Rushi's

Ctrl+AI+Ship

Tag: local-llm

LLM parameters: what they are and how they actually work

How to Pick the Right Model to Run on Your Local Machine

Sharing Local LLM Models Between Ollama and llama.cpp

A developer’s guide to Gemma 4 and Google’s open model play

Fixing the “Model May Not Exist” Error When Using Ollama with Claude Code