CO/AI Subscribe
Wednesday · June 24, 2026 · Issue No. 905
Video

Open Source LLMs on GOD mode. Local LLMs MAXXED OUT on the RTX 5090!

Watch on YouTube

Open source LLMs go local: maxing out the new RTX 5090

If you’re frustrated with subscription-based AI services or concerned about data privacy, there’s an exciting alternative: running large language models (LLMs) locally on your own computer. In a recent exploration, tech enthusiasts have pushed the boundaries of what’s possible with the new RTX 5090 graphics card, and the results are impressive.

Why run AI models locally?

Running LLMs on your own computer offers several advantages:

  • No subscription fees
  • Complete privacy (no data sent to third parties)
  • 24/7 access without internet connection
  • Freedom from usage restrictions

While open-source models might not match the capabilities of proprietary giants like ChatGPT, they’re surprisingly capable and improving rapidly.

The hardware matters

The RTX 5090’s massive 32GB of VRAM makes it possible to run sophisticated AI models that would choke lesser graphics cards. This demonstration showed how this GPU can handle models of various sizes:

  • DeepSeek R1 7B (used 10GB VRAM)
  • DeepSeek R1 14B (used 18GB VRAM)
  • DeepSeek R1 32B (used 20-32GB VRAM depending on settings)
  • Gemma 2 7B Vision model (used 24.4GB VRAM)
  • Tiny 360M parameter models (barely used 1GB VRAM)

Performance is impressive

The generation speeds, particularly with the smaller models, were remarkable:

  • DeepSeek R1 7B: ~78 tokens per second
  • DeepSeek R1 14B: ~
    Share: X LinkedIn Email
Video Feed

More videos

All videos →
Claude Fable 5: When Capability Meets Economics
Video

Claude Fable 5: When Capability Meets Economics

Anthropic released Cloud Fable 5 with a paradox built in: safeguards sophisticated enough to let a mythosclass model...

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs
Video

Run Agentic AI Entirely on Your Mac—No Cloud, No Latency, No Privacy Tradeoffs

Apple’s MLX framework is mature enough now that you can run serious agentic AI workflows locally on Silicon...

Hermes Agent Master Class
Video

Hermes Agent Master Class

Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging...

CONSULTING

Outsider
Labs.

A management consulting team focused on AI transformations for executives and business owners.

Work with us →