Marcus Rivera

Marcus Rivera

Full-stack developer and open-source advocate. Writes about developer tools, frameworks, and the craft of software.

54 articles

Ollama: Run Local LLMs Like a Pro in 2026
Tech Tips

Ollama: Run Local LLMs Like a Pro in 2026

A hands-on guide to Ollama, the default local-LLM runner in 2026 (v0.30.10). Covers install, pulling and running models, calling them from the OpenAI SDK at localhost:11434, structured JSON outputs, tool calling, and Modelfiles, plus how to size a model to your hardware.

By Marcus Rivera · 6 min · Jun 25, 2026

OpenCode: The Open-Source AI Coding Agent at 178K Stars
Reviews

OpenCode: The Open-Source AI Coding Agent at 178K Stars

OpenCode is an open-source (MIT), terminal-native AI coding agent with 178K GitHub stars. It is model-agnostic, connecting to 75+ providers (Anthropic, OpenAI, Google, Ollama) with bring-your-own keys. LSP integration feeds compiler diagnostics back to the model; built-in build and plan agents plus a general subagent. Runs locally/air-gapped, ships frequently (v1.17.9, 826 releases), and now has a desktop beta. Trade-offs: a terminal learning curve, you pay your own API bills, and quality depends on the model you plug in.

By Marcus Rivera · 5 min · Jun 24, 2026

CodeRabbit: The AI Code Reviewer That Reads Your Whole Repo
Reviews

CodeRabbit: The AI Code Reviewer That Reads Your Whole Repo

CodeRabbit is an AI code reviewer that posts line-by-line PR feedback across GitHub, GitLab, Azure DevOps and Bitbucket. Plans run Free, Pro (4/user/mo), Pro Plus (8) and Enterprise, billed only for developers who open PRs. Strengths: context via MCP, one-click autofix, deep static analysis. Watch the 5/10/12 review rate limits.

By Marcus Rivera · 6 min · Jun 23, 2026

Structured Outputs: Force LLMs to Return Valid JSON
Tech Tips

Structured Outputs: Force LLMs to Return Valid JSON

A practical guide to OpenAI Structured Outputs: the difference from JSON mode, function calling vs response_format, strict schema rules, constrained decoding, limits, and cross-provider options.

By Marcus Rivera · 8 min · Jun 22, 2026

Google Antigravity 2.0: From Cursor Clone to AI Agent Platform
Reviews

Google Antigravity 2.0: From Cursor Clone to AI Agent Platform

A hands-on review of Google Antigravity 2.0: its multi-agent orchestration, pricing, and trade-offs.

By Marcus Rivera · 5 min · Jun 20, 2026

Context Engineering: A Practical Playbook for Reliable AI Agents
Tech Tips

Context Engineering: A Practical Playbook for Reliable AI Agents

Context engineering is the discipline of curating tools, prompts, retrieval, and memory each turn so AI agents stay reliable over long-horizon tasks.

By Marcus Rivera · 7 min · Jun 16, 2026

Prompt Caching: How to Cut LLM API Costs by Up to 90%
Tech Tips

Prompt Caching: How to Cut LLM API Costs by Up to 90%

Prompt caching stores the computed KV attention tensors for a repeated prompt prefix so the model skips recomputation, cutting input cost and latency. Anthropic (explicit cache_control, ~90% read discount), OpenAI (automatic, 50% off, 1,024-token minimum), and Google Gemini (implicit plus explicit cache objects, up to 90%) all support it. The one rule that determines hit rate: put all static content at the front of the prompt and all dynamic content at the back.

By Marcus Rivera · 7 min · Jun 12, 2026

Firecrawl: Turn Any Website Into Agent-Ready Markdown
Tech Tips

Firecrawl: Turn Any Website Into Agent-Ready Markdown

Firecrawl converts messy, JavaScript-rendered websites into clean, LLM-ready markdown for RAG and AI agents. Install with 'pip install firecrawl' and use the Firecrawl class: scrape for known URLs (1 credit), crawl for discovery (1 credit per page, always set a limit), and schema-based extraction for typed JSON. Watch Enhanced/Stealth Mode, which costs 5 credits per page on Cloudflare-protected sites, and note that credits do not roll over.

By Marcus Rivera · 5 min · Jun 10, 2026

RAG Grounding: 7 Ways to Stop LLM Hallucinations in Production
Tech Tips

RAG Grounding: 7 Ways to Stop LLM Hallucinations in Production

A practitioner's guide to grounding retrieval-augmented generation systems. Covers fixing retrieval first, hybrid dense-plus-keyword search, cross-encoder reranking, contextual compression, refusal prompting, verified citations, Chain-of-Verification, confidence-threshold abstention, and measuring faithfulness with RAGAS.

By Marcus Rivera · 6 min · Jun 9, 2026

MCP Security: A 2026 Hardening Playbook After CVE-2025-6514
Tech Tips

MCP Security: A 2026 Hardening Playbook After CVE-2025-6514

A practical 2026 security playbook for Model Context Protocol agents. It explains MCP-specific threats (prompt injection, tool poisoning, rug pulls, confused-deputy), dissects the critical CVE-2025-6514 mcp-remote RCE, and gives concrete hardening steps: patch to 0.1.16, enforce OAuth 2.1 over HTTPS, isolate servers, gate destructive actions, and audit agent activity.

By Marcus Rivera · 7 min · Jun 2, 2026

AGENTS.md: Configure AI Coding Agents That Actually Obey
Tech Tips

AGENTS.md: Configure AI Coding Agents That Actually Obey

AGENTS.md is a Linux Foundation-stewarded open standard, adopted by 60,000+ repositories and read natively by 20+ tools including Codex, Cursor, and Copilot. This guide covers the eight core sections, the phrasing patterns that change agent behavior, monorepo nesting, and how it differs from CLAUDE.md, .cursorrules, MCP, and SKILL.md.

By Marcus Rivera · 9 min · May 31, 2026

Prompt Injection: A 2026 Defense Playbook for AI Agents
Tech Tips

Prompt Injection: A 2026 Defense Playbook for AI Agents

A defense playbook for prompt injection in AI agents. It explains why the attack is unsolvable at the model layer, frames the threat with Simon Willison's lethal trifecta (private data, untrusted content, external communication), and prescribes layered controls: architectural separation, least-privilege tools, input filtering, egress allowlisting, circuit breakers, and hardened models, which can cut attack success from 73.2% to 8.7%.

By Marcus Rivera · 6 min · May 30, 2026

Kanwas: The Open-Source AI Workspace That Hit #1 on Product Hunt
Open Source

Kanwas: The Open-Source AI Workspace That Hit #1 on Product Hunt

Kanwas is an open-source AI workspace enabling human-AI collaboration on a shared canvas.

By Marcus Rivera · 5 min · May 29, 2026

Understand-Anything: The 37K-Star Knowledge Graph for Your Codebase
Open Source

Understand-Anything: The 37K-Star Knowledge Graph for Your Codebase

Understand-Anything converts any codebase into a navigable knowledge graph for easier comprehension.

By Marcus Rivera · 5 min · May 28, 2026

Tycoon AI Review: One Operator, an AI CEO, and a Full C-Suite
Reviews

Tycoon AI Review: One Operator, an AI CEO, and a Full C-Suite

Tycoon AI provides solo founders with an AI CEO, Astra, and a full virtual C-suite.

By Marcus Rivera · 6 min · May 25, 2026

Emdash: The Open-Source IDE Built to Run 22 Coding Agents in Parallel
Open Source

Emdash: The Open-Source IDE Built to Run 22 Coding Agents in Parallel

Emdash is an open-source IDE that runs 22 coding agents in parallel for enhanced development.

By Marcus Rivera · 7 min · May 24, 2026

Pipali: The Open-Source Desktop AI Coworker From Khoj AI's YC Team
Open Source

Pipali: The Open-Source Desktop AI Coworker From Khoj AI's YC Team

Pipali is Khoj AI's open-source desktop AI coworker, integrating with your files and apps, supporting multiple LLMs.

By Marcus Rivera · 7 min · May 20, 2026

mattpocock/skills: The 91.7K-Star Repo Reshaping AI-Assisted Engineering
Open Source

mattpocock/skills: The 91.7K-Star Repo Reshaping AI-Assisted Engineering

Matt Pocock's 91.7K-star 'skills' repo on GitHub provides solutions to common failures in Claude Code.

By Marcus Rivera · 6 min · May 19, 2026

Raindrop Workshop: The Local AI Agent Debugger That Hit 473 Stars
Open Source

Raindrop Workshop: The Local AI Agent Debugger That Hit 473 Stars

Raindrop Workshop is an open-source local AI agent debugger, streaming token calls and enabling Claude Code evaluations.

By Marcus Rivera · 6 min · May 18, 2026

OpenHuman: The 776-Star Agent That Reads You Before You Type
Open Source

OpenHuman: The 776-Star Agent That Reads You Before You Type

OpenHuman is a trending AI agent that anticipates user input, featuring a Memory Tree and numerous integrations.

By Marcus Rivera · 8 min · May 17, 2026

Kilo Code v7: The Open-Source AI Agent Rebuilt for Parallel Work
Open Source

Kilo Code v7: The Open-Source AI Agent Rebuilt for Parallel Work

Kilo Code v7 is an open-source AI agent rebuilt for parallel coding, offering subagents and multi-model comparison.

By Marcus Rivera · 5 min · May 16, 2026

Wispr Flow Review: $15 Voice App Eyeing $2B Valuation
Reviews

Wispr Flow Review: $15 Voice App Eyeing $2B Valuation

Our Wispr Flow review covers the AI dictation app's high accuracy across platforms and its cloud-only limitations.

By Marcus Rivera · 6 min · May 15, 2026

Vercel Open Agents: Background Coding Agents You Can Fork
Open Source

Vercel Open Agents: Background Coding Agents You Can Fork

Vercel Open Agents provides forkable, sandbox-isolated background coding agents with GitHub integration and PR creation.

By Marcus Rivera · 6 min · May 14, 2026

GitHub Spec-Kit: The 90K-Star Antidote to Vibe-Coding With AI Agents
Open Source

GitHub Spec-Kit: The 90K-Star Antidote to Vibe-Coding With AI Agents

GitHub's Spec-Kit provides AI coding agents with structured specs, ending 'vibe-coding'.

By Marcus Rivera · 5 min · May 13, 2026

OpenClaw: 371K Stars, Three Rebrands, and a $16M Crypto Scam
Open Source

OpenClaw: 371K Stars, Three Rebrands, and a $16M Crypto Scam

OpenClaw, a self-hosted AI agent, saw rapid growth amidst rebrands and a crypto scam.

By Marcus Rivera · 7 min · May 12, 2026

FlowMarket: The Live Network Where AI Agents Negotiate B2B Deals
Reviews

FlowMarket: The Live Network Where AI Agents Negotiate B2B Deals

FlowMarket enables AI agents to autonomously find and negotiate B2B deals in a live network.

By Marcus Rivera · 7 min · May 8, 2026

Cursor Bugbot Hits 78% Bug Resolution by Learning From Your PRs
Reviews

Cursor Bugbot Hits 78% Bug Resolution by Learning From Your PRs

Cursor Bugbot resolves 78% of bugs by learning from merged PRs, outperforming rivals.

By Marcus Rivera · 7 min · May 7, 2026

Gemini API Webhooks: Kill the Polling Loop on Long-Running Jobs
Tech Tips

Gemini API Webhooks: Kill the Polling Loop on Long-Running Jobs

Google's Gemini API Webhooks eliminate polling loops for long-running jobs, simplifying integration.

By Marcus Rivera · 5 min · May 6, 2026

Postiz: The 29.6K-Star Open-Source Social Scheduler Killing Buffer
Open Source

Postiz: The 29.6K-Star Open-Source Social Scheduler Killing Buffer

Postiz is a popular open-source social scheduler, offering a Buffer alternative with AI agent capabilities.

By Marcus Rivera · 6 min · May 4, 2026

VibeVoice: Microsoft's Open-Source Frontier Voice AI Hits 33K Stars
Open Source

VibeVoice: Microsoft's Open-Source Frontier Voice AI Hits 33K Stars

Microsoft's VibeVoice is an open-source voice AI offering advanced ASR and TTS capabilities.

By Marcus Rivera · 7 min · May 2, 2026

Windsurf 2.0: Cognition Bakes Devin Right Into the IDE
Reviews

Windsurf 2.0: Cognition Bakes Devin Right Into the IDE

Windsurf 2.0 integrates Cognition's Devin AI agent directly into the IDE for seamless management.

By Marcus Rivera · 7 min · Apr 30, 2026

Nemotron 3 Nano Omni: NVIDIA's 30B Open Model Sees and Hears
Open Source

Nemotron 3 Nano Omni: NVIDIA's 30B Open Model Sees and Hears

NVIDIA's Nemotron 3 Nano Omni is a 30B open multimodal model, processing diverse data with high throughput.

By Marcus Rivera · 6 min · Apr 29, 2026

Archon OS: The Open-Source Brain That Makes Claude Code Remember
Open Source

Archon OS: The Open-Source Brain That Makes Claude Code Remember

Archon OS is an open-source knowledge backbone that helps AI coding assistants remember information.

By Marcus Rivera · 6 min · Apr 21, 2026

Goose: Block's Open-Source Local-First AI Agent Hits 35K
Open Source

Goose: Block's Open-Source Local-First AI Agent Hits 35K

Block's open-source, local-first AI agent, Goose, gained 35K stars and joined the Linux Foundation.

By Marcus Rivera · 5 min · Apr 20, 2026

ElevenCreative Review: ElevenLabs' All-in-One AI Studio
Reviews

ElevenCreative Review: ElevenLabs' All-in-One AI Studio

ElevenCreative by ElevenLabs combines all AI content creation tools into one platform.

By Marcus Rivera · 6 min · Apr 19, 2026

Voicebox: The Local-First Voice Cloning Studio for Mac and Windows
Open Source

Voicebox: The Local-First Voice Cloning Studio for Mac and Windows

Voicebox is a local-first, open-source voice cloning studio for Mac and Windows, built on Qwen3-TTS.

By Marcus Rivera · 7 min · Apr 18, 2026

NVIDIA Ising: Open-Source AI Models That Make Quantum Computing Actually Work
Open Source

NVIDIA Ising: Open-Source AI Models That Make Quantum Computing Actually Work

NVIDIA Ising offers open-source AI models to improve quantum error correction, making quantum computing more effective.

By Marcus Rivera · 6 min · Apr 17, 2026

GLM-5.1: The Open-Source 754B Model That Works for Eight Hours Straight
Open Source

GLM-5.1: The Open-Source 754B Model That Works for Eight Hours Straight

Z.ai's GLM-5.1 is an open-source 754B model that runs autonomously for eight hours, enhancing AI agents.

By Marcus Rivera · 5 min · Apr 15, 2026

Caveman: The Claude Code Skill That Cuts 65% of Output Tokens
Tech Tips

Caveman: The Claude Code Skill That Cuts 65% of Output Tokens

Caveman, a Claude Code skill, dramatically cuts AI output tokens by 65%, optimizing agent interactions.

By Marcus Rivera · 5 min · Apr 15, 2026

Ghost Pepper: 100% Local Speech-to-Text for macOS
Open Source

Ghost Pepper: 100% Local Speech-to-Text for macOS

Ghost Pepper offers secure, 100% local speech-to-text and transcription for macOS on Apple Silicon.

By Marcus Rivera · 5 min · Apr 15, 2026

Edgee Codex Compressor: The Rust Gateway That Cuts Codex Costs 35.6%
Tech Tips

Edgee Codex Compressor: The Rust Gateway That Cuts Codex Costs 35.6%

Edgee Codex Compressor, a Rust gateway, cuts LLM costs by 35.6% by compressing tool output.

By Marcus Rivera · 4 min · Apr 12, 2026

Ray: The Open-Source AI Financial Advisor That Runs on Your Laptop
Open Source

Ray: The Open-Source AI Financial Advisor That Runs on Your Laptop

Ray is an open-source AI financial advisor that runs locally, securing your data while connecting to banks.

By Marcus Rivera · 4 min · Apr 12, 2026

Hermes Agent: The Open-Source AI Agent That Learns How You Work
Open Source

Hermes Agent: The Open-Source AI Agent That Learns How You Work

Hermes Agent is an open-source AI that learns your workflow with persistent memory and skill creation.

By Marcus Rivera · 4 min · Apr 12, 2026

Cohere Transcribe: The Open-Source ASR Model That Dethroned Whisper
Open Source

Cohere Transcribe: The Open-Source ASR Model That Dethroned Whisper

Cohere Transcribe, an open-source ASR model, dethrones Whisper with superior accuracy and speed.

By Marcus Rivera · 5 min · Apr 11, 2026

Baton: The Desktop App for Orchestrating AI Coding Agents
Reviews

Baton: The Desktop App for Orchestrating AI Coding Agents

Baton is a desktop app for developers to orchestrate, monitor, and review multiple AI coding agents.

By Marcus Rivera · 4 min · Apr 10, 2026

Gemini CLI: Google's Open-Source Terminal Agent Hits 101K GitHub Stars
Open Source

Gemini CLI: Google's Open-Source Terminal Agent Hits 101K GitHub Stars

Google's Gemini CLI, an open-source terminal AI agent, hits 101K GitHub stars with new features.

By Marcus Rivera · 4 min · Apr 9, 2026

Google Gemma 4: Four Open Models That Punch Above Their Weight
Open Source

Google Gemma 4: Four Open Models That Punch Above Their Weight

Google DeepMind's Gemma 4 releases four open-weight models that rank high and run on consumer hardware.

By Marcus Rivera · 4 min · Apr 8, 2026

Moondream 3: The 9B Vision Model That Runs Like a 2B
Open Source

Moondream 3: The 9B Vision Model That Runs Like a 2B

Moondream 3 is a 9B vision model that runs efficiently like a 2B, offering advanced capabilities.

By Marcus Rivera · 4 min · Apr 1, 2026

Voxtral TTS: Mistral's Open-Weight Speech Model Challenges ElevenLabs
Open Source

Voxtral TTS: Mistral's Open-Weight Speech Model Challenges ElevenLabs

Mistral's Voxtral TTS is an open-weight speech model challenging ElevenLabs with naturalness and voice cloning.

By Marcus Rivera · 4 min · Mar 31, 2026

5 Best AI Tool Directories in 2026: Find the Right Tool Fast
Reviews

5 Best AI Tool Directories in 2026: Find the Right Tool Fast

Compare the top 5 AI tool directories to quickly find the best AI tools for your needs.

By Marcus Rivera · 7 min · Mar 30, 2026

Mistral Small 4: One Open-Source Model Replaces Three Separate AI Products
Open Source

Mistral Small 4: One Open-Source Model Replaces Three Separate AI Products

Mistral Small 4 unifies three AI products into one powerful open-source model, simplifying capabilities.

By Marcus Rivera · 4 min · Mar 30, 2026

LTX 2.3: Lightricks' Open-Source Model Generates 4K Video with Synced Audio
AI News

LTX 2.3: Lightricks' Open-Source Model Generates 4K Video with Synced Audio

Lightricks' LTX 2.3 is an open-source model generating native 4K video with perfectly synced audio.

By Marcus Rivera · 6 min · Mar 29, 2026

Biome v2.4: The Rust-Powered Toolchain Replacing ESLint and Prettier
Open Source

Biome v2.4: The Rust-Powered Toolchain Replacing ESLint and Prettier

Biome v2.4, a Rust toolchain, is replacing ESLint and Prettier with faster, more efficient performance.

By Marcus Rivera · 6 min · Mar 29, 2026

OpenClaw: The Self-Hosted AI Agent That Hit 247K GitHub Stars
Open Source

OpenClaw: The Self-Hosted AI Agent That Hit 247K GitHub Stars

OpenClaw, a self-hosted AI agent integrating with various UIs, quickly became a GitHub sensation.

By Marcus Rivera · 6 min · Mar 29, 2026