<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Hans Christian Thjømøe — Blog</title><description>Notes on software architecture, AI tooling, agentic workflows, and self-hosted local AI.</description><link>https://www.neoteric.no/</link><language>en-us</language><item><title>Claude Code&apos;s /simplify Stopped Fixing Code Yesterday</title><link>https://www.neoteric.no/blog/claude-code-s-simplify-stopped-fixing-code-yesterday/</link><guid isPermaLink="true">https://www.neoteric.no/blog/claude-code-s-simplify-stopped-fixing-code-yesterday/</guid><description>Claude Code 2.1.147 renamed /simplify to /code-review and dropped the auto-fix behavior. The new command reports bugs at chosen effort levels but no longer changes code.</description><pubDate>Fri, 22 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>agentic-workflows</category><category>tooling</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Your Private MCP Server Is Now Claude-Reachable</title><link>https://www.neoteric.no/blog/your-private-mcp-server-is-now-claude-reachable/</link><guid isPermaLink="true">https://www.neoteric.no/blog/your-private-mcp-server-is-now-claude-reachable/</guid><description>Anthropic shipped MCP tunnels on May 19. Claude agents can call internal databases, ticketing systems, and on-prem APIs through one outbound connection — no inbound firewall rules required.</description><pubDate>Thu, 21 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>mcp</category><category>agentic-workflows</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Your vLLM Thinking Budget Was Doing Nothing With MTP On</title><link>https://www.neoteric.no/blog/your-vllm-thinking-budget-was-doing-nothing-with-mtp-on/</link><guid isPermaLink="true">https://www.neoteric.no/blog/your-vllm-thinking-budget-was-doing-nothing-with-mtp-on/</guid><description>vLLM 0.21.0 shipped Friday with a quiet fix: thinking_token_budget was being silently ignored when MTP speculative decoding was enabled. If you serve reasoning models with spec decode, you have been paying for it.</description><pubDate>Mon, 18 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>local-ai</category><category>vllm</category><category>speculative-decoding</category><category>benchmarks</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Claude Code v2.1.100+ Burns ~20K Phantom Tokens Per Request</title><link>https://www.neoteric.no/blog/claude-code-v2-1-100-burns-20k-phantom-tokens-per-request/</link><guid isPermaLink="true">https://www.neoteric.no/blog/claude-code-v2-1-100-burns-20k-phantom-tokens-per-request/</guid><description>A server-side bug in Claude Code v2.1.100+ inflates every request by roughly 20K cache_creation tokens — about 40% overhead. Pin v2.1.98 until fixed.</description><pubDate>Sun, 17 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>agentic-workflows</category><category>benchmarks</category><category>industry-signal</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Your Local Qwen3.6 Throughput Probably Just Halved (and How to Fix It)</title><link>https://www.neoteric.no/blog/llama-cpp-mtp-flag-rename/</link><guid isPermaLink="true">https://www.neoteric.no/blog/llama-cpp-mtp-flag-rename/</guid><description>llama.cpp renamed the MTP flag on May 13. The old --spec-type mtp is silently ignored. If your tok/s dropped from 140 to 70 you are likely running without speculative decoding.</description><pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>llama.cpp</category><category>qwen</category><category>local-ai</category><category>speculative-decoding</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>MCP Server Roundup: Which Are Actually Worth Adding to Your Setup in May 2026</title><link>https://www.neoteric.no/blog/mcp-server-roundup-may-2026/</link><guid isPermaLink="true">https://www.neoteric.no/blog/mcp-server-roundup-may-2026/</guid><description>Eighteen months after Anthropic released MCP, the ecosystem is wide enough that picking the wrong servers slows your agent down. Here is the practical short list — what to install, what to skip, and the trap most people fall into.</description><pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>mcp</category><category>claude</category><category>tooling</category><category>agentic-workflows</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Speculative Decoding Explained: Why Your Local Model Got 2× Faster in 2026</title><link>https://www.neoteric.no/blog/speculative-decoding-why-local-ai-got-fast/</link><guid isPermaLink="true">https://www.neoteric.no/blog/speculative-decoding-why-local-ai-got-fast/</guid><description>The same Qwen3.6-27B that ran at 70 tokens/sec on a 4090 in January was running at 140 tokens/sec by April. Nothing changed about the model. Speculative decoding moved from research curiosity to default. Here is what it actually does.</description><pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>local-ai</category><category>llama-cpp</category><category>performance</category><category>speculative-decoding</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Third-Party Claude Agents Lose the Subscription Subsidy June 15</title><link>https://www.neoteric.no/blog/third-party-claude-agents-lose-the-subscription-subsidy-june-15/</link><guid isPermaLink="true">https://www.neoteric.no/blog/third-party-claude-agents-lose-the-subscription-subsidy-june-15/</guid><description>Anthropic is splitting Claude billing on June 15 — Agent SDK and ACP usage moves to a capped credit pool ($20/$100/$200) at full API rates.</description><pubDate>Sat, 16 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>agentic-workflows</category><category>industry-signal</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>The Local AI Inflection Point: May 2026</title><link>https://www.neoteric.no/blog/local-ai-inflection-point-may-2026/</link><guid isPermaLink="true">https://www.neoteric.no/blog/local-ai-inflection-point-may-2026/</guid><description>Three model releases in three weeks moved local AI from &apos;good enough for hobbies&apos; to &apos;good enough for production&apos;. Here&apos;s what changed and why it matters.</description><pubDate>Fri, 15 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>local-ai</category><category>qwen</category><category>gemma</category><category>self-hosted</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Running Qwen3.6-27B Locally: Hardware, Quantization, and What Actually Works</title><link>https://www.neoteric.no/blog/running-qwen-3-6-27b-locally/</link><guid isPermaLink="true">https://www.neoteric.no/blog/running-qwen-3-6-27b-locally/</guid><description>A practical guide to running Qwen3.6-27B on consumer hardware in 2026 — memory requirements per quant level, recommended runners, and the MTP trick that doubles your tokens per second.</description><pubDate>Mon, 11 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>qwen</category><category>local-ai</category><category>llama.cpp</category><category>homelab</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>A 27B Model on a Single GPU Is 10 Points Off Claude Opus 4.7</title><link>https://www.neoteric.no/blog/qwen-3-6-27b-vs-claude-opus-4-7-benchmarks/</link><guid isPermaLink="true">https://www.neoteric.no/blog/qwen-3-6-27b-vs-claude-opus-4-7-benchmarks/</guid><description>Qwen3.6-27B running locally now scores within 10 points of frontier closed models on SWE-bench Verified. The benchmark table, lined up side by side.</description><pubDate>Fri, 08 May 2026 00:00:00 GMT</pubDate><category>ai</category><category>qwen</category><category>claude</category><category>local-ai</category><category>benchmarks</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Claude Opus 4.7: 87.6% on SWE-bench and 1M Context at Standard Pricing</title><link>https://www.neoteric.no/blog/claude-opus-4-7-coding-leap/</link><guid isPermaLink="true">https://www.neoteric.no/blog/claude-opus-4-7-coding-leap/</guid><description>Anthropic shipped Opus 4.7 on April 16, 2026, with a seven-point SWE-bench jump, the 1M context window now generally available with no premium, and a new task budget primitive for agent loops.</description><pubDate>Fri, 17 Apr 2026 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>anthropic</category><category>coding</category><category>agents</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Gemma 4: Google&apos;s Open Model Family Goes Multimodal</title><link>https://www.neoteric.no/blog/google-gemma-4-open-models/</link><guid isPermaLink="true">https://www.neoteric.no/blog/google-gemma-4-open-models/</guid><description>Google released Gemma 4 on April 2, 2026 — four variants from 2B to 31B, with 256K context, native vision and audio, and Apache 2.0 licensing. Here&apos;s what it&apos;s for, where it fits, and how to run it.</description><pubDate>Sun, 05 Apr 2026 00:00:00 GMT</pubDate><category>ai</category><category>google</category><category>gemma</category><category>local-ai</category><category>multimodal</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>What&apos;s New in Optimizely CMS 13: The Big Picture</title><link>https://www.neoteric.no/blog/optimizely-cms-13-whats-new/</link><guid isPermaLink="true">https://www.neoteric.no/blog/optimizely-cms-13-whats-new/</guid><description>Optimizely CMS 13 went GA on April 1, 2026. Visual Builder is now the default editor, Content Manager replaces tree-first navigation, Optimizely Graph and Opti ID are mandatory, and the platform jumps to .NET 10. Here&apos;s what actually changed, where it&apos;s worth caring, and what the upgrade is going to cost you.</description><pubDate>Wed, 01 Apr 2026 00:00:00 GMT</pubDate><category>optimizely</category><category>cms</category><category>dotnet</category><category>headless</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Claude Opus 4.6: A Million-Token Context and a New Agent Team Model</title><link>https://www.neoteric.no/blog/claude-opus-4-6-million-token-context-and-agent-teams/</link><guid isPermaLink="true">https://www.neoteric.no/blog/claude-opus-4-6-million-token-context-and-agent-teams/</guid><description>Anthropic released Opus 4.6 on February 5, 2026, with a 1M token context beta, agent teams, adaptive thinking, and developer effort controls — all at the same price as 4.5.</description><pubDate>Fri, 06 Feb 2026 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>anthropic</category><category>agents</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Introducing Azure DevOps Workflow: Manage Work Items Without Leaving VS Code</title><link>https://www.neoteric.no/blog/azure-devops-workflow-vscode-extension/</link><guid isPermaLink="true">https://www.neoteric.no/blog/azure-devops-workflow-vscode-extension/</guid><description>A VS Code extension that brings Azure DevOps sprint boards, work item management, and AI-powered assistance directly into your editor.</description><pubDate>Tue, 25 Nov 2025 00:00:00 GMT</pubDate><category>vscode</category><category>azure-devops</category><category>productivity</category><category>open-source</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Claude Opus 4.5: Anthropic&apos;s New Flagship Model Sets the Bar for AI Coding</title><link>https://www.neoteric.no/blog/claude-opus-4-5-anthropics-new-flagship/</link><guid isPermaLink="true">https://www.neoteric.no/blog/claude-opus-4-5-anthropics-new-flagship/</guid><description>Anthropic&apos;s latest model achieves state-of-the-art results in agentic coding and brings meaningful improvements across reasoning, mathematics, and everyday tasks.</description><pubDate>Tue, 25 Nov 2025 00:00:00 GMT</pubDate><category>ai</category><category>claude</category><category>anthropic</category><category>coding</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item><item><title>Google Gemini 3 Pro: The New Leader in Multimodal AI</title><link>https://www.neoteric.no/blog/google-gemini-3-pro-multimodal-reasoning/</link><guid isPermaLink="true">https://www.neoteric.no/blog/google-gemini-3-pro-multimodal-reasoning/</guid><description>Google&apos;s Gemini 3 Pro brings generative interfaces, 1M token context, and state-of-the-art multimodal reasoning to developers and consumers alike.</description><pubDate>Tue, 25 Nov 2025 00:00:00 GMT</pubDate><category>ai</category><category>google</category><category>gemini</category><category>multimodal</category><author>post@neoteric.no (Hans Christian Thjømøe)</author></item></channel></rss>