<?xml version="1.0" encoding="UTF-8" ?>
<?xml-stylesheet type="text/xsl" href="rss.xsl"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    
    <title>Sieve</title>
    <description>Transparent context reduction for LLMs.</description>
    <link>https://llmsieve.dev/</link>
    <atom:link href="https://llmsieve.dev/feed_rss_updated.xml" rel="self" type="application/rss+xml" />

    
    
    <docs>https://github.com/llmsieve/llm-sieve</docs>
    <language>en</language>

    
    <pubDate>Mon, 15 Jun 2026 13:23:14 -0000</pubDate>
    <lastBuildDate>Mon, 15 Jun 2026 13:23:14 -0000</lastBuildDate>
    <ttl>1440</ttl>

    
    <generator>MkDocs RSS plugin - v1.19.0</generator>

    
    
    <image>
      <url>None</url>
      <title>Sieve</title>
      <link>https://llmsieve.dev/</link>
    </image>
    

    
    
    <item>
      <title>The two causes of your token bill</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Design</category>
        
      
      <description>Agent token bills have two separate causes. Why compression and reduction are complementary fixes, not rivals.
</description>
      <link>https://llmsieve.dev/blog/2026/06/15/the-two-causes-of-your-token-bill/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/15/the-two-causes-of-your-token-bill/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-two-causes-token-cost.png" type="image/png" length="None" />
      
    </item>
    
    <item>
      <title>Sieve, mem0, Zep: three shapes of agent memory</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Design</category>
        
      
      <description>mem0&#39;s SDK, Zep&#39;s managed platform, and Sieve&#39;s transparent proxy solve agent memory in three different shapes. How to pick.
</description>
      <link>https://llmsieve.dev/blog/2026/06/10/sieve-mem0-zep-three-shapes-of-agent-memory/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/10/sieve-mem0-zep-three-shapes-of-agent-memory/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-three-shapes-of-memory.png" type="image/png" length="51988" />
      
    </item>
    
    <item>
      <title>Persistent memory for Ollama, in about five minutes</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Guides</category>
        
      
      <description>Give any Ollama model persistent, encrypted memory in about five minutes — without changing your client code.
</description>
      <link>https://llmsieve.dev/blog/2026/06/10/persistent-memory-for-ollama-in-about-five-minutes/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/10/persistent-memory-for-ollama-in-about-five-minutes/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-persistent-memory-for-ollama.png" type="image/png" length="46761" />
      
    </item>
    
    <item>
      <title>The hidden cost of context</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Engineering</category>
        
      
      <description>Why LLM agent token bills grow faster than developers expect — and what context actually costs in compute at scale.
</description>
      <link>https://llmsieve.dev/blog/2026/06/10/the-hidden-cost-of-context/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/10/the-hidden-cost-of-context/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-hidden-cost-of-context.png" type="image/png" length="43338" />
      
    </item>
    
    <item>
      <title>What always-on agents stand to gain from a context proxy</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Design</category>
        
      
      <description>OpenClaw, Hermes, and the always-on agent workload — why gateway-style assistants are the heaviest context spenders yet.
</description>
      <link>https://llmsieve.dev/blog/2026/06/10/what-always-on-agents-stand-to-gain-from-a-context-proxy/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/10/what-always-on-agents-stand-to-gain-from-a-context-proxy/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-always-on-agents.png" type="image/png" length="61502" />
      
    </item>
    
    <item>
      <title>Why Sieve</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Design</category>
        
      
      <description>Why Sieve is a transparent proxy rather than a memory library or prompt compaction — and what that trade-off costs you.
</description>
      <link>https://llmsieve.dev/blog/2026/06/09/why-sieve/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/09/why-sieve/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-why-sieve.png" type="image/png" length="38298" />
      
    </item>
    
    <item>
      <title>Compute is the bottleneck. Tokens are just the price tag.</title>
      
      
        
      <author>Sieve Engineering</author>
        
      
      
      
        
      <category>Economics</category>
        
      
      <description>Compute is the binding constraint on AI. Tokens just reprice it. Workload shape is the one lever you fully control.
</description>
      <link>https://llmsieve.dev/blog/2026/06/09/compute-is-the-bottleneck-tokens-are-just-the-price-tag/</link>
      <pubDate>Mon, 15 Jun 2026 13:23:15 +0000</pubDate>
      <source url="https://llmsieve.dev/feed_rss_updated.xml">Sieve</source>
      
      <guid isPermaLink="true">https://llmsieve.dev/blog/2026/06/09/compute-is-the-bottleneck-tokens-are-just-the-price-tag/</guid>
      
      <enclosure url="https://llmsieve.dev/assets/social/social-card-compute-bottleneck.png" type="image/png" length="56922" />
      
    </item>
    
  </channel>
</rss>