Reflection 10 - Will's Wacky Web Blog

Google AI Introduces STATIC: A Sparse Matrix Framework Delivering 948x Faster Constrained Decoding for LLM-based Generative Retrieval

Building a Whole RAG System: Successes and Failures

April 7, 2026

This is one of those times that I just read something and go "Wow! That was interesting! But wtf did any of that mean?" Somehow it is always theory math. Too many sweats in the field. Putting on my serious hat now. This is an interesting idea. The core concept of distinguishing RAG (querying based on vector similarity) from constrained decoding (querying based on an ID I just made up) is another abstraction of AI using AI using AI. That idea, at its core, sounds like something someone at FTX would come up with. But then we get to Google basically starting to back-propagate and train this ID generation so that it is relatively smart, and it works somehow. It's the classic idea of AI algorithms: given some context of the past, I can make an educated guess instead of do ing the hard work. And because we never ACTUALLY do the work, we just have to make sure that the answer we came up with is in the database and matches some filters we set up in the new STATIC framework, and its something like 1000x faster and more energy efficient. Wild.

Cassidy had some ENERGY! And I love working with those people. I think the most interesting thing for me was finding out that Cassidy put so much effort into jams and hackathons even still. Maybe it was just a good perk of her current position, but I haven't really heard of people being that into it? And it made talking to her really humanizing because even though she wasn't a strictly technical role, she was doing everything we had learned to do in school. Her answers about AI were pretty nuanced which I appreciated as well. Definitely excited to connect with her more.

Conversation

Me: I need help writing this new element in a social site that I just got tasked with. Jira ID is 1234. Plan for me.

Chat: No. Smile

Me: ?????? what do you mean no?

Chat: I mean no. :D

Me: ??????

Chat: My records say that you know how to do this. And the company cannot justify using tokens for this task at this moment.

Me: ?????? We are literally wasting tokens talking right now. I think you could probably have done it already.

Chat: This chat has been flagged for excessive token usage. Please try again later. :D

Me: ??????