<feed xmlns="http://www.w3.org/2005/Atom"> <id>https://vinayvobbili.github.io/</id><title>Vinay Vobbilichetty</title><subtitle>Engineering notes on distributed LLM platforms, agentic AI, and security automation — from someone shipping production systems at scale.</subtitle> <updated>2026-06-06T23:30:45-04:00</updated> <author> <name>Vinay Vobbilichetty</name> <uri>https://vinayvobbili.github.io/</uri> </author><link rel="self" type="application/atom+xml" href="https://vinayvobbili.github.io/feed.xml"/><link rel="alternate" type="text/html" hreflang="en" href="https://vinayvobbili.github.io/"/> <generator uri="https://jekyllrb.com/" version="4.4.1">Jekyll</generator> <rights> © 2026 Vinay Vobbilichetty </rights> <icon>/assets/img/favicons/favicon.ico</icon> <logo>/assets/img/favicons/favicon-96x96.png</logo> <entry><title>The Day My AI SOC Went Quiet</title><link href="https://vinayvobbili.github.io/posts/the-day-my-ai-soc-went-quiet/" rel="alternate" type="text/html" title="The Day My AI SOC Went Quiet" /><published>2026-06-06T12:00:00-04:00</published> <updated>2026-06-06T12:00:00-04:00</updated> <id>https://vinayvobbili.github.io/posts/the-day-my-ai-soc-went-quiet/</id> <content type="text/html" src="https://vinayvobbili.github.io/posts/the-day-my-ai-soc-went-quiet/" /> <author> <name>Vinay Vobbilichetty</name> </author> <category term="Security" /> <category term="LLM" /> <summary>A multi-agent AI SOC stopped posting verdicts — no errors, no crash, no alert. Just silence. The failover had worked perfectly, and that was exactly the problem. A post-mortem on success-shaped failures in LLM systems, and why "it didn't throw" is the most dangerous sentence in production AI.</summary> </entry> <entry><title>detflow: A Detection-Engineering Copilot You Can pip install</title><link href="https://vinayvobbili.github.io/posts/detflow-detection-engineering-copilot/" rel="alternate" type="text/html" title="detflow: A Detection-Engineering Copilot You Can pip install" /><published>2026-06-06T09:00:00-04:00</published> <updated>2026-06-06T17:36:53-04:00</updated> <id>https://vinayvobbili.github.io/posts/detflow-detection-engineering-copilot/</id> <content type="text/html" src="https://vinayvobbili.github.io/posts/detflow-detection-engineering-copilot/" /> <author> <name>Vinay Vobbilichetty</name> </author> <category term="Security" /> <category term="LLM" /> <summary>I kept rebuilding the same four things inside every detection-as-code pipeline — lint a rule, draft one from plain English, check it against what you already run, and review it like a senior engineer. So I extracted them into detflow, a vendor-neutral OSS Python package. Deterministic lint and overlap with no dependencies, model-agnostic drafting and review, and a never-raises contract so it degrades instead of breaking.</summary> </entry> <entry><title>iocflow: Turning a Production AI SOC into a Shippable OSS Library</title><link href="https://vinayvobbili.github.io/posts/iocflow-agentic-ioc-lifecycle/" rel="alternate" type="text/html" title="iocflow: Turning a Production AI SOC into a Shippable OSS Library" /><published>2026-05-31T09:00:00-04:00</published> <updated>2026-06-06T17:36:53-04:00</updated> <id>https://vinayvobbili.github.io/posts/iocflow-agentic-ioc-lifecycle/</id> <content type="text/html" src="https://vinayvobbili.github.io/posts/iocflow-agentic-ioc-lifecycle/" /> <author> <name>Vinay Vobbilichetty</name> </author> <category term="Security" /> <category term="LLM" /> <summary>After building SOC-in-a-Box — a multi-agent AI SOC where one local LLM wears every hat behind a human-in-the-loop gate — I distilled the durable lesson into iocflow, an open-source Python package for the whole IOC lifecycle. Deterministic primitives (extract → enrich → comment → hunt → block) as tools, a LangGraph multi-agent team on top, and three-layer authority so the LLM never gets the final say on a destructive action.</summary> </entry> <entry><title>SOC-in-a-Box: One LLM, Eight Hats, A Production-Bar AI SOC on a Single GPU</title><link href="https://vinayvobbili.github.io/posts/building-soc-in-a-box/" rel="alternate" type="text/html" title="SOC-in-a-Box: One LLM, Eight Hats, A Production-Bar AI SOC on a Single GPU" /><published>2026-05-30T09:00:00-04:00</published> <updated>2026-06-06T17:36:53-04:00</updated> <id>https://vinayvobbili.github.io/posts/building-soc-in-a-box/</id> <content type="text/html" src="https://vinayvobbili.github.io/posts/building-soc-in-a-box/" /> <author> <name>Vinay Vobbilichetty</name> </author> <category term="Security" /> <category term="LLM" /> <summary>An AVP-sponsored multi-agent SOC where one local LLM plays Sentinel, Tier 2, IR Lead, Threat Intel, SOC Manager, Detection Engineer, and Threat Hunter — coordinated over a Redis Streams bus with a human-in-the-loop approval gate before any real-system action. The framework choices, the architectural trade-offs, and the backtest harness that lets us put real numbers on agent quality before going live.</summary> </entry> <entry><title>Three Chat Template Patterns That Silently Kill Your Prompt Cache</title><link href="https://vinayvobbili.github.io/posts/three-chat-template-patterns-kill-prompt-cache/" rel="alternate" type="text/html" title="Three Chat Template Patterns That Silently Kill Your Prompt Cache" /><published>2026-05-14T09:00:00-04:00</published> <updated>2026-05-14T22:06:37-04:00</updated> <id>https://vinayvobbili.github.io/posts/three-chat-template-patterns-kill-prompt-cache/</id> <content type="text/html" src="https://vinayvobbili.github.io/posts/three-chat-template-patterns-kill-prompt-cache/" /> <author> <name>Vinay Vobbilichetty</name> </author> <category term="LLM" /> <category term="Performance" /> <summary>Before swapping models on a prompt-caching LLM backend, three Jinja2 patterns in the chat template will quietly break your cache hit rate. A 5-minute check against the published tokenizer config catches all three — no GPU, no weights, no inference required.</summary> </entry> </feed>
