<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Event Archives - ShiftMag</title>
	<atom:link href="https://shiftmag.dev/category/event/feed/" rel="self" type="application/rss+xml" />
	<link>https://shiftmag.dev/category/event/</link>
	<description>Insightful engineering content &#38; community</description>
	<lastBuildDate>Wed, 27 May 2026 13:36:40 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shiftmag.dev/wp-content/uploads/2024/08/cropped-ShiftMag-favicon-32x32.png</url>
	<title>Event Archives - ShiftMag</title>
	<link>https://shiftmag.dev/category/event/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Trisha Gee: AI Won&#8217;t Fix Your Broken Pipeline &#8211; It Will Break It Faster</title>
		<link>https://shiftmag.dev/trisha-gee-ai-wont-fix-your-broken-pipeline-it-will-break-it-faster-9785/</link>
		
		<dc:creator><![CDATA[Ivan Pelivanovic]]></dc:creator>
		<pubDate>Wed, 27 May 2026 13:36:39 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Developer Productivity]]></category>
		<category><![CDATA[development]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9785</guid>

					<description><![CDATA[<p>At Devoxx UK, I spoke with Trisha Gee - author and one of the most recognized voices in the Java space - about what really happens when teams lean heavily on AI. Her take was far darker than the conference hype.</p>
<p>The post <a href="https://shiftmag.dev/trisha-gee-ai-wont-fix-your-broken-pipeline-it-will-break-it-faster-9785/">Trisha Gee: AI Won&#8217;t Fix Your Broken Pipeline &#8211; It Will Break It Faster</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img fetchpriority="high" decoding="async" width="1200" height="720" src="https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx-768x461.jpg 768w" sizes="(max-width: 1200px) 100vw, 1200px" /></figure>

<div class="wp-block-post-excerpt wp-block-hidden-desktop wp-block-hidden-mobile wp-block-hidden-tablet"><p class="wp-block-post-excerpt__excerpt">At Devoxx UK, I spoke with Trisha Gee &#8211; author and one of the most recognized voices in the Java space &#8211; about what really happens when teams lean heavily on AI. Her take was far darker than the conference hype. </p></div>


<p class="wp-block-paragraph"><strong>Trisha Gee</strong> has spent over two decades in software development, from startups to global enterprises &#8211; equally at home discussing DORA metrics and SPACE frameworks as business outcomes and organizational design.</p>



<p class="wp-block-paragraph">At <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx UK</a>, she gave a talk about how <strong>software engineering principles stay the same regardless of what tooling era you are in</strong>. </p>



<p class="wp-block-paragraph">I wanted to understand what that means right now when AI is writing a significant portion of the code.</p>



<h2 class="wp-block-heading"><span id="ai-exposes-the-weakest-link-not-just-the-fastest-path">AI exposes the weakest link, not just the fastest path</span></h2>



<p class="wp-block-paragraph">Trisha frames AI as an amplifier, not a solution. When I asked what that looks like beyond demos, she put it simply: <strong>it exposes the problems that were already there</strong>, the ones you didn&#8217;t know you had.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The most common thing I saw (I was working at Gradle, so we dealt with a lot of build tooling) was more code, more tests, and tests taking longer. The continuous delivery pipeline took a lot of pressure.</p>
</blockquote>



<p class="wp-block-paragraph">The broader pattern she describes is straightforward but easy to miss when you are excited about shipping faster. &#8220;Whichever part of your system is the weakest, it&#8217;s going to expose that part,&#8221; she said.</p>



<p class="wp-block-paragraph">Reframing it this way, while most conversations about AI adoption focus on what gets faster, Trisha highlights what deteriorates first.</p>



<h2 class="wp-block-heading"><span id="when-code-gets-cheap-everything-else-gets-expensive">When code gets cheap, everything else gets expensive</span></h2>



<p class="wp-block-paragraph">When I asked Trisha where teams should focus once code generation becomes cheap, her answer was <em>everywhere</em>.</p>



<figure class="wp-block-image size-large"><img decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-1024x614.jpg?x94846" alt="" class="wp-image-9877" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-768x461.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">&nbsp;Photo: DevoxxUK / Flickr</figcaption></figure>



<p class="wp-block-paragraph">What she means is that optimizing the <strong>writing of code without understanding the surrounding system does not move the needle</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It&#8217;s not about one thing which is going to fix one problem, it&#8217;s about really understanding the whole system, it&#8217;s about understanding even the whole organization, the whole enterprise. Where does IT and technology and software fit into that? What are you really trying to deliver? What is the business benefit?</p>
</blockquote>



<p class="wp-block-paragraph">She described this as <strong>working across two ends of the process</strong>. On the input side, teams need to get better at questioning requirements before writing anything. On the output side, they need to look at build pipelines, test parallelism, flaky tests, and DORA metrics.</p>



<p class="wp-block-paragraph">&#8220;If you can measure those things (your DORA metrics, build times, whether delivered requirements actually give users value) you can start to see which parts of the process are working and which need attention,&#8221; Trisha explained.</p>



<h2 class="wp-block-heading"><span id="measuring-the-wrong-things-optimizes-the-wrong-things">Measuring the wrong things optimizes the wrong things</span></h2>



<p class="wp-block-paragraph">She also makes a sharp point about measurement and optimization.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you measure lines of code for productivity, you&#8217;ll get more lines of code. But really productivity is not just about what we call these activity metrics. It&#8217;s not just lines of code. It&#8217;s not just pull requests, merges, features delivered.</p>
</blockquote>



<p class="wp-block-paragraph">The thing teams consistently miss is the full arc of delivery.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Developer experience and productivity is the whole piece. Did it get out to the user? Did it meet the user&#8217;s needs? Is the user paying for more of our stuff? Is the business getting what they need from what the developers are doing? What you&#8217;re measuring there impacts what you&#8217;re going to optimize.</p>
</blockquote>



<p class="wp-block-paragraph">That last line is worth sitting with. If your productivity metrics stop at pull requests merged, you are optimizing for pull requests merged.</p>



<h2 class="wp-block-heading"><span id="the-space-framework-and-why-three-metrics-beat-one"><br>The SPACE framework and why three metrics beat one</span></h2>



<p class="wp-block-paragraph">When I asked Trisha what teams should measure, she pointed to the SPACE framework. SPACE stands for <strong>satisfaction</strong>, <strong>performance</strong>, <strong>activity</strong>, <strong>communication and collaboration</strong>, and <strong>efficiency and flow</strong>.</p>



<p class="wp-block-paragraph">DORA metrics, which most teams are more familiar with, are a subset of it. Her recommendation is to <strong>pick metrics from three different dimensions</strong> rather than relying on a single category. The reasoning is that single-category metrics tend to be easy to game without improving anything real.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">So yes, you can write more code, but no, you didn&#8217;t do what the business wanted.</p>
</blockquote>



<figure class="wp-block-image size-large"><img decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-1024x614.jpg?x94846" alt="" class="wp-image-9879" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-768x461.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1.jpg 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: Marin Pavelić</figcaption></figure>



<p class="wp-block-paragraph">She also brought up Fred Brooks and <strong>communication overhead</strong> as something the industry consistently underweights. The harder metrics to capture, like satisfaction and flow, are often more revealing than the activity metrics that dashboards make easy to track.</p>



<p class="wp-block-paragraph">The business outcomes she keeps returning to are specific: &#8220;You need to measure, did it do what you wanted it to do? Did it get out to the user in time? Did they start spending more money with us? Did it fix your retention problem?&#8221;</p>



<p class="wp-block-paragraph">Those are the things which matter much more to the business.</p>



<h2 class="wp-block-heading"><span id="what-to-fix-before-adopting-ai">What to fix before adopting AI</span></h2>



<p class="wp-block-paragraph">I wondered what teams need to get right before AI tooling can actually help them. Trisha&#8217;s first answer was essentially: stop adopting AI the way you have adopted everything else. </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We generally get requirements, write the code, chuck it out there, and then you&#8217;re kind of done. That&#8217;s not how it should work.</p>
</blockquote>



<p class="wp-block-paragraph">What she advocates for instead is <strong>applying the scientific method to engineering decisions</strong>, which sounds obvious but rarely happens in real life.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Have a hypothesis, do your investigation, measure the results, have a conclusion. Generally speaking, we have not been great at that in our industry.</p>
</blockquote>



<p class="wp-block-paragraph">Applied to AI adoption specifically, that means being precise about what you are actually trying to achieve. What are we trying to achieve with AI? Do we want to deliver more features more quickly to the customer or do we want to perhaps deliver higher quality features? Because those two things are not necessarily the same thing Trisha concluded.</p>



<p class="wp-block-paragraph">Therefore the practical instruction she gives is to <strong>run short experiments, measure one change at a time, and iterate</strong>. But have a hypothesis, figure out how to measure it, measure it, get feedback, and iterate over that.</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="AI Won&amp;apos;t Fix What You Can&amp;apos;t Measure" width="500" height="281" src="https://www.youtube.com/embed/iH4UnTskOSM?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p class="wp-block-paragraph"><br></p>
<p>The post <a href="https://shiftmag.dev/trisha-gee-ai-wont-fix-your-broken-pipeline-it-will-break-it-faster-9785/">Trisha Gee: AI Won&#8217;t Fix Your Broken Pipeline &#8211; It Will Break It Faster</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>When systems go down, devs still juggle 10 tabs. PagerDuty says MCP fixes that</title>
		<link>https://shiftmag.dev/when-systems-go-down-devs-still-juggle-10-tabs-pagerduty-says-mcp-fixes-that-9657/</link>
		
		<dc:creator><![CDATA[Ivan Pelivanovic]]></dc:creator>
		<pubDate>Fri, 22 May 2026 14:13:25 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Developer Productivity]]></category>
		<category><![CDATA[MCP]]></category>
		<category><![CDATA[PagerDuty]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9657</guid>

					<description><![CDATA[<p>Production incidents are a context problem. By the time an engineers understand what's happening, they've already bounced across several different tools - and the incident is still ongoing. PagerDuty thinks MCP is the fix.</p>
<p>The post <a href="https://shiftmag.dev/when-systems-go-down-devs-still-juggle-10-tabs-pagerduty-says-mcp-fixes-that-9657/">When systems go down, devs still juggle 10 tabs. PagerDuty says MCP fixes that</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2-300x158.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2-1024x538.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2-768x403.jpg 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>

<div class="wp-block-post-excerpt wp-block-hidden-desktop wp-block-hidden-mobile wp-block-hidden-tablet"><p class="wp-block-post-excerpt__excerpt">Production incidents are a context problem. By the time an engineers understand what&#8217;s happening, they&#8217;ve already bounced across several different tools &#8211; and the incident is still ongoing. PagerDuty thinks MCP is the fix. </p></div>


<p class="wp-block-paragraph">When incidents hit production systems, engineers rarely stay inside one tool for long,  jumping from logs to dashboards to runbooks, trying to <strong>reconstruct what is actually happening</strong>.</p>



<p class="wp-block-paragraph">Talking to other builders, it seemed like almost everybody faces this context-switching problem. </p>



<p class="wp-block-paragraph"><strong>Rocío Bayon</strong> (Product Manager) and <strong>Sebastian Villanelo</strong> (Sr. Forward Deployed Engineer) from PagerDuty think MCP is how you fix it.</p>



<h2 class="wp-block-heading"><span id="pagerduty-built-their-mcp-to-cut-context-switching">PagerDuty built their MCP to cut context switching</span></h2>



<p class="wp-block-paragraph">Rocío explained that their MCP is solving the issue of context switching:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When an incident hits, the engineer has to go between 5 to 10 different tools to understand what&#8217;s happening.</p>
</blockquote>



<p class="wp-block-paragraph">That&#8217;s the real problem they&#8217;re trying to solve. </p>



<p class="wp-block-paragraph">PagerDuty&#8217;s framing of MCP was interesting: neither Rocío nor Sebastian described MCP as just another integration layer. They framed it as <strong>connective tissue</strong> that gathers logs, alerts, runbooks, and incident context into a single workflow.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">What the MCP does, it brings all that context into one platform where engineers are usually already working.</p>
</blockquote>



<p class="wp-block-paragraph">Most engineering organizations already have enormous amounts of observability data. The real problem is that <strong>it is scattered across systems</strong>, and engineers end up reconstructing operational context manually during incidents.</p>



<h2 class="wp-block-heading"><span id="retrieve-what-you-need-nothing-more">Retrieve what you need, nothing more</span></h2>



<p class="wp-block-paragraph">Sebastian framed the problem as signal retrieval. Rather than feeding the model more information, the goal is <strong>pulling the relevant operational state around a specific incident</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you have the right parameters or the queries and all this stuff, you will retrieve the exact information that you need.</p>
</blockquote>



<p class="wp-block-paragraph">That means <strong>narrowing context around the actual incident window</strong>. When an incident hits, it retrieves information around that time only, Sebastian explained.</p>



<p class="wp-block-paragraph">That also changes how they think about efficiency, reducing context switching directly affects operational speed, token usage, and cost.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You will see that information only with one call. And that saves a lot of tokens and time. That&#8217;s money and time.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-1024x614.jpg?x94846" alt="" class="wp-image-9873" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-768x461.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: Lea Lobor</figcaption></figure>



<h2 class="wp-block-heading"><span id="ai-helps-but-engineers-still-decide">AI helps but engineers still decide</span></h2>



<p class="wp-block-paragraph">Still, both of them were careful not to frame AI as autonomous incident management. </p>



<p class="wp-block-paragraph">Rocío repeatedly emphasized that <strong>MCP and AI systems are primarily helping with context gathering and operational visibility</strong>, while engineers remain responsible for the high-risk decisions:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The AI is helping you, but the engineer is the one who is assessing and making decisions where there&#8217;s a high risk.</p>
</blockquote>



<p class="wp-block-paragraph">That human layer is intentional. PagerDuty&#8217;s broader vision seems less about replacing on-call engineers and more about <strong>reducing the operational overhead surrounding incidents</strong>. Their MCP systems help gather information, surface relationships between systems, and accelerate investigation workflows, but humans still decide what actually happens next.</p>



<p class="wp-block-paragraph">Rocío also mentioned that their <strong>SRE agent</strong> is designed to support larger incident workflows beyond information retrieval:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It can also help you trigger those incident workflows. So it can help you resolve the incident. And it learns as it goes.</p>
</blockquote>



<h2 class="wp-block-heading">&#8220;MCP &#8211; the connective tissue between tools&#8221;</h2>



<p class="wp-block-paragraph">I asked Rocío and Sebastian, how does MCP fit into the tools they already use without becoming just another silo.</p>



<p class="wp-block-paragraph">And both of them clearly framed <strong>MCP as anti-silo infrastructure</strong> since it brings everything to one place. Rocío called MCP &#8220;the connective tissue between all these different tools.&#8221;</p>



<p class="wp-block-paragraph">That framing probably captures the broader architectural challenge better than anything else in the interview. </p>



<p class="wp-block-paragraph">Modern incident response already spans dozens of systems: observability platforms, deployment pipelines, CI/CD tooling, ticketing systems, infrastructure management, and communication layers. </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI systems inherit that fragmentation unless something explicitly connects operational state.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="engineers-trust-systems-that-behave-predictably">Engineers trust systems that behave predictably</span></h2>



<p class="wp-block-paragraph">Sebastian mentioned that <strong>teams often react very differently to MCP systems</strong>. Some embrace them immediately while others remain skeptical, especially around security and predictability. For him, trust improves once systems consistently produce expected outcomes:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When a person or a teammate says &#8220;ah, I&#8217;m retrieving what I&#8217;m expecting to retrieve&#8221;, that will help them to trust it.</p>
</blockquote>



<p class="wp-block-paragraph">A lot of AI tooling discussions still focus on model capability, reasoning quality, or benchmark performance. But <strong>operational systems are usually adopted much more pragmatically</strong>. Engineers trust systems that behave predictably, retrieve the right operational context, and fit into workflows they already rely on.</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="10 Tools, One IDE: PagerDuty&amp;apos;s Incident MCP" width="500" height="281" src="https://www.youtube.com/embed/YRIiKkG7JY0?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>
<p>The post <a href="https://shiftmag.dev/when-systems-go-down-devs-still-juggle-10-tabs-pagerduty-says-mcp-fixes-that-9657/">When systems go down, devs still juggle 10 tabs. PagerDuty says MCP fixes that</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</title>
		<link>https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Tue, 19 May 2026 18:39:50 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Vojin Jovanović]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9802</guid>

					<description><![CDATA[<p>A 1,000 libraries, 90% coverage, 1,700 in API tokens. Nobody typed a single test by hand.</p>
<p>The post <a href="https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/">Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">When humans maintained the GraalVM native image reflection metadata repository, <strong>coverage sat at just 14%</strong>. Tests were often stubs that technically compiled but covered nothing meaningful, nobody wanted to write them for someone else&#8217;s code, and the results showed.</p>



<p class="wp-block-paragraph">At <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx UK</a>, <strong>Vojin Jovanovic</strong> (Principal Researcher, Oracle Labs) and <strong>Mihailo Markovic</strong> (Software Engineer, Oracle), presented how they replaced that process with an autonomous AI agent pipeline. </p>



<p class="wp-block-paragraph">The result is <strong>90% dynamic access coverage across more than 1,000 JVM libraries</strong>, roughly 2 billion tokens spent, and a GitHub repository generating thousands of commits per week &#8211; while Vojin was at a hotel the night before the conference.</p>



<h2 class="wp-block-heading"><span id="the-problem-with-graalvm-reflection">The problem with GraalVM reflection</span></h2>



<p class="wp-block-paragraph">GraalVM Native Image takes a Java application, performs static analysis, and AOT compiles it into a single binary. The benefits are significant: <strong>startup roughly 10x faster than a standard JVM</strong>,<strong> </strong>dramatically lower memory footprint. </p>



<p class="wp-block-paragraph">But static analysis has a fundamental limitation: when a method calls Class.forName(&#8220;Foo&#8221;) with a dynamic argument, the analyser <strong>cannot determine at compile time what class will be needed</strong>. Reflection calls break the closed-world assumption.</p>



<p class="wp-block-paragraph">The solution is <strong>reachability metadata</strong> &#8211; a JSON file that tells the native image compiler which classes, methods, and fields need to be accessible at runtime. Writing this metadata requires running tests that exercise all the relevant code paths. </p>



<p class="wp-block-paragraph">For a library like Hibernate Core, that means covering 264 individual reflection call sites. For Tomcat, 205. Across the JVM ecosystem, the number is enormous, and until recently, it was almost entirely a manual process that humans were not doing well.</p>



<h2 class="wp-block-heading"><span id="start-simple-then-add-feedback">Start simple, then add feedback</span></h2>



<p class="wp-block-paragraph">The first approach was straightforward: give an LLM the library source code, tell it to generate comprehensive Java tests, collect the metadata via a JVMTI agent. </p>



<p class="wp-block-paragraph">The results were not impressive &#8211; 5.7% coverage for logback, 2.9% for H2. Vojin noted how this doesn’t feel like AGI.</p>



<p class="wp-block-paragraph">The shift came from <strong>adding GraalVM’s static analysis directly to the agent’s context</strong>. Instead of asking the LLM to guess which code paths matter, the pipeline runs a static analysis pass that identifies every dynamic access call site (the exact class, method, and line number) and feeds that report directly to the agent. With this addition, logback coverage jumped to 97%, H2 to 84.3%, in five iterations.</p>



<p class="wp-block-paragraph"><strong>The next layer was JaCoCo integration</strong>. After each generation round, the pipeline correlates coverage data with the remaining uncovered call sites and feeds only the uncovered ones back into the next iteration. The agent knows exactly what it hit and what it missed. Vojin noted:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We always create a checkpoint in those systems so we can go back to it if something goes wrong. And in these LLM-driven workflows, something is always going wrong.</p>
</blockquote>



<p class="wp-block-paragraph">With this feedback loop: logback reached 100%, H2 reached 96.1%.</p>



<h2 class="wp-block-heading"><span id="coverage-sometimes-still-isn%e2%80%99t-enough">Coverage sometimes still isn’t enough</span></h2>



<p class="wp-block-paragraph">For larger, more complex libraries (Guava, Tomcat, MongoDB) even the feedback loop left gaps. The team added a third technique:<strong> PGO</strong> (Profile-Guided Optimization) <strong>profiling from GraalVM’s Graal compiler</strong>. The profiler samples execution and produces a call trace, which can be correlated with static analysis to identify exactly where a test nearly reached a reflection call but diverged.</p>



<p class="wp-block-paragraph">The profiling feedback tells the agent not just what’s uncovered, but <strong>where in the call stack a test went in the wrong direction</strong> and what it would need to do differently. Results: Guava went from 50% to 72%, Tomcat from 45% to 83%, MongoDB reached 100%. </p>



<p class="wp-block-paragraph">The feedback also tells the agent (and the engineers) why certain calls cannot be covered: a security service only available on Java 6, a cleaner class incompatible with the current JVM. &#8220;If you cannot reach it, tell us why,&#8221; the prompt instructs, and the agent does.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="622" src="https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-1024x622.jpg?x94846" alt="" class="wp-image-9853" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-1024x622.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-300x182.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-768x466.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="cost-agents-and-model-selection">Cost, agents and model selection</span></h2>



<p class="wp-block-paragraph"><strong>Codex</strong> was the first agent framework the team tried. For logback  (a library with 33 dynamic access calls) Codex spent $35:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If we’re spending $35 per library for a thousand libraries, we’re not replacing humans.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>The alternative was P</strong>, a minimal agent that starts with a 200-token context describing basic file operations and bash execution. Same results, roughly 10x cheaper and the lesson is straightforward:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Simple task, use a simple agent. You already give it a lot of rules, a lot of context, and you’ve grounded it enough so it can perform on the level of these big agents.</p>
</blockquote>



<p class="wp-block-paragraph">On model selection, the team compared GPT 5.5 against several open-source alternatives &#8211; GLM, Kimi K2, DeepSeek, Gemma. <strong>GPT 5.5 consistently outperformed them on coverage</strong>. The counterintuitive finding was this: a more expensive model that makes the right decision in one shot can cost less overall than a cheaper model that wastes tokens going in the wrong direction.</p>



<h2 class="wp-block-heading"><span id="the-architecture-that-lets-it-run-without-you">The architecture that lets it run without you</span></h2>



<p class="wp-block-paragraph">The pipeline now operates as a <strong>third-generation system</strong>. When a user opens an issue requesting a library, the agent fetches the issue, runs the generation workflow, verifies the output, creates a pull request, reviews it, and merges or escalates to human review &#8211; automatically. The &#8220;human intervention&#8221; label on GitHub still exists, but its queue has shrunk dramatically.</p>



<p class="wp-block-paragraph"><strong>Documentation, not smarter prompting, was what made the difference</strong>. </p>



<p class="wp-block-paragraph">Vojin outlined what he calls <em><strong>the</strong> <strong>key context layers</strong></em>: </p>



<ul class="wp-block-list">
<li>raison d’être (why does this project exist, in two sentences), </li>



<li>state of direction (where the architecture stands today), </li>



<li>functional specification (how the system behaves), </li>



<li>architectural specification (how it is built), </li>



<li>decision records (what major choices were made and why), and </li>



<li>comprehensive logs that serve as checkpoints for recovery.</li>
</ul>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When you do all of these things, it takes almost a few days for a very big project. You will reduce your work by 50%, 60%, 70%.</p>
</blockquote>



<p class="wp-block-paragraph">The payoff is that agents with this context can diagnose failures, trace them through logs, and fix the underlying system, not just the immediate problem.</p>



<p class="wp-block-paragraph">The RAID system (an automated issue-resolution agent) was built in four prompts on a Sunday morning. It sweeps human intervention tickets, classifies them, performs deep analysis using the project logs, and either opens a GitHub issue for humans or attempts a fix in a forked branch with review. Jovanovic added:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Never work on the problem, always work on the system. You never go and fix a ticket. You always go fix the rules.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="where-things-stand">Where things stand</span></h2>



<p class="wp-block-paragraph">The repository currently supports 1,021 libraries. Without five large Hibernate libraries that predate the automated pipeline, <strong>dynamic access coverage across the ecosystem is 90%</strong>. </p>



<p class="wp-block-paragraph">The GitHub repository has accumulated roughly 2,977 branches. In the week before Devoxx, it logged approximately 8,000-9,000 commits, with agents committing every few minutes around the clock. </p>



<p class="wp-block-paragraph">Total cost for the project: approximately $1,700 in API tokens, plus personal compute on Jovanovic’s home desktop, running around the clock because the Oracle compliance process for cloud infrastructure takes time. The key point is simple:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Start with neural, simplest thing, get results, and then slowly chop off things and put them into algorithms, because they are much cheaper and faster.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-1024x614.jpg?x94846" alt="" class="wp-image-9850" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-768x460.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="we-caught-vojin-jovanovic-for-a-few-more-questions">We caught Vojin Jovanovic for a few more questions!</span></h2>



<p class="wp-block-paragraph">After the talk, we sat down with Vojin for a few minutes to ask him a couple more questions. </p>



<h3 class="wp-block-heading"><span id="you-tested-over-1000-libraries-what-broke-first-when-you-tried-to-scale">You tested over 1,000 libraries. What broke first when you tried to scale?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: Basically everything broke. We had mostly infrastructure issues, all kinds of GitHub failures. When you build a system at this scale, you need to assume that everything will fail and needs to recover. We broke GitHub rate limits. My machine was broken because it was running so many things. The key takeaway is that <strong>you need to build a system in a way that you can always continue</strong>. When things fail, you always checkpoint and continue from a checkpoint. We do work in sizable chunks, and when something fails, you just restart the chunk.</p>



<h3 class="wp-block-heading"><span id="is-just-asking-the-llm-enough">Is just asking the LLM enough?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: If you had asked me four weeks ago, I would say no. Now I would say <strong>you need to know how to ask it</strong>, and it will be enough. I was like, &#8220;GitHub is failing with a 504, abstract away all GitHub calls and retry.&#8221; It did it in two minutes. With today’s models, it’s a matter of minutes, not hours.</p>



<h3 class="wp-block-heading"><span id="what-did-you-learn-about-the-trade-off-between-cost-speed-and-coverage">What did you learn about the trade-off between cost, speed, and coverage?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: I haven’t seen a situation when doing something with an LLM is more expensive than doing that by a human typing on the keyboard. Build a system that uses the most efficient LLM for the job — you’re going to get far and not cost much money at all.</p>



<h3 class="wp-block-heading"><span id="when-does-using-multiple-agents-make-sense">When does using multiple agents make sense?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: Where I use it is for <strong>decisions and research</strong>. I use Claude Opus 4.7, Gemini 3.1, and GPT 5.5. I ask them all, let them discuss, and I discuss together with them. Each brings something to the table. Before, it was always Claude who was the smartest. Now GPT 5.5 is second and close to the first. Things are changing. The most important bit is getting the system designed right. Once you do that, coding, I don’t care who does it.</p>
<p>The post <a href="https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/">Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Uber Shares What Happens When 1.500 AI Agents Hit Production</title>
		<link>https://shiftmag.dev/uber-shares-what-happens-when-1-500-ai-agents-hit-production-9430/</link>
		
		<dc:creator><![CDATA[Ivan Pelivanovic]]></dc:creator>
		<pubDate>Mon, 04 May 2026 14:19:55 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[MCP]]></category>
		<category><![CDATA[Uber]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9430</guid>

					<description><![CDATA[<p>Hearing how Uber scaled to 1.500 AI agents made me realize just how quickly things can spiral when those agents start acting faster than humans can keep up.</p>
<p>The post <a href="https://shiftmag.dev/uber-shares-what-happens-when-1-500-ai-agents-hit-production-9430/">Uber Shares What Happens When 1.500 AI Agents Hit Production</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="2100" height="1400" src="https://shiftmag.dev/wp-content/uploads/2026/04/55192368682_89b60f358c_o-scaled.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/04/55192368682_89b60f358c_o-scaled.jpg 2100w, https://shiftmag.dev/wp-content/uploads/2026/04/55192368682_89b60f358c_o-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/04/55192368682_89b60f358c_o-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/04/55192368682_89b60f358c_o-768x512.jpg 768w" sizes="auto, (max-width: 2100px) 100vw, 2100px" /></figure>


<p class="wp-block-paragraph">At the <a href="https://events.linuxfoundation.org/mcp-dev-summit-north-america/" target="_blank" rel="noreferrer noopener">MCP Dev Summit North America</a> earlier this month, I was listening to <strong>Meghana Somasundara</strong>, (Agentic AI Lead, Uber), and <strong>Rush Tehrani</strong> (Senior Engineering Manager leading the Agentic AI Platform, Uber) talk about what they’re building.</p>



<p class="wp-block-paragraph">By their account, <strong>more than 90% of Uber’s 5.000+ engineers already use AI monthly</strong> for agentic workflows. They also have over <a href="https://shiftmag.dev/how-uber-engineers-use-ai-agents-8617/" target="_blank" rel="noreferrer noopener">1.500 monthly active agents internally</a>, running more than 60.000 executions per week. </p>



<p class="wp-block-paragraph">What stood out to me was Meghana’s framing of the real risk: not deliberate misuse, but <strong>an agent causing serious damage by accident</strong>, faster than any human could react:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It takes us humans a lot more effort to break things. But with agents,&nbsp;it&#8217;s&nbsp;a lot faster, a lot quicker, and the blast radius is a lot higher.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="what-problems-did-uber-face-when-scaling-ai">What problems did Uber face when scaling AI?</span></h2>



<p class="wp-block-paragraph">Meghana and Rush’s talk focused on three problems that nearly made those numbers impossible to reach. The first was <strong>the lack of a shared way of building</strong>.</p>



<p class="wp-block-paragraph">When agent adoption spreads organically across a large engineering organization, teams tend to build independently. At Uber Technologies, with over 10.000 internal services, that meant dozens of teams were building MCP servers and custom integrations on their own, without shared standards, central oversight, and any real way to reuse what others had already built.</p>



<p class="wp-block-paragraph">The result was predictable: <strong>duplicated work, and a growing stack of systems that only the original team really understood</strong>, as Meghana Somasundara explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The simple truth was, if you can&#8217;t manage the development lifecycle, you just can&#8217;t trust it in production.</p>
</blockquote>



<p class="wp-block-paragraph">When agents start making decisions across systems, inconsistent implementations stop being a minor issue but they become harder to track, debug and even harder to trust.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2026/04/55193657725_819f8b9753_o-1024x683.jpg?x94846" alt="" class="wp-image-9448" srcset="https://shiftmag.dev/wp-content/uploads/2026/04/55193657725_819f8b9753_o-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/04/55193657725_819f8b9753_o-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/04/55193657725_819f8b9753_o-768x512.jpg 768w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: Agentic AI Foundation (Flickr) &#8211; Meghana Somasundara (Agentic AI Lead, Uber) and Rush Tehrani (Senior Engineering Manager, Uber)</figcaption></figure>



<p class="wp-block-paragraph">The second problem included<strong> security</strong>. Agents operating across a complex service landscape could unknowingly call endpoints they shouldn’t, expose sensitive data, or trigger operations nobody intended. Add third-party MCP servers into the mix (Uber uses many external systems) and the governance problem scales quickly.</p>



<p class="wp-block-paragraph">They needed <strong>full visibility into call patterns</strong>: who was accessing what data, under what conditions, and what happened when things went wrong. Without that, running agents in production at scale becomes a trust problem.</p>



<p class="wp-block-paragraph"><strong>Finding the right tool</strong> quickly became the third problem, Rush asked himself:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">How does an agent or the engineer building it actually find the right one?</p>
</blockquote>



<p class="wp-block-paragraph">Not just any MCP server, but one that’s reliable, performs well, and doesn’t quietly degrade everything built on top of it.</p>



<p class="wp-block-paragraph">When discovery is left unmanaged, agents default to whatever is most visible rather than what actually works best. At smaller scale, that’s an annoyance, but across thousands of services, it becomes a <strong>systemic quality problem</strong>.</p>



<h2 class="wp-block-heading"><span id="how-uber-addressed-these-challenges">How Uber addressed these challenges</span></h2>



<p class="wp-block-paragraph">Uber&#8217;s answer to all three problems was a <strong>centralized MCP gateway and registry</strong>. </p>



<p class="wp-block-paragraph">Meghana describes it as a central control plane that turns Uber’s endpoints into MCP tools, with service owners deciding what gets exposed and how it’s defined.</p>



<p class="wp-block-paragraph">Every change flows through pull requests, passes security scans before deployment, and is continuously monitored in production, while a central registry (acting as the single source of truth) removes duplication and enforces tighter scrutiny on third-party MCPs.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2026/05/55193402013_db67d79bd4_k-1-1024x683.jpg?x94846" alt="" class="wp-image-9558" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55193402013_db67d79bd4_k-1-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55193402013_db67d79bd4_k-1-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55193402013_db67d79bd4_k-1-768x512.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55193402013_db67d79bd4_k-1.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: Agentic AI Foundation (Flickr)</figcaption></figure>



<p class="wp-block-paragraph">In their <strong>no-code Agent Builder</strong>, as Rush explained, engineers can pre-select specific tools from an MCP server so the model doesn’t have to decide which one to use, and they can also lock down parameters so the agent doesn’t have to infer them at runtime, ultimately reducing the number of decisions and things that can go wrong.</p>



<p class="wp-block-paragraph">Getting the infrastructure right shows up in adoption: <strong>their coding agent <em>Minions</em> generates about 1.800 code changes weekly</strong> and is used by 95% of Uber engineers, but that’s the output, not the real lesson.</p>



<p class="wp-block-paragraph">On the roadmap are evaluation metrics in the registry to help teams spot reliable servers before committing, and &#8220;skills&#8221;, reusable MCP patterns with built-in A/B testing that bake evaluation into how knowledge is shared.  </p>



<h2 class="wp-block-heading">Does any of this apply if&nbsp;you&#8217;re&nbsp;not Uber?&nbsp;&nbsp;&nbsp;</h2>



<p class="wp-block-paragraph">Uber operates at a scale most engineering teams never see (10.000+ services in play) but while the complexity is extreme, the underlying failure patterns Meghana and Rush describe aren’t unique to them.  </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Teams often end up building the same integrations in parallel, with governance only becoming a priority after something breaks, and discovery treated as an afterthought. These problems appear well before reaching 1.500 agents &#8211; once multiple teams start using the same MCP infrastructure without a shared layer.</p>
</blockquote>



<p class="wp-block-paragraph">The Uber model <strong>won&#8217;t translate directly to smaller organisations</strong>. But if you&#8217;re already running MCP servers across more than two teams and nobody owns discoverability or access control yet, that gap could surface soon. </p>
<p>The post <a href="https://shiftmag.dev/uber-shares-what-happens-when-1-500-ai-agents-hit-production-9430/">Uber Shares What Happens When 1.500 AI Agents Hit Production</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Tech Conferences Aren’t Dead. But Why We Go Is Changing.</title>
		<link>https://shiftmag.dev/tech-conferences-arent-dead-but-the-old-reasons-to-go-might-be-8787/</link>
		
		<dc:creator><![CDATA[Ivan Pelivanovic]]></dc:creator>
		<pubDate>Tue, 31 Mar 2026 14:01:02 +0000</pubDate>
				<category><![CDATA[Event]]></category>
		<category><![CDATA[tech conference]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=8787</guid>

					<description><![CDATA[<p>When was the last time a dev conference taught you something you couldn’t learn online? Probably never. But that’s the wrong benchmark - conferences were never just about information.</p>
<p>The post <a href="https://shiftmag.dev/tech-conferences-arent-dead-but-the-old-reasons-to-go-might-be-8787/">Tech Conferences Aren’t Dead. But Why We Go Is Changing.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="2047" height="1190" src="https://shiftmag.dev/wp-content/uploads/2026/03/54513893224_bb77612be3_k.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/03/54513893224_bb77612be3_k.jpg 2047w, https://shiftmag.dev/wp-content/uploads/2026/03/54513893224_bb77612be3_k-300x174.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/03/54513893224_bb77612be3_k-1024x595.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/03/54513893224_bb77612be3_k-768x446.jpg 768w" sizes="auto, (max-width: 2047px) 100vw, 2047px" /></figure>


<p class="wp-block-paragraph">Why would you, as a developer, fly halfway around the world to hear something you could Google in minutes?</p>



<p class="wp-block-paragraph">&#8220;Because there’s more to it than just getting plain information,&#8221; says <strong>Mark Hazell</strong>, organiser of <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx UK</a> and co-founder of Voxxed.</p>



<h2 class="wp-block-heading"><span id="some-things-just-can%e2%80%99t-be-replicated-online">Some things just can’t be replicated online</span></h2>



<p class="wp-block-paragraph">Conferences feel like one of the few places where simply showing up still counts. In a way, they’re a throwback, a reminder that not all value happens behind a screen.</p>



<p class="wp-block-paragraph">And that’s precisely what makes them stand out: remote work offers undeniable flexibility, but it often <strong>fragments our attention</strong>. It’s hard to find real focus, especially if you’re trying to keep a healthy work-life balance. At a conference, that changes, as Mark points out.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Simply not being distracted by incoming mail or slack messages is worth its weight in gold in terms of the knowledge you take away.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2026/03/54512970712_5db8a16201_k-1024x683.jpg?x94846" alt="" class="wp-image-8890" title="devoxxuk" srcset="https://shiftmag.dev/wp-content/uploads/2026/03/54512970712_5db8a16201_k-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/03/54512970712_5db8a16201_k-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/03/54512970712_5db8a16201_k-768x512.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/03/54512970712_5db8a16201_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<p class="wp-block-paragraph">The person next to you might be <strong>facing the same problem</strong>, or they might have already solved it. That kind of closeness makes learning immediate, practical, and way faster than online.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Many people tell me they watch a session on-demand from Devoxx UK and wish they could be in the room so they can chat with others who are facing similar challenges or are even further along in finding solutions.</p>
</blockquote>



<h2 class="wp-block-heading">But conferences are expensive&#8230;</h2>



<p class="wp-block-paragraph">Let’s face it: conferences aren’t cheap. Between tickets, flights, and hotels, the costs add up fast. And with companies tightening budgets and cutting back on travel, that expense really matters. If you don’t get real value in return, <strong>it can quickly feel like a waste of both time and money</strong>.</p>



<p class="wp-block-paragraph">Mark doesn’t deny it. Instead, he reframes the question: if you take your team to the right conference, you’ll see a strong return.</p>



<p class="wp-block-paragraph">The keyword here is well-chosen:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">I do think it’s key to research up front and find the conference that accelerates learning and problem solving in ways truly relevant to those attending. That way, instead of weeks of trial and error, your team can spend a day or two at the conference and return with practical techniques, ideas, and tooling suggestions that boost productivity and quality.</p>
</blockquote>



<p class="wp-block-paragraph">Picking the right conference is all about fit. How long will your team be out? Is the ticket worth it? Will they meet people facing similar challenges? That’s where the real value is, says Mark. Plan ahead, and <strong>early bird tickets, flights, and hotels cost a lot less than last-minute bookings</strong>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2026/03/54513711031_dec8550190_k-1024x683.jpg?x94846" alt="" class="wp-image-8887" title="devoxxuk" srcset="https://shiftmag.dev/wp-content/uploads/2026/03/54513711031_dec8550190_k-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/03/54513711031_dec8550190_k-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/03/54513711031_dec8550190_k-768x512.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/03/54513711031_dec8550190_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="big-stages-or-small-communities">Big stages or small communities?</span></h2>



<p class="wp-block-paragraph">It might seem that large flagship conferences have the upper hand with bigger budgets, bigger names, and more production. And in some cases, that’s true, Mark admits: &#8220;If a conference is run by a large company with deep pockets, it can be more financially resilient.&#8221;</p>



<p class="wp-block-paragraph">But that’s not the model Devoxx relies on, <strong>its strength comes from the community</strong>: they rely on a big team who volunteer their time and help them pull together all of the content, shape how the event looks and feels, and execute it on the ground.</p>



<p class="wp-block-paragraph">In fact, many of today’s most respected conferences began as small, grassroots initiatives, including Devoxx itself, which grew from the London Java Community.</p>



<p class="wp-block-paragraph">And for Mark, the real distinction isn’t size &#8211; it’s about <strong>quality and intent</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Whatever the size of the event, the content has to stay balanced and neutral. Without that, scale&nbsp;doesn’t&nbsp;mean much.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="when-people-feel-welcome-real-connections-follow">When people feel welcome, real connections follow</span></h2>



<p class="wp-block-paragraph">Modern conferences sit at the intersection of <strong>learning, hiring, and business</strong>. Sponsorships and recruitment are part of the reality, especially in expensive cities like London. But Mark doesn’t see it as a trade-off between developers and companies:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">I prefer the notion of weaving strands together to create a fabric that everyone is part of.</p>
</blockquote>



<p class="wp-block-paragraph">That&nbsp;means creating an environment where attendees&nbsp;benefit&nbsp;from sponsors being&nbsp;present&nbsp;and sponsors&nbsp;benefit&nbsp;from genuine interaction with the community.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2026/03/54513801801_8f95990774_k-1024x683.jpg?x94846" alt="" class="wp-image-8910" srcset="https://shiftmag.dev/wp-content/uploads/2026/03/54513801801_8f95990774_k-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/03/54513801801_8f95990774_k-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/03/54513801801_8f95990774_k-768x512.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/03/54513801801_8f95990774_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<p class="wp-block-paragraph">That same philosophy extends to how Devoxx grows by creating <strong>real opportunities for first-time speakers</strong>, helping them gain experience and build confidence. Many return to mentor the next group, creating a self-sustaining cycle that supports the broader developer community.</p>



<p class="wp-block-paragraph">When there’s no barrier, people talk more freely, ask more questions, and connect naturally, Mark says.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our philosophy is to create an environment where everyone is equal (sorry speakers, that means no private room out back to go hang out in), everyone is welcome and everyone is respected. This is noticeable and means the event has this really special, open vibe to it.</p>
</blockquote>



<p class="wp-block-paragraph">As Mark puts it, when people feel welcome and respected, they talk, share, and enjoy themselves, and meaningful connections naturally follow. &#8220;Sure, we do stuff like hosting evening socials, a party, a pub quiz,&#8221; he says, &#8220;but it’s really the collective buy-in from everyone to welcome and respect each other that makes all the difference.&#8221;</p>



<p class="wp-block-paragraph"><em>ShiftMag is recognized as a friend of the Devoxx UK conference.</em></p>
<p>The post <a href="https://shiftmag.dev/tech-conferences-arent-dead-but-the-old-reasons-to-go-might-be-8787/">Tech Conferences Aren’t Dead. But Why We Go Is Changing.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>CTOs Face Pressure to Deliver AI Gains, but Productivity Isn’t There Yet</title>
		<link>https://shiftmag.dev/ctos-face-pressure-to-deliver-ai-gains-but-productivity-isnt-there-yet-8615/</link>
		
		<dc:creator><![CDATA[Nikolina Oršulić]]></dc:creator>
		<pubDate>Wed, 18 Mar 2026 15:45:53 +0000</pubDate>
				<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[CTO Craft]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=8615</guid>

					<description><![CDATA[<p>Andy Skipper, founder of CTO Craft, warns that even seasoned CTOs struggle with the pressure to deliver AI-driven productivity while balancing innovation and reality.</p>
<p>The post <a href="https://shiftmag.dev/ctos-face-pressure-to-deliver-ai-gains-but-productivity-isnt-there-yet-8615/">CTOs Face Pressure to Deliver AI Gains, but Productivity Isn’t There Yet</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/03/andy-skipper-3.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/03/andy-skipper-3.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/03/andy-skipper-3-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/03/andy-skipper-3-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/03/andy-skipper-3-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">How are CTOs feeling about AI? </p>



<p class="wp-block-paragraph">According to <strong>Andy Skipper</strong>, founder of <a href="https://ctocraft.com/" target="_blank" rel="noreferrer noopener">CTO Craft</a>, they’re experiencing <strong>fear</strong>, <strong>uncertainty</strong>, and <strong>doubt</strong>.</p>



<p class="wp-block-paragraph">And if the technical leaders of companies are feeling that way, what can the rest of us expect? Certainly, we dream of productivity boosts and an AI El Dorado &#8211; but that’s not the reality.</p>



<p class="wp-block-paragraph">That’s why we sat down with Skipper to talk about <strong>how CTOs should manage expectations for AI</strong>, and how to navigate the hype versus reality.</p>



<h2 class="wp-block-heading"><span id="stakeholders-and-investors-are-watching-ctos-closely-and-the-pressure-is-rising">Stakeholders and investors are watching CTOs closely, and the pressure is rising</span></h2>



<p class="wp-block-paragraph">Many CTOs, Skipper notes, are navigating <strong>intense pressure from non-technical stakeholders and investors alike</strong>, especially with the massive resources being invested in AI and LLM technologies.</p>



<p class="wp-block-paragraph">He’s a bit careful about this:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI is not going to reduce costs or increase productivity in the way some non-technical people think just yet. It&#8217;s getting there, but it&#8217;s not there yet.</p>
</blockquote>



<p class="wp-block-paragraph">At the same time, Skipper points out a surprising upside: AI is giving engineering leaders a chance to <strong>reconnect with the code and architecture</strong> without writing all the code themselves:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">One of the things you have to accept as an engineering leader is that you are going to get further away from the code the more senior you become. AI gives people an opportunity to get back to architecture and development work, even if they aren’t coding themselves.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="cto-role-can-be-isolating">CTO role can be isolating</span></h2>



<p class="wp-block-paragraph">When Skipper became a CTO for the first time, he quickly realized just how isolating the role could be. There was nowhere for tech leaders to share challenges, get support, or navigate the non-technical side of the job.</p>



<p class="wp-block-paragraph">That gap inspired him to start CTO Craft, now a community helping senior engineering leaders navigate team dynamics, strategy, and AI.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When I was a CTO for the first time, I didn&#8217;t have somebody who I could talk to about the issues I was seeing or compare notes with people who had similar challenges. That&#8217;s what CTO Craft is all about &#8211; helping people understand where the challenges come from and understand they&#8217;re not alone in having those challenges.</p>
</blockquote>



<p class="wp-block-paragraph">As a coach and mentor, Andy works closely with CTOs around the world, helping them deal with issues like burnout, communication with nontechnical stakeholders, and, lately, how to adapt in the AI era.</p>



<h2 class="wp-block-heading"><span id="the-most-common-cto-mistake-always-chasing-the-newest-technologies">The most common CTO mistake? Always chasing the newest technologies</span></h2>



<p class="wp-block-paragraph">Many first-time CTOs struggle with burnout, overextending themselves to shield teams from stress, and balancing hands-on coding with high-level responsibilities. He explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">A lot of the people that I work with directly are suffering from burnout. First time CTOs commonly miss out self-preservation. And usually that&#8217;s a combination of too much expectation of their own energy levels, their own abilities, backlogs…</p>
</blockquote>



<p class="wp-block-paragraph">And after overextending themselves, first-time CTOs often make another common mistake: <strong>chasing the newest technologies</strong>. While adopting the latest tools and frameworks can seem exciting, Skipper warns that it’s not always the best choice for fast-moving teams trying to scale.</p>



<p class="wp-block-paragraph">&#8220;<strong>Using bleeding-edge tech can slow you down</strong>, make systems harder to maintain, and even complicate hiring because the talent pool for newer technologies might be limited,&#8221; he explains.</p>



<p class="wp-block-paragraph">As a coach, Skipper says these are just some of the recurring challenges he sees among engineering leaders, alongside a range of other operational and people-related issues.</p>



<h2 class="wp-block-heading"><span id="engineering-skills-alone-won%e2%80%99t-make-you-a-cto">Engineering skills alone won’t make you a CTO</span></h2>



<p class="wp-block-paragraph">For aspiring engineering leaders, Skipper highlights that growing into a successful CTO requires more than technical excellence: <strong>commercial understanding</strong>, <strong>communication</strong>, <strong>coaching</strong>, and <strong>vision-setting</strong> are just as crucial:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The difference between a good engineering manager and a great CTO is understanding how technology drives business success, while still inspiring and guiding your teams.</p>
</blockquote>



<p class="wp-block-paragraph">But technical and business skills are only part of the picture. <strong>Motivation and team management are equally critical</strong>. Skipper stresses that not everyone is motivated by the same things, and leaders need to understand individual drivers:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Having a vision in the first place is very important. But when it comes to actually bringing individuals along on the journey, they all need to be worked with differently. You can&#8217;t just set it and expect everyone to be motivated.</p>
</blockquote>



<p class="wp-block-paragraph">He also warns against a common mistake among CTOs: <strong>trying to shield their teams from the challenges of a pivot or rapid change</strong>. While the instinct is understandable, it often backfires and drains the leader&#8217;s emotional energy. Instead, transparency and realistic communication are key:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Being transparent, being realistic, measuring your words, not being super negative about everything, but still being realistic, I think all these things are really important.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="the-need-for-a-support-network-not-another-tech-stack">The need for a support network, not another tech stack</span></h2>



<p class="wp-block-paragraph">Skipper believes resilience and peer support are crucial for engineering leaders navigating the complexity of the CTO role. Sharing experiences and learning from others can help leaders realize they’re not alone when facing difficult decisions.</p>



<p class="wp-block-paragraph">Looking ahead, however, he admits that the pace of technological change makes it <strong>hard to predict what the role will look like in the future</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Five years from now, I honestly have no idea what the role of a CTO will look like. The way we build software is already changing rapidly, especially with AI. But the fundamentals like setting a vision, communicating it clearly, and connecting technology with business outcomes, will always remain essential.</p>
</blockquote>



<p class="wp-block-paragraph">For Skipper, that uncertainty makes peer support crucial: it helps leaders adapt, learn, and navigate a fast-changing profession.</p>



<p class="wp-block-paragraph">Ultimately, he believes the most important skill for CTOs is the <strong>ability to keep learning</strong> and tackle challenges without going it alone.</p>



<p class="wp-block-paragraph"><em>*<a href="https://www.infobip.com/" target="_blank" rel="noreferrer noopener">Infobip</a>, the global communications API leader that launched ShiftMag, was an Event Partner at CTO Craft 2026.</em></p>



<p class="wp-block-paragraph"></p>
<p>The post <a href="https://shiftmag.dev/ctos-face-pressure-to-deliver-ai-gains-but-productivity-isnt-there-yet-8615/">CTOs Face Pressure to Deliver AI Gains, but Productivity Isn’t There Yet</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>OpenAI Shares How They’re Turning Engineers into AI Team Leads</title>
		<link>https://shiftmag.dev/openai-shares-how-theyre-turning-engineers-into-ai-team-leads-8262/</link>
		
		<dc:creator><![CDATA[Ivan Brezak Brkan]]></dc:creator>
		<pubDate>Mon, 02 Mar 2026 15:56:59 +0000</pubDate>
				<category><![CDATA[Event]]></category>
		<category><![CDATA[Pragmatic Summit]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=8262</guid>

					<description><![CDATA[<p>Roles aren’t disappearing - capabilities are expanding, and often the problem isn’t the system, it’s the prompt. I saw that firsthand at this year’s Pragmatic Summit in San Francisco.</p>
<p>The post <a href="https://shiftmag.dev/openai-shares-how-theyre-turning-engineers-into-ai-team-leads-8262/">OpenAI Shares How They’re Turning Engineers into AI Team Leads</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Six months ago, if someone had told me that engineers would start <strong>naming their AI agents and treating them like teammates</strong>, I probably would’ve rolled my eyes.</p>



<p class="wp-block-paragraph">Honestly, even today, it still sounds a little… absurd. </p>



<p class="wp-block-paragraph">That is, until I heard directly at the Pragmatic Summit in San Francisco that&#8217;s happening right now inside OpenAI.</p>



<p class="wp-block-paragraph"><strong>Vijaye Raji</strong> and <strong>Thibaut Sottiaux</strong> from OpenAI say AI is shifting development from manual coding to guiding AI teams (setting goals and guardrails) while speeding up work and keeping core roles essential.</p>



<h2 class="wp-block-heading"><span id="close-the-laptop-join-the-meeting-come-back-to%c2%a0finished%c2%a0code-%c2%a0">Close the laptop. Join the meeting. Come back to finished code. </span></h2>



<p class="wp-block-paragraph">Raji’s (CTO, Applications, OpenAI) been at OpenAI for only six months, and already he’s seen <a href="https://openai.com/index/introducing-codex/" target="_blank" rel="noreferrer noopener">Codex</a> go from just a tool, to an extension, to an agent… and now it actually feels like a teammate.</p>



<p class="wp-block-paragraph">Inside OpenAI, they recently launched something called a <strong>Codex Box</strong>. </p>



<p class="wp-block-paragraph">Basically, engineers can grab a dev box on the server, fire off prompts, and let the system run things in parallel while they just work from their laptop. Sounds amazing, right?</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="538" src="https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1-1024x538.png?x94846" alt="Ivan Brezak Brkan" class="wp-image-8265" title="" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1-768x403.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo by Ivan Brezak Brkan</figcaption></figure>



<p class="wp-block-paragraph">Some engineers are using hundreds of billions of tokens per week across multiple agents &#8211; not for fun, but because that’s just how they build now. Raji said:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Software development inside OpenAI isn’t a single-threaded human loop anymore. It’s parallel. And that is going to become the new normal.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="designers-and-pms-are-writing-code-what%e2%80%99s-going-on">Designers and PMs are writing code. What’s going on?</span></h2>



<p class="wp-block-paragraph">Sottiaux (Engineering lead for Codex, OpenAI) described <strong>how the Codex team works</strong> today.</p>



<p class="wp-block-paragraph">&#8220;It changes constantly. Almost week to week,&#8221; he said. &#8220;We look for bottlenecks, solve them, and then a new one pops up.&#8221;</p>



<p class="wp-block-paragraph">At first, the slowest part was code generation, then it became code review, and now the friction often comes from <strong>understanding user needs faster</strong> &#8211; parsing feedback from Twitter, Reddit, and SDK experiments <strong>and turning that into product direction</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Speed up coding, and suddenly reviews become the bottleneck. Fix reviews, and CI/CD slows things down. That rhythm has become normal. Instead of debating every trade-off in design docs and discarding alternatives, teams try multiple implementations in parallel and focus on what actually works.</p>
</blockquote>



<p class="wp-block-paragraph">&#8220;Trying things is cheaper,&#8221; Sottiaux added. &#8220;So we try more things.&#8221;</p>



<p class="wp-block-paragraph">And the rules? They’re blurring. Designers are shipping more code, PMs are writing and testing ideas, and it’s not that roles disappear &#8211; everyone’s capabilities are expanding.</p>



<h2 class="wp-block-heading"><span id="usually-the-problem-is-the-prompt-not-the-system">Usually the problem is the prompt, not the system</span></h2>



<p class="wp-block-paragraph">What about <strong>long-running, autonomous tasks</strong>? </p>



<p class="wp-block-paragraph">AI coding tools might seem like advanced autocomplete &#8211; type a few words, get a few lines back. Helpful, yes, but still reactive. Sottiaux challenged that: </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Give the model a meaningful, well-defined objective, and it doesn’t just respond &#8211; it runs, for hours.</p>
</blockquote>



<p class="wp-block-paragraph">Inside OpenAI, the model runs on its own for hours, sometimes producing full reports. <strong>Engineers review the results</strong>, pick what works, and feed it back &#8211; this isn’t just suggestions anymore, it’s delegated execution.</p>



<p class="wp-block-paragraph">There was also an unusually honest anecdote shared during the discussion: a researcher admitted that whenever he thought he was smarter than Codex, it turned out the problem was the prompt, not the system. </p>



<p class="wp-block-paragraph">The bottleneck isn’t typing speed &#8211; it’s <strong>defining the goal clearly</strong>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="538" src="https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1024x538.jpg?x94846" alt="" class="wp-image-8266" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1024x538.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-300x158.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-768x403.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo by Ivan Brezak Brkan</figcaption></figure>



<h2 class="wp-block-heading"><span id="ai-tools-accelerate-work-and-ahape-ai-native-engineers">AI tools accelerate work and ahape AI-native engineers</span></h2>



<p class="wp-block-paragraph">During weekly analytics reviews, teams don’t assign follow-ups, they just <strong>trigger Codex threads</strong>. &#8220;Twenty minutes later, the answers are ready before the meeting even ends,&#8221; one leader said.</p>



<p class="wp-block-paragraph">In high-severity incidents, Codex gets effectively paged into calls to help figure out what went wrong and suggest the fastest recovery. &#8220;It’s like having small consultants working quietly in parallel,&#8221; they added.</p>



<p class="wp-block-paragraph">So what does this mean for junior engineers? </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">OpenAI is hiring new grads and running a strong internship program, believing the next generation will be AI-native and comfortable with these tools from day one.</p>
</blockquote>



<p class="wp-block-paragraph">At the same time, strong foundations, guardrails, and code reviews remain essential. As they put it, &#8220;Foundations will never go out of fashion.&#8221;</p>



<h2 class="wp-block-heading"><span id="engineers-will-guide-ai-teams-speeding-up-code-without-touching-every-line">Engineers will guide AI teams, speeding up code without touching every line</span></h2>



<p class="wp-block-paragraph">Vijaye has spent more than two decades in the industry. He has lived through the rise of developer tools, the shift to higher-level abstractions, the mobile wave, and the social platform era. In his view, <strong>none of those transitions felt quite like this one</strong>.</p>



<p class="wp-block-paragraph">What makes the current moment different isn’t just what the technology can do, it’s how quickly it is evolving. <strong>The speed of change</strong>, he suggested, is on another level entirely.</p>



<p class="wp-block-paragraph">And Sottiaux expects that pace to accelerate even further. </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">In the near term, I anticipate another order-of-magnitude jump in development speed, enabled by networks of agents collaborating toward large, shared goals. Instead of a single assistant responding to prompts, entire clusters could work together on complex builds.</p>
</blockquote>



<p class="wp-block-paragraph">As systems get more complex, engineers stop checking every line of code and start setting constraints, guardrails, and validating outputs. <strong>It’s less about manual control and more about guiding the system</strong>, and working through a single assistant that coordinates all the agents behind the scenes. </p>



<p class="wp-block-paragraph">Whether this ends up being the smartest leap in the industry or a step we rushed into too quickly, only time will tell.</p>


<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/02/Pragmatic-Summit-OpenAI.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/Pragmatic-Summit-OpenAI.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2026/02/Pragmatic-Summit-OpenAI-300x158.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/02/Pragmatic-Summit-OpenAI-1024x538.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/Pragmatic-Summit-OpenAI-768x403.jpg 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph"></p>
<p>The post <a href="https://shiftmag.dev/openai-shares-how-theyre-turning-engineers-into-ai-team-leads-8262/">OpenAI Shares How They’re Turning Engineers into AI Team Leads</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Chip Huyen: To Build or Not to Build &#8211; When AI Can Do It All?</title>
		<link>https://shiftmag.dev/chip-huyen-to-build-or-not-to-build-when-ai-can-do-it-all-8238/</link>
		
		<dc:creator><![CDATA[Ivan Brezak Brkan]]></dc:creator>
		<pubDate>Fri, 27 Feb 2026 14:00:19 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[Developer Experience]]></category>
		<category><![CDATA[Pragmatic Summit]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=8238</guid>

					<description><![CDATA[<p>I was at Pragmatic Summit when Chip Huyen reframed the AI conversation - if any product can be generated from a clear description, code isn’t the constraint, and true value lies elsewhere.</p>
<p>The post <a href="https://shiftmag.dev/chip-huyen-to-build-or-not-to-build-when-ai-can-do-it-all-8238/">Chip Huyen: To Build or Not to Build &#8211; When AI Can Do It All?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/02/chip.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/chip.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/02/chip-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/chip-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/chip-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">&#8220;If AI can replicate almost anything quickly and cheaply, <strong>what’s the point of building anything at all</strong>?&#8221; <a href="https://www.linkedin.com/in/chiphuyen/" target="_blank" rel="noreferrer noopener">Chip Huyen</a> asked at the start of her talk at the Pragmatic Summit.” </p>



<p class="wp-block-paragraph">And that question carries weight because she isn’t a casual AI observer: she’s an ex-Netflix researcher, former NVIDIA core developer, and an author who explores AI engineering.</p>



<p class="wp-block-paragraph">She told us a personal story: <strong>after building a product, someone recreated it with AI almost immediately</strong>. </p>



<p class="wp-block-paragraph">That moment forced her to confront hard questions &#8211; if anything can be copied, where’s the moat, the incentive, or the point of the effort?</p>



<h2 class="wp-block-heading">&#8220;I built a product &#8211; and someone copied it with AI&#8221;</h2>



<p class="wp-block-paragraph">After she built a product, someone emailed her a clone generated with AI. The message read: &#8220;I love what you’ve built. So I used AI to recreate exactly that. And here’s the link.&#8221;</p>



<p class="wp-block-paragraph">She described her reaction bluntly: <strong>&#8220;I’m flattered. But also, why the f**k?</strong>&#8220;</p>



<p class="wp-block-paragraph">That moment crystallized a new reality: if replication requires minimal effort, traditional defensibility weakens. Technical execution no longer guarantees leverage. She framed the shift clearly:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you can describe a software, then AI can build it for you. The constraint moves upstream. The critical question no longer asks how to build, but what to build.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="the-real-advantage-comes-from-context">The real advantage comes from context</span></h2>



<p class="wp-block-paragraph">But Chip <strong>pushed back against the idea that AI erases all opportunities</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Common problems are quickly handled by AI, but challenges with nuance and context remain &#8211; and those are where real value lies.</p>
</blockquote>



<p class="wp-block-paragraph">She illustrated this with chatbots: U.S. users expect instant replies, while in parts of Asia, waiting signals respect. These nuances matter. As AI handles common solutions, <strong>advantage goes to those who master context</strong> (cultural, behavioral, or domain-specific) where generic automation fails.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/02/1.chip-1-1024x614.png?x94846" alt="" class="wp-image-8281" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/1.chip-1-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/1.chip-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/1.chip-1-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/1.chip-1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Chip spoke at this year’s Pragmatic Summit in San Francisco.</figcaption></figure>



<h2 class="wp-block-heading"><span id="engineering-culture-is-changing">Engineering culture is changing</span></h2>



<p class="wp-block-paragraph">Workflows built around humans writing code (pull requests, line-by-line reviews, mentorship) don’t work the same when AI generates large chunks. </p>



<p class="wp-block-paragraph">Junior developers may disengage, and even seniors wonder<strong> &#8220;How do I give feedback to my AI?&#8221;</strong></p>



<p class="wp-block-paragraph">The focus moves from polishing code to designing instructions and systems. Mentorship now teaches structured thinking in a human–AI–human loop.</p>



<p class="wp-block-paragraph">And Chip <strong>didn&#8217;t have an answers about job displacement or copyright</strong>, she acknowledged uncertainty.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">I do think it’s a bit scary and I don’t really know what the futures look like but builders still shape tools that affect labor markets, creative industries, and institutions.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="when-ai-acts-who%e2%80%99s-accountable">When AI acts, who’s accountable?</span></h2>



<p class="wp-block-paragraph">As AI systems move beyond code editors, the risks grow. Chip drew a hard line: if AI acts in the real world (like a car hitting a pedestrian), mistakes can’t be undone. </p>



<p class="wp-block-paragraph">The question isn’t if AI can act, but <strong>whether it should without strict limits</strong>. </p>



<p class="wp-block-paragraph">Engineers now must build guardrails, monitoring, and escalation paths from the start &#8211; autonomy demands containment.</p>



<h2 class="wp-block-heading"><span id="enjoy-building-but-choose-wisely-what-to-build">Enjoy building, but choose wisely what to build</span></h2>



<p class="wp-block-paragraph">Chip closed on a personal note:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Fundamentally, I enjoy building. It just brings me joy.</p>
</blockquote>



<p class="wp-block-paragraph">In an environment where execution becomes cheap, intrinsic motivation gains weight. She compared building to music that creates tension and resolution, and to assembling Lego sets for friends. Not every project requires a moat. Not every product needs revenue logic.</p>



<p class="wp-block-paragraph">Her final reframing carried strategic weight. <strong>If replication becomes trivial, the advantage may belong to those who decide what deserves to exist</strong>. Vision, context, and responsibility define the new frontier. Execution follows.</p>
<p>The post <a href="https://shiftmag.dev/chip-huyen-to-build-or-not-to-build-when-ai-can-do-it-all-8238/">Chip Huyen: To Build or Not to Build &#8211; When AI Can Do It All?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>This CTO Says 93% of Developers Use AI, but Productivity Is Still 10%</title>
		<link>https://shiftmag.dev/this-cto-says-93-of-developers-use-ai-but-productivity-is-still-10-8013/</link>
		
		<dc:creator><![CDATA[Ivan Brezak Brkan]]></dc:creator>
		<pubDate>Wed, 18 Feb 2026 15:13:58 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[CTO]]></category>
		<category><![CDATA[Developer Experience]]></category>
		<category><![CDATA[Developer Productivity]]></category>
		<category><![CDATA[Laura Tacho]]></category>
		<category><![CDATA[Pragmatic Summit]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=8013</guid>

					<description><![CDATA[<p>I was in the room at this year’s Pragmatic Summit when Laura Tacho dropped the numbers: nearly all developers use AI coding assistants, over a quarter of production code is AI-written - and yet productivity gains haven’t budged past 10%.</p>
<p>The post <a href="https://shiftmag.dev/this-cto-says-93-of-developers-use-ai-but-productivity-is-still-10-8013/">This CTO Says 93% of Developers Use AI, but Productivity Is Still 10%</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">I had the chance to attend this year’s Pragmatic Summit and catch <a href="https://www.linkedin.com/in/lauratacho/" target="_blank" rel="noreferrer noopener">Laura Tacho</a> &#8211; CTO at DX, executive advisor, and Austrian Innovator of the Year &#8211; in her keynote. </p>



<p class="wp-block-paragraph">She presented her latest research, <em><a href="https://lauratacho.com/research" target="_blank" rel="noreferrer noopener">Measuring Developer Productivity &amp; AI Impact</a>,</em> based on three months of data collected through February 1.</p>



<p class="wp-block-paragraph">The research surveyed <strong>121.000 developers across 450+ companies</strong>. A striking 92.6% of them use an AI coding assistant at least once a month, and roughly 75% use one weekly. Clearly, AI isn’t just a side experiment anymore, it’s part of the workflow.</p>


<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="720" src="https://shiftmag.dev/wp-content/uploads/2026/02/1.cartoonized.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/1.cartoonized.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/02/1.cartoonized-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/1.cartoonized-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/1.cartoonized-768x461.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">Here are the top takeaways I found most compelling from Laura’s research.</p>



<h2 class="wp-block-heading"><span id="the-10-productivity-plateau">The 10% productivity plateau</span></h2>



<p class="wp-block-paragraph">The first thing most people think of with AI assistance is saving time. According to the research, developers say they’re <strong>saving about 4 hours a week</strong> &#8211; pretty much the same as Q2 2025, with Q4 2025 numbers sitting around 3.6-3.7 hours. </p>



<p class="wp-block-paragraph">It looks like the time-saving boost has leveled off. Productivity shows the same pattern:<strong> it jumped around 10% when AI first took off, and since then, it’s stayed steady at that level</strong>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/02/1-1-1024x614.png?x94846" alt="" class="wp-image-8072" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/1-1-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/1-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/1-1-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/1-1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">What’s really shifting is the <strong>amount of &#8220;AI-authored code&#8221;</strong> &#8211; that is, code that gets merged into the main repository or production environment with little to no human intervention. Laura breaks this down using the latest data:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Looking at about 4.2 million developers between November 2025 and February 2026, AI-authored code now makes up 26.9% of all production code &#8211; up from 22% last quarter. Daily AI users are also hitting a milestone: nearly a third of the code they merge, which passes review and goes into production, is written by AI.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/02/2.1-1024x614.png?x94846" alt="" class="wp-image-8074" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/2.1-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/2.1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/2.1-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/2.1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">One example Laura loves to highlight is how AI is speeding up the <strong>onboarding process</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Looking at the data quarter by quarter, from Q1 2024 through Q4 2025, onboarding time has been cut in half. Specifically, we’re measuring it by the &#8220;time to the 10th Pull Request (PR).&#8221;</p>
</blockquote>



<p class="wp-block-paragraph">This metric (widely seen as a key sign of successful onboarding) has now been cut in half. Because of that, Laura sees <strong>AI as a powerful tool for getting people up to speed</strong>, whether it’s new hires, engineers switching projects, or even non-engineers stepping into technical workflows.</p>



<p class="wp-block-paragraph">The faster someone gets up to speed, the longer the productivity boost lasts, usually for at least two years. This points to a bigger trend: AI is helping developers get up to speed faster, reducing mental load, and making it easier to onboard into complex codebases.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/02/5.1-1-1024x614.png?x94846" alt="" class="wp-image-8075" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/5.1-1-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/5.1-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/5.1-1-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/5.1-1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading"><span id="in-struggling-organizations-ai-exposes-flaws-instead-of-fixing-them">In struggling organizations, AI exposes flaws instead of fixing them</span></h2>



<p class="wp-block-paragraph">Laura also pointed out a part of the research that looks at <strong>how AI impacts company performance</strong>. This segment analyzed data from 67.000 developers between November 2025 and February 2026, and the findings are strikingly divided.</p>



<p class="wp-block-paragraph">Some companies are dealing with twice as many customer-facing incidents, while others see a 50% drop. </p>



<p class="wp-block-paragraph">The difference comes down to <strong>how AI is used</strong>: in well-structured organizations, AI acts as a &#8220;force multiplier,&#8221; helping teams move faster, scale with higher quality, and boost reliability. In struggling organizations, AI tends to highlight existing flaws rather than fix them. Based on this, Laura concludes:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Transformation is uncomfortable. Organizations that were ready to quit their cloud or agile transformations are now giving up on AI transformation, too. It’s difficult to look at an entire organization and realize that something fundamental must change to see a real impact on the bottom line.</p>
</blockquote>



<p class="wp-block-paragraph">According to her, adoption alone doesn’t guarantee results, just <strong>using the tools doesn’t automatically improve an organization</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">This is really a management problem. The hype made it sound like just trying AI would automatically pay off. But so far, most tools have been used for individual coding tasks. To see real impact, we need to use AI at the organizational level, not just for single tasks.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/02/6.1-1024x614.png?x94846" alt="" class="wp-image-8076" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/6.1-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/6.1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/6.1-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/6.1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">Laura also touched on the most popular AI tools among developers, specifically highlighting <strong>Codex</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The Codex desktop app launched on February 2 and has already topped one million downloads, with a 60% growth rate just last week. They recently rolled out GPT-5.3 Codex. Inside OpenAI, 95% of developers use Codex, and those users submit roughly 60% more Pull Requests each week.</p>
</blockquote>



<p class="wp-block-paragraph">As a real-world example, Laura highlights Cisco, where <strong>18.000 engineers use Codex daily</strong> for complex migrations and code reviews. This has cut their code review time in half. But Laura cautions that AI won’t fix deeper organizational issues unless you tackle those problems head-on, and that starts with acknowledging they exist.</p>



<p class="wp-block-paragraph">Since organizations remain constrained by human and systemic friction, Laura notes: </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">I am skeptical of any technology&#8217;s promise to improve performance without addressing those underlying constraints. If we don&#8217;t solve our systemic issues, we’ll just &#8220;carry them into space with us.&#8221; The real question isn&#8217;t how to colonize Mars, but how to achieve actual organizational impact.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="devex-is-more-important-than-ever">DevEx is more important than ever</span></h2>



<p class="wp-block-paragraph">To wrap things up, Laura revealed the secret to success for those who are &#8220;winning&#8221; with AI: </p>



<p class="wp-block-paragraph">1. <strong>They set clear goals and measure results</strong>.</p>



<ol class="wp-block-list">
<li></li>
</ol>



<p class="wp-block-paragraph">2. <strong>They recognize that Developer Experience (DevEx) matters more than ever</strong>.</p>



<p class="wp-block-paragraph">3. <strong>AI succeeds when factors like fast Continuous Integration (CI), clear documentation, and well-defined services are in place</strong>.</p>



<p class="wp-block-paragraph">At the end of the day, getting real organizational results means treating AI as a company-wide challenge. The research shows the barriers aren’t technical, they come down to change management and leadership support. Laura sums it up:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Successful organizations experiment by tackling real customer problems. Exploring Mars sounds exciting, but it’s not sustainable &#8211; it’s expensive and distracts from the core business. Focus your experiments on the customer to drive meaningful results. After all, somewhere, something incredible is waiting to be discovered.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/02/7-1-1024x614.png?x94846" alt="" class="wp-image-8077" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/7-1-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/7-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/7-1-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2026/02/7-1.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Really enjoyed your talk, and I really appreciated our chat afterward!</figcaption></figure>



<p class="wp-block-paragraph"><em><strong>The productivity gap isn&#8217;t unique. Read why <a href="https://shiftmag.dev/stack-overflow-survey-2025-ai-5653/">84% of developers use AI, but most don&#8217;t trust it</a>. The trust issue is part of the equation.</strong></em></p>
<p>The post <a href="https://shiftmag.dev/this-cto-says-93-of-developers-use-ai-but-productivity-is-still-10-8013/">This CTO Says 93% of Developers Use AI, but Productivity Is Still 10%</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Forget the Model, It’s Workflows That Make LLM Products Run</title>
		<link>https://shiftmag.dev/llms-can-improve-customer-operations-7716/</link>
		
		<dc:creator><![CDATA[Marko Crnjanski]]></dc:creator>
		<pubDate>Thu, 05 Feb 2026 14:18:39 +0000</pubDate>
				<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[How to Web]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[programming]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=7716</guid>

					<description><![CDATA[<p>Building with LLMs is nothing like traditional software. If we want something that actually works in production, we have to test it, monitor it, and keep iterating on real customer data.</p>
<p>The post <a href="https://shiftmag.dev/llms-can-improve-customer-operations-7716/">Forget the Model, It’s Workflows That Make LLM Products Run</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/02/New-ShiftMag-panel-interview-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">From his experience leading AI product teams, <strong>Andrew Mende</strong> (Senior Product Manager, Machine Learning at Booking.com) explained what it truly takes to ship LLM-based products in production.</p>



<h2 class="wp-block-heading"><span id="making-ai-products-reliable-requires-new-workflows">Making AI products reliable requires new workflows</span></h2>



<p class="wp-block-paragraph">For Mende, the buzz around AI is a rare shift, like the rise of smartphones. But what does it mean for product teams?</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">This moment unlocks new ways of solving customer problems that were previously impossible due to technical constraints. </p>
</blockquote>



<p class="wp-block-paragraph">He was clear: <strong>traditional product management approaches often fail with AI-driven products</strong>. </p>



<p class="wp-block-paragraph">LLM-based systems behave differently, demand new workflows, and bring new types of risk. </p>



<p class="wp-block-paragraph">Unlike deterministic software, <strong>LLMs are probabilistic</strong> (identical inputs can produce different outputs), making experimentation easy but production readiness challenging, and forcing teams to rethink how they test, evaluate, and monitor features.</p>



<p class="wp-block-paragraph">One of the biggest traps, Mende explained, is <strong>confusing a successful prototype with a scalable solution</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It’s easy to paste a prompt into ChatGPT and see results; much harder to make it reliable across thousands of real customer inputs.</p>
</blockquote>



<p class="wp-block-paragraph">Teams need <strong>structured datasets</strong>, big tables of real customer examples, to track accuracy, spot regressions, and see if changes actually work. Without them, it’s all guesswork.</p>



<h2 class="wp-block-heading"><span id="focus-on-accuracy-cost-and-speed">Focus on accuracy, cost, and speed</span></h2>



<p class="wp-block-paragraph">Mende’s practical approach to model selection focuses on <strong>accuracy, cost, and latency</strong>: start with the most capable model to see if the problem can be solved, then move to smaller or faster models to optimize performance. </p>



<p class="wp-block-paragraph">This requires testing <strong>multiple configurations</strong> (context size, prompts, and parameters) since even small changes affect results. Beyond the model, context selection, prompt instructions, and external tools are critical:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">For example, when a customer asks about a specific order, the system should fetch real-time data instead of relying on static knowledge. This combination of LLMs and tools turns simple prompts into full systems, but also increases complexity and maintenance costs.</p>
</blockquote>



<h2 class="wp-block-heading">LLMs can transform how users interact &#8211; if teams build the right infrastructure</h2>



<p class="wp-block-paragraph">Mende concluded his How to Web lecture by saying LLMs shine by transforming user interaction: for the first time, <strong>digital products can understand plain language</strong>, turning customer requests directly into actions.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">This shift brings digital experiences closer to human conversations and enables new product patterns that were out of reach just a few years ago.</p>
</blockquote>



<p class="wp-block-paragraph">The challenge now, Mende explained, is not whether LLMs work, but whether teams are willing to build the evaluation, monitoring, and infrastructure required to make them truly useful.</p>



<p class="wp-block-paragraph"></p>
<p>The post <a href="https://shiftmag.dev/llms-can-improve-customer-operations-7716/">Forget the Model, It’s Workflows That Make LLM Products Run</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 

Served from: shiftmag.dev @ 2026-06-16 04:36:35 by W3 Total Cache
-->