<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Artificial Intelligence Archives - ShiftMag</title>
	<atom:link href="https://shiftmag.dev/category/artificial-intelligence/feed/" rel="self" type="application/rss+xml" />
	<link>https://shiftmag.dev/category/artificial-intelligence/</link>
	<description>Insightful engineering content &#38; community</description>
	<lastBuildDate>Thu, 25 Jun 2026 13:35:46 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shiftmag.dev/wp-content/uploads/2024/08/cropped-ShiftMag-favicon-32x32.png</url>
	<title>Artificial Intelligence Archives - ShiftMag</title>
	<link>https://shiftmag.dev/category/artificial-intelligence/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>A Future Where Nobody Writes Code Manually Might Be Closer Than It Seems</title>
		<link>https://shiftmag.dev/future-no-one-writes-code-manually-10471/</link>
		
		<dc:creator><![CDATA[Ivan Simic]]></dc:creator>
		<pubDate>Thu, 25 Jun 2026 13:35:45 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[development]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=10471</guid>

					<description><![CDATA[<p>As AI tools reshape how software is built, the engineers in our new video say the job is shifting from writing every line by hand to guiding, reviewing, and orchestrating what AI produces.</p>
<p>The post <a href="https://shiftmag.dev/future-no-one-writes-code-manually-10471/">A Future Where Nobody Writes Code Manually Might Be Closer Than It Seems</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img fetchpriority="high" decoding="async" width="1280" height="720" src="https://shiftmag.dev/wp-content/uploads/2026/06/Thumb_2.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/Thumb_2.jpg 1280w, https://shiftmag.dev/wp-content/uploads/2026/06/Thumb_2-300x169.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/06/Thumb_2-1024x576.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/Thumb_2-768x432.jpg 768w" sizes="(max-width: 1280px) 100vw, 1280px" /></figure>


<p class="wp-block-paragraph">Once again, we brought together some of the finest minds Infobip has to answer tricky questions about the future of software. </p>



<p class="wp-block-paragraph">This time around, we spoke to four Infobip engineers about <strong>how they use AI in their daily work</strong> and how they view the AI revolution happening now.</p>



<h2 class="wp-block-heading"><span id="research-plan-execute">Research, plan, execute</span></h2>



<p class="wp-block-paragraph">With rapidly changing AI infrastructure, the things that used to be normal in software development are getting different, but some things stay the same. </p>



<p class="wp-block-paragraph"><strong>Petar Dučić</strong>, Engineering Director, said that the company&#8217;s mantra <strong>&#8220;you build it, you own it&#8221; has remained the same in the AI era</strong>. This simply means that engineers are responsible for whatever they build.</p>



<p class="wp-block-paragraph">Senior IT Research Scientist <strong>Ante Kapetanović</strong>, added that engineers need to <strong>separate their work phases </strong>efficiently:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You have to separate your research phase, your planning phase, and your coding implementation, whatever phase. This ultimately means that you own each step of the way. And basically, it is not AI-assisted coding, it is more human-assisting AI.</p>
</blockquote>



<h2 class="wp-block-heading">Engineering is now becoming even more necessary&#8230;</h2>



<p class="wp-block-paragraph">It&#8217;s true that using AI tools is, in many cases, a cheaper alternative to real people, but Petar pointed out that engineering is now becoming even more necessary, because there&#8217;s so many things that can go wrong, and <strong>we need real people to check them</strong> and undestand what&#8217;s going on.</p>



<p class="wp-block-paragraph">Senior Software Engineer<strong> Rino Čala</strong> pointed out that there&#8217;s <strong>three types of mistakes agentic tools make</strong>: logical mistakes, code-based mistakes and security mistakes. The solution is, as Rino puts it, just more tests:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">So it is definitely important to run tests, to run some local tests, CI tests, and do some static checks as well.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>Zvonimir Petković</strong>, Staf Engineer, then explained that <strong>security issues</strong> are the number one flaw with AI software tools:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Security is the main risk with deploying Gen-AI generated code. With the whole Vibe coding setup, nobody looks at the code, and oftentimes we have also non-engineers deploying code. The hiding sensitive data within the source code itself, this is the number one problem.</p>
</blockquote>



<p class="wp-block-paragraph">The second problem for Zvonimir is <strong>scalability</strong>. Something that is built in a couple days might work fine for a small team, but cannot be scaled to 5,000 people easily.</p>



<h2 class="wp-block-heading">&#8230; and engineers are now more orchestrators than code writers</h2>



<p class="wp-block-paragraph">A stark contrast to the narrative of AI taking away jobs for engineers is that, with more people actively using AI, there&#8217;s a <strong>bigger need for someone with a technical background </strong>to help with not just support, but education.</p>



<p class="wp-block-paragraph">&#8220;We&#8217;re slowly becoming context engineers&#8221;, added Ante, saying that engineers are now spending a lot of time managing their context in different AI tools. He is personally a big advocate for writing your own code and feels like this is a major part of being an engineer. Still, Ante admits that might not be the case in a couple years.</p>



<p class="wp-block-paragraph">Zvonimir, interestingly, had a take about exactly that:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The total trend is that in a few years&#8217; time, we&#8217;ll have the situation where nobody writes the code manually. Software engineers will be like persons who are the experts in that field, so they will be able to review what gen AI has generated.</p>
</blockquote>



<p class="wp-block-paragraph">In conclusion, as Rino puts it, engineers are now more in the role of orchestrators and organizers than they are code writes, since they spend a lot of time managing AI models to do things properly.</p>



<p class="wp-block-paragraph"><strong>Want to hear more? Check out the video.</strong></p>



<p class="wp-block-paragraph"><em>Special thanks to our fellow colleagues at Infobip, the publisher of ShiftMag!</em></p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe title="From Code Writers to Agent Managers" width="500" height="281" src="https://www.youtube.com/embed/KGTQ7vG6ofo?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>
<p>The post <a href="https://shiftmag.dev/future-no-one-writes-code-manually-10471/">A Future Where Nobody Writes Code Manually Might Be Closer Than It Seems</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>This IDE Plugin Shows the Energy Cost of Your AI Prompts</title>
		<link>https://shiftmag.dev/energy-cost-ai-prompts-10404/</link>
		
		<dc:creator><![CDATA[Ivan Simic]]></dc:creator>
		<pubDate>Tue, 23 Jun 2026 13:22:55 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Open Source]]></category>
		<category><![CDATA[environmental impact]]></category>
		<category><![CDATA[IDE]]></category>
		<category><![CDATA[plugin]]></category>
		<category><![CDATA[university of bristol]]></category>
		<category><![CDATA[ustwo]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=10404</guid>

					<description><![CDATA[<p>When tech company ustwo assessed one AI product’s carbon footprint, they found most came from AI inference. It raised a question: if AI has a measurable environmental impact, why is it almost invisible to everyday users?</p>
<p>The post <a href="https://shiftmag.dev/energy-cost-ai-prompts-10404/">This IDE Plugin Shows the Energy Cost of Your AI Prompts</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img decoding="async" width="1280" height="719" src="https://shiftmag.dev/wp-content/uploads/2026/06/PRISM-TEAM-ustwo.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/PRISM-TEAM-ustwo.png 1280w, https://shiftmag.dev/wp-content/uploads/2026/06/PRISM-TEAM-ustwo-300x169.png 300w, https://shiftmag.dev/wp-content/uploads/2026/06/PRISM-TEAM-ustwo-1024x575.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/PRISM-TEAM-ustwo-768x431.png 768w" sizes="(max-width: 1280px) 100vw, 1280px" /></figure>


<p class="wp-block-paragraph">That question led Paolo Rizzi (sustainability principal at ustwo), <strong>Nayan Jain</strong>, Executive Director of AI at ustwo and Nick Hegarty (Executive Director of Technology at ustwo)to start looking for tools that could help developers see the environmental cost of their AI usage while they worked.</p>



<p class="wp-block-paragraph">Having found no real tools for this, ustwo and the University of Bristol built one: <a href="https://marketplace.visualstudio.com/items?itemName=ustwo.prism-carbon-tracker&amp;ssr=false#overview" target="_blank" rel="noreferrer noopener">PRISM</a>. It launched last week, and we sat down with Nayan to talk about how it works and what it aims to change.</p>



<h2 class="wp-block-heading"><span id="prism-uses-ai-token-activity-to-estimate-energy-use-and-emissions">PRISM uses AI token activity to estimate energy use and emissions</span></h2>



<p class="wp-block-paragraph">The tools that were available mostly focused on data centers or broad &#8220;big picture&#8221; ideas, but <strong>none catered to the developers actually using AI tools</strong>. </p>



<p class="wp-block-paragraph">That gap led the ustwo team to think about ways to connect AI usage to real-world energy:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We quickly ran into a challenge that still exists today: a lack of transparent data from model providers. Without reliable information on energy consumption and infrastructure, it is difficult to build and validate a model with confidence.</p>
</blockquote>



<p class="wp-block-paragraph">Working around that, <strong>they decided to rely on tokens</strong>, a visible and relatively accurate measure of AI spend.</p>



<p class="wp-block-paragraph">The idea was to use token activity as a proxy for compute demand and estimate energy use and emissions using published research and carbon accounting principles, including the Green Software Foundation’s Software Carbon Intensity framework. </p>



<p class="wp-block-paragraph">Nick Hegarty, who helped narrow the focus: could they help developers understand the environmental impact of their AI use while they worked? That made the project possible.</p>



<h2 class="wp-block-heading"><span id="from-idea-to-ide">From idea to IDE</span></h2>



<p class="wp-block-paragraph">The answer was to create an <strong>in-editor tool</strong>, where the developer could see an estimation of their token costs and impact on energy consumption in real time. </p>



<p class="wp-block-paragraph">The theory here is that, with this data, engineers can see their habits and perhaps be more conscious about their usage:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our theory is that making this visible can guide engineers into more mindful habits around their AI consumption in the moment. Because AI providers don&#8217;t publish complete energy or emissions data, PRISM acts as a proxy for energy consumption by surfacing an estimate rather than an exact measurement.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>PRISM directly monitors token usage</strong>, the model being used, and the provider. For other tools, like GitHub Copilot, PRISM reads local activity logs. AI requests made by an application at runtime are captured through a local interceptor.</p>



<p class="wp-block-paragraph">The app then combines input and output tokens into an estimate. Nayan notes that these will be separated &#8220;as soon as robust factors exist&#8221;.</p>



<h2 class="wp-block-heading"><span id="how-red-was-that-prompt">How red was that prompt?</span></h2>



<p class="wp-block-paragraph">In practice, PRISM is <strong>more of a subtle indicator than a big flashing number </strong>that appears after every call. Nayan explained how it feels to use it:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">In the editor, a status indicator reflects your most recent call, colour coded. The headline feature is Relative Impact Classification, where each interaction is rated Green, Amber, or Red based on where it sits compared with the other requests in the same project.</p>
</blockquote>



<p class="wp-block-paragraph">Nayan continued to explain the colors:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Green is below the median, Amber sits between the 50th and 90th percentile, and Red is the top tenth. A few requests need to accumulate before the colours become meaningful, because the whole point is comparison within your own project rather than against an arbitrary threshold. </p>
</blockquote>



<p class="wp-block-paragraph">Clicking around the dashboard more, <strong>users can get information broken down by model usage</strong>, as well as other interesting metrics:</p>



<ul class="wp-block-list">
<li>Timeline of estimated carbon over the course of development</li>



<li>Heatmap that shades from green through amber to red</li>



<li>Breakdowns by branch and other visualisations.</li>
</ul>



<p class="wp-block-paragraph">However, Nayan explains that a relative, percentile-based design was chosen due to the inability of estimates to present absolute carbon figures. <strong>The goal of the tool is to explain and raise awareness more</strong>, and hopefully educate engineers on how their usage looks like from the eco standpoint.</p>



<h2 class="wp-block-heading"><span id="the-impact-is-awareness-not-less-ai-use">The impact is awareness, not less AI use</span></h2>



<p class="wp-block-paragraph">Ustwo has tested PRISM with UOB students and across the company&#8217;s engineering team, and the results have been positive so far:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Several users said that seeing estimated emissions made them more deliberate with AI tools.</p>
</blockquote>



<p class="wp-block-paragraph">Nayan added that engineers, having seen the data for their usage, tried to make adjustments to their style and became a bit more conscious of how they refine requests.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Some wrote shorter, more precise prompts instead of using multiple iterations, and others paid closer attention to model selection after seeing how much environmental impact different models could have for similar tasks.</p>
</blockquote>



<p class="wp-block-paragraph">But as Nayan said, what interested them the most wasn’t that developers used AI less, but that they became more aware of how they were using it. “Once the data was visible, users started noticing things they hadn’t considered before.”</p>



<h2 class="wp-block-heading"><span id="prism-won%e2%80%99t-solve-ai%e2%80%99s-environmental-impact-but-it-makes-it-more-visible">PRISM won’t solve AI’s environmental impact, but it makes it more visible</span></h2>



<p class="wp-block-paragraph">Right now, PRISM can <strong>provide data and insights for cloud and assistant-based models </strong>by identifying them, capturing token usage, and calculating the energy factor from a list of supported models. Locally run models are not yet supported, but might be in the future.</p>



<p class="wp-block-paragraph">As the tool grows, ustwo sees its ideal outcome at three levels. </p>



<p class="wp-block-paragraph"><strong>For engineers, the goal is awareness</strong>: giving users more information about their environmental impact during their work. Nayan says this is not about telling people what to do, but showing them a fuller picture of the tools they’re using. <strong>For organisations, the goal is to create a shared picture and open up more conversations</strong> about sustainability, governance, and responsible AI.</p>



<p class="wp-block-paragraph">Beyond those, ustwo is positive about the potential of collaboration in the field of environmental impact of AI. He concluded:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">PRISM won&#8217;t solve the environmental impact of AI on its own, but if it helps make that impact a little more visible, and sparks better conversations and behaviours as a result, then we&#8217;ve achieved something worthwhile.</p>
</blockquote>
<p>The post <a href="https://shiftmag.dev/energy-cost-ai-prompts-10404/">This IDE Plugin Shows the Energy Cost of Your AI Prompts</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>AI generates larger pull requests. Larger pull requests bring more bugs.</title>
		<link>https://shiftmag.dev/ai-generates-larger-pull-requests-larger-pull-requests-bring-more-bugs-9932/</link>
		
		<dc:creator><![CDATA[Mia Biberovic]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 06:07:28 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[CTO Craft]]></category>
		<category><![CDATA[Developer Productivity]]></category>
		<category><![CDATA[measuring developer productivity]]></category>
		<category><![CDATA[pull request]]></category>
		<category><![CDATA[Span]]></category>
		<category><![CDATA[Stephen Poletto]]></category>
		<category><![CDATA[tokenmaxxing]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9932</guid>

					<description><![CDATA[<p>What economist Charles Goodhart said in 1975 is completely applicable in 2026: When a measure becomes a target, it ceases to be a good measure.</p>
<p>The post <a href="https://shiftmag.dev/ai-generates-larger-pull-requests-larger-pull-requests-bring-more-bugs-9932/">AI generates larger pull requests. Larger pull requests bring more bugs.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-1.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-1.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-1-300x158.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-1-1024x538.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-1-768x403.jpg 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">When companies start tracking engineer token consumption on internal leaderboards, something has gone wrong in the measurement chain. </p>



<p class="wp-block-paragraph"><strong>Stephen Poletto</strong>, Field CTO at Span, used his CTO Craft Con talk in Toronto to argue that the AI tooling wave has arrived with a familiar problem attached: <strong>organizations are reaching for the most legible metric available rather than the most meaningful one</strong>. </p>



<p class="wp-block-paragraph">The result is a rerun of every previous failed attempt to quantify developer productivity, but this time with a pretty substantial compute bill.</p>



<h1 class="wp-block-heading"><span id="burn-baby-burn">Burn, baby, burn</span></h1>



<p class="wp-block-paragraph">Poletto opened with a data point that frames the problem neatly. Uber and ServiceNow both <strong>burned through their entire annual AI token budgets within the first five months</strong> of the year. That pace of consumption is being held up in some quarters as a sign of healthy adoption. </p>



<p class="wp-block-paragraph">Poletto&#8217;s position is that it mostly signals a measurement vacuum.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Just because you&#8217;re spending money and using these things doesn&#8217;t necessarily mean that you&#8217;re producing better outcomes. That&#8217;s the issue that I have with tokenmaxxing.</p>
</blockquote>



<p class="wp-block-paragraph">The <a href="https://fortune.com/2026/04/09/meta-killed-employee-ai-token-dashboard/" target="_blank" rel="noreferrer noopener">leaderboard dynamic at Meta</a>, where engineers were reportedly running expensive jobs purely to rank well on an internal token-consumption meter, illustrates the trap cleanly. Poletto named it directly: Goodhart&#8217;s Law. </p>



<p class="wp-block-paragraph">Coined by British economist Charles Goodhart in 1975, it’s completely applicable to the 2026 problem: <strong>when a measure becomes a target, it ceases to be a good measure.</strong> Or, in today’s words: Set token usage as a goal and people will optimize for token usage, not for shipping software that works.</p>



<p class="wp-block-paragraph">This isn&#8217;t a new failure mode. Poletto traced the same pattern through lines of code, pull request counts, and story points, each of which generated its own gaming behavior when elevated to headline metric status. Tokenmaxxing, in his framing, is:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The same old pitfalls of trying to quantify developer productivity all over again.</p>
</blockquote>



<p class="wp-block-paragraph">The alternative he proposed is a ratio: customer value delivered against the total cost of producing it, headcount, tooling, and token spend included. DORA metrics and PR throughput are not useless, he argued, but they measure the inside of the system, not its output. Treating them as primary goals disconnects engineering effort from the outcomes the business actually cares about.</p>



<h1 class="wp-block-heading"><span id="what-about-controling-the-pr-size">What about controling the PR size?</span></h1>



<figure class="wp-block-image size-large is-resized"><img loading="lazy" decoding="async" width="1024" height="576" src="https://shiftmag.dev/wp-content/uploads/2026/05/image-1024x576.png?x94846" alt="Span: AI creates bigger pull requests" class="wp-image-9936" style="width:1200px" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/image-1024x576.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/image-300x169.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/image-768x432.png 768w, https://shiftmag.dev/wp-content/uploads/2026/05/image.png 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Bigger pull requests mean spending more time on reworking AI generated code afterwards (source: <a href="https://www.span.app/blog/ai-writes-bigger-prs" type="link" id="https://www.span.app/blog/ai-writes-bigger-prs" target="_blank" rel="noreferrer noopener">Span</a>)</figcaption></figure>



<p class="wp-block-paragraph">Span&#8217;s own benchmark data, drawn from its customer base, puts some numbers around where teams currently sit. About<strong> half to two-thirds of net new code is now AI-generated</strong>, up from 10 to 20 percent a year ago. PR throughput is running at roughly 1.7 times pre-AI rates. Neither figure, by itself, says anything about whether those teams are delivering more value to customers.</p>



<p class="wp-block-paragraph">The quality picture is more nuanced than the headline defect numbers suggest. Span&#8217;s analysis found that when controlling for pull request size, AI has a negligible independent effect on defect rates. The actual driver is that <a href="https://www.span.app/blog/ai-writes-bigger-prs" target="_blank" rel="noreferrer noopener">AI generates larger pull requests</a>, and <strong>larger pull requests correlate with more bugs</strong>. That is, in principle, actionable: you don&#8217;t need to focus on the AI generated code, but to the PR scope discipline.</p>



<h1 class="wp-block-heading"><span id="different-approaches-to-reduce-human-review-burden">Different approaches to reduce human review burden</span></h1>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="682" src="https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-2-1024x682.jpg?x94846" alt="" class="wp-image-10177" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-2-1024x682.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-2-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-2-768x511.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/06/CTO-Day1-Stephen-poletto-2.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">CTO Craft Con</figcaption></figure>



<p class="wp-block-paragraph"><strong>Code review is absorbing the strain more visibly</strong>. Poletto cited <a href="https://www.span.app/blog/introducing-ai-impact-report" target="_blank" rel="noreferrer noopener">30 percent more rework time on AI-generated code</a> compared to human-generated code, along with more review round trips. Teams that are navigating this well are doing so through process changes rather than raw tooling, pre-review automation gates, semantic routing of review assignments by code ownership and reviewer availability, and environment-level QA that lets agents validate their own output before a pull request opens.</p>



<p class="wp-block-paragraph"><strong>Stripe, Ramp, and WorkOS</strong> all came up as examples of teams that have built cloud environments where agents can run tasks more autonomously, with the explicit goal of disqualifying broken work before it reaches a human reviewer. Ramp&#8217;s approach to screenshots, attaching before-and-after visuals to PRs so reviewers can see what changed at a glance, is a small example of the same principle: <strong>reduce the human review burden by doing more verification earlier</strong>.</p>



<p class="wp-block-paragraph"><a href="https://www.intercom.com/help/en/articles/9515824-what-is-fin" target="_blank" rel="noreferrer noopener">Fin from Intercom</a> took a different angle, capturing agent-human interaction logs from development sessions and using them to provide personalized coaching to engineers on how to work more effectively with AI tools. Poletto noted they also tracked which agent skills were actively used versus deprecated, applying the same funnel analysis logic that product teams use for user flows.</p>



<p class="wp-block-paragraph">The thread connecting all of these examples is that the teams seeing compounding returns are treating their development workflow as a system to be instrumented and optimized, not a collection of individual contributors to be nudged toward higher token counts.</p>



<h1 class="wp-block-heading"><span id="writing-code-is-no-longer-a-bottleneck">Writing code is no longer a bottleneck</span></h1>



<p class="wp-block-paragraph">Poletto said in closing:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You should treat your development system as a system that can be optimized. Telemetry, observability, helping understand where those dynamics are, can help you be more confident in where you&#8217;re investing.</p>
</blockquote>



<p class="wp-block-paragraph">The bottleneck, his data suggests, is no longer primarily writing code but <strong>deciding what to build and validating that it works once built</strong>. That shift puts pressure on skills that most engineering hiring and evaluation frameworks are not set up to reward.</p>
<p>The post <a href="https://shiftmag.dev/ai-generates-larger-pull-requests-larger-pull-requests-bring-more-bugs-9932/">AI generates larger pull requests. Larger pull requests bring more bugs.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Gian Segato, Anthropic: &#8220;AI Products Are a Lagging Indicator of Growth&#8221;</title>
		<link>https://shiftmag.dev/gian-segato-anthropic-ai-growth-10231/</link>
		
		<dc:creator><![CDATA[Ivan Simic]]></dc:creator>
		<pubDate>Fri, 12 Jun 2026 13:29:37 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[ai week milano]]></category>
		<category><![CDATA[anthropic]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=10231</guid>

					<description><![CDATA[<p>Watching AI product evolution from the sidelines makes you feel like things are going fast, but according to Anthropic's Gian Segato, that might not be the best metric.</p>
<p>The post <a href="https://shiftmag.dev/gian-segato-anthropic-ai-growth-10231/">Gian Segato, Anthropic: &#8220;AI Products Are a Lagging Indicator of Growth&#8221;</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/06/New-ShiftMag-panel-interview-2.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/New-ShiftMag-panel-interview-2.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/06/New-ShiftMag-panel-interview-2-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/06/New-ShiftMag-panel-interview-2-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/New-ShiftMag-panel-interview-2-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">On June 9th, 2026, Anthropic released Claude Fable 5, its most powerful model ever. Fable is much faster than Opus 4.8, which is itself much faster than the ones before it. It seems like every AI model is faster, better and can do more, but what does that mean in practice? At <a href="https://www.aiweek.it/en/" target="_blank" rel="noreferrer noopener nofollow">AI Week Milano</a>, we listened to<strong> Data Science Manager for the Research team at Anthropic, Gian Segato</strong>, explain how he views AI evolution.</p>



<p class="wp-block-paragraph">In a few years, we&#8217;ve come from an AI chatbot that summarizes documents and writes a bit of code to a knowledge worker that is, by all intents and purposes, the closest thing to an &#8220;AI coworker&#8221; we have. These are jumps in capability that the general public struggles to follow, but they&#8217;re not the <em>real </em>reality.</p>



<p class="wp-block-paragraph">Gian explained in his talk that most of these flagship and public features were developed months before Claude users used them. According to him, seeing a new feature such as coding or image creation is the best way to know where AI <em><strong>was</strong></em> a few months ago, not where it is now. Since AI capabilities move so fast, it&#8217;s hard to keep track of it all:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Products are fundamentally a lagging indicator. You cannot just judge capabilities by looking backwards. Products are built on top of capabilities.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="the-acceleration-is-accelerating">The acceleration is accelerating</span></h2>



<p class="wp-block-paragraph">A good (or just more realistic) way to measure the speed at which models are getting smarter is to look at a new metric that Gian suggested, which works in three steps:</p>



<ul class="wp-block-list">
<li>Take a task, for example, the creation of a simple app, solving a particular problem or reading a data set;</li>



<li>Find out how much it would take for a human expert to do it;</li>



<li>Then check whether a model can do it and how much time it takes.</li>
</ul>



<p class="wp-block-paragraph">Doing this gives you a clean exponential and trajectory of where you are, and where you&#8217;re going. Right now, Gian thinks that the best models can <strong>autonomously</strong> complete work that would take a human expert around two days. Of course, he&#8217;s not talking about answering our emails or vibe-coding mini games, but complex work that requires in-depth knowledge of the matter at hand and particular skills.</p>



<p class="wp-block-paragraph">The speed at which models can do these tasks is, however, doubling every four months. Just a few years ago, it was doubling every <strong>seven</strong> months; the acceleration is, as Gian puts it, accelerating. This means that the capabilities of models are doubling at a rate that is also becoming faster; they take less time to become twice as efficient.</p>



<p class="wp-block-paragraph">What&#8217;s happening is relatively simple: <a href="https://blogs.nvidia.com/blog/ai-scaling-laws/" target="_blank" rel="noreferrer noopener nofollow">we&#8217;re just throwing more power and information to AI models</a>, and they&#8217;re responding to it well.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="538" src="https://shiftmag.dev/wp-content/uploads/2026/06/Shift-In-article-IS-1024x538.png?x94846" alt="" class="wp-image-10294" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/Shift-In-article-IS-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/Shift-In-article-IS-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/06/Shift-In-article-IS-768x403.png 768w, https://shiftmag.dev/wp-content/uploads/2026/06/Shift-In-article-IS.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading"><span id="give-ai-more-power-and-it-gets-proportionally-smarter">Give AI more power and it gets proportionally smarter</span></h2>



<p class="wp-block-paragraph">For those unfamiliar with how <strong>Anthropic</strong> was created, the mention of a paper in which researchers figured out that just giving more power to AI models makes the AI models smarter came as a revelation. This means, in Segato&#8217;s words, that the line that portrays AI capability in relation to the amount of processing power is just a straight line, and one that researchers currently see no end to.</p>



<p class="wp-block-paragraph">The information is both slightly concerning and very impressive. Gian noted that the line spans eight orders of magnitude, which is immense and means that AI capability can scale incredibly while still not showing signs of vulnerability. For the sake of keeping this article short and easy to understand, let&#8217;s just say that this is almost unheard of, except in some physics research and laws of nature:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Typical things we build; engineering things, societal phenomena, you two or three-x them and they break down. There&#8217;s no equation that spans eight orders of magnitude except in physics and the laws of nature.</p>
</blockquote>



<p class="wp-block-paragraph">Calling AI a &#8220;law of physics&#8221; or a &#8220;force of nature&#8221; might be something that belongs in a conference keynote and is something an AI startup is just waiting to use for its new investor pitch after listening to this presentation.</p>



<p class="wp-block-paragraph">However, this also means that there&#8217;s really no way of telling how &#8220;deep&#8221; or &#8220;smart&#8221; an AI model can get if all that it takes for it to get there is just more hardware, more RAM, and more learning material. It also means that we&#8217;ll see more AI data centers as companies like Anthropic throw more chips and money to their models.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2026/06/WhatsApp-Image-2025-05-16-at-10.52.58-copia-1024x683.jpeg?x94846" alt="" class="wp-image-10300" srcset="https://shiftmag.dev/wp-content/uploads/2026/06/WhatsApp-Image-2025-05-16-at-10.52.58-copia-1024x683.jpeg 1024w, https://shiftmag.dev/wp-content/uploads/2026/06/WhatsApp-Image-2025-05-16-at-10.52.58-copia-300x200.jpeg 300w, https://shiftmag.dev/wp-content/uploads/2026/06/WhatsApp-Image-2025-05-16-at-10.52.58-copia-768x512.jpeg 768w, https://shiftmag.dev/wp-content/uploads/2026/06/WhatsApp-Image-2025-05-16-at-10.52.58-copia.jpeg 1080w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">AI Week Milano had more than 25,000 visitors this year. Credit: AI Week</figcaption></figure>



<h2 class="wp-block-heading"><span id="the-more-capability-the-more-risks">The more capability, the more risks</span></h2>



<p class="wp-block-paragraph">Gian continued that AI is now in the &#8220;tasks&#8221; era. This means that projects that take dozens of minutes for a human are regularly delegated to AI models, which do them easily. In six to 12 months, we&#8217;re probably going to enter something Anthropic calls the &#8220;projects&#8221; era, where AI models will be able to handle work that would take dozens of hours for humans.</p>



<p class="wp-block-paragraph">By then, we&#8217;re going to be looking at the equivalent of a CEO instructing a marketing director, rather than a human assigning a task to an AI tool. Seeing as the growth is both fast and linear, we&#8217;re approaching that era whether we&#8217;re prepared for it or not.</p>



<p class="wp-block-paragraph">With this growth come risks that can&#8217;t be ignored. For example, the more work models do, the less supervision is possible for us as humans. For example, if a model is doing a marketing campaign, a human can&#8217;t watch its every step:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Necessarily there&#8217;s going to be some element where we have to accept some lack of supervision. That&#8217;s both intellectually interesting and pretty scary. This is a little unsettling.</p>
</blockquote>



<p class="wp-block-paragraph">Gian&#8217;s &#8220;encouraging&#8221; words aside, this was the part of the presentation where he connected the researchers who&#8217;ve discovered the connection between power and AI model capability as the founders of Anthropic. This served as a diving board to present that Anthropic is really serious in terms of security and monitoring its models. Still, AI creators telling you they&#8217;re &#8220;doing all they can&#8221; to contain AI doesn&#8217;t really fill you with optimism. </p>



<p class="wp-block-paragraph">The talk from Anthropic&#8217;s researcher comes at a time when news of Mythos and Fable, the company&#8217;s extremely capable models and their effects on the internet are making headlines. Gian echoed this by saying that AI models that <em>might</em> potentially be used to engineer a new pandemic, having learned all there is to know about viruses and disease, might also be the best way to find out how cancer spreads and how the brain works. </p>



<p class="wp-block-paragraph">Of course, he believes that the <em>good guys</em> will be the best way of stopping the negative sides of AI from shining through. Whether the world feels like Anthropic are said <em>good guys </em>is another story entirely. </p>



<p class="wp-block-paragraph">Concluding his talk, Gian said that he believes the next period of human history will be the one where we will be able to &#8220;compress 100 years of industrial and scientific revolution into a very fun and interesting decade&#8221;.</p>
<p>The post <a href="https://shiftmag.dev/gian-segato-anthropic-ai-growth-10231/">Gian Segato, Anthropic: &#8220;AI Products Are a Lagging Indicator of Growth&#8221;</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>20 Year Old Unicorn Goes AI-First Without Mass AI Layoffs</title>
		<link>https://shiftmag.dev/ai-first-izabel-jelenic-infobip-10156/</link>
		
		<dc:creator><![CDATA[Anastasija Uspenski]]></dc:creator>
		<pubDate>Wed, 10 Jun 2026 12:03:47 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Developer Experience]]></category>
		<category><![CDATA[agentic AI]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[AI-first]]></category>
		<category><![CDATA[CTO]]></category>
		<category><![CDATA[infobip]]></category>
		<category><![CDATA[Izabel Jelenić]]></category>
		<category><![CDATA[MCP]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=10156</guid>

					<description><![CDATA[<p>A CTO with 20 years of experience through multiple tech shifts sees layoffs not as an AI effect, but as a correction after an unsustainable hiring boom. He sees AI as a reset: an opportunity for strong junior engineers, and a wake-up call for senior developers facing an existential shift in how they stay relevant.</p>
<p>The post <a href="https://shiftmag.dev/ai-first-izabel-jelenic-infobip-10156/">20 Year Old Unicorn Goes AI-First Without Mass AI Layoffs</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p class="wp-block-paragraph">Whether you are a skeptical senior AI engineer, a cautious junior, or an enthusiast who has been fully engaged in AI since the moment you discovered it, the interview I prepared will stay with you.</p>


<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/izo1-1.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/izo1-1.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/izo1-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/izo1-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/izo1-1-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">This May, I spoke with a CTO who truly understands the spirit of the moment and brings a grounded, realistic view of the current wave of AI expansion. <strong>He applies that perspective across a team of roughly 1,000 developers.</strong></p>



<p class="wp-block-paragraph">That is <a href="https://www.linkedin.com/in/izabel-jelenic-1855974/">Izabel Jelenić</a>, CTO of <a href="https://www.infobip.com/" target="_blank" rel="noreferrer noopener">Infobip</a>, first Croatian unicorn. As a co-founder, he has been there since the very beginning, when two university friends started a small startup that later grew into a billion-dollar global organization. <strong>Over the past 20 years, this veteran tech leader has navigated every major technological shift while helping scale the company worldwide.</strong></p>



<p class="wp-block-paragraph">The conversation comes at a moment when we are in the middle of the agentic AI era, where AI systems move beyond answering questions and begin executing tasks, supporting and automating real workflows. At Infobip, this shift has been embraced early and deliberately through an <strong>AI-first transformation that extends across the entire organization, engaging not only developers but employees in every function</strong>.</p>



<h2 class="wp-block-heading"><span id="developers-should-adopt-ai-gradually">Developers should adopt AI gradually</span></h2>



<p class="wp-block-paragraph">I asked Izabel to explain what AI-first means and how the idea of shifting to an AI-first mindset emerged.</p>



<p class="wp-block-paragraph">What followed was a comprehensive breakdown of its evolution, core principles, and practical execution. With over two decades in the tech industry, my interlocutor emphasizes that he has never witnessed a technological shift this monumental, signaling clear proof that AI is rapidly moving from mere hype into everyday utility. </p>



<p class="wp-block-paragraph">That makes mastering these capabilities and maximizing their potential absolutely crucial. A common pitfall, however, is assuming AI belongs solely to developers, which completely misses the mark given the technology&#8217;s universal transformative power:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">That is why Infobip promotes a mindset in which all technical and non-technical employees embrace curiosity toward AI so they can be part of the new business and technological order instead of letting it overtake them. That is the AI-first shift the company has chosen as its business philosophy for the future.</p>
</blockquote>



<p class="wp-block-paragraph">While technical teams naturally adopt new tools faster and more efficiently, he believes those outside software engineering shouldn&#8217;t be left behind. Providing non-technical employees with the right resources and training allows them to leverage AI agents as personal assistants, ultimately making their daily work both easier and faster.</p>



<p class="wp-block-paragraph">When it comes to software developers, the Infobip CTO sees two extremes:</p>



<ol id="block-2e1c8206-e7bb-4608-9086-979d2a7323a0" class="wp-block-list">
<li>Some cling tightly to writing code and represent the anti-AI camp.</li>



<li>Others rely fully on AI and embrace vibe coding.</li>
</ol>



<p class="wp-block-paragraph">In his view, <strong>neither approach is correct</strong>.</p>



<p class="wp-block-paragraph">Because technological development has not yet reached the &#8220;dark factory&#8221; stage (where systems run entirely automated and unassisted), rushing into a full AI concept remains premature. Instead, this direction serves as a north star—a long-term guide rather than an immediate reality. While it remains uncertain if a fully autonomous state is entirely achievable, forcing such an approach today is clearly unwise.</p>



<p class="wp-block-paragraph"><strong>Developers should adopt AI gradually</strong>. They should understand which tools suit them best, such as coding, review, and testing.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="682" src="https://shiftmag.dev/wp-content/uploads/2026/05/DSC04514-1024x682.jpg?x94846" alt="AI-first mindset" class="wp-image-9717" title="AI-first mindset" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/DSC04514-1024x682.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/DSC04514-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/DSC04514-768x512.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/DSC04514.jpg 2000w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo credit: Neven Kacun</figcaption></figure>



<h2 class="wp-block-heading">If you are a bad developer, AI can&#8217;t help you</h2>



<p class="wp-block-paragraph">From the CTO&#8217;s perspective, Infobip developers have adopted an AI-first mindset naturally and with little friction, though overdoing it remains a risk. There is a critical need to retain ownership and maintain a strong architecture, especially since AI often acts as an amplifier, scaling both good practices and underlying flaws:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you are a bad developer, AI will not help you write good code. If you are a good developer, AI can help you a lot in terms of speed. But you have to know how to guide it and recognize its mistakes, because it can be very convincing.</p>
</blockquote>



<p class="wp-block-paragraph">He explains that an AI-first mindset changes how organizations operate. <strong>AI speeds up processes by automating everything that can be automated</strong>. People can master this technology by adopting engineering logic, and they gain a strong advantage for the future.</p>



<p class="wp-block-paragraph">By ignoring AI, a person pushes themselves out of the industry. Izabel finds it surprising that some smaller companies do not adopt AI faster. They are in a growth phase where AI could make a huge difference. They could implement it quickly because teams are small and compact. Despite this, adoption remains low.</p>



<p class="wp-block-paragraph">The Infobip co-founder sees a common misconception that AI is mainly for developers. <strong>The biggest impact actually appears in GTM</strong> because AI automates business processes extremely well. Hyper-personalization becomes fast, advanced, and efficient. It can significantly help business scaling:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You can automatically analyze the market, get a list of potential clients, identify use cases that clients need, and do outreach. With hyper-personalization, you can prepare content in a way that immediately helps a person understand what a company sells, but from their perspective.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="ai-will-reshape-roles-in-tech-companies">AI will reshape roles in tech companies</span></h2>



<p class="wp-block-paragraph">Even though AI can quickly improve business organization, he notices that people mostly focus on coding. Coding has never been the biggest problem in IT businesses, <strong>the real bottlenecks usually appear in sales</strong>.</p>



<p class="wp-block-paragraph">This is where AI transforms work, making processes more efficient and cost-effective. Far from replacing client relationships or removing the human element, the goal is simply to automate repetitive operational tasks, leading to a much more streamlined workflow:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">People need to get on the AI train. If they do not use AI, new companies will emerge that are AI-native. They start with two people and AI. Their approach will feel more natural and much faster than those who use AI only for software development.</p>
</blockquote>



<p class="wp-block-paragraph">When asked to describe what an AI-first transformation looks like from the inside (using Infobip as an example) and whether the company will abandon certain processes or redistribute roles, the answer is far from black and white. This transition will happen gradually, <strong>marking the most significant technological shift in over two decades</strong>. </p>



<p class="wp-block-paragraph">Ultimately, enthusiastic professionals eager to learn will be the ones expanding their roles by actively implementing these new technologies:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">People experienced in one specialization can now work much more broadly. They can keep their core role, but they can accomplish much more. Someone who used to be only a backend developer can now work with databases or frontend. They may not reach the level of an experienced database engineer, but they can handle simple tasks. They do not need to wait for anyone.</p>
</blockquote>



<p class="wp-block-paragraph">The concept of Agentic AI is another major focal point, pointing toward a future where everyone operates with multiple digital assistants. By helping manage core business tasks faster and more efficiently, these agents streamline daily operations while actively expanding a professional&#8217;s overall skill set.</p>



<h2 class="wp-block-heading"><span id="ai-agents-are-still-nothing-without-people">AI agents are still nothing without people</span></h2>



<p class="wp-block-paragraph">As part of its AI-first mindset, Infobip is building its own <a href="https://www.infobip.com/mcp" target="_blank" rel="noreferrer noopener">Model Context Protocol (MCP)</a>, integrations, enabling AI agents to easily consume and interact with Infobip services. CTO believes this is vital because <strong>AI agents will soon be everywhere</strong>, embedded in business systems, running on personal devices, practically on anything you can imagine:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Once it became clear that AI agents are the primary means of interacting with the outside world, adoption accelerated rapidly. Companies realized how easy it is to connect these agents to external systems and services. The key point is that AI agents can now perform real-world tasks by consuming various APIs, especially those available on platforms like MCP. This capability makes them an integral part of business operations, and it’s why we believe their presence will become ubiquitous in the near future.</p>
</blockquote>



<p class="wp-block-paragraph">However, he stresses <strong>the importance of human involvement</strong>. It should not disappear:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Communication must continue to exist. We cannot fully hand it over to AI agents. They can help, but communication becomes even more important because everyone becomes more productive. Without alignment, you can end up with many generated tools that no one understands.</p>
</blockquote>



<p class="wp-block-paragraph">There is also a clear warning regarding hyperproduction, which can easily spiral into chaos if teams begin operating without proper coordination.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="682" src="https://shiftmag.dev/wp-content/uploads/2026/05/DSC04413-1024x682.jpg?x94846" alt="" class="wp-image-9720" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/DSC04413-1024x682.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/DSC04413-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/DSC04413-768x512.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/DSC04413.jpg 2000w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo credit: Neven Kacun</figcaption></figure>



<h2 class="wp-block-heading"><span id="there-are-fewer-junior-developers-but-they-are-still-valuable">There are fewer junior developers, but they are still valuable</span></h2>



<p class="wp-block-paragraph">I also wanted to ask an experienced professional what will happen to junior developers. Demand for them has decreased globally. AI now generates much of the code that juniors used to write.</p>



<p class="wp-block-paragraph">At the same time, a generational shift could lead to a talent shortage. Looking back at the end of the pandemic when the IT sector saw a massive influx of talent, Izabel notes that many recent layoffs in US companies are likely happening under the cover of AI:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">They use AI as an excuse for inflated numbers and inflated costs. That is why fewer juniors get hired. I think balance is returning. We will not see excessive hiring from before, but there will still be a need for young, smart people who bring new ideas and fresh energy into processes.</p>
</blockquote>



<p class="wp-block-paragraph">To combat these talent gaps, Infobip runs targeted internships and collaborates closely with universities, frequently transitioning these students into permanent roles. In CTO&#8217;s experience, these fresh perspectives bring genuine value and innovative ideas to the team. </p>



<p class="wp-block-paragraph"><strong>AI plays a pivotal role in this onboarding process by granting immediate access to learning resources</strong>, allowing junior engineers to upskill rapidly. By leveraging AI as a 24/7 virtual mentor alongside human guidance, juniors can accelerate their professional growth and secure key positions within the company much faster</p>



<h2 class="wp-block-heading"><span id="experienced-developers-face-an-identity-crisis-because-of-ai">Experienced developers face an identity crisis because of AI</span></h2>



<p class="wp-block-paragraph">Skepticism among experienced developers toward AI, in the view of Infobip&#8217;s co-founder, stems from a natural feeling of disappointment and an ego hit that is not easy to accept. At the same time, he points out that over twenty years, workflows, frameworks, programming languages, and machines have all continually evolved:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You spend 20 years writing code as a developer, refining it, running it, seeing how it works, writing tests, and then suddenly you stop looking at the code and start talking to a tool. You experience an identity crisis because you used to identify with the code you wrote.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>My interlocutor views these feelings as entirely legitimate</strong>, but encourages professionals to expand their scope and boost productivity. Ultimate purpose does not need to remain locked within a single job description when fulfillment can be found in wider responsibilities. </p>



<p class="wp-block-paragraph">This adaptation is admittedly difficult, but as he points out, once people move past the &#8220;this is useless&#8221; phase, they position themselves for rapid growth by embracing the new reality.</p>



<p class="wp-block-paragraph">Speaking from his experience as a CTO for 20 years, Izabel describes the transition to an AI-first mindset:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">These changes are quite painful. We also went through turbulence, but at some point you see that AI brings value and you have to use it. Claims about AI being prohibitively expensive often serve as mere excuses. Open-source models can already run locally on standard laptops, completely bypassing the need for intensive training in many scenarios. Ultimately, running a model and training one are two entirely different concepts. Today, hardware capabilities and overall quality are night and day compared to what was available just a few years ago.</p>
</blockquote>



<p class="wp-block-paragraph">The question may be how good an LLM you use, but you will not be able to work without it. The technology is here to stay. It is important to understand its strengths and weaknesses and use it as soon as possible.</p>



<h2 class="wp-block-heading"><span id="the-future-is-uncertain-but-exciting">The future is uncertain but exciting</span></h2>



<p class="wp-block-paragraph">Finally, when asked what the next twenty years of Infobip will look like, the answer was a candid &#8220;I don&#8217;t know.&#8221; Just as no one could have predicted the current shift, no one can definitively predict the next. Still, one thing is certain: there is immense excitement about this change. </p>



<p class="wp-block-paragraph">Within Infobip, both technical and non-technical teams are fully embracing AI. Another deep reshaping is underway, business as usual no longer exists. Instead, the industry is witnessing a real transformation:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Twenty years ago we said &#8220;We are just starting&#8221;, and now we are in the same situation again. That will probably continue for the next 20 years. Our best people keep learning new things that drive them forward instead of standing still.</p>
</blockquote>



<p class="wp-block-paragraph">This mindset feels humble, but also necessary in a time of major technological transformation. It reflects the Socratic idea that true wisdom begins with understanding how little we know.</p>



<p class="wp-block-paragraph"><em>Infobip is the publisher of ShiftMag, recognizing the need for high-quality content for developers.</em></p>
<p>The post <a href="https://shiftmag.dev/ai-first-izabel-jelenic-infobip-10156/">20 Year Old Unicorn Goes AI-First Without Mass AI Layoffs</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Trisha Gee: AI Won&#8217;t Fix Your Broken Pipeline &#8211; It Will Break It Faster</title>
		<link>https://shiftmag.dev/trisha-gee-ai-wont-fix-your-broken-pipeline-it-will-break-it-faster-9785/</link>
		
		<dc:creator><![CDATA[Ivan Pelivanovic]]></dc:creator>
		<pubDate>Wed, 27 May 2026 13:36:39 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Developer Productivity]]></category>
		<category><![CDATA[development]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9785</guid>

					<description><![CDATA[<p>At Devoxx UK, I spoke with Trisha Gee - author and one of the most recognized voices in the Java space - about what really happens when teams lean heavily on AI. Her take was far darker than the conference hype.</p>
<p>The post <a href="https://shiftmag.dev/trisha-gee-ai-wont-fix-your-broken-pipeline-it-will-break-it-faster-9785/">Trisha Gee: AI Won&#8217;t Fix Your Broken Pipeline &#8211; It Will Break It Faster</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="720" src="https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/tirsha-devoxx-768x461.jpg 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>

<div class="wp-block-post-excerpt wp-block-hidden-desktop wp-block-hidden-mobile wp-block-hidden-tablet"><p class="wp-block-post-excerpt__excerpt">At Devoxx UK, I spoke with Trisha Gee &#8211; author and one of the most recognized voices in the Java space &#8211; about what really happens when teams lean heavily on AI. Her take was far darker than the conference hype. </p></div>


<p class="wp-block-paragraph"><strong>Trisha Gee</strong> has spent over two decades in software development, from startups to global enterprises &#8211; equally at home discussing DORA metrics and SPACE frameworks as business outcomes and organizational design.</p>



<p class="wp-block-paragraph">At <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx UK</a>, she gave a talk about how <strong>software engineering principles stay the same regardless of what tooling era you are in</strong>. </p>



<p class="wp-block-paragraph">I wanted to understand what that means right now when AI is writing a significant portion of the code.</p>



<h2 class="wp-block-heading"><span id="ai-exposes-the-weakest-link-not-just-the-fastest-path">AI exposes the weakest link, not just the fastest path</span></h2>



<p class="wp-block-paragraph">Trisha frames AI as an amplifier, not a solution. When I asked what that looks like beyond demos, she put it simply: <strong>it exposes the problems that were already there</strong>, the ones you didn&#8217;t know you had.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The most common thing I saw (I was working at Gradle, so we dealt with a lot of build tooling) was more code, more tests, and tests taking longer. The continuous delivery pipeline took a lot of pressure.</p>
</blockquote>



<p class="wp-block-paragraph">The broader pattern she describes is straightforward but easy to miss when you are excited about shipping faster. &#8220;Whichever part of your system is the weakest, it&#8217;s going to expose that part,&#8221; she said.</p>



<p class="wp-block-paragraph">Reframing it this way, while most conversations about AI adoption focus on what gets faster, Trisha highlights what deteriorates first.</p>



<h2 class="wp-block-heading"><span id="when-code-gets-cheap-everything-else-gets-expensive">When code gets cheap, everything else gets expensive</span></h2>



<p class="wp-block-paragraph">When I asked Trisha where teams should focus once code generation becomes cheap, her answer was <em>everywhere</em>.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-1024x614.jpg?x94846" alt="" class="wp-image-9877" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx-768x461.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/crowd-devoxx.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">&nbsp;Photo: DevoxxUK / Flickr</figcaption></figure>



<p class="wp-block-paragraph">What she means is that optimizing the <strong>writing of code without understanding the surrounding system does not move the needle</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It&#8217;s not about one thing which is going to fix one problem, it&#8217;s about really understanding the whole system, it&#8217;s about understanding even the whole organization, the whole enterprise. Where does IT and technology and software fit into that? What are you really trying to deliver? What is the business benefit?</p>
</blockquote>



<p class="wp-block-paragraph">She described this as <strong>working across two ends of the process</strong>. On the input side, teams need to get better at questioning requirements before writing anything. On the output side, they need to look at build pipelines, test parallelism, flaky tests, and DORA metrics.</p>



<p class="wp-block-paragraph">&#8220;If you can measure those things (your DORA metrics, build times, whether delivered requirements actually give users value) you can start to see which parts of the process are working and which need attention,&#8221; Trisha explained.</p>



<h2 class="wp-block-heading"><span id="measuring-the-wrong-things-optimizes-the-wrong-things">Measuring the wrong things optimizes the wrong things</span></h2>



<p class="wp-block-paragraph">She also makes a sharp point about measurement and optimization.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you measure lines of code for productivity, you&#8217;ll get more lines of code. But really productivity is not just about what we call these activity metrics. It&#8217;s not just lines of code. It&#8217;s not just pull requests, merges, features delivered.</p>
</blockquote>



<p class="wp-block-paragraph">The thing teams consistently miss is the full arc of delivery.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Developer experience and productivity is the whole piece. Did it get out to the user? Did it meet the user&#8217;s needs? Is the user paying for more of our stuff? Is the business getting what they need from what the developers are doing? What you&#8217;re measuring there impacts what you&#8217;re going to optimize.</p>
</blockquote>



<p class="wp-block-paragraph">That last line is worth sitting with. If your productivity metrics stop at pull requests merged, you are optimizing for pull requests merged.</p>



<h2 class="wp-block-heading"><span id="the-space-framework-and-why-three-metrics-beat-one"><br>The SPACE framework and why three metrics beat one</span></h2>



<p class="wp-block-paragraph">When I asked Trisha what teams should measure, she pointed to the SPACE framework. SPACE stands for <strong>satisfaction</strong>, <strong>performance</strong>, <strong>activity</strong>, <strong>communication and collaboration</strong>, and <strong>efficiency and flow</strong>.</p>



<p class="wp-block-paragraph">DORA metrics, which most teams are more familiar with, are a subset of it. Her recommendation is to <strong>pick metrics from three different dimensions</strong> rather than relying on a single category. The reasoning is that single-category metrics tend to be easy to game without improving anything real.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">So yes, you can write more code, but no, you didn&#8217;t do what the business wanted.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-1024x614.jpg?x94846" alt="" class="wp-image-9879" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1-768x461.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/Trisha-Gee-1.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: Marin Pavelić</figcaption></figure>



<p class="wp-block-paragraph">She also brought up Fred Brooks and <strong>communication overhead</strong> as something the industry consistently underweights. The harder metrics to capture, like satisfaction and flow, are often more revealing than the activity metrics that dashboards make easy to track.</p>



<p class="wp-block-paragraph">The business outcomes she keeps returning to are specific: &#8220;You need to measure, did it do what you wanted it to do? Did it get out to the user in time? Did they start spending more money with us? Did it fix your retention problem?&#8221;</p>



<p class="wp-block-paragraph">Those are the things which matter much more to the business.</p>



<h2 class="wp-block-heading"><span id="what-to-fix-before-adopting-ai">What to fix before adopting AI</span></h2>



<p class="wp-block-paragraph">I wondered what teams need to get right before AI tooling can actually help them. Trisha&#8217;s first answer was essentially: stop adopting AI the way you have adopted everything else. </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We generally get requirements, write the code, chuck it out there, and then you&#8217;re kind of done. That&#8217;s not how it should work.</p>
</blockquote>



<p class="wp-block-paragraph">What she advocates for instead is <strong>applying the scientific method to engineering decisions</strong>, which sounds obvious but rarely happens in real life.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Have a hypothesis, do your investigation, measure the results, have a conclusion. Generally speaking, we have not been great at that in our industry.</p>
</blockquote>



<p class="wp-block-paragraph">Applied to AI adoption specifically, that means being precise about what you are actually trying to achieve. What are we trying to achieve with AI? Do we want to deliver more features more quickly to the customer or do we want to perhaps deliver higher quality features? Because those two things are not necessarily the same thing Trisha concluded.</p>



<p class="wp-block-paragraph">Therefore the practical instruction she gives is to <strong>run short experiments, measure one change at a time, and iterate</strong>. But have a hypothesis, figure out how to measure it, measure it, get feedback, and iterate over that.</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="AI Won&amp;apos;t Fix What You Can&amp;apos;t Measure" width="500" height="281" src="https://www.youtube.com/embed/iH4UnTskOSM?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p class="wp-block-paragraph"><br></p>
<p>The post <a href="https://shiftmag.dev/trisha-gee-ai-wont-fix-your-broken-pipeline-it-will-break-it-faster-9785/">Trisha Gee: AI Won&#8217;t Fix Your Broken Pipeline &#8211; It Will Break It Faster</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Killing PRs was the easy part. Now Technical Death Keeps the CTO Up.</title>
		<link>https://shiftmag.dev/killing-prs-was-the-easy-part-now-technical-death-keeps-the-cto-up-9910/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Tue, 26 May 2026 14:39:07 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[iBOOD]]></category>
		<category><![CDATA[Sander Hoogendoorn]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9910</guid>

					<description><![CDATA[<p>Everything you think is non-negotiable in software development, Sander Hoogendoorn's team quietly dropped - and nothing broke.</p>
<p>The post <a href="https://shiftmag.dev/killing-prs-was-the-easy-part-now-technical-death-keeps-the-cto-up-9910/">Killing PRs was the easy part. Now Technical Death Keeps the CTO Up.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander-.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander-.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander--300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander--1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander--768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph"><strong>Sander Hoogendoorn </strong>has been writing code for over 40 years and is currently CTO at iBOOD, a Dutch e-commerce company. </p>



<p class="wp-block-paragraph">His talk at <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx</a>, <em>The Last Pull Request</em>, was a live report from a team that quietly dismantled most of what the industry considers non-negotiable, and then kept shipping.</p>



<p class="wp-block-paragraph">Now there&#8217;s a new concern.</p>



<h2 class="wp-block-heading">AI didn&#8217;t change everything. Change didn&#8217;t wait for AI.</h2>



<p class="wp-block-paragraph">Sander opened with a timeline: source control, IDEs, the web, mobile, the cloud, microservices. Each wave reshaped what developers could build and how. AI is just the latest.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI is going to change everything? No.&nbsp;Everything already changed everything. And this is not the last step.</p>
</blockquote>



<p class="wp-block-paragraph">The point wasn&#8217;t to diminish AI but to put it in context. Every major shift expanded the tooling, and the problem space alongside it. For most teams, that problem space now sits in what Sander calls <strong>complex territory</strong>: no best practices, only things that might emerge from experimentation. Dave Snowden&#8217;s Cynefin framework is blunt about this: in a complex context, there is no right answer to find. You have to invent one.</p>



<p class="wp-block-paragraph">That&#8217;s the actual job, Sander says. Not typing code. <strong>Solving problems that have never been solved before</strong>.</p>



<h2 class="wp-block-heading"><span id="selfware">Selfware</span></h2>



<p class="wp-block-paragraph">Sander introduced a concept: selfware. Software built by non-developers (marketers, finance teams, executives) using AI to <strong>solve their own problems without involving engineering</strong>. </p>



<p class="wp-block-paragraph">At iBOOD, the content director is already doing it. So is the CMO:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We as tech are not fast enough. And I’ve seen this before. In the 80s and 90s, everyone started writing Excel spreadsheets.</p>
</blockquote>



<p class="wp-block-paragraph">The difference now is that the output isn’t a pivot table, it’s software. Unmanaged, untested, running on personal accounts with passwords nobody reviews, exporting customer data in ways that would make your compliance team cry. This is happening right now, and <strong>most engineering teams haven&#8217;t figured out what to do about it</strong>.</p>



<h2 class="wp-block-heading">No scrum, sprints, pull requests&#8230;</h2>



<p class="wp-block-paragraph">The list of things they stopped doing is long: no scrum, no sprints, no retros. Fewer standups. No scrum master, no product owner. Minimal estimates. <strong>No pull requests</strong> &#8211; because every branch is a merge waiting to happen, every review costs time, and reviewers rarely know what the code was supposed to do in the first place.</p>



<p class="wp-block-paragraph">What replaced it? <a href="https://shiftmag.dev/pair-programming-benefits-challenges-563/" target="_blank" rel="noreferrer noopener">Pair programming</a>. <a href="https://shiftmag.dev/mob-programming-why-do-it-882/" target="_blank" rel="noreferrer noopener">Mob programming</a>. Smaller changes, checked in faster, continuously. Everyone on the team is an architect. Everyone is accountable for everything, Sander says:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Perfection is achieved not when there’s nothing more to add but when there’s nothing left to take away.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="pair-me-with-claude">Pair me with Claude</span></h2>



<p class="wp-block-paragraph">Today, Sander&#8217;s 13-person team <strong>pairs with AI through most of their working day</strong>. It became the natural way to work. Currently that means Claude, though that could change next week.</p>



<p class="wp-block-paragraph">AI breaks things. Two weeks before the talk, Sander pushed AI-generated changes that silently removed all dependency injections from their web page constructors. None of the pages were serving data. He didn&#8217;t catch it until later:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">I’m not saying not to use AI. I do it every day. But I do think we should check what it’s doing.</p>
</blockquote>



<p class="wp-block-paragraph">What worries him most is what he calls <strong>technical death</strong> &#8211; a state where a team spends all its time keeping existing software alive, with nothing left for anything new. Technical debt compounding under AI-generated code nobody fully reviews. Complexity accumulating faster than it gets cleaned up. That&#8217;s the real risk.</p>



<h2 class="wp-block-heading"><span id="we-asked-sander-a-few-more-things">We asked Sander a few more things</span></h2>



<h3 class="wp-block-heading"><span id="your-team-dropped-pull-requests-was-that-an-ai-decision">Your team dropped pull requests. Was that an AI decision?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: No, we did that a long time ago, it has nothing to do with AI. The problem with pull requests is that <strong>they slow you down</strong>. The longer you wait with merging back into main, the harder it gets, because other people make changes too. And what you see very often is that people reviewing other people’s code tend not to know or even understand what the code was supposed to do. So they check formatting, linting, naming conventions. Which is pretty stupid, because that you can automate.</p>



<p class="wp-block-paragraph"><strong>Pull requests make sense in open source</strong>, where you have no idea who’s submitting changes or what the quality of their work is. But on your own team? I don’t see any problems with committing code from anybody automatically. We work together every day, we write code together. You just don’t need it.</p>



<h3 class="wp-block-heading"><span id="ai-is-part-of-your-team-now-what-happens-when-something-breaks">AI is part of your team now. What happens when something breaks?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: We don’t track who broke it. <strong>Everybody on my team is accountable for everything, including me</strong>. If I push something and the pipeline fails and I’m not around, somebody else picks it up. I have no doubt about that. So accountability is… I don’t care too much about it, because it’s distributed. We don’t blame people. We just fix it.</p>



<h3 class="wp-block-heading"><span id="you%e2%80%99ve-been-critical-of-agile-is-ai-exposing-that-teams-never-really-understood-it">You’ve been critical of Agile. Is AI exposing that teams never really understood it?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: I’m not critical about Agile. I think a lot of people misunderstand what Agile actually means. Agile does not mean Scrum. Actually, to be quite honest, Scrum is not really Agile. The Scrum Guide says Scrum is immutable, which basically means it’s not Agile, because Agile means you can improve on anything.</p>



<p class="wp-block-paragraph">There is nothing in Agile that says you need to do sprints. The key statement in the Agile Manifesto is the one at the top: <strong>we are uncovering better ways of developing software</strong>. Everything else doesn’t really matter. As long as you have that mindset, there’s always something to improve. No default way of working is going to solve the problem for you.</p>



<h3 class="wp-block-heading"><span id="where-does-this-go-in-two-or-three-years">Where does this go in two or three years?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: I think we will soon realize that the <strong>English language is too ambiguous and not concise enough to specify to an AI what to do</strong>. So what will happen is that we’ll develop better ways of having conversations with AI &#8211; more precise, less ambiguous. And what those languages are called? Programming languages. We will develop programming languages that allow us to talk to an AI in a way that the AI is able to create lower-level code from it.</p>



<p class="wp-block-paragraph">Programming will be programming, except with different tools. As they always have been.</p>
<p>The post <a href="https://shiftmag.dev/killing-prs-was-the-easy-part-now-technical-death-keeps-the-cto-up-9910/">Killing PRs was the easy part. Now Technical Death Keeps the CTO Up.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>When systems go down, devs still juggle 10 tabs. PagerDuty says MCP fixes that</title>
		<link>https://shiftmag.dev/when-systems-go-down-devs-still-juggle-10-tabs-pagerduty-says-mcp-fixes-that-9657/</link>
		
		<dc:creator><![CDATA[Ivan Pelivanovic]]></dc:creator>
		<pubDate>Fri, 22 May 2026 14:13:25 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Developer Productivity]]></category>
		<category><![CDATA[MCP]]></category>
		<category><![CDATA[PagerDuty]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9657</guid>

					<description><![CDATA[<p>Production incidents are a context problem. By the time an engineers understand what's happening, they've already bounced across several different tools - and the incident is still ongoing. PagerDuty thinks MCP is the fix.</p>
<p>The post <a href="https://shiftmag.dev/when-systems-go-down-devs-still-juggle-10-tabs-pagerduty-says-mcp-fixes-that-9657/">When systems go down, devs still juggle 10 tabs. PagerDuty says MCP fixes that</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2-300x158.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2-1024x538.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/rocio-and-sebastian-2-768x403.jpg 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>

<div class="wp-block-post-excerpt wp-block-hidden-desktop wp-block-hidden-mobile wp-block-hidden-tablet"><p class="wp-block-post-excerpt__excerpt">Production incidents are a context problem. By the time an engineers understand what&#8217;s happening, they&#8217;ve already bounced across several different tools &#8211; and the incident is still ongoing. PagerDuty thinks MCP is the fix. </p></div>


<p class="wp-block-paragraph">When incidents hit production systems, engineers rarely stay inside one tool for long,  jumping from logs to dashboards to runbooks, trying to <strong>reconstruct what is actually happening</strong>.</p>



<p class="wp-block-paragraph">Talking to other builders, it seemed like almost everybody faces this context-switching problem. </p>



<p class="wp-block-paragraph"><strong>Rocío Bayon</strong> (Product Manager) and <strong>Sebastian Villanelo</strong> (Sr. Forward Deployed Engineer) from PagerDuty think MCP is how you fix it.</p>



<h2 class="wp-block-heading"><span id="pagerduty-built-their-mcp-to-cut-context-switching">PagerDuty built their MCP to cut context switching</span></h2>



<p class="wp-block-paragraph">Rocío explained that their MCP is solving the issue of context switching:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When an incident hits, the engineer has to go between 5 to 10 different tools to understand what&#8217;s happening.</p>
</blockquote>



<p class="wp-block-paragraph">That&#8217;s the real problem they&#8217;re trying to solve. </p>



<p class="wp-block-paragraph">PagerDuty&#8217;s framing of MCP was interesting: neither Rocío nor Sebastian described MCP as just another integration layer. They framed it as <strong>connective tissue</strong> that gathers logs, alerts, runbooks, and incident context into a single workflow.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">What the MCP does, it brings all that context into one platform where engineers are usually already working.</p>
</blockquote>



<p class="wp-block-paragraph">Most engineering organizations already have enormous amounts of observability data. The real problem is that <strong>it is scattered across systems</strong>, and engineers end up reconstructing operational context manually during incidents.</p>



<h2 class="wp-block-heading"><span id="retrieve-what-you-need-nothing-more">Retrieve what you need, nothing more</span></h2>



<p class="wp-block-paragraph">Sebastian framed the problem as signal retrieval. Rather than feeding the model more information, the goal is <strong>pulling the relevant operational state around a specific incident</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you have the right parameters or the queries and all this stuff, you will retrieve the exact information that you need.</p>
</blockquote>



<p class="wp-block-paragraph">That means <strong>narrowing context around the actual incident window</strong>. When an incident hits, it retrieves information around that time only, Sebastian explained.</p>



<p class="wp-block-paragraph">That also changes how they think about efficiency, reducing context switching directly affects operational speed, token usage, and cost.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You will see that information only with one call. And that saves a lot of tokens and time. That&#8217;s money and time.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-1024x614.jpg?x94846" alt="" class="wp-image-9873" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71-768x461.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/Editorial-IP-71.jpg 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: Lea Lobor</figcaption></figure>



<h2 class="wp-block-heading"><span id="ai-helps-but-engineers-still-decide">AI helps but engineers still decide</span></h2>



<p class="wp-block-paragraph">Still, both of them were careful not to frame AI as autonomous incident management. </p>



<p class="wp-block-paragraph">Rocío repeatedly emphasized that <strong>MCP and AI systems are primarily helping with context gathering and operational visibility</strong>, while engineers remain responsible for the high-risk decisions:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The AI is helping you, but the engineer is the one who is assessing and making decisions where there&#8217;s a high risk.</p>
</blockquote>



<p class="wp-block-paragraph">That human layer is intentional. PagerDuty&#8217;s broader vision seems less about replacing on-call engineers and more about <strong>reducing the operational overhead surrounding incidents</strong>. Their MCP systems help gather information, surface relationships between systems, and accelerate investigation workflows, but humans still decide what actually happens next.</p>



<p class="wp-block-paragraph">Rocío also mentioned that their <strong>SRE agent</strong> is designed to support larger incident workflows beyond information retrieval:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It can also help you trigger those incident workflows. So it can help you resolve the incident. And it learns as it goes.</p>
</blockquote>



<h2 class="wp-block-heading">&#8220;MCP &#8211; the connective tissue between tools&#8221;</h2>



<p class="wp-block-paragraph">I asked Rocío and Sebastian, how does MCP fit into the tools they already use without becoming just another silo.</p>



<p class="wp-block-paragraph">And both of them clearly framed <strong>MCP as anti-silo infrastructure</strong> since it brings everything to one place. Rocío called MCP &#8220;the connective tissue between all these different tools.&#8221;</p>



<p class="wp-block-paragraph">That framing probably captures the broader architectural challenge better than anything else in the interview. </p>



<p class="wp-block-paragraph">Modern incident response already spans dozens of systems: observability platforms, deployment pipelines, CI/CD tooling, ticketing systems, infrastructure management, and communication layers. </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI systems inherit that fragmentation unless something explicitly connects operational state.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="engineers-trust-systems-that-behave-predictably">Engineers trust systems that behave predictably</span></h2>



<p class="wp-block-paragraph">Sebastian mentioned that <strong>teams often react very differently to MCP systems</strong>. Some embrace them immediately while others remain skeptical, especially around security and predictability. For him, trust improves once systems consistently produce expected outcomes:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When a person or a teammate says &#8220;ah, I&#8217;m retrieving what I&#8217;m expecting to retrieve&#8221;, that will help them to trust it.</p>
</blockquote>



<p class="wp-block-paragraph">A lot of AI tooling discussions still focus on model capability, reasoning quality, or benchmark performance. But <strong>operational systems are usually adopted much more pragmatically</strong>. Engineers trust systems that behave predictably, retrieve the right operational context, and fit into workflows they already rely on.</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="10 Tools, One IDE: PagerDuty&amp;apos;s Incident MCP" width="500" height="281" src="https://www.youtube.com/embed/YRIiKkG7JY0?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>
<p>The post <a href="https://shiftmag.dev/when-systems-go-down-devs-still-juggle-10-tabs-pagerduty-says-mcp-fixes-that-9657/">When systems go down, devs still juggle 10 tabs. PagerDuty says MCP fixes that</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</title>
		<link>https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Tue, 19 May 2026 18:39:50 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Vojin Jovanović]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9802</guid>

					<description><![CDATA[<p>A 1,000 libraries, 90% coverage, 1,700 in API tokens. Nobody typed a single test by hand.</p>
<p>The post <a href="https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/">Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">When humans maintained the GraalVM native image reflection metadata repository, <strong>coverage sat at just 14%</strong>. Tests were often stubs that technically compiled but covered nothing meaningful, nobody wanted to write them for someone else&#8217;s code, and the results showed.</p>



<p class="wp-block-paragraph">At <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx UK</a>, <strong>Vojin Jovanovic</strong> (Principal Researcher, Oracle Labs) and <strong>Mihailo Markovic</strong> (Software Engineer, Oracle), presented how they replaced that process with an autonomous AI agent pipeline. </p>



<p class="wp-block-paragraph">The result is <strong>90% dynamic access coverage across more than 1,000 JVM libraries</strong>, roughly 2 billion tokens spent, and a GitHub repository generating thousands of commits per week &#8211; while Vojin was at a hotel the night before the conference.</p>



<h2 class="wp-block-heading"><span id="the-problem-with-graalvm-reflection">The problem with GraalVM reflection</span></h2>



<p class="wp-block-paragraph">GraalVM Native Image takes a Java application, performs static analysis, and AOT compiles it into a single binary. The benefits are significant: <strong>startup roughly 10x faster than a standard JVM</strong>,<strong> </strong>dramatically lower memory footprint. </p>



<p class="wp-block-paragraph">But static analysis has a fundamental limitation: when a method calls Class.forName(&#8220;Foo&#8221;) with a dynamic argument, the analyser <strong>cannot determine at compile time what class will be needed</strong>. Reflection calls break the closed-world assumption.</p>



<p class="wp-block-paragraph">The solution is <strong>reachability metadata</strong> &#8211; a JSON file that tells the native image compiler which classes, methods, and fields need to be accessible at runtime. Writing this metadata requires running tests that exercise all the relevant code paths. </p>



<p class="wp-block-paragraph">For a library like Hibernate Core, that means covering 264 individual reflection call sites. For Tomcat, 205. Across the JVM ecosystem, the number is enormous, and until recently, it was almost entirely a manual process that humans were not doing well.</p>



<h2 class="wp-block-heading"><span id="start-simple-then-add-feedback">Start simple, then add feedback</span></h2>



<p class="wp-block-paragraph">The first approach was straightforward: give an LLM the library source code, tell it to generate comprehensive Java tests, collect the metadata via a JVMTI agent. </p>



<p class="wp-block-paragraph">The results were not impressive &#8211; 5.7% coverage for logback, 2.9% for H2. Vojin noted how this doesn’t feel like AGI.</p>



<p class="wp-block-paragraph">The shift came from <strong>adding GraalVM’s static analysis directly to the agent’s context</strong>. Instead of asking the LLM to guess which code paths matter, the pipeline runs a static analysis pass that identifies every dynamic access call site (the exact class, method, and line number) and feeds that report directly to the agent. With this addition, logback coverage jumped to 97%, H2 to 84.3%, in five iterations.</p>



<p class="wp-block-paragraph"><strong>The next layer was JaCoCo integration</strong>. After each generation round, the pipeline correlates coverage data with the remaining uncovered call sites and feeds only the uncovered ones back into the next iteration. The agent knows exactly what it hit and what it missed. Vojin noted:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We always create a checkpoint in those systems so we can go back to it if something goes wrong. And in these LLM-driven workflows, something is always going wrong.</p>
</blockquote>



<p class="wp-block-paragraph">With this feedback loop: logback reached 100%, H2 reached 96.1%.</p>



<h2 class="wp-block-heading"><span id="coverage-sometimes-still-isn%e2%80%99t-enough">Coverage sometimes still isn’t enough</span></h2>



<p class="wp-block-paragraph">For larger, more complex libraries (Guava, Tomcat, MongoDB) even the feedback loop left gaps. The team added a third technique:<strong> PGO</strong> (Profile-Guided Optimization) <strong>profiling from GraalVM’s Graal compiler</strong>. The profiler samples execution and produces a call trace, which can be correlated with static analysis to identify exactly where a test nearly reached a reflection call but diverged.</p>



<p class="wp-block-paragraph">The profiling feedback tells the agent not just what’s uncovered, but <strong>where in the call stack a test went in the wrong direction</strong> and what it would need to do differently. Results: Guava went from 50% to 72%, Tomcat from 45% to 83%, MongoDB reached 100%. </p>



<p class="wp-block-paragraph">The feedback also tells the agent (and the engineers) why certain calls cannot be covered: a security service only available on Java 6, a cleaner class incompatible with the current JVM. &#8220;If you cannot reach it, tell us why,&#8221; the prompt instructs, and the agent does.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="622" src="https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-1024x622.jpg?x94846" alt="" class="wp-image-9853" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-1024x622.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-300x182.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-768x466.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="cost-agents-and-model-selection">Cost, agents and model selection</span></h2>



<p class="wp-block-paragraph"><strong>Codex</strong> was the first agent framework the team tried. For logback  (a library with 33 dynamic access calls) Codex spent $35:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If we’re spending $35 per library for a thousand libraries, we’re not replacing humans.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>The alternative was P</strong>, a minimal agent that starts with a 200-token context describing basic file operations and bash execution. Same results, roughly 10x cheaper and the lesson is straightforward:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Simple task, use a simple agent. You already give it a lot of rules, a lot of context, and you’ve grounded it enough so it can perform on the level of these big agents.</p>
</blockquote>



<p class="wp-block-paragraph">On model selection, the team compared GPT 5.5 against several open-source alternatives &#8211; GLM, Kimi K2, DeepSeek, Gemma. <strong>GPT 5.5 consistently outperformed them on coverage</strong>. The counterintuitive finding was this: a more expensive model that makes the right decision in one shot can cost less overall than a cheaper model that wastes tokens going in the wrong direction.</p>



<h2 class="wp-block-heading"><span id="the-architecture-that-lets-it-run-without-you">The architecture that lets it run without you</span></h2>



<p class="wp-block-paragraph">The pipeline now operates as a <strong>third-generation system</strong>. When a user opens an issue requesting a library, the agent fetches the issue, runs the generation workflow, verifies the output, creates a pull request, reviews it, and merges or escalates to human review &#8211; automatically. The &#8220;human intervention&#8221; label on GitHub still exists, but its queue has shrunk dramatically.</p>



<p class="wp-block-paragraph"><strong>Documentation, not smarter prompting, was what made the difference</strong>. </p>



<p class="wp-block-paragraph">Vojin outlined what he calls <em><strong>the</strong> <strong>key context layers</strong></em>: </p>



<ul class="wp-block-list">
<li>raison d’être (why does this project exist, in two sentences), </li>



<li>state of direction (where the architecture stands today), </li>



<li>functional specification (how the system behaves), </li>



<li>architectural specification (how it is built), </li>



<li>decision records (what major choices were made and why), and </li>



<li>comprehensive logs that serve as checkpoints for recovery.</li>
</ul>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When you do all of these things, it takes almost a few days for a very big project. You will reduce your work by 50%, 60%, 70%.</p>
</blockquote>



<p class="wp-block-paragraph">The payoff is that agents with this context can diagnose failures, trace them through logs, and fix the underlying system, not just the immediate problem.</p>



<p class="wp-block-paragraph">The RAID system (an automated issue-resolution agent) was built in four prompts on a Sunday morning. It sweeps human intervention tickets, classifies them, performs deep analysis using the project logs, and either opens a GitHub issue for humans or attempts a fix in a forked branch with review. Jovanovic added:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Never work on the problem, always work on the system. You never go and fix a ticket. You always go fix the rules.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="where-things-stand">Where things stand</span></h2>



<p class="wp-block-paragraph">The repository currently supports 1,021 libraries. Without five large Hibernate libraries that predate the automated pipeline, <strong>dynamic access coverage across the ecosystem is 90%</strong>. </p>



<p class="wp-block-paragraph">The GitHub repository has accumulated roughly 2,977 branches. In the week before Devoxx, it logged approximately 8,000-9,000 commits, with agents committing every few minutes around the clock. </p>



<p class="wp-block-paragraph">Total cost for the project: approximately $1,700 in API tokens, plus personal compute on Jovanovic’s home desktop, running around the clock because the Oracle compliance process for cloud infrastructure takes time. The key point is simple:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Start with neural, simplest thing, get results, and then slowly chop off things and put them into algorithms, because they are much cheaper and faster.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-1024x614.jpg?x94846" alt="" class="wp-image-9850" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-768x460.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="we-caught-vojin-jovanovic-for-a-few-more-questions">We caught Vojin Jovanovic for a few more questions!</span></h2>



<p class="wp-block-paragraph">After the talk, we sat down with Vojin for a few minutes to ask him a couple more questions. </p>



<h3 class="wp-block-heading"><span id="you-tested-over-1000-libraries-what-broke-first-when-you-tried-to-scale">You tested over 1,000 libraries. What broke first when you tried to scale?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: Basically everything broke. We had mostly infrastructure issues, all kinds of GitHub failures. When you build a system at this scale, you need to assume that everything will fail and needs to recover. We broke GitHub rate limits. My machine was broken because it was running so many things. The key takeaway is that <strong>you need to build a system in a way that you can always continue</strong>. When things fail, you always checkpoint and continue from a checkpoint. We do work in sizable chunks, and when something fails, you just restart the chunk.</p>



<h3 class="wp-block-heading"><span id="is-just-asking-the-llm-enough">Is just asking the LLM enough?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: If you had asked me four weeks ago, I would say no. Now I would say <strong>you need to know how to ask it</strong>, and it will be enough. I was like, &#8220;GitHub is failing with a 504, abstract away all GitHub calls and retry.&#8221; It did it in two minutes. With today’s models, it’s a matter of minutes, not hours.</p>



<h3 class="wp-block-heading"><span id="what-did-you-learn-about-the-trade-off-between-cost-speed-and-coverage">What did you learn about the trade-off between cost, speed, and coverage?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: I haven’t seen a situation when doing something with an LLM is more expensive than doing that by a human typing on the keyboard. Build a system that uses the most efficient LLM for the job — you’re going to get far and not cost much money at all.</p>



<h3 class="wp-block-heading"><span id="when-does-using-multiple-agents-make-sense">When does using multiple agents make sense?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: Where I use it is for <strong>decisions and research</strong>. I use Claude Opus 4.7, Gemini 3.1, and GPT 5.5. I ask them all, let them discuss, and I discuss together with them. Each brings something to the table. Before, it was always Claude who was the smartest. Now GPT 5.5 is second and close to the first. Things are changing. The most important bit is getting the system designed right. Once you do that, coding, I don’t care who does it.</p>
<p>The post <a href="https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/">Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>How Developers Should Build AI Tools &#8211; So The EU Doesn’t Lose IT</title>
		<link>https://shiftmag.dev/how-developers-should-build-ai-tools-so-the-eu-doesnt-lose-it-9482/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Fri, 15 May 2026 13:20:37 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Ervin Jagatić]]></category>
		<category><![CDATA[infobip]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9482</guid>

					<description><![CDATA[<p>What happens when regulators ask an AI company to explain exactly how its system made a decision? </p>
<p>The post <a href="https://shiftmag.dev/how-developers-should-build-ai-tools-so-the-eu-doesnt-lose-it-9482/">How Developers Should Build AI Tools &#8211; So The EU Doesn’t Lose IT</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">The August 2026 deadline for the <a href="https://artificialintelligenceact.eu/" target="_blank" rel="noreferrer noopener">EU AI Act</a> is getting close, and companies and developerds building AI products are starting to feel it. </p>



<p class="wp-block-paragraph">High-risk AI systems need to be compliant by then, and the ones doing it well aren&#8217;t treating it as a last-minute legal scramble. They&#8217;re <strong>building compliance in from the start</strong>. </p>



<p class="wp-block-paragraph">We sat down with <strong>Ervin Jagatic</strong> (AI Business Unit Director, Infobip) to talk about what that actually looks like at Infobip, and why compliance-by-design is turning into something engineers think about, not just lawyers.</p>



<h2 class="wp-block-heading"><span id="compliance-starts-in-the-design-phase">Compliance starts in the design phase</span></h2>



<p class="wp-block-paragraph">AI Act compliance doesn&#8217;t start at deployment. Ervin is clear on this: <strong>it has to enter during system architecture, before a single line of agent code is written</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Compliance enters during the design phase &#8211; system architecture, data flow planning. Every layer of our AI Agents product, from planning to memory to tool execution, needs to be designed with traceability and human oversight in mind. We can&#8217;t bolt that on after the orchestrator is already coordinating multiple sub-agents autonomously.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="the-ai-act-is-changing-product-development-in-3-ways"><strong>The AI Act is changing product development in 3 ways</strong></span></h2>



<p class="wp-block-paragraph">That shift has already changed how Infobip&#8217;s teams design and ship AI-powered features. Ervin points to three major changes that came directly from the AI Act.</p>



<h3 class="wp-block-heading"><span id="1-transparency-and-auditability">1. Transparency and auditability</span></h3>



<p class="wp-block-paragraph">Transparency is the first. Infobip&#8217;s <strong>AI Agents documentation is explicit</strong>: &#8220;you cannot script exact responses&#8221; &#8211; agents &#8220;generate responses dynamically.&#8221; </p>



<p class="wp-block-paragraph">That unpredictability is exactly why the company expanded its logging and analytics infrastructure, Ervin explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The AI Act&#8217;s transparency obligations pushed us to build comprehensive logging into our Insights and Analytics layer. Every agent execution now produces detailed logs &#8211; requests, responses, processing steps. That&#8217;s not just good engineering, it&#8217;s a direct response to auditability requirements.</p>
</blockquote>



<h3 class="wp-block-heading"><span id="2-explicit-guardrails-instead-of-assumptions">2. Explicit guardrails instead of assumptions</span></h3>



<p class="wp-block-paragraph">The second shift relates to behavioral boundaries and guardrails. Infobip now <strong>requires customers to define capability boundaries, mandatory restrictions, and compliance rules directly inside every agent’s system prompt</strong>, Ervin points out:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our own documentation warns that if you do not explicitly define these constraints, the agent makes assumptions. That design philosophy, forcing explicit guardrails rather than relying on implicit model behavior, comes directly from the Act’s emphasis on risk mitigation by design.</p>
</blockquote>



<h3 class="wp-block-heading"><span id="3-human-oversight-is-a-part-of-the-architecture">3. Human oversight is a part of the architecture</span></h3>



<p class="wp-block-paragraph">The third shift is human oversight &#8211; not as an external policy layer, but <strong>built directly into the product architecture</strong>. Ervin explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><a href="https://www.infobip.com/agentos" target="_blank" rel="noreferrer noopener">AgentOS</a> uses a human-in-the-loop model where complex issues are escalated from AI agents to human agents. We are talking about a core architectural decision that applies human oversight requirements while also improving the product.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="why-compliance-by-design-is-becoming-the-standard">Why compliance-by-design is becoming the standard</span></h2>



<p class="wp-block-paragraph">Ervin believes compliance-by-design is quickly becoming <strong>the</strong> <strong>new industry standard</strong>, particularly for teams building enterprise-grade AI systems:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">For developers and ML engineers at Infobip, compliance-by-design means several practical things. It means every AI agent we build has a defined architecture where an orchestrator coordinates sub-agents, each with explicit scope, tools, and behavioral rules.</p>
</blockquote>



<p class="wp-block-paragraph">It also <strong>changes how engineering teams think about data</strong>. &#8220;It means our engineers think about data lineage and provenance from the moment they design a training pipeline, not because someone from legal asked them to, but because the architecture demands it,&#8221; Ervin points out.</p>



<p class="wp-block-paragraph">To support that approach, Infobip <strong>invested heavily in tooling and analytics infrastructure</strong> that now serves both operational and regulatory purposes, Ervin said:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our Insights and Analytics platform is our compliance infrastructure. When a regulator asks ‘show me how this AI system made this decision,’ we need to answer that question with structured evidence, not anecdotes.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="risk-assessment-depends-on-the-use-case">Risk assessment depends on the use case</span></h2>



<p class="wp-block-paragraph">Internally, the company approaches risk assessment through a framework closely aligned with the <strong>AI Act’s four-tier classification model</strong>: unacceptable, high, limited, and minimal risk. However, Ervin notes that Infobip applies this framework at the feature level rather than only at the system level:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">This is important because a platform like Infobip’s serves vastly different use cases. An AI gamification tool for lead generation on WhatsApp is a fundamentally different risk profile than an AI agent that handles authentication.</p>
</blockquote>



<p class="wp-block-paragraph">The company <strong>evaluates risk based on several factors</strong>, including the sensitivity of the data involved, the autonomy of the AI component, and the intended use case, Ervin explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our internal process follows a lifecycle approach. During identification, we map known and foreseeable risks, including risks from reasonably foreseeable misuse. During estimation, we assess probability and severity. During mitigation, we implement design controls, testing procedures, and human oversight.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>Monitoring continues after deployment</strong> through analytics infrastructure designed for drift detection, incident investigation, and performance tracking. For enterprise customers, risk assessment also becomes a collaborative process between Infobip and client compliance teams.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">A bank using our AI agents to automate customer support has different risk considerations than a retail brand using the same technology for product recommendations. The platform is the same; the risk profile is not.</p>
</blockquote>



<h2 class="wp-block-heading">August 2026 is approaching&#8230;</h2>



<p class="wp-block-paragraph">As August 2026 closes in, Ervin says the conversation has shifted:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The question is no longer whether to integrate compliance into product development. The question is whether you&#8217;ve built the infrastructure to do it at speed.</p>
</blockquote>
<p>The post <a href="https://shiftmag.dev/how-developers-should-build-ai-tools-so-the-eu-doesnt-lose-it-9482/">How Developers Should Build AI Tools &#8211; So The EU Doesn’t Lose IT</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 

Served from: shiftmag.dev @ 2026-06-26 12:54:27 by W3 Total Cache
-->