<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Marin Pavelić, Author at ShiftMag</title>
	<atom:link href="https://shiftmag.dev/author/marinpavelic/feed/" rel="self" type="application/rss+xml" />
	<link>https://shiftmag.dev/author/marinpavelic/</link>
	<description>Insightful engineering content &#38; community</description>
	<lastBuildDate>Wed, 27 May 2026 13:47:52 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shiftmag.dev/wp-content/uploads/2024/08/cropped-ShiftMag-favicon-32x32.png</url>
	<title>Marin Pavelić, Author at ShiftMag</title>
	<link>https://shiftmag.dev/author/marinpavelic/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Killing PRs was the easy part. Now Technical Death Keeps the CTO Up.</title>
		<link>https://shiftmag.dev/killing-prs-was-the-easy-part-now-technical-death-keeps-the-cto-up-9910/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Tue, 26 May 2026 14:39:07 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[iBOOD]]></category>
		<category><![CDATA[Sander Hoogendoorn]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9910</guid>

					<description><![CDATA[<p>Everything you think is non-negotiable in software development, Sander Hoogendoorn's team quietly dropped - and nothing broke.</p>
<p>The post <a href="https://shiftmag.dev/killing-prs-was-the-easy-part-now-technical-death-keeps-the-cto-up-9910/">Killing PRs was the easy part. Now Technical Death Keeps the CTO Up.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img fetchpriority="high" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander-.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander-.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander--300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander--1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/iBOOD-sander--768x403.png 768w" sizes="(max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph"><strong>Sander Hoogendoorn </strong>has been writing code for over 40 years and is currently CTO at iBOOD, a Dutch e-commerce company. </p>



<p class="wp-block-paragraph">His talk at <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx</a>, <em>The Last Pull Request</em>, was a live report from a team that quietly dismantled most of what the industry considers non-negotiable, and then kept shipping.</p>



<p class="wp-block-paragraph">Now there&#8217;s a new concern.</p>



<h2 class="wp-block-heading">AI didn&#8217;t change everything. Change didn&#8217;t wait for AI.</h2>



<p class="wp-block-paragraph">Sander opened with a timeline: source control, IDEs, the web, mobile, the cloud, microservices. Each wave reshaped what developers could build and how. AI is just the latest.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI is going to change everything? No.&nbsp;Everything already changed everything. And this is not the last step.</p>
</blockquote>



<p class="wp-block-paragraph">The point wasn&#8217;t to diminish AI but to put it in context. Every major shift expanded the tooling, and the problem space alongside it. For most teams, that problem space now sits in what Sander calls <strong>complex territory</strong>: no best practices, only things that might emerge from experimentation. Dave Snowden&#8217;s Cynefin framework is blunt about this: in a complex context, there is no right answer to find. You have to invent one.</p>



<p class="wp-block-paragraph">That&#8217;s the actual job, Sander says. Not typing code. <strong>Solving problems that have never been solved before</strong>.</p>



<h2 class="wp-block-heading"><span id="selfware">Selfware</span></h2>



<p class="wp-block-paragraph">Sander introduced a concept: selfware. Software built by non-developers (marketers, finance teams, executives) using AI to <strong>solve their own problems without involving engineering</strong>. </p>



<p class="wp-block-paragraph">At iBOOD, the content director is already doing it. So is the CMO:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We as tech are not fast enough. And I’ve seen this before. In the 80s and 90s, everyone started writing Excel spreadsheets.</p>
</blockquote>



<p class="wp-block-paragraph">The difference now is that the output isn’t a pivot table, it’s software. Unmanaged, untested, running on personal accounts with passwords nobody reviews, exporting customer data in ways that would make your compliance team cry. This is happening right now, and <strong>most engineering teams haven&#8217;t figured out what to do about it</strong>.</p>



<h2 class="wp-block-heading">No scrum, sprints, pull requests&#8230;</h2>



<p class="wp-block-paragraph">The list of things they stopped doing is long: no scrum, no sprints, no retros. Fewer standups. No scrum master, no product owner. Minimal estimates. <strong>No pull requests</strong> &#8211; because every branch is a merge waiting to happen, every review costs time, and reviewers rarely know what the code was supposed to do in the first place.</p>



<p class="wp-block-paragraph">What replaced it? <a href="https://shiftmag.dev/pair-programming-benefits-challenges-563/" target="_blank" rel="noreferrer noopener">Pair programming</a>. <a href="https://shiftmag.dev/mob-programming-why-do-it-882/" target="_blank" rel="noreferrer noopener">Mob programming</a>. Smaller changes, checked in faster, continuously. Everyone on the team is an architect. Everyone is accountable for everything, Sander says:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Perfection is achieved not when there’s nothing more to add but when there’s nothing left to take away.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="pair-me-with-claude">Pair me with Claude</span></h2>



<p class="wp-block-paragraph">Today, Sander&#8217;s 13-person team <strong>pairs with AI through most of their working day</strong>. It became the natural way to work. Currently that means Claude, though that could change next week.</p>



<p class="wp-block-paragraph">AI breaks things. Two weeks before the talk, Sander pushed AI-generated changes that silently removed all dependency injections from their web page constructors. None of the pages were serving data. He didn&#8217;t catch it until later:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">I’m not saying not to use AI. I do it every day. But I do think we should check what it’s doing.</p>
</blockquote>



<p class="wp-block-paragraph">What worries him most is what he calls <strong>technical death</strong> &#8211; a state where a team spends all its time keeping existing software alive, with nothing left for anything new. Technical debt compounding under AI-generated code nobody fully reviews. Complexity accumulating faster than it gets cleaned up. That&#8217;s the real risk.</p>



<h2 class="wp-block-heading"><span id="we-asked-sander-a-few-more-things">We asked Sander a few more things</span></h2>



<h3 class="wp-block-heading"><span id="your-team-dropped-pull-requests-was-that-an-ai-decision">Your team dropped pull requests. Was that an AI decision?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: No, we did that a long time ago, it has nothing to do with AI. The problem with pull requests is that <strong>they slow you down</strong>. The longer you wait with merging back into main, the harder it gets, because other people make changes too. And what you see very often is that people reviewing other people’s code tend not to know or even understand what the code was supposed to do. So they check formatting, linting, naming conventions. Which is pretty stupid, because that you can automate.</p>



<p class="wp-block-paragraph"><strong>Pull requests make sense in open source</strong>, where you have no idea who’s submitting changes or what the quality of their work is. But on your own team? I don’t see any problems with committing code from anybody automatically. We work together every day, we write code together. You just don’t need it.</p>



<h3 class="wp-block-heading"><span id="ai-is-part-of-your-team-now-what-happens-when-something-breaks">AI is part of your team now. What happens when something breaks?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: We don’t track who broke it. <strong>Everybody on my team is accountable for everything, including me</strong>. If I push something and the pipeline fails and I’m not around, somebody else picks it up. I have no doubt about that. So accountability is… I don’t care too much about it, because it’s distributed. We don’t blame people. We just fix it.</p>



<h3 class="wp-block-heading"><span id="you%e2%80%99ve-been-critical-of-agile-is-ai-exposing-that-teams-never-really-understood-it">You’ve been critical of Agile. Is AI exposing that teams never really understood it?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: I’m not critical about Agile. I think a lot of people misunderstand what Agile actually means. Agile does not mean Scrum. Actually, to be quite honest, Scrum is not really Agile. The Scrum Guide says Scrum is immutable, which basically means it’s not Agile, because Agile means you can improve on anything.</p>



<p class="wp-block-paragraph">There is nothing in Agile that says you need to do sprints. The key statement in the Agile Manifesto is the one at the top: <strong>we are uncovering better ways of developing software</strong>. Everything else doesn’t really matter. As long as you have that mindset, there’s always something to improve. No default way of working is going to solve the problem for you.</p>



<h3 class="wp-block-heading"><span id="where-does-this-go-in-two-or-three-years">Where does this go in two or three years?</span></h3>



<p class="wp-block-paragraph"><strong>Sander</strong>: I think we will soon realize that the <strong>English language is too ambiguous and not concise enough to specify to an AI what to do</strong>. So what will happen is that we’ll develop better ways of having conversations with AI &#8211; more precise, less ambiguous. And what those languages are called? Programming languages. We will develop programming languages that allow us to talk to an AI in a way that the AI is able to create lower-level code from it.</p>



<p class="wp-block-paragraph">Programming will be programming, except with different tools. As they always have been.</p>
<p>The post <a href="https://shiftmag.dev/killing-prs-was-the-easy-part-now-technical-death-keeps-the-cto-up-9910/">Killing PRs was the easy part. Now Technical Death Keeps the CTO Up.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</title>
		<link>https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Tue, 19 May 2026 18:39:50 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Vojin Jovanović]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9802</guid>

					<description><![CDATA[<p>A 1,000 libraries, 90% coverage, 1,700 in API tokens. Nobody typed a single test by hand.</p>
<p>The post <a href="https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/">Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/ai-agents-java-libraries-1200x630-1-768x403.png 768w" sizes="(max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">When humans maintained the GraalVM native image reflection metadata repository, <strong>coverage sat at just 14%</strong>. Tests were often stubs that technically compiled but covered nothing meaningful, nobody wanted to write them for someone else&#8217;s code, and the results showed.</p>



<p class="wp-block-paragraph">At <a href="https://www.devoxx.co.uk/" target="_blank" rel="noreferrer noopener">Devoxx UK</a>, <strong>Vojin Jovanovic</strong> (Principal Researcher, Oracle Labs) and <strong>Mihailo Markovic</strong> (Software Engineer, Oracle), presented how they replaced that process with an autonomous AI agent pipeline. </p>



<p class="wp-block-paragraph">The result is <strong>90% dynamic access coverage across more than 1,000 JVM libraries</strong>, roughly 2 billion tokens spent, and a GitHub repository generating thousands of commits per week &#8211; while Vojin was at a hotel the night before the conference.</p>



<h2 class="wp-block-heading"><span id="the-problem-with-graalvm-reflection">The problem with GraalVM reflection</span></h2>



<p class="wp-block-paragraph">GraalVM Native Image takes a Java application, performs static analysis, and AOT compiles it into a single binary. The benefits are significant: <strong>startup roughly 10x faster than a standard JVM</strong>,<strong> </strong>dramatically lower memory footprint. </p>



<p class="wp-block-paragraph">But static analysis has a fundamental limitation: when a method calls Class.forName(&#8220;Foo&#8221;) with a dynamic argument, the analyser <strong>cannot determine at compile time what class will be needed</strong>. Reflection calls break the closed-world assumption.</p>



<p class="wp-block-paragraph">The solution is <strong>reachability metadata</strong> &#8211; a JSON file that tells the native image compiler which classes, methods, and fields need to be accessible at runtime. Writing this metadata requires running tests that exercise all the relevant code paths. </p>



<p class="wp-block-paragraph">For a library like Hibernate Core, that means covering 264 individual reflection call sites. For Tomcat, 205. Across the JVM ecosystem, the number is enormous, and until recently, it was almost entirely a manual process that humans were not doing well.</p>



<h2 class="wp-block-heading"><span id="start-simple-then-add-feedback">Start simple, then add feedback</span></h2>



<p class="wp-block-paragraph">The first approach was straightforward: give an LLM the library source code, tell it to generate comprehensive Java tests, collect the metadata via a JVMTI agent. </p>



<p class="wp-block-paragraph">The results were not impressive &#8211; 5.7% coverage for logback, 2.9% for H2. Vojin noted how this doesn’t feel like AGI.</p>



<p class="wp-block-paragraph">The shift came from <strong>adding GraalVM’s static analysis directly to the agent’s context</strong>. Instead of asking the LLM to guess which code paths matter, the pipeline runs a static analysis pass that identifies every dynamic access call site (the exact class, method, and line number) and feeds that report directly to the agent. With this addition, logback coverage jumped to 97%, H2 to 84.3%, in five iterations.</p>



<p class="wp-block-paragraph"><strong>The next layer was JaCoCo integration</strong>. After each generation round, the pipeline correlates coverage data with the remaining uncovered call sites and feeds only the uncovered ones back into the next iteration. The agent knows exactly what it hit and what it missed. Vojin noted:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We always create a checkpoint in those systems so we can go back to it if something goes wrong. And in these LLM-driven workflows, something is always going wrong.</p>
</blockquote>



<p class="wp-block-paragraph">With this feedback loop: logback reached 100%, H2 reached 96.1%.</p>



<h2 class="wp-block-heading"><span id="coverage-sometimes-still-isn%e2%80%99t-enough">Coverage sometimes still isn’t enough</span></h2>



<p class="wp-block-paragraph">For larger, more complex libraries (Guava, Tomcat, MongoDB) even the feedback loop left gaps. The team added a third technique:<strong> PGO</strong> (Profile-Guided Optimization) <strong>profiling from GraalVM’s Graal compiler</strong>. The profiler samples execution and produces a call trace, which can be correlated with static analysis to identify exactly where a test nearly reached a reflection call but diverged.</p>



<p class="wp-block-paragraph">The profiling feedback tells the agent not just what’s uncovered, but <strong>where in the call stack a test went in the wrong direction</strong> and what it would need to do differently. Results: Guava went from 50% to 72%, Tomcat from 45% to 83%, MongoDB reached 100%. </p>



<p class="wp-block-paragraph">The feedback also tells the agent (and the engineers) why certain calls cannot be covered: a security service only available on Java 6, a cleaner class incompatible with the current JVM. &#8220;If you cannot reach it, tell us why,&#8221; the prompt instructs, and the agent does.</p>



<figure class="wp-block-image size-large"><img decoding="async" width="1024" height="622" src="https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-1024x622.jpg?x94846" alt="" class="wp-image-9853" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-1024x622.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-300x182.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k-768x466.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55258392222_e98c47ff10_k.jpg 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="cost-agents-and-model-selection">Cost, agents and model selection</span></h2>



<p class="wp-block-paragraph"><strong>Codex</strong> was the first agent framework the team tried. For logback  (a library with 33 dynamic access calls) Codex spent $35:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If we’re spending $35 per library for a thousand libraries, we’re not replacing humans.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>The alternative was P</strong>, a minimal agent that starts with a 200-token context describing basic file operations and bash execution. Same results, roughly 10x cheaper and the lesson is straightforward:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Simple task, use a simple agent. You already give it a lot of rules, a lot of context, and you’ve grounded it enough so it can perform on the level of these big agents.</p>
</blockquote>



<p class="wp-block-paragraph">On model selection, the team compared GPT 5.5 against several open-source alternatives &#8211; GLM, Kimi K2, DeepSeek, Gemma. <strong>GPT 5.5 consistently outperformed them on coverage</strong>. The counterintuitive finding was this: a more expensive model that makes the right decision in one shot can cost less overall than a cheaper model that wastes tokens going in the wrong direction.</p>



<h2 class="wp-block-heading"><span id="the-architecture-that-lets-it-run-without-you">The architecture that lets it run without you</span></h2>



<p class="wp-block-paragraph">The pipeline now operates as a <strong>third-generation system</strong>. When a user opens an issue requesting a library, the agent fetches the issue, runs the generation workflow, verifies the output, creates a pull request, reviews it, and merges or escalates to human review &#8211; automatically. The &#8220;human intervention&#8221; label on GitHub still exists, but its queue has shrunk dramatically.</p>



<p class="wp-block-paragraph"><strong>Documentation, not smarter prompting, was what made the difference</strong>. </p>



<p class="wp-block-paragraph">Vojin outlined what he calls <em><strong>the</strong> <strong>key context layers</strong></em>: </p>



<ul class="wp-block-list">
<li>raison d’être (why does this project exist, in two sentences), </li>



<li>state of direction (where the architecture stands today), </li>



<li>functional specification (how the system behaves), </li>



<li>architectural specification (how it is built), </li>



<li>decision records (what major choices were made and why), and </li>



<li>comprehensive logs that serve as checkpoints for recovery.</li>
</ul>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When you do all of these things, it takes almost a few days for a very big project. You will reduce your work by 50%, 60%, 70%.</p>
</blockquote>



<p class="wp-block-paragraph">The payoff is that agents with this context can diagnose failures, trace them through logs, and fix the underlying system, not just the immediate problem.</p>



<p class="wp-block-paragraph">The RAID system (an automated issue-resolution agent) was built in four prompts on a Sunday morning. It sweeps human intervention tickets, classifies them, performs deep analysis using the project logs, and either opens a GitHub issue for humans or attempts a fix in a forked branch with review. Jovanovic added:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Never work on the problem, always work on the system. You never go and fix a ticket. You always go fix the rules.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="where-things-stand">Where things stand</span></h2>



<p class="wp-block-paragraph">The repository currently supports 1,021 libraries. Without five large Hibernate libraries that predate the automated pipeline, <strong>dynamic access coverage across the ecosystem is 90%</strong>. </p>



<p class="wp-block-paragraph">The GitHub repository has accumulated roughly 2,977 branches. In the week before Devoxx, it logged approximately 8,000-9,000 commits, with agents committing every few minutes around the clock. </p>



<p class="wp-block-paragraph">Total cost for the project: approximately $1,700 in API tokens, plus personal compute on Jovanovic’s home desktop, running around the clock because the Oracle compliance process for cloud infrastructure takes time. The key point is simple:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Start with neural, simplest thing, get results, and then slowly chop off things and put them into algorithms, because they are much cheaper and faster.</p>
</blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-1024x614.jpg?x94846" alt="" class="wp-image-9850" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-1024x614.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-300x180.jpg 300w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k-768x460.jpg 768w, https://shiftmag.dev/wp-content/uploads/2026/05/55259209226_a368299446_k.jpg 2048w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption">Photo: DevoxxUK / Flickr</figcaption></figure>



<h2 class="wp-block-heading"><span id="we-caught-vojin-jovanovic-for-a-few-more-questions">We caught Vojin Jovanovic for a few more questions!</span></h2>



<p class="wp-block-paragraph">After the talk, we sat down with Vojin for a few minutes to ask him a couple more questions. </p>



<h3 class="wp-block-heading"><span id="you-tested-over-1000-libraries-what-broke-first-when-you-tried-to-scale">You tested over 1,000 libraries. What broke first when you tried to scale?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: Basically everything broke. We had mostly infrastructure issues, all kinds of GitHub failures. When you build a system at this scale, you need to assume that everything will fail and needs to recover. We broke GitHub rate limits. My machine was broken because it was running so many things. The key takeaway is that <strong>you need to build a system in a way that you can always continue</strong>. When things fail, you always checkpoint and continue from a checkpoint. We do work in sizable chunks, and when something fails, you just restart the chunk.</p>



<h3 class="wp-block-heading"><span id="is-just-asking-the-llm-enough">Is just asking the LLM enough?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: If you had asked me four weeks ago, I would say no. Now I would say <strong>you need to know how to ask it</strong>, and it will be enough. I was like, &#8220;GitHub is failing with a 504, abstract away all GitHub calls and retry.&#8221; It did it in two minutes. With today’s models, it’s a matter of minutes, not hours.</p>



<h3 class="wp-block-heading"><span id="what-did-you-learn-about-the-trade-off-between-cost-speed-and-coverage">What did you learn about the trade-off between cost, speed, and coverage?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: I haven’t seen a situation when doing something with an LLM is more expensive than doing that by a human typing on the keyboard. Build a system that uses the most efficient LLM for the job — you’re going to get far and not cost much money at all.</p>



<h3 class="wp-block-heading"><span id="when-does-using-multiple-agents-make-sense">When does using multiple agents make sense?</span></h3>



<p class="wp-block-paragraph"><strong>Vojin</strong>: Where I use it is for <strong>decisions and research</strong>. I use Claude Opus 4.7, Gemini 3.1, and GPT 5.5. I ask them all, let them discuss, and I discuss together with them. Each brings something to the table. Before, it was always Claude who was the smartest. Now GPT 5.5 is second and close to the first. Things are changing. The most important bit is getting the system designed right. Once you do that, coding, I don’t care who does it.</p>
<p>The post <a href="https://shiftmag.dev/teaching-ai-agents-to-test-1000-java-libraries-and-letting-them-run-while-you-sleep-9802/">Teaching AI Agents to Test 1,000 Java Libraries – and Letting Them Run While You Sleep</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>How Developers Should Build AI Tools &#8211; So The EU Doesn’t Lose IT</title>
		<link>https://shiftmag.dev/how-developers-should-build-ai-tools-so-the-eu-doesnt-lose-it-9482/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Fri, 15 May 2026 13:20:37 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Ervin Jagatić]]></category>
		<category><![CDATA[infobip]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9482</guid>

					<description><![CDATA[<p>What happens when regulators ask an AI company to explain exactly how its system made a decision? </p>
<p>The post <a href="https://shiftmag.dev/how-developers-should-build-ai-tools-so-the-eu-doesnt-lose-it-9482/">How Developers Should Build AI Tools &#8211; So The EU Doesn’t Lose IT</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/05/eu-ai-act-compliance-1200x630-1-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">The August 2026 deadline for the <a href="https://artificialintelligenceact.eu/" target="_blank" rel="noreferrer noopener">EU AI Act</a> is getting close, and companies and developerds building AI products are starting to feel it. </p>



<p class="wp-block-paragraph">High-risk AI systems need to be compliant by then, and the ones doing it well aren&#8217;t treating it as a last-minute legal scramble. They&#8217;re <strong>building compliance in from the start</strong>. </p>



<p class="wp-block-paragraph">We sat down with <strong>Ervin Jagatic</strong> (AI Business Unit Director, Infobip) to talk about what that actually looks like at Infobip, and why compliance-by-design is turning into something engineers think about, not just lawyers.</p>



<h2 class="wp-block-heading"><span id="compliance-starts-in-the-design-phase">Compliance starts in the design phase</span></h2>



<p class="wp-block-paragraph">AI Act compliance doesn&#8217;t start at deployment. Ervin is clear on this: <strong>it has to enter during system architecture, before a single line of agent code is written</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Compliance enters during the design phase &#8211; system architecture, data flow planning. Every layer of our AI Agents product, from planning to memory to tool execution, needs to be designed with traceability and human oversight in mind. We can&#8217;t bolt that on after the orchestrator is already coordinating multiple sub-agents autonomously.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="the-ai-act-is-changing-product-development-in-3-ways"><strong>The AI Act is changing product development in 3 ways</strong></span></h2>



<p class="wp-block-paragraph">That shift has already changed how Infobip&#8217;s teams design and ship AI-powered features. Ervin points to three major changes that came directly from the AI Act.</p>



<h3 class="wp-block-heading"><span id="1-transparency-and-auditability">1. Transparency and auditability</span></h3>



<p class="wp-block-paragraph">Transparency is the first. Infobip&#8217;s <strong>AI Agents documentation is explicit</strong>: &#8220;you cannot script exact responses&#8221; &#8211; agents &#8220;generate responses dynamically.&#8221; </p>



<p class="wp-block-paragraph">That unpredictability is exactly why the company expanded its logging and analytics infrastructure, Ervin explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The AI Act&#8217;s transparency obligations pushed us to build comprehensive logging into our Insights and Analytics layer. Every agent execution now produces detailed logs &#8211; requests, responses, processing steps. That&#8217;s not just good engineering, it&#8217;s a direct response to auditability requirements.</p>
</blockquote>



<h3 class="wp-block-heading"><span id="2-explicit-guardrails-instead-of-assumptions">2. Explicit guardrails instead of assumptions</span></h3>



<p class="wp-block-paragraph">The second shift relates to behavioral boundaries and guardrails. Infobip now <strong>requires customers to define capability boundaries, mandatory restrictions, and compliance rules directly inside every agent’s system prompt</strong>, Ervin points out:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our own documentation warns that if you do not explicitly define these constraints, the agent makes assumptions. That design philosophy, forcing explicit guardrails rather than relying on implicit model behavior, comes directly from the Act’s emphasis on risk mitigation by design.</p>
</blockquote>



<h3 class="wp-block-heading"><span id="3-human-oversight-is-a-part-of-the-architecture">3. Human oversight is a part of the architecture</span></h3>



<p class="wp-block-paragraph">The third shift is human oversight &#8211; not as an external policy layer, but <strong>built directly into the product architecture</strong>. Ervin explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><a href="https://www.infobip.com/agentos" target="_blank" rel="noreferrer noopener">AgentOS</a> uses a human-in-the-loop model where complex issues are escalated from AI agents to human agents. We are talking about a core architectural decision that applies human oversight requirements while also improving the product.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="why-compliance-by-design-is-becoming-the-standard">Why compliance-by-design is becoming the standard</span></h2>



<p class="wp-block-paragraph">Ervin believes compliance-by-design is quickly becoming <strong>the</strong> <strong>new industry standard</strong>, particularly for teams building enterprise-grade AI systems:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">For developers and ML engineers at Infobip, compliance-by-design means several practical things. It means every AI agent we build has a defined architecture where an orchestrator coordinates sub-agents, each with explicit scope, tools, and behavioral rules.</p>
</blockquote>



<p class="wp-block-paragraph">It also <strong>changes how engineering teams think about data</strong>. &#8220;It means our engineers think about data lineage and provenance from the moment they design a training pipeline, not because someone from legal asked them to, but because the architecture demands it,&#8221; Ervin points out.</p>



<p class="wp-block-paragraph">To support that approach, Infobip <strong>invested heavily in tooling and analytics infrastructure</strong> that now serves both operational and regulatory purposes, Ervin said:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our Insights and Analytics platform is our compliance infrastructure. When a regulator asks ‘show me how this AI system made this decision,’ we need to answer that question with structured evidence, not anecdotes.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="risk-assessment-depends-on-the-use-case">Risk assessment depends on the use case</span></h2>



<p class="wp-block-paragraph">Internally, the company approaches risk assessment through a framework closely aligned with the <strong>AI Act’s four-tier classification model</strong>: unacceptable, high, limited, and minimal risk. However, Ervin notes that Infobip applies this framework at the feature level rather than only at the system level:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">This is important because a platform like Infobip’s serves vastly different use cases. An AI gamification tool for lead generation on WhatsApp is a fundamentally different risk profile than an AI agent that handles authentication.</p>
</blockquote>



<p class="wp-block-paragraph">The company <strong>evaluates risk based on several factors</strong>, including the sensitivity of the data involved, the autonomy of the AI component, and the intended use case, Ervin explains:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our internal process follows a lifecycle approach. During identification, we map known and foreseeable risks, including risks from reasonably foreseeable misuse. During estimation, we assess probability and severity. During mitigation, we implement design controls, testing procedures, and human oversight.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>Monitoring continues after deployment</strong> through analytics infrastructure designed for drift detection, incident investigation, and performance tracking. For enterprise customers, risk assessment also becomes a collaborative process between Infobip and client compliance teams.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">A bank using our AI agents to automate customer support has different risk considerations than a retail brand using the same technology for product recommendations. The platform is the same; the risk profile is not.</p>
</blockquote>



<h2 class="wp-block-heading">August 2026 is approaching&#8230;</h2>



<p class="wp-block-paragraph">As August 2026 closes in, Ervin says the conversation has shifted:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The question is no longer whether to integrate compliance into product development. The question is whether you&#8217;ve built the infrastructure to do it at speed.</p>
</blockquote>
<p>The post <a href="https://shiftmag.dev/how-developers-should-build-ai-tools-so-the-eu-doesnt-lose-it-9482/">How Developers Should Build AI Tools &#8211; So The EU Doesn’t Lose IT</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Inside the AWS Hierarchy: Engineering Levels Explained</title>
		<link>https://shiftmag.dev/inside-the-aws-hierarchy-engineering-levels-explained-9174/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Fri, 17 Apr 2026 14:42:25 +0000</pubDate>
				<category><![CDATA[Career]]></category>
		<category><![CDATA[AWS]]></category>
		<category><![CDATA[software developer career]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=9174</guid>

					<description><![CDATA[<p>Decoding the AWS system is a roadmap for developers aiming for the top tiers of Big Tech. Here’s how to climb the ladder.</p>
<p>The post <a href="https://shiftmag.dev/inside-the-aws-hierarchy-engineering-levels-explained-9174/">Inside the AWS Hierarchy: Engineering Levels Explained</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/04/1-3.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/04/1-3.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/04/1-3-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/04/1-3-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/04/1-3-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">Welcome to the engineering hierarchy of one of the world’s biggest tech companies, with nearly 200K employees&#8230; and I’ll be your guide.</p>



<p class="wp-block-paragraph">Since Amazon <strong>doesn’t publicly publish an official breakdown of its career levels</strong>, I reviewed various sources such as <a href="https://dev.to/alexr/amazon-software-engineer-levels-roles-and-expectations-with-salary-1017" target="_blank" rel="noreferrer noopener">Dev.to articles</a> and <a href="https://www.levels.fyi/en-gb/companies/amazon/salaries/software-engineer?country=254" target="_blank" rel="noreferrer noopener">salary websites</a>.</p>



<p class="wp-block-paragraph">To start with, AWS is famous for a decentralized, high-ownership environment where <strong>engineers don’t just write code, they must run what they build</strong>. Understanding the AWS career ladder is essential for any developer looking to enter the ecosystem that powers over a third of the cloud.</p>



<h2 class="wp-block-heading"><span id="the-aws-leveling-system">The AWS leveling system</span></h2>



<p class="wp-block-paragraph">Amazon (and by extension AWS) uses a structured leveling system that spans from L1 to L12. However, most <strong>software engineers operate between L4 and L8</strong>, with higher levels reserved for a very small number of highly influential technical leaders.</p>



<p class="wp-block-paragraph">While some companies hire engineers at lower levels, AWS typically starts its professional software engineering track at L4.</p>



<h3 class="wp-block-heading"><span id="level-4-software-development-engineer-i-sde-i">Level 4: Software Development Engineer I (SDE I)</span></h3>



<p class="wp-block-paragraph">The SDE I position at Amazon serves as the foundational entry point, typically designed for recent <strong>college graduates or engineers with limited professional experience</strong>. At this stage, the primary focus is on building a robust technical baseline. You are responsible for hands-on coding and debugging within well-defined tasks, contributing to the development of small features that integrate into larger projects.</p>



<p class="wp-block-paragraph">While you are expected to deliver high-quality code, you aren&#8217;t expected to do everything yourself. AWS has a <strong>heavy emphasis on mentorship at this level</strong>. Because of that SDE I engineers receive significant guidance from seniors to help them navigate complex systems and understand specific development tools and practices. It is a period of incremental skills development.</p>



<h3 class="wp-block-heading"><span id="level-5-software-development-engineer-ii-sde-ii">Level 5: Software Development Engineer II (SDE II) </span></h3>



<p class="wp-block-paragraph">SDE II engineers are often the backbone of the company’s engineering organization. At this stage, it’s all about being <strong>fully self-sufficient</strong>. SDE IIs are expected to manage their own workloads with minimal supervision, prioritizing tasks effectively to deliver consistent, high-quality results.</p>



<p class="wp-block-paragraph">Beyond just executing tasks, SDE IIs begin to take <strong>ownership of larger systems and components.</strong> They are responsible for designing and implementing solutions that specifically meet Amazon’s high standards for scalability, performance, and reliability. This is also the level where your influence begins to expand beyond your code, you start acting as a mentor to L4 engineers and begin coordinating on cross-functional projects.</p>



<h3 class="wp-block-heading"><span id="level-6-senior-software-development-engineer-sde-iii">Level 6: Senior Software Development Engineer (SDE III) </span></h3>



<p class="wp-block-paragraph">The Senior SDE role is an advanced position reserved for experienced engineers who have demonstrated <strong>strong technical and leadership capabilities</strong>. At this level, the scope of responsibility expands beyond individual contributions to owning larger systems and leading complex projects.</p>



<p class="wp-block-paragraph">Senior SDEs are expected to design scalable architectures, make high-impact technical decisions, and <strong>guide the work of other engineers</strong> on their team and adjacent teams. Their influence is significant, though typically focused within a team or a group of closely related teams.</p>



<h3 class="wp-block-heading"><span id="level-7-8-principal-and-senior-principal-engineer">Level 7 &amp; 8: Principal and Senior Principal Engineer</span></h3>



<p class="wp-block-paragraph">Levels 7 and 8 represent the elite tier of engineering talent at AWS. As a Level 7 (Principal Engineer), you move into a <strong>strategic role</strong>, shaping the technical direction across multiple teams<strong> </strong>or an entire organization. They work closely with senior leadership and are responsible for solving complex, high-impact problems that affect large parts of the business.</p>



<p class="wp-block-paragraph">The Senior Principal Engineer <strong>(L8) sits at the pinnacle of technical innovation</strong>. These engineers define long-term technical vision for major areas of AWS, often influencing hundreds of engineers indirectly through architecture, standards, and strategic initiatives.</p>



<h3 class="wp-block-heading"><span id="level-10-distinguished-engineer-vp">Level 10: Distinguished Engineer / VP</span></h3>



<p class="wp-block-paragraph">Beyond Level 8, the roles become <strong>extremely rare</strong>. It&#8217;s reserved for a small group of engineers with company-wide or industry-level impact. These roles focus on setting long-term technical direction, solving the most complex architectural challenges, and influencing Amazon’s strategy at the highest level.</p>



<p class="wp-block-paragraph">Level 10 is reserved for <strong>world-renowned visionaries and thought leaders</strong> who have a remarkable track record of technical excellence. They are responsible for identifying future technology trends and anticipating market shifts years before they happen. They set the architectural principles and technical standards that position Amazon as a continued leader in the industry. As mentors to the company’s highest technical leaders, they foster a culture of innovation that ensures AWS remains at the cutting edge of what is technologically possible.</p>



<h2 class="wp-block-heading"><span id="what-makes-aws-levels-different">What makes AWS levels different?</span></h2>



<h3 class="wp-block-heading"><span id="1-leadership-principles-lps-as-a-metric">1. Leadership Principles (LPs) as a metric</span></h3>



<p class="wp-block-paragraph">Technical ability alone is not enough for promotion. Engineers are evaluated against <a href="https://www.amazon.jobs/content/en/our-workplace/leadership-principles" target="_blank" rel="noreferrer noopener">Amazon’s Leadership Principles</a>, which are used as a framework to assess impact, decision-making, ownership, and long-term thinking. As engineers progress through levels, they are expected to demonstrate these principles with <strong>increasing scope and consistency.</strong> Promotion depends on how well your work maps to multiple principles, not just one.</p>



<h3 class="wp-block-heading">2. The power of the &#8220;Doc&#8221;</h3>



<p class="wp-block-paragraph">AWS is a famously &#8220;silent&#8221; company. They don&#8217;t use PowerPoints, they use 6-page memos. <strong>Your ability to move from L5 to L6 depends heavily on your writing</strong>. Can you argue your architectural choices in a structured, data-driven document? If you can&#8217;t write, you can&#8217;t lead at AWS.</p>



<h3 class="wp-block-heading"><span id="3-total-compensation-tc-structure">3. Total Compensation (TC) structure</span></h3>



<p class="wp-block-paragraph">AWS compensation is uniquely structured compared to Google or Meta. While they have recently increased base salary caps, a large portion of your wealth comes from <strong>RSUs (Stock)</strong> with a 4-year vesting schedule:</p>



<ul class="wp-block-list">
<li><strong>Year 1:</strong> 5%</li>



<li><strong>Year 2:</strong> 15%</li>



<li><strong>Year 3:</strong> 40%</li>



<li><strong>Year 4:</strong> 40%This &#8220;back-loaded&#8221; vesting is designed to reward those who stay and grow through the levels.</li>
</ul>



<h2 class="wp-block-heading"><span id="aws-salary-expectations-2026-estimates">AWS salary expectations (2026 estimates)</span></h2>



<p class="wp-block-paragraph">To provide an accurate picture of what these roles pay, we analyzed the latest 2026 data aggregates from <strong>Levels.fyi</strong> and <strong>6figr.com</strong>, which utilize verified salary stubs from engineers in major tech hubs. Compensation at AWS varies significantly depending on team, location, and negotiation, but follows a consistent structure of base salary, bonus, and stock (RSUs).</p>



<figure class="wp-block-table"><table class="has-fixed-layout"><thead><tr><td><strong>Level</strong></td><td><strong>Role</strong></td><td><strong>Estimated Total Comp (TC)</strong></td></tr></thead><tbody><tr><td><strong>L4</strong></td><td>SDE I</td><td>$150k &#8211; $245k</td></tr><tr><td><strong>L5</strong></td><td>SDE II</td><td>$220k &#8211; $320k+</td></tr><tr><td><strong>L6</strong></td><td>Senior SDE</td><td>$300k &#8211; $420k+</td></tr><tr><td><strong>L7</strong></td><td>Principal</td><td>$400k &#8211; $600k+</td></tr></tbody></table></figure>



<p class="wp-block-paragraph"><em><strong>Note:</strong> These figures reflect top-tier offers in high-cost US markets. For European hubs like Dublin, Luxembourg, or Berlin, expect a 15-25% reduction in base cash, though stock grants remain aggressive.</em></p>



<h2 class="wp-block-heading"><span id="is-the-climb-worth-it">Is the climb worth it?</span></h2>



<p class="wp-block-paragraph">The AWS ladder is demanding and often associated with a <strong>high-performance culture</strong>. Engineers who progress through it tend to develop strong ownership, system design skills, and operational discipline.</p>



<p class="wp-block-paragraph">While experiences vary by team, time at AWS is generally seen as a strong signal of technical capability and execution, particularly at senior levels. For those who align with its culture, the system offers a clear path for growth.</p>
<p>The post <a href="https://shiftmag.dev/inside-the-aws-hierarchy-engineering-levels-explained-9174/">Inside the AWS Hierarchy: Engineering Levels Explained</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>AI Won’t Replace Security Tools &#8211; It’s Helping Them Prioritize Biggest Threats</title>
		<link>https://shiftmag.dev/ai-wont-replace-security-tools-8760/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Fri, 20 Mar 2026 15:24:51 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Video]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[cybersecurity]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=8760</guid>

					<description><![CDATA[<p>Mackenzie Jackson, security researcher and advocate, told me that AI can’t catch the bugs, but it knows which ones actually matter and provides the context teams need.</p>
<p>The post <a href="https://shiftmag.dev/ai-wont-replace-security-tools-8760/">AI Won’t Replace Security Tools &#8211; It’s Helping Them Prioritize Biggest Threats</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2026/03/Mackenzie-Jackson-2026.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2026/03/Mackenzie-Jackson-2026.png 1200w, https://shiftmag.dev/wp-content/uploads/2026/03/Mackenzie-Jackson-2026-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2026/03/Mackenzie-Jackson-2026-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2026/03/Mackenzie-Jackson-2026-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">For <strong>Mackenzie Jackson</strong> (Developer and Security Advocate, Aikido Security) modern security is a nonstop game of <em>whack-a-mole,</em> with alerts and vulnerabilities keeping teams busy putting out fires instead of preventing them.</p>



<p class="wp-block-paragraph">But that chaos of cybersecurity is familiar territory for him: he investigates attacks and helps teams turn those findings into actionable steps.</p>



<p class="wp-block-paragraph">But strip away the complexity, and <strong>his advice on security</strong> is surprisingly simple:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">One of the biggest areas for smaller teams to focus on is simply stopping the bleeding.</p>
</blockquote>



<p class="wp-block-paragraph">You don’t need a flawless system, <strong>you need to regain control</strong>, and by implementing proactive measures companies neutralize threats before they ever touch production. It’s not a complete solution, but it’s a necessary foundation.</p>



<h2 class="wp-block-heading"><span id="cybersecurity-rests-on-two-pillars-people-and-access">Cybersecurity rests on two pillars: people and access</span></h2>



<p class="wp-block-paragraph">From the outside, cybersecurity looks like a web of interconnected threats and technically, and it is. But when incidents are investigated, the story tends to collapse into something much more&#8230; human:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">When you actually investigate a breach, what happened? Well, someone was probably phished, their credentials stolen, and that gave access to a system.</p>
</blockquote>



<p class="wp-block-paragraph">From there, attackers escalate, finding additional credentials, uncovering secrets, moving laterally through systems. Despite all the layers of technical complexity, <strong>most breaches still come down to two variables</strong>: people and acces<strong>s.</strong> This doesn’t make security easy, but it does make it clearer.</p>



<h2 class="wp-block-heading">Brakes make race cars faster &#8211; and security works the same way</h2>



<p class="wp-block-paragraph">One of the oldest problems in cybersecurity is organizational: How do you convince leadership to invest in something that, ideally, prevents things from happening?</p>



<p class="wp-block-paragraph"><strong>Fear is the usual tactic</strong> so you talk about reputational damage, financial loss, worst-case scenarios. It works, but only to a point and that is why Jackson suggests a different framing:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Brakes make race cars go faster.</p>
</blockquote>



<p class="wp-block-paragraph">It’s a counterintuitive analogy, but an effective one: <strong>without brakes, speed becomes dangerous</strong>. With them, drivers can push harder, take sharper turns, and move faster with confidence. Security, in this sense is an enabler:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If we build security now, we can innovate faster… establish your brakes so that you can go faster with confidence.</p>
</blockquote>



<p class="wp-block-paragraph">The alternative, adding security later, under pressure from compliance or customer demands almost always slows teams down.</p>



<h2 class="wp-block-heading"><span id="security-tools-are-here-to-stay-but-ai-gives-them-context">Security tools are here to stay, but AI gives them context</span></h2>



<p class="wp-block-paragraph">The arrival of AI introduced a pattern: <strong>urgency first, understanding later. </strong></p>



<p class="wp-block-paragraph">After tools like GPT entered the mainstream, companies rushed to integrate AI into their security products. But much of that <strong>early adoption</strong>, Jackson suggests, <strong>was surface-level</strong>. The real value of AI lies elsewhere:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI is a terrible scanner… but it’s great at understanding context.</p>
</blockquote>



<p class="wp-block-paragraph">Traditional security tools are deterministic and that is why they answer yes-or-no questions. Is there a vulnerability? Does this code contain a known issue? <strong>AI, by contrast, is non-deterministic</strong>. It doesn’t always give the same answer twice and that makes it unreliable for detection, but powerful for interpretation:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you give it vulnerabilities and ask how severe this is, how exploitable it is that’s where AI becomes incredibly useful.</p>
</blockquote>



<p class="wp-block-paragraph">In other words, AI doesn’t replace security tools. It <strong>complements</strong> them, helping teams prioritize what actually matters.</p>



<h2 class="wp-block-heading"><span id="ai-doesn%e2%80%99t-make-attackers-smarter-it-makes-attacks-easier">AI doesn’t make attackers smarter, it makes attacks easier</span></h2>



<p class="wp-block-paragraph">So if AI isn’t fundamentally changing how attacks work, what is it changing? <strong>Scale</strong>.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">AI has given script kiddies superpowers.</p>
</blockquote>



<p class="wp-block-paragraph">This phrase captures the shift precisely: AI doesn’t necessarily make attackers more skilled, it makes attacks easier to execute, faster to launch, and accessible to a much larger pool of people. But the core mechanics of attacks remain the same:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It’s not moving the bar up… it’s changing the scale.</p>
</blockquote>



<p class="wp-block-paragraph">And that, perhaps, is the most important takeaway. <strong>Because if the nature of attacks hasn’t fundamentally changed, neither has the foundation of defense.</strong> Good security hygiene. Strong access control. Protecting the software development lifecycle, Jackson points out.</p>



<p class="wp-block-paragraph">The tools may evolve. The threats may accelerate. But the principles still hold.</p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="AI Is Breaking Cybersecurity" width="500" height="281" src="https://www.youtube.com/embed/uU4D8LDWRQI?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>
<p>The post <a href="https://shiftmag.dev/ai-wont-replace-security-tools-8760/">AI Won’t Replace Security Tools &#8211; It’s Helping Them Prioritize Biggest Threats</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>How to Build Competitive Advantage with Agentic AI</title>
		<link>https://shiftmag.dev/how-to-build-competitive-advantage-with-agentic-ai-without-burning-out-6724/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Wed, 29 Oct 2025 11:52:16 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[agentic AI]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[How to Web]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=6724</guid>

					<description><![CDATA[<p>After months of work, your AI agent can run tasks, create content, even make decisions. Exciting - but how do you use it safely and effectively in the real world?</p>
<p>The post <a href="https://shiftmag.dev/how-to-build-competitive-advantage-with-agentic-ai-without-burning-out-6724/">How to Build Competitive Advantage with Agentic AI</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="720" src="https://shiftmag.dev/wp-content/uploads/2025/10/77.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2025/10/77.png 1200w, https://shiftmag.dev/wp-content/uploads/2025/10/77-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/10/77-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2025/10/77-768x461.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">Remembering Christmas 2016, when she received her first Alexa device, <strong>Gillian Armstrong</strong> was fascinated by the idea of a technology that no longer needed humans to understand it &#8211; but instead had to understand us.</p>



<p class="wp-block-paragraph">With this story, the Senior Solutions Architect opened her session at <a href="https://www.howtoweb.co/" target="_blank" rel="noreferrer noopener">How To Web</a>, illustrating just how far artificial intelligence has come in less than a decade. From chatbots and FAQs to generative and now agentic AI, technology is clearly evolving &#8211; <strong>from reactive tools to autonomous problem-solvers</strong>.</p>



<p class="wp-block-paragraph">To illustrate this evolution, Armstrong introduced &#8220;Dan,&#8221; a character based on many industry clients she has worked with:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Dan loves technology. Five years ago, he came back from a conference saying how we need to use AI in everything. Two years ago, it was generative AI in everything. And earlier this year, he stared saying what really matters now is agentic AI.</p>
</blockquote>



<p class="wp-block-paragraph">Through Dan’s story, Armstrong walked through the stages of adoption and the <strong>pitfalls that come with rushing to embrace the latest AI trend </strong>without thinking about the actual business problem.</p>



<h2 class="wp-block-heading"><span id="4-principles-for-building-agentic-ai-systems">4 principles for building agentic AI systems</span></h2>



<p class="wp-block-paragraph">Armstrong framed her talk around four guiding principles that any business should follow when considering AI systems that can act autonomously.</p>



<h3 class="wp-block-heading"><span id="1-understand-your-models">1. Understand your models</span></h3>



<p class="wp-block-paragraph">Not every use case requires generative AI. Many problems can still be solved with<strong> simpler and cheaper tools</strong>, Gillian points out:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If your chatbot is really simple, you don’t actually need to move to a generative AI chatbot.</p>
</blockquote>



<p class="wp-block-paragraph">Different models vary in cost, speed, and suitability. Choosing the wrong one can <strong>increase complexity </strong>without improving outcomes.</p>



<h3 class="wp-block-heading"><span id="2-balance-your-risks">2. Balance your risks</span></h3>



<p class="wp-block-paragraph">Generative and agentic systems come with new challenges like unpredictable responses, bias, and even manipulation attempts from users. That’s why <strong>evaluation frameworks and guardrails</strong> must be<strong> </strong>built in from the start:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">You need to build your systems thinking about responsibility and safety and security upfront.</p>
</blockquote>



<p class="wp-block-paragraph">Armstrong highlighted the importance of internal testing, human-in-the-loop setups for early deployments, and guardrails that can intercept off-topic or harmful interactions.</p>



<h3 class="wp-block-heading"><span id="3-your-fundamentals-matter">3. Your fundamentals matter</span></h3>



<p class="wp-block-paragraph">Agentic AI relies on tools, databases, APIs, document repositories to actually perform tasks. Without those business systems being accessible and modular, AI agents remain just &#8220;fancy FAQ pages&#8221;:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Those tools are gonna make your agent so much more powerful… but remember, it’s going to be able to do things. So don’t give it too much access to everything.</p>
</blockquote>



<p class="wp-block-paragraph">Armstrong stressed the need for <strong>modular business components, exposed through APIs</strong>, as the foundation for any scalable AI strategy.</p>



<h3 class="wp-block-heading"><span id="4-embrace-agility">4. Embrace agility</span></h3>



<p class="wp-block-paragraph">The AI journey is not about tearing down existing systems, but<strong> evolving them step by step</strong>. Businesses should integrate new technologies gradually, keeping systems flexible for future updates:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Fit the technology to your business problem. Don’t try to find the problems to solve with the technology just because it’s new and shiny.</p>
</blockquote>



<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2025/10/Gillian-Armstrong-htw-202557.png?x94846" alt="" class="wp-image-6727" srcset="https://shiftmag.dev/wp-content/uploads/2025/10/Gillian-Armstrong-htw-202557.png 800w, https://shiftmag.dev/wp-content/uploads/2025/10/Gillian-Armstrong-htw-202557-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/10/Gillian-Armstrong-htw-202557-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></figure>



<h2 class="wp-block-heading"><span id="the-real-break-lies-in-moving-from-chatbots-to-true-ai-agents">The real break lies in moving from chatbots to true AI agents</span></h2>



<p class="wp-block-paragraph">Gillian revisited the history of customer service technology. Early websites with FAQ pages gave way to search functions, then to chatbots powered by natural language processing. Adding speech-to-text and text-to-speech made them more interactive, but still rigid. They required <strong>pre-defined intents and responses.</strong></p>



<p class="wp-block-paragraph">Generative AI improved flexibility, but at the cost of control. Large language models can generate natural responses, yet they also introduce risks. That’s why evaluation, guardrails, and<strong> careful system design are essential.</strong></p>



<p class="wp-block-paragraph">Still, Gillian argued, generative chatbots are ultimately just a smarter version of the FAQ. The real shift happens with <strong>AI agents</strong> &#8211; systems that not only understand language, but can reason, make decisions, and act on them.</p>



<p class="wp-block-paragraph">Take the example of an<strong> insurance claims agent.</strong> Instead of simply answering questions, the AI recognizes when a customer actually wants to file a claim. It can ask for missing details, access the policy database, and submit the claim itself through connected tools:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Every single one of those thoughts is a call to the model. And that will start to add latency and cost. You need to be aware of that when you’re building an agent.</p>
</blockquote>



<p class="wp-block-paragraph">Because of this complexity and risk, Gillian recommended modularizing agents. Instead of building one super-agent with access to everything, companies should design <strong>multiple specialized agents</strong>, each with a narrow goal and limited set of tools:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Not only do we need to modularize our business functionality, we need to modularize our agents. We can reuse them in different ways. We can keep our risk low.</p>
</blockquote>



<h2 class="wp-block-heading">Strong foundations &#8211; smarter AI</h2>



<p class="wp-block-paragraph">In closing, Gillian reminded the audience that AI adoption should be a<strong> journey of evolution, not revolution. </strong>Each stage builds on the previous one, and the best results come when businesses align technology with real problems &#8211; not the other way around:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Make sure you’re building your systems now so that they’re open and agile, and you can bring these things in.</p>
</blockquote>



<p class="wp-block-paragraph">For businesses wondering how to take the next step, her advice was clear: focus on strong foundations, balance risk with innovation, and design with agility in mind. That way, agentic AI systems can move from buzzword to genuine competitive advantage.</p>



<p class="wp-block-paragraph"></p>
<p>The post <a href="https://shiftmag.dev/how-to-build-competitive-advantage-with-agentic-ai-without-burning-out-6724/">How to Build Competitive Advantage with Agentic AI</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>You Built an AI Agent &#8211; But How Do You Price It?</title>
		<link>https://shiftmag.dev/you-built-an-ai-agent-but-how-do-you-price-it-6379/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Thu, 09 Oct 2025 13:43:08 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Emanuel Martonca]]></category>
		<category><![CDATA[How to Web]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=6379</guid>

					<description><![CDATA[<p>You finally built that AI agent. It writes code, drafts emails, maybe even runs tasks on its own. It’s powerful, useful - and ready to ship. But then reality hits: how do you actually price something like this?</p>
<p>The post <a href="https://shiftmag.dev/you-built-an-ai-agent-but-how-do-you-price-it-6379/">You Built an AI Agent &#8211; But How Do You Price It?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202575.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202575.png 800w, https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202575-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202575-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></figure>


<p class="wp-block-paragraph">For years, SaaS companies cruised on easy per-seat pricing and almost-free scaling. Enter AI: every query burns power, every model costs cash, and suddenly <strong>startups are in a pricing puzzle</strong>.</p>



<p class="wp-block-paragraph">In his talk <em>Pricing for AI Agents</em> at the <a href="https://www.howtoweb.co/" target="_blank" rel="noreferrer noopener">How to Web Conference 2025 in Bucharest</a>, Emanuel Martonca (Founder, Pricing Strategist at Soft Fight) dives into <strong>why traditional SaaS pricing no longer works</strong> &#8211; and what it takes to build sustainable business models in the AI era.</p>



<h2 class="wp-block-heading">Let&#8217;s start with an example</h2>



<p class="wp-block-paragraph">Emanuel opens his talk with a simple story: </p>



<p class="wp-block-paragraph">Imagine you’re an angel investor having lunch with a founder who’s built an AI platform that <strong>helps large companies map their employees’ skills</strong>. </p>



<p class="wp-block-paragraph">The founder explains that the tool lets sales teams quickly find experts in niche technologies across the organization, making it easier to sell IT services.</p>



<p class="wp-block-paragraph">In a company of 10.000 people, sales representatives are often far removed from the engineers doing the actual work. So, when a client in New York asks about a specific technology, the salesperson might <strong>have no idea whether anyone in the company has that expertise</strong> &#8211; or even where to find them.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The founder claims his AI solves this problem in days, not months, and points out the lucrative potential. After all, some companies currently pay almost $400.000 annually for software solving the same problem.</p>
</blockquote>



<p class="wp-block-paragraph">However, Emanuel warns &#8211; there are<strong> couple of critical considerations</strong> when thinking about AI pricing.</p>



<h2 class="wp-block-heading"><span id="think-saas-think-small-think-ai-think-big-and-expensive">Think SaaS, think small. Think AI, think big (and expensive)</span></h2>



<p class="wp-block-paragraph">AI is fundamentally different from traditional SaaS, explains Martonca. While SaaS benefits from near-zero marginal costs for additional users and high gross margins, <strong>AI is computationally expensive</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">A single AI query can consume <strong>ten times more energy than a Google search</strong>. Such costs must be considered in pricing, along with other factors like marketing, positioning, differentiation, and risk</p>
</blockquote>



<p class="wp-block-paragraph">Unlike SaaS, where the main concern might be looking like a glorified spreadsheet, AI introduces far more complex risks. </p>



<p class="wp-block-paragraph">Traditional SaaS frameworks and mental models <strong>don’t translate to AI startups</strong> &#8211; they require a different approach. In particular, common SaaS seat-based subscription models often fail in the AI context.</p>



<p class="wp-block-paragraph">As Martonca highlights, <strong>AI frequently replaces the very people you might charge for</strong>, making seat-based pricing impractical.</p>



<p class="wp-block-paragraph">Moreover, many AI projects, proofs of concept, pilots, or experiments never reach production:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Every AI pilot that doesn’t go to production represents lost revenue for software vendors, and AI accelerates development, reducing the need for large teams &#8211; further impacting legacy software revenues.</p>
</blockquote>



<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202576-1.png?x94846" alt="" class="wp-image-6466" srcset="https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202576-1.png 800w, https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202576-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/10/emmanuel-martonca-202576-1-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></figure>



<h2 class="wp-block-heading"><span id="price-the-problem-not-the-technology">Price the problem, not the technology!</span></h2>



<p class="wp-block-paragraph">Currently, there is <strong>no standard model for AI pricing.</strong> </p>



<p class="wp-block-paragraph">Unlike SaaS, where &#8220;good-better-best&#8221; packages and per-seat subscriptions were well-established, AI pricing is complex and still experimental:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><strong>You can price by input, output, outcome, or performance</strong>. The choice depends heavily on the problem being solved and the client’s perceived value, rather than purely on technological complexity.</p>
</blockquote>



<p class="wp-block-paragraph">Many founders get caught up in explaining how their AI works so they talk about the models, the architecture, the agents, but clients care most about solving a business problem. </p>



<p class="wp-block-paragraph">A central lesson is to <strong>price the problem, not the technology</strong>, Emanuel points out.</p>



<p class="wp-block-paragraph">In the skills-matching example, instead of charging for the software or the AI engine, the vendor could charge for each successful match of employee to project. This approach shifts risk to the vendor, but<strong> </strong>aligns price with the value delivered to the client.</p>



<h2 class="wp-block-heading"><span id="companies-used-to-be-product-or-service-focused-not-anymore">Companies used to be product- or service-focused. Not anymore.</span></h2>



<p class="wp-block-paragraph">Emanuel also highlights the <strong>blurring line between products and services in AI</strong>. Traditionally, companies were either product-focused or service-focused. AI challenges this distinction.</p>



<p class="wp-block-paragraph">OpenAI, for example, sells<strong> </strong>consulting services alongside its technology platform. Delivering outcomes and real business results has become the primary source of value, not just providing access to software.</p>



<p class="wp-block-paragraph"><strong>AI budgets also differ</strong> <strong>from traditional IT budgets</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">SaaS historically took money from IT departments. AI often taps into HR or services budgets, which are significantly larger.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="always-start-with-the-customer-and-their-problem">Always start with the customer and their problem</span></h2>



<p class="wp-block-paragraph">For both startups and established companies, Emanuel’s advice is clear: start with the customer and the problem they need solved. <strong>Identify what they value and what they’re willing to pay for</strong> &#8211; only then design a solution and assess its economic viability.</p>



<p class="wp-block-paragraph">Most AI vendors currently use a <strong>hybrid model</strong>: a flat base price for platform access, some included usage, and additional charges based on usage or tokens. It’s a pragmatic &#8211; if temporary &#8211; solution in an environment full of unknowns. Yet the fundamental principles of pricing still apply:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Understand the value delivered, choose the right metric for your model, and price according to the problem solved, not just the technology deployed.</p>
</blockquote>



<p class="wp-block-paragraph">This is important beacuse getting <strong>pricing wrong can be fatal</strong>. Companies that adapt their models to reflect value and outcomes, rather than legacy SaaS logic, will be best positioned to succeed in this new era.</p>
<p>The post <a href="https://shiftmag.dev/you-built-an-ai-agent-but-how-do-you-price-it-6379/">You Built an AI Agent &#8211; But How Do You Price It?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>The Future of Dev Tools is Autonomous, Engineers Will Become Fleet Generals</title>
		<link>https://shiftmag.dev/deveeloper-tools-ai-software-engineering-5299/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Thu, 22 May 2025 12:39:48 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[AI agents]]></category>
		<category><![CDATA[Codeium]]></category>
		<category><![CDATA[Daytona]]></category>
		<category><![CDATA[developer tools]]></category>
		<category><![CDATA[Ivan Burazin]]></category>
		<category><![CDATA[Jesse Robbins]]></category>
		<category><![CDATA[Kenneth Auchenberg]]></category>
		<category><![CDATA[Orca]]></category>
		<category><![CDATA[Peter Zakin]]></category>
		<category><![CDATA[Shift Conference Miami]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=5299</guid>

					<description><![CDATA[<p>We're all going from being software engineers writing code to model operators—code composers. </p>
<p>The post <a href="https://shiftmag.dev/deveeloper-tools-ai-software-engineering-5299/">The Future of Dev Tools is Autonomous, Engineers Will Become Fleet Generals</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="720" src="https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202582.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202582.png 1200w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202582-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202582-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202582-768x461.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">The development environment is undergoing a radical shift at the intersection of software engineering and AI. No longer just about writing clean code, developer experience today is about<strong> crafting systems that collaborate with humans and increasingly with AI agents. </strong></p>



<p class="wp-block-paragraph">This was the central theme of a panel conversation <span style="margin: 0px;padding: 0px"><em>titled &#8220;Investing in Dev Tools in the Age of A</em>I&#8221; that featured <strong>Jesse Robbins</strong> (Heavybit), <strong>Peter Zakin</strong> (Codeium),<strong> Kenneth Auchenberg</strong> (AlleyCorp), and <strong>Ivan Burazin</strong> (Daytona) </span>at the <a href="https://shift.infobip.com/us/" target="_blank" rel="noreferrer noopener">Infobip Shift Miami conference</a>.</p>



<p class="wp-block-paragraph">Speakers first reflected on how developer experience has evolved—<strong>from desktop to mobile, from static tools to dynamic, collaborative environments</strong>. But as Jesse put it, we&#8217;re now entering a new phase where delegation is the new automation.</p>



<p class="wp-block-paragraph">Traditionally, improving developer experience meant offering excellent documentation, strong community support, clear APIs, and plenty of example code. <strong>But today, that&#8217;s not enough</strong>, Jesse points out: </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If you&#8217;re building software now, you&#8217;re not just designing for humans. <strong>You&#8217;re designing for agents</strong>, too. And they need the same things—documentation, context, and clarity of intent.</p>
</blockquote>



<p class="wp-block-paragraph">Jesse compared the rise of autonomous agents in dev workflows to a <strong>new kind of SEO, where developers optimize their tools for discoverability and cooperation with AI agents</strong>. Whether it&#8217;s delegating tasks, workflows, or entire design processes, success now depends on how well tools can communicate their purpose to humans and machines alike.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202581-1024x614.png?x94846" alt="" class="wp-image-5317" srcset="https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202581-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202581-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202581-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202581.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading"><span id="engineers-as-fleet-generals">Engineers as Fleet Generals</span></h2>



<p class="wp-block-paragraph">This shift is <strong>redefining what it means to be a developer</strong>. Kenneth described it in striking terms:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We&#8217;re all going from being software engineers writing code to <strong>model operators—code composers.</strong> It&#8217;s like being an art director, hovering over the shoulder of your agents.</p>
</blockquote>



<p class="wp-block-paragraph">He likened the future of development to managing a &#8220;fleet&#8221; of AI workers, where <strong>engineers must learn to orchestrate, debug, and direct multiple agents</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Every engineer is becoming a fleet general. You&#8217;re not just an IC anymore—managing autonomous contributors.</p>
</blockquote>



<p class="wp-block-paragraph">That may sound intimidating, but Peter argued that <strong>humans will continue to play a central, even irreplaceable role</strong>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">What remains when AI removes all the toil—the boring work? Humans are still responsible for the labor. We&#8217;re the backstop. We&#8217;re the audit log. And that&#8217;s not going away.</p>
</blockquote>



<p class="wp-block-paragraph">As the panelists agreed, there&#8217;s an emerging class of problems—ethics, oversight, accountability—<strong>that only humans can solve.</strong> Responsibility remains a human job even in a world run by autonomous agents.</p>



<h2 class="wp-block-heading"><span id="cursor-windsurf-and-the-agent-race">Cursor, Windsurf, and the Agent Race</span></h2>



<p class="wp-block-paragraph">The latter part of the discussion focused on recent AI-native tools that are transforming the developer landscape—Cursor and Windsurf. Cursor, an enhanced version of VS Code utilizing AI agents, is now <strong>valued at $9 billion. </strong></p>



<p class="wp-block-paragraph">Windsurf, a similar tool, was just acquired by OpenAI for $3 billion. These figures raised a <strong>provocative question </strong>often heard in investor conversations: <em>&#8220;What if AWS or Microsoft builds this?&#8221;</em></p>



<p class="wp-block-paragraph">Kenneth, who was part of the original 12-person VS Code team, had a candid response:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Cursor and Windsurf aren&#8217;t really in the business of building a code editor. They&#8217;re using the<strong> VS Code base as a shipping vehicle</strong>. The real innovation is the agent.</p>
</blockquote>



<p class="wp-block-paragraph">Building an editor like VS Code from scratch is a massive endeavor, one only a few tech giants could undertake. But the opportunity now lies in <strong>building the best agent experience on top of that infrastructure</strong>, Kenneth points out:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We&#8217;re moving toward an agentic future where<strong> everyone will have agents on their engineering team</strong>. That&#8217;s the business. That&#8217;s the value.</p>
</blockquote>



<p class="wp-block-paragraph">This paradigm shift means the next battle in dev tools isn&#8217;t about IDEs—it&#8217;s about <strong>who builds the most effective co-pilot.</strong></p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="614" src="https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202580-1024x614.png?x94846" alt="" class="wp-image-5316" srcset="https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202580-1024x614.png 1024w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202580-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202580-768x461.png 768w, https://shiftmag.dev/wp-content/uploads/2025/05/infobip-shift-miami-202580.png 1200w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<h2 class="wp-block-heading"><span id="open-source-as-the-foundation">Open Source as the Foundation</span></h2>



<p class="wp-block-paragraph">Jesse pointed to the importance of the <strong>open-source ecosystem in enabling this transition.</strong> He&#8217;s an investor in Continue, an open-source plugin integrating VS Code and JetBrains tools to bring AI into developers&#8217; everyday workflows:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Because of this open-source ecosystem, I started writing code again. It felt joyful. This moment in time makes that possible. You get prompted, and you learn. <br></p>



<p class="wp-block-paragraph">It&#8217;s what I remember loving about development. Experiencing <strong>joy in collaborating with tools</strong> instead of fighting them may be the most important change of all.</p>
</blockquote>



<p class="wp-block-paragraph">The developer landscape is undergoing a seismic shift, driven not only by breakthroughs in AI <strong>but also by developers&#8217; changing roles</strong>. Whether through tools like Cursor and Windsurf or evolving team dynamics that blend engineering with product thinking, the panelists painted a future where developers are not just builders but strategic decision-makers.</p>



<p class="wp-block-paragraph">In an era where AI is both collaborator and competitor, the key challenge remains: <strong>staying adaptable, curious, and aligned with long-term value</strong>, regardless of whether you&#8217;re coding the next billion-dollar product or redefining what it means to build software.</p>



<p class="wp-block-paragraph"></p>
<p>The post <a href="https://shiftmag.dev/deveeloper-tools-ai-software-engineering-5299/">The Future of Dev Tools is Autonomous, Engineers Will Become Fleet Generals</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>How We&#8217;ve Built An MCP Server For Messaging</title>
		<link>https://shiftmag.dev/how-infobips-mcp-enables-true-agentic-ai-5220/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Wed, 07 May 2025 10:53:12 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[AI agents]]></category>
		<category><![CDATA[MCP]]></category>
		<category><![CDATA[Shift Conference]]></category>
		<category><![CDATA[Shift Miami 2025]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=5220</guid>

					<description><![CDATA[<p>Infobip’s MCP is redefining how AI agents interact with real-world systems - enabling them to take meaningful actions with nothing more than a prompt.</p>
<p>The post <a href="https://shiftmag.dev/how-infobips-mcp-enables-true-agentic-ai-5220/">How We&#8217;ve Built An MCP Server For Messaging</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2025/05/emanuel-lacic-2025-infobip-shift-miami.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2025/05/emanuel-lacic-2025-infobip-shift-miami.png 800w, https://shiftmag.dev/wp-content/uploads/2025/05/emanuel-lacic-2025-infobip-shift-miami-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2025/05/emanuel-lacic-2025-infobip-shift-miami-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></figure>


<p class="wp-block-paragraph">At this year&#8217;s <a href="https://shift.infobip.com/us/">Shift conference in Miami</a>, <strong>Emanuel Lacic</strong> (Principal Engineer, Infobip) provided a glimpse into the future of agentic AI, where large language models do more than just generate text &#8211; they can actively shape the real world.</p>



<p class="wp-block-paragraph">His talk centered around Infobip’s integration of the Model Context Protocol (MCP), <strong>a new open standard that enables AI agents to directly access and execute remote API calls</strong> — including Infobip’s own CPaaS tools like SMS, Viber, and WhatsApp — without glue code or manual integration. MCP is quickly emerging as a major trend in the tech industry, with many companies jumping on board. Let&#8217;s look at how Infobip plans to integrate its CPaaS tool channels in the near future to drive automation and improve usability.</p>



<h2 class="wp-block-heading"><span id="enabling-ai-agents-to-do-more-than-just-talk">Enabling AI agents to do more than just talk</span></h2>



<p class="wp-block-paragraph">Rather than simply diving into the buzz around generative AI and autonomous agents, Emanuel began by challenging the status quo. Instead of focusing on the conversational abilities of large language models, he posed a more ambitious question: &#8220;<strong>How can we make agents not only talk, but actually do something meaningful?</strong>&#8220;</p>



<p class="wp-block-paragraph">Traditionally, implementing actions required developers to <strong>manually stitch together APIs</strong>,<strong> write middleware</strong>, and <strong>carefully orchestrate how LLMs interacted with external tools</strong>. However, as Emanuel explained, this approach doesn&#8217;t scale, especially when every API has its own specifications and quirks.</p>



<p class="wp-block-paragraph"><strong>Infobip’s answer is to remove this friction entirely</strong>.</p>



<h2 class="wp-block-heading">Infobip&#8217;s CPaaS is now agent-friendly</h2>



<p class="wp-block-paragraph">Infobip’s CPaaS platform is known for enabling messaging across SMS, Viber, WhatsApp, and other channels. Until now, integrating these services into an AI pipeline required traditional API work: reading documentation, generating JSON payloads, and writing manual code. <strong>With MCP, LLMs can now do all of that themselves</strong>, Emanuel explained:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We wanted to <strong>eliminate the need for glue code entirely</strong>. Developers shouldn&#8217;t have to write a single line of custom logic just to send a message or trigger an action. With a prompt and an agent, it should simply work.</p>
</blockquote>



<p class="wp-block-paragraph">Infobip has launched its own <a href="https://mcp.infobip.com/">MCP endpoint</a> &#8211; currently in beta &#8211; that opens up its communication tools to agentic AI. Developers can now instruct a language model to send messages, retrieve account data, or manage users using a simple prompt. <strong>The agent handles method discovery, request formatting, and real-time execution</strong>. Emanuel explained:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Our APIs are now <strong>machine-readable</strong> <strong>and model-understandable</strong>.That means agents can independently explore what our platform can do — and use it without any human intervention.</p>



<p class="wp-block-paragraph"></p>
</blockquote>



<h2 class="wp-block-heading"><span id="how-does-this-actually-play-out-in-practice">How does this actually play out in practice?</span></h2>



<p class="wp-block-paragraph">To illustrate what’s already possible, Emanuel shared two examples.</p>



<p class="wp-block-paragraph">In a real customer support scenario, a company might use an AI pipeline to analyze customer sentiment. If a negative message is detected, <strong>the system automatically drafts and sends an apology via WhatsApp</strong>. Previously, developers had to manually integrate sentiment analysis with Infobip’s messaging API. Now, with MCP, an agent can manage the entire flow &#8211; from interpreting feedback to sending the message &#8211; all from a single prompt.</p>



<p class="wp-block-paragraph">The second example was a bit sweeter: imagine launching a new chocolate product and wanting to send a Viber campaign with an image and message. All the agent needs is a prompt containing the product description, image, and recipient details. From there, <strong>it composes the message, identifies the appropriate API call, and sends it</strong>, even intelligently splitting the content into text and image components when necessary.</p>



<h2 class="wp-block-heading"><span id="the-future-is-prompt-driven">The future is prompt-driven</span></h2>



<p class="wp-block-paragraph">The key takeaway? </p>



<p class="wp-block-paragraph"><strong>MCP marks a fundamental shift in how we build AI applications</strong>. Instead of manually integrating APIs, developers can simply describe the desired outcome, and let the agent handle the rest.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">It’s all about removing the glue code. You don’t need to code anything. Just use generative AI, and it should do all the work for you.</p>
</blockquote>



<p class="wp-block-paragraph">The Model Connector Protocol represents <strong>a step toward making AI agents more capable, reliable, and adaptable in real-world systems</strong>. By shifting the burden of integration from developers to the agents themselves, MCP opens the door to new possibilities for automation and intelligent behavior.</p>



<p class="wp-block-paragraph">While the technology is still evolving and early in adoption, its potential impact is clear &#8211; especially for platforms like Infobip that are already central to communication workflows. As more developers experiment with MCP, its role in shaping the future of AI infrastructure will become increasingly important.</p>



<p class="wp-block-paragraph"><strong>Explore the <a href="https://github.com/infobip/mcp" target="_blank" rel="noreferrer noopener">Infobip MCP servers on GitHub</a>, or build your own MCP server using the open-source <a href="https://github.com/infobip/infobip-openapi-mcp" target="_blank" rel="noreferrer noopener">Infobip OpenAPI MCP framework</a>. </strong></p>
<p>The post <a href="https://shiftmag.dev/how-infobips-mcp-enables-true-agentic-ai-5220/">How We&#8217;ve Built An MCP Server For Messaging</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Tejas Kumar: The future of AI isn’t LLMs, but affordable small language models</title>
		<link>https://shiftmag.dev/tejas-kumar-the-future-of-ai-isnt-llms-but-affordable-small-language-models-4318/</link>
		
		<dc:creator><![CDATA[Marin Pavelić]]></dc:creator>
		<pubDate>Tue, 08 Oct 2024 13:02:38 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[Shift Conference]]></category>
		<category><![CDATA[Shift Zadar 2024]]></category>
		<category><![CDATA[Tejas Kumar]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=4318</guid>

					<description><![CDATA[<p>Are you tired of the AI hype? Let’s see what it can really do.</p>
<p>The post <a href="https://shiftmag.dev/tejas-kumar-the-future-of-ai-isnt-llms-but-affordable-small-language-models-4318/">Tejas Kumar: The future of AI isn’t LLMs, but affordable small language models</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202469.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202469.png 800w, https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202469-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202469-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></figure>


<p class="has-text-align-left wp-block-paragraph"><strong>Tejas Kumar</strong>, an AI DevRel Engineer at DataStax, took the stage at the Infobip Shift conference with a no-hype, straight-to-the-point talk on AI.</p>



<p class="has-text-align-left wp-block-paragraph">He broke down what AI engineering looks like today, sharing techniques for cutting costs, avoiding hallucinations, and what’s going to be key for building the next wave of AI systems.</p>



<h1 class="wp-block-heading"><span id="rag-solves-the-top-3-ai-limitations">RAG solves the top 3 AI limitations</span></h1>



<p class="wp-block-paragraph">The main limitations developers face today when working with AI are <strong>hallucinations, knowledge cutoffs, and finite context windows</strong>. Tejas believes that these three &#8220;flies&#8221; can be swatted in one strike using a technique called <strong>Retrieval-Augmented Generation (RAG)</strong>, which combines pre-trained language models with a real-time data retrieval system:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>With RAG, you fetch data from an authoritative source and use it to enhance or alter the generated text from an LLM. This data reaches the LLM through prompt engineering.</em></p>
</blockquote>



<p class="wp-block-paragraph">Tejas demonstrated how RAG works with a simple example. Kumar illustrated the RAG process with just a few clicks: <strong>he inputs a webpage into an embedding model, which then numerically encodes the data.</strong></p>



<p class="wp-block-paragraph">This model performs a similarity search,<strong> pulling relevant information from the database to answer the user&#8217;s question.</strong> This process ensures that responses are based on the most up-to-date information, effectively eliminating hallucinations common in LLMs like GPT.</p>



<h1 class="wp-block-heading">Chatbots are boring &#8211; AI should feel real</h1>



<p class="wp-block-paragraph">AI chatbots are everywhere today, but Tejas believes they&#8217;re mostly boring. <strong>They serve a purpose, but that purpose is very narrowly defined.</strong> That&#8217;s why Tejas offers an example of how a chatbot can be used more broadly-like searching Netflix.</p>



<p class="wp-block-paragraph">Tejas entered &#8220;movies with a strong female lead&#8221; into Netflix&#8217;s search system, which traditionally might return incorrect or no results. However, if a search system uses semantic AI in the background—understanding the meaning of the user&#8217;s query rather than just keywords &#8211; <strong>the user experience can be significantly enhanced:</strong></p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>With semantic search, we improve search results and generate interactive user interfaces that understand user intent on demand.</em></p>
</blockquote>



<p class="wp-block-paragraph">Tejas illustrated how DataStax developed a tool for semantic search that not only delivers accurate results for such queries<strong> but can generate an interactive user interface (UI) on demand</strong>. This means that by typing &#8220;movies with a strong female lead,&#8221; Netflix could present relevant movie posters and trailers. This kind of interactive UI represents the future of AI, where developers can use models like Langflow to integrate AI into applications without disrupting the user experience, Tejas emphasized:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">As developers, we have a responsibility to our users. We must build AI experiences beyond simple chatbots and deliver real, purposeful interactions.</p>
</blockquote>



<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202470-1.png?x94846" alt="" class="wp-image-4330" srcset="https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202470-1.png 800w, https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202470-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202470-1-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /><figcaption class="wp-element-caption">Filip Popović/Infobip Shift</figcaption></figure>



<h1 class="wp-block-heading"><span id="ssms-instead-of-llms">SSMs instead of LLMs?</span></h1>



<p class="wp-block-paragraph">Looking ahead, Tejas sees a <strong>shift from general-purpose LLMs to small specialized models (SSMs)</strong>, which is his (unofficial) term for AI systems tailored to specific tasks:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>What if, instead of models like GPT-4 with 600 billion parameters, we had a smaller model with 7 billion specialized parameters? That&#8217;s the future, and that&#8217;s where we should invest.</em></p>
</blockquote>



<p class="wp-block-paragraph">Tejas believes companies will turn to <strong>smaller models focused on individual needs</strong>. That way developers will drastically cut costs while maintaining good product performance.</p>



<h1 class="wp-block-heading"><span id="building-responsible-ai-must-come-first">Building Responsible AI Must Come First</span></h1>



<p class="wp-block-paragraph">AI must be developed ethically, and one of the key things to watch out for is what Tejas calls &#8220;authority bias&#8221; &#8211; where <strong>users assume that results generated by AI are always correct</strong> simply because they come from an authoritative-sounding source:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>We need to be transparent about the data used to train LLMs. AI should be able to say, &#8220;Hey, this data might be wrong.&#8221;</em></p>
</blockquote>



<p class="wp-block-paragraph">The future of AI is in creating tools that allow models to recognize the limits of their capabilities. When AI can&#8217;t provide an answer, <strong>it should be able to use external tools or API</strong>s to retrieve the necessary information to ensure accuracy.</p>



<p class="wp-block-paragraph">In conclusion, Tejas encourages developers to think beyond simple chatbots because he believes the future of AI is tied to <strong>combining the power of LLMs with specialized models and dynamic interfaces that enhance user experiences.</strong></p>



<h1 class="wp-block-heading">AI won&#8217;t replace developers, but some skills will disappear</h1>



<p class="wp-block-paragraph">Tejas could also be heard further on the panel &#8220;AI-Powered Development Tools: Enhancing or Replacing Human Developers?&#8221; where he was joined by <strong>Simi Olabisi</strong>, an AI expert from Microsoft, and the discussion was moderated by our executive editor <strong>Antonija Bilić Arar</strong> on the ShiftMag stage!</p>



<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" width="800" height="480" src="https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202468-1.png?x94846" alt="" class="wp-image-4324" srcset="https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202468-1.png 800w, https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202468-1-300x180.png 300w, https://shiftmag.dev/wp-content/uploads/2024/09/Tejas-Kumar-shift-202468-1-768x461.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></figure>



<p class="wp-block-paragraph">AI will do the opposite of what people expect. <strong>It won&#8217;t replace developers</strong>; it will make them better at their job, says Simi:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>The tools we&#8217;re building at Microsoft are designed to handle repetitive tasks, allowing developers to focus on more complex and creative activities.</em></p>
</blockquote>



<p class="wp-block-paragraph">This brings us to the question of juniors and how they will learn<strong>. Simi believes they won&#8217;t need to spend time mastering basic tasks:</strong></p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>Just like floppy disks became obsolete, some fundamental skills may become less important to master, but that doesn&#8217;t mean they&#8217;ll skip important lessons. They&#8217;ll face challenging tasks early in their careers.</em></p>
</blockquote>



<p class="wp-block-paragraph"><strong>Think about this as an evolution from a paintbrush to a camera. </strong>Tejas pointed out that basic tools of human creativity are still necessary to solve 70 to 80% of coding tasks, but human oversight and creativity remain essential. Simi concluded that we&#8217;re not facing any dramatic change within the next five years. Tools will advance, and AI will continue to enhance our abilities, <strong>but developers remain a key part of the entire process.</strong></p>
<p>The post <a href="https://shiftmag.dev/tejas-kumar-the-future-of-ai-isnt-llms-but-affordable-small-language-models-4318/">Tejas Kumar: The future of AI isn’t LLMs, but affordable small language models</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 

Served from: shiftmag.dev @ 2026-06-04 00:32:28 by W3 Total Cache
-->