<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Tena Šojer Keser, Author at ShiftMag</title>
	<atom:link href="https://shiftmag.dev/author/tena/feed/" rel="self" type="application/rss+xml" />
	<link>https://shiftmag.dev/author/tena/</link>
	<description>Insightful engineering content &#38; community</description>
	<lastBuildDate>Mon, 16 Dec 2024 10:41:15 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shiftmag.dev/wp-content/uploads/2024/08/cropped-ShiftMag-favicon-32x32.png</url>
	<title>Tena Šojer Keser, Author at ShiftMag</title>
	<link>https://shiftmag.dev/author/tena/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Network API: Achieving Simplicity on the Other Side of Complexity</title>
		<link>https://shiftmag.dev/network-api-simplicity-4593/</link>
		
		<dc:creator><![CDATA[Tena Šojer Keser]]></dc:creator>
		<pubDate>Thu, 05 Dec 2024 14:58:53 +0000</pubDate>
				<category><![CDATA[Network API]]></category>
		<category><![CDATA[Camara project]]></category>
		<category><![CDATA[infobip]]></category>
		<category><![CDATA[network API]]></category>
		<category><![CDATA[telco]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=4593</guid>

					<description><![CDATA[<p>The conversation about Network APIs either gets too technical or too ethereal, promising telco magic dust to developers.</p>
<p>The post <a href="https://shiftmag.dev/network-api-simplicity-4593/">Network API: Achieving Simplicity on the Other Side of Complexity</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img fetchpriority="high" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2024/12/network-api.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2024/12/network-api.png 1200w, https://shiftmag.dev/wp-content/uploads/2024/12/network-api-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2024/12/network-api-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/12/network-api-768x403.png 768w" sizes="(max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">Both approaches tend to be pretty vague. So it&#8217;s pretty rare that representatives of different telecoms—like Orange, Telefónica, Deutsche Telekom—share a stage and discuss not only the actual <strong>use cases of network APIs</strong> but also their limitations and the work that needs to be done to make them useful for developers.  </p>



<p class="wp-block-paragraph"><strong>Cedric Gonin (Orange), Fernando Guillot Gimeno (</strong>Telefónica Innovación Digital<strong>), and Noel Wirzius (</strong>Deutsche Telekom AG) got pretty real(istic) onstage at <strong><a href="https://shift.infobip.com/">Infobip Shift Conference</a></strong>, where they were joined by  Nokia&#8217;s <strong>Shkumbin Hamiti and Infobip&#8217;s Matija Razem. </strong>  </p>



<h2 class="wp-block-heading"><span id="what-are-the-use-cases-for-network-apis%c2%a0%c2%a0"><strong>What are the use cases for Network APIs?</strong>  </span></h2>



<p class="wp-block-paragraph">The first issue on the table was – what do network APIs actually provide?&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">Gonin explained that network APIs <strong>give developers access to certain user information that telcos use</strong> to identify their customers, like ISDN, MSIDN, SIM status, and other user information. This information is what you need to confirm that the device in question is really the one requesting information without sending information via SMS.  </p>



<p class="wp-block-paragraph">It adds an additional layer of security against fraud and regular usage—after all, users can and do change numbers.  </p>



<p class="wp-block-paragraph">Gimeno stressed that dynamic data is especially interesting to developers. Take, for example, <strong>API Device location—</strong>it has two uses, he explains. One is<strong> verification,</strong> verifying info that you already know—if you&#8217;re in front of the ATM, the bank can see if you are really around the ATM and the one getting the money. If you&#8217;re away, it can stop the transaction automatically.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">The other use is<strong> geofencing, </strong>meaning developers can set an area where they want an activity to occur. That way, a shop can offer a customer something when they are in the vicinity, or you could get a notification when your kids arrive where they are going.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">Of course, some of this can be achieved through a GPS signal, but GPS can be tampered with when it&#8217;s not coming from multiple antennas. Wirzius notes that getting the location from your telco without using your phone&#8217;s GPS is not only more accurate but also saves batteries. So it&#8217;s not just about inventing new use cases with Network APIs—it&#8217;s also about <strong>upgrading old ones using different technology.</strong>  </p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe title="Network APIs – Why Should You Care?" width="500" height="281" src="https://www.youtube.com/embed/v2-wsawNurI?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p class="wp-block-paragraph">Use cases are great, but ease of use should be greater&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">The <em>what</em> is important when it comes to Network APIs, everyone agrees, but the <em>how</em> holds as much weight. Quality of service is a part of mission-critical capabilities, and part of that is <strong>simplifying API usage and documentation.   </strong></p>



<p class="wp-block-paragraph">The telcos must embark on this journey together if they want NetworkAPIs to reach their potential. They noted that simplicity does not come naturally to them.   </p>



<p class="wp-block-paragraph">After all, telcos rely on standards, as there is no other way to ensure interoperability. They have a natural tendency to introduce new standards when they face a problem, and that is somewhat at odds with their current mission of removing complexity from Network APIs.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">This is also where partnerships like the one with <strong><a href="https://www.infobip.com/network-apis" target="_blank" rel="noreferrer noopener">Infobip</a></strong> help by removing complexity for the end users – the developers. Let&#8217;s say you want to fly a drone over country borders. You cannot do this unless you can guarantee you&#8217;ll be able to operate it at all times, requiring you to have agreements with each telco that covers the area you want to fly it. <strong><a href="https://www.infobip.com/network-apis" target="_blank" rel="noreferrer noopener">Intermediaries like Infobip</a></strong> allow you to do this through a single API. They negotiate and make agreements with telcos all over Europe and worldwide so that developers don&#8217;t have to.   </p>



<h3 class="wp-block-heading"><span id="in-the-words-of-steve-balmer-developers-developers-developers"><strong>In the words of Steve Balmer: Developers, developers, developers&nbsp;</strong>&nbsp;</span></h3>



<p class="wp-block-paragraph">It&#8217;s also important to make things available to developers in a way they are used to, Hamiti stressed: &#8220;Linux foundation project <a href="https://camaraproject.org/" target="_blank" rel="noreferrer noopener">Camara</a> is one of the key efforts in that direction. Contributors help to scale it, but making it truly global requires close collaboration – between telcos and developers, but also amongst telcos themselves.&#8221; </p>



<p class="wp-block-paragraph">The first steps for developers, Gonin noted, would be to bring new ideas:&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">&#8220;We are trying to do the same as with the first smartphones. We provide support for new ideas and create a platform for them. We can help bring new use cases to market and make them successful. We need to provide a full range to end-users to make Network APIs happen.&#8221;</p>



<p class="wp-block-paragraph">Gimeno agrees:&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">&#8220;The center point is developers. I worked for many years for Microsoft. I always remember Steve Balmer standing up in front of a huge crowd shouting, &#8220;Developers, developers , developers!&#8221;. Everything we are doing as integrators and telcos is making it easy for devs to do what they do.&#8221;&nbsp;&nbsp;</p>



<p class="wp-block-paragraph"><span style="box-sizing: border-box; margin: 0px; padding: 0px;">He reminds us that it&#8217;s a new</span> initiative—with 44 telcos participating, covering 22 countries. Yet if NetworkAPIs are global one day, telcos need to stop competing in their area and work together so developers don&#8217;t have to worry about what part of the country their users are in and what telco they are using.  </p>
<p>The post <a href="https://shiftmag.dev/network-api-simplicity-4593/">Network API: Achieving Simplicity on the Other Side of Complexity</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>The Magic and Reality of AI: What can Generative AI actually do?</title>
		<link>https://shiftmag.dev/what-can-generative-ai-do-3267/</link>
		
		<dc:creator><![CDATA[Tena Šojer Keser]]></dc:creator>
		<pubDate>Tue, 28 May 2024 11:08:53 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Christine Spang]]></category>
		<category><![CDATA[Shift Conference]]></category>
		<category><![CDATA[Shift Miami 2024]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=3267</guid>

					<description><![CDATA[<p>AI is just about using computers to generate leverage - do more with less.</p>
<p>The post <a href="https://shiftmag.dev/what-can-generative-ai-do-3267/">The Magic and Reality of AI: What can Generative AI actually do?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img decoding="async" width="2100" height="1402" src="https://shiftmag.dev/wp-content/uploads/2024/05/53701015957_60ee6c0a76_o-scaled.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/53701015957_60ee6c0a76_o-scaled.jpg 2100w, https://shiftmag.dev/wp-content/uploads/2024/05/53701015957_60ee6c0a76_o-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2024/05/53701015957_60ee6c0a76_o-1024x684.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/53701015957_60ee6c0a76_o-768x513.jpg 768w" sizes="(max-width: 2100px) 100vw, 2100px" /></figure>


<p class="wp-block-paragraph">Many things pass for AI, and sometimes it&#8217;s hard to put them under the common denominator, but Christine Spang still gives it a shot with a simple equation in her <a href="https://shift.infobip.com/us/">Shift Miami talk</a>:&nbsp;<em>Chatbots are cool, but what else can generative AI do</em>?</p>



<p class="has-text-align-center wp-block-paragraph"><strong>Leverage = having a higher impact with a smaller input</strong></p>



<p class="has-text-align-center wp-block-paragraph"><strong>AI = using computers to generate leverage</strong></p>



<p class="wp-block-paragraph"><strong>Computing</strong> itself, argues Christine,<strong> is about giving more leverage to individuals or groups,</strong> and the rise of LLMs has driven AI magic into new sets of use cases.&nbsp;</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We used to carry water to the village; now we have tap water. We invented language, and then systems for storing information. We keep making better ways to use the data and information we have today &#8211; AI is our latest attempt at that.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="chatbots-knowledge-bases-coding-assistants"><strong>Chatbots, knowledge bases, coding assistants</strong></span></h2>



<p class="wp-block-paragraph">Today, the most notable use cases are user-facing chatbots, knowledge bases, and coding assistants (or, as often happens, some combination of the three).</p>



<p class="wp-block-paragraph">Chatbots have come a long way from their initial instances and can now boast <strong>great UI and conversational intelligence.&nbsp;</strong></p>



<p class="wp-block-paragraph">Christine argues that coding assistance<strong> (or copilots) supercharges our coding powers</strong>, ensuring enhanced productivity and efficiency in the dev cycle.&nbsp;</p>



<p class="wp-block-paragraph">AI-powered knowledge bases give us<strong> access to the right information at the time when we need it</strong>, not when a customer service agent is available or can schedule a call.</p>



<p class="wp-block-paragraph">As a good example of that (and the benefits that AI brings), Christine mentioned her company&#8217;s own chatbot, <a href="https://nylas.com/blog/pr/nylas-launches-its-new-generative-ai-assist-chatbot/">Nylas Assist</a>, a chat user interface for their docs. Launched in August 2023, <strong>it has reduced the number of raised tickets by 25%</strong>, even though the user base grew by 30% &#8211; that&#8217;s precisely the leverage she&#8217;s talking about</p>



<h2 class="wp-block-heading"><span id="language-is-messy-data-should-not-be">Language is messy; data should not be</span></h2>



<p class="wp-block-paragraph">All three use cases are pretty useful &#8211; but is this the peak AI we&#8217;re experiencing? We&#8217;ve probably all guessed that it&#8217;s not. At the moment, Christine notes, <strong>we have generalized datasets, which only allow us to get generalized actions.&nbsp;</strong></p>



<p class="wp-block-paragraph">Human language, on the other hand, is messy, full of nuances, and context-dependent. Communication is the bottleneck where most relevant information and context pass through, she states, and communication happens over many different channels, like messaging apps, email, voice, and&nbsp;social media, as well asynchronously.</p>



<p class="wp-block-paragraph">Current LLMs are trained on the entire scrapable content available online. That&#8217;s a lot of text but not necessarily a lot of (right) context. LLMS trained on social media, Christine exemplifies, don&#8217;t necessarily have the right context for business.&nbsp;</p>



<h2 class="wp-block-heading"><span id="customization-is-the-next-step-forward">Customization is the next step forward</span></h2>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="683" src="https://shiftmag.dev/wp-content/uploads/2024/05/DSC07694-Enhanced-NR-1024x683.jpg?x94846" alt="" class="wp-image-3280" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/DSC07694-Enhanced-NR-1024x683.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/DSC07694-Enhanced-NR-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2024/05/DSC07694-Enhanced-NR-768x512.jpg 768w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">Communication data is a haystack, she stresses, it&#8217;s not valuable as an unstructured pile. </p>



<p class="wp-block-paragraph">Take, for example, email, which is Nylas&#8217;s bread–and–butter data. It is particularly messy, with plain text, images, links, and formatting. Passing that data to a model the right way is a challenge. Everything needs to be pre-processed in a specific way, <strong>extracting not just information but also information order.</strong> It needs to be structured to have value and the right context to move from generalized outputs.&nbsp;</p>



<p class="wp-block-paragraph">Context is exactly how Christine sees generative AI evolving and generating even more leverage: When we access the context of a dataset rather than just the dataset itself, we can customize models and get customized actions instead of generalized ones.</p>
<p>The post <a href="https://shiftmag.dev/what-can-generative-ai-do-3267/">The Magic and Reality of AI: What can Generative AI actually do?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Want to build a more accurate Copilot with fewer hallucinations? Move from prompting to fine-tuning.</title>
		<link>https://shiftmag.dev/build-a-more-accurate-copilot-with-fewer-hallucinations-3256/</link>
		
		<dc:creator><![CDATA[Tena Šojer Keser]]></dc:creator>
		<pubDate>Tue, 07 May 2024 13:12:32 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[Copilot]]></category>
		<category><![CDATA[Emanuel Lacic]]></category>
		<category><![CDATA[large language models]]></category>
		<category><![CDATA[Shift Conference]]></category>
		<category><![CDATA[Shift Miami 2024]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=3256</guid>

					<description><![CDATA[<p>Is prompting enough? Emanuel shares exploration of his team and what they learned regarding prompting strategies, fine-tuning, and model size.</p>
<p>The post <a href="https://shiftmag.dev/build-a-more-accurate-copilot-with-fewer-hallucinations-3256/">Want to build a more accurate Copilot with fewer hallucinations? Move from prompting to fine-tuning.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="2100" height="1402" src="https://shiftmag.dev/wp-content/uploads/2024/05/53701014927_a82c426152_o-scaled.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/53701014927_a82c426152_o-scaled.jpg 2100w, https://shiftmag.dev/wp-content/uploads/2024/05/53701014927_a82c426152_o-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2024/05/53701014927_a82c426152_o-1024x684.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/53701014927_a82c426152_o-768x513.jpg 768w" sizes="auto, (max-width: 2100px) 100vw, 2100px" /></figure>


<p class="wp-block-paragraph">Is prompting enough? Emanuel Lacić asked this question on the stage of the <a href="https://shift.infobip.com/us/" target="_blank" rel="noreferrer noopener">Shift Conference in Miami </a>as he explored the process of creating a Copilot for a UI-based chatbot builder.  </p>



<p class="wp-block-paragraph">The chatbot builder in question, <a href="https://www.infobip.com/docs/answers/generative-ai/answers-copilot#:~:text=Answers%20copilot%20is%20a%20Generative,an%20outline%20of%20the%20design." target="_blank" rel="noreferrer noopener">Answers Copilot</a>, is a GenAI feature tha<strong>t enables end users to design a chatbot based on their natural language input</strong>. GenAI creates an outline of the design of how the chatbot should behave, automating the chatbot building process to a degree, and the end user then customizes it to meet their requirement. &nbsp;</p>



<h2 class="wp-block-heading"><span id="starting-with-prompting">Starting with prompting</span></h2>



<p class="wp-block-paragraph">The initial process relied on prompting: Emanuel and his team described what the underlying code looked like, had Open<s> </s>AI generate the code blocks representing visual elements, and then plugged it in to have it rendered in the UI. Preferably with as few hallucinations (i.e., generated code that leads to an error when rendering), and as predictable output as possible. &nbsp;</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="575" src="https://shiftmag.dev/wp-content/uploads/2024/05/image-2-1024x575.png?x94846" alt="" class="wp-image-3259" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/image-2-1024x575.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/image-2-300x169.png 300w, https://shiftmag.dev/wp-content/uploads/2024/05/image-2-768x432.png 768w, https://shiftmag.dev/wp-content/uploads/2024/05/image-2.png 1381w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">They tested different prompt engineering strategies with<strong> Microsoft’s API for GPT-3.5 Turbo</strong>. By testing different techniques ranging from zero-shot to few-shot prompting with domain-specific instructions, they managed to <strong>lower the percentage of hallucinations to 12.63%</strong> on average. &nbsp;Accuracy was measured using HitRate – the number of times where the generated code blacked matched to a 100% of what was expected &#8211; which peaked at 2.13%.</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="277" src="https://shiftmag.dev/wp-content/uploads/2024/05/image-1024x277.png?x94846" alt="" class="wp-image-3257" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/image-1024x277.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/image-300x81.png 300w, https://shiftmag.dev/wp-content/uploads/2024/05/image-768x208.png 768w, https://shiftmag.dev/wp-content/uploads/2024/05/image.png 1377w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">Having created the Copilot using different prompting strategies, it was time to answer Emanuel’s titular question: Is prompting enough? The team decided to test the hypothesis that <strong>LLMs with context-specific data might yield a lower percentage of hallucinations and higher accuracy</strong> (i.e., by measuring the HitRate and turning to fine-tuning.&nbsp;</p>



<h2 class="wp-block-heading"><span id="bigger-is-not-always-better">Bigger is not always better&nbsp;</span></h2>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="684" src="https://shiftmag.dev/wp-content/uploads/2024/05/53701912011_be5443f9fc_o-1024x684.jpg?x94846" alt="" class="wp-image-3263" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/53701912011_be5443f9fc_o-1024x684.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/53701912011_be5443f9fc_o-300x200.jpg 300w, https://shiftmag.dev/wp-content/uploads/2024/05/53701912011_be5443f9fc_o-768x513.jpg 768w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">As end users can task the Answers Copilot with creating a chatbot for a variety of use cases, the task of fine-tuning it required the team to know what input users might provide, as well as what is the desired output. Since real-world data was not available, <strong>GenAI was put to the task of synthetically creating some. &nbsp;</strong></p>



<p class="wp-block-paragraph">The data was then used to fine-tune LLMs of various sizes: <a href="https://learn.microsoft.com/en-us/azure/ai-services/openai/tutorials/fine-tune">OpenAI GPT-3.5 Turbo</a> (large), <a href="https://arxiv.org/pdf/2310.06825.pdf">Mistral 7B Instruct </a>(mid), <a href="https://arxiv.org/pdf/2302.13971.pdf" target="_blank" rel="noreferrer noopener">LLaMa 3B</a> (small), and <a href="https://arxiv.org/pdf/2310.06694.pdf" target="_blank" rel="noreferrer noopener">Sheared LLaMa 1.3B</a> (tiny). In addition to training the models with relevant data, the team used <a href="https://arxiv.org/abs/2106.09685" target="_blank" rel="noreferrer noopener">LoRA</a> to fine-tune visual element generation.  </p>



<p class="wp-block-paragraph">The fine-tuning process did yield the desired results: LLMs trained on relevant data had a significantly lower number of hallucinations, with <strong>0.04% as the lowest achieved hallucination rate</strong>. The accuracy, on the other hand, also improved significantly, where the <strong>HitRate climbed up to 26.72%.&nbsp;</strong></p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="254" src="https://shiftmag.dev/wp-content/uploads/2024/05/image-1-1024x254.png?x94846" alt="" class="wp-image-3258" srcset="https://shiftmag.dev/wp-content/uploads/2024/05/image-1-1024x254.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/05/image-1-300x75.png 300w, https://shiftmag.dev/wp-content/uploads/2024/05/image-1-768x191.png 768w, https://shiftmag.dev/wp-content/uploads/2024/05/image-1.png 1377w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">Interestingly, Emanuel notes the best performing models were Sheared LlaMA (in terms of hallucinations) and Mistral 7b Instruct (when it came to HitRate):&nbsp;</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><em>Sometimes you don’t need the largest, best performing LLM. But the only way to know which one performs best is to experiment – you can’t know beforehand.</em>&nbsp;</p>
</blockquote>



<h2 class="wp-block-heading"><span id="what%e2%80%99s-next">What’s next?&nbsp;</span></h2>



<p class="wp-block-paragraph">There are always ways to polish Copilots, with <strong>user feedback being the logical next step</strong>. To that end, he showed the <a href="https://arxiv.org/abs/2402.01306" target="_blank" rel="noreferrer noopener">KTO method</a> (Kahneman-Tversky Optimization): As it requires only a binary signal (desirable/undesirable outcome), the<strong> user feedback data is more abundant, cheaper, and faster to collect</strong> than data based on user preference between two different outputs, which is used in other popular methods like Reinforcement Learning. KTO is also a good choice when there is a marked imbalance between the number of desirable and undesirable examples.&nbsp;</p>



<p class="wp-block-paragraph">To take user feedback a step further, a multiarmed bandit algorithm can be used, as Emanuel demonstrated, to determine <strong>which of the LLMs produces the most favorable results </strong>while running in production and, consequently, <strong>which LLM to choose in an automatic way. &nbsp;</strong></p>



<figure class="wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div class="wp-block-embed__wrapper">
<iframe loading="lazy" title="Emanuel Lacić: Is Prompting Enough? The Process of Making a Copilot for UI-based Chatbot Builders" width="500" height="281" src="https://www.youtube.com/embed/eEaKR4uatwE?feature=oembed" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>
</div></figure>



<p class="wp-block-paragraph">You can find <a href="http://elacic.me/documents/talks/2024_04_Shift_Copilot.pdf" target="_blank" rel="noreferrer noopener">Emanuel&#8217;s slides</a> here or find out more about his work on his <a href="http://elacic.me/" target="_blank" rel="noreferrer noopener">personal website</a>.&nbsp;</p>



<p class="wp-block-paragraph"></p>
<p>The post <a href="https://shiftmag.dev/build-a-more-accurate-copilot-with-fewer-hallucinations-3256/">Want to build a more accurate Copilot with fewer hallucinations? Move from prompting to fine-tuning.</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Is DevOps just a conspiracy theory?</title>
		<link>https://shiftmag.dev/what-is-devops-3236/</link>
		
		<dc:creator><![CDATA[Tena Šojer Keser]]></dc:creator>
		<pubDate>Tue, 30 Apr 2024 11:48:31 +0000</pubDate>
				<category><![CDATA[DevOps]]></category>
		<category><![CDATA[Event]]></category>
		<category><![CDATA[Baruch Sadogursky]]></category>
		<category><![CDATA[Shift Conference]]></category>
		<category><![CDATA[Shift Miami 2024]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=3236</guid>

					<description><![CDATA[<p>What is DevOps, anyway? It's been around for 15 years, and has grown into an industry - generating countless tools and quite a revenue. And yet, many people can't answer the question with certainty. </p>
<p>The post <a href="https://shiftmag.dev/what-is-devops-3236/">Is DevOps just a conspiracy theory?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2023/11/Baruch-Sadogursky.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2023/11/Baruch-Sadogursky.png 1200w, https://shiftmag.dev/wp-content/uploads/2023/11/Baruch-Sadogursky-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2023/11/Baruch-Sadogursky-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2023/11/Baruch-Sadogursky-768x403.png 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">There is something to unpack here, says <strong>Baruch Sadogursky</strong> as he takes the stage at the <a href="https://shift.infobip.com/us/">Miami Shift Conference</a>. A conspiracy theory, even. </p>



<p class="wp-block-paragraph">And here it is:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">&#8220;DevOps is a conspiracy by Ops people to make developers work harder.&#8221;</p>
</blockquote>



<p class="wp-block-paragraph">Jokes aside, <a href="https://www.linkedin.com/in/jbaruch/">Baruch</a> urges us to look into who teaches us about DevOps. A quick Google search along the lines of &#8220;What is DevOps&#8221; reveals that the answer to this question can be found on the websites of AWS (an Ops platform), Atlassian (a DevOps company), Gitlab (Offering DevOps in the box), Synopsis ( Observability platform). Is it a coincidence that all sources that teach us about DevOps are (dev)ops or DevOps-adjacent companies? &nbsp;</p>



<p class="wp-block-paragraph">Why do they want to sell DevOps ideas to us so badly?<strong> Is it about the revenue</strong> that the industry generates? A DevOps salary is almost double that of a system administrator (with not that great of a difference in the job description); there are almost 20k open positions for DevOps engineers in us alone, and let&#8217;s not forget about the certification business behind it.&nbsp;</p>



<p class="wp-block-paragraph">One could easily make the connection between money and the push for DevOps – but only if we forget to go back in time to when DeOps first started. When it was first conceived and gained momentum, t<strong>he money was just not there</strong>, Baruch notes.&nbsp;</p>



<h2 class="wp-block-heading"><span id="this-looks-familiar">This looks familiar</span></h2>



<div class="wp-block-cover"><span aria-hidden="true" class="wp-block-cover__background has-background-dim"></span><img loading="lazy" decoding="async" width="1024" height="580" class="wp-block-cover__image-background wp-image-3240" alt="" src="https://shiftmag.dev/wp-content/uploads/2024/04/Devops-toolchain.svg_.png?x94846" data-object-fit="cover" srcset="https://shiftmag.dev/wp-content/uploads/2024/04/Devops-toolchain.svg_.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/04/Devops-toolchain.svg_-300x170.png 300w, https://shiftmag.dev/wp-content/uploads/2024/04/Devops-toolchain.svg_-768x435.png 768w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /><div class="wp-block-cover__inner-container is-layout-flow wp-block-cover-is-layout-flow">
<p class="wp-block-paragraph"></p>
</div></div>



<p class="wp-block-paragraph">If we look at the DevOps cycle, it seems pretty familiar – it is basically an <strong>agile development cycle, with an Ops cycle plastered on there</strong>, and suddenly it&#8217;s supposed to all be dev work! Even when we look at the main aspects of DevOps work, we hear Ops concepts: deploying code, going from code commit to running code in production, unplanned outages, and degraded services.</p>



<p class="wp-block-paragraph">Baruch protests that none of them feature in the basic postulates of dev work: <strong>Developers do new features, refactor, and&nbsp;fix bugs</strong>; that&#8217;s the job. The code passes the tests; it works, and the job is done. What happens in production is beyond the scope of work, so there is no room for these ops concepts there, right?</p>



<p class="wp-block-paragraph">Well, it&#8217;s not that simple. If we take a look at a software craftsman definition of done Baruch provided, the list is much longer: </p>



<p class="wp-block-paragraph">The project is done once the dev understands what needs to be done; the code is simple, readable, and easy to deploy; non-functional requirements are met; no tech debt was created; tests pass &amp; QA is happy with the code; team lead, PO and client are all satisfied. That&#8217;s a much longer list, and what stands out is&nbsp;<em>quality.</em></p>



<p class="wp-block-paragraph">Baruch leads with another difficult question&nbsp;here: <strong>What is quality</strong>,&nbsp;that&nbsp;we&#8217;re so concerned about?</p>



<h2 class="wp-block-heading"><span id="testing-in-production-caring-about-what-happens-in-production">Testing in production = caring about what happens in production</span></h2>



<p class="wp-block-paragraph"><strong>Quality is an ever-evolving concept.</strong> It&#8217;s certainly not what it was ten years ago, as Baruch illustrates with an example of his old bank website, which allowed you to check your balances and transactions but nothing more complex. It was nice to have but not critical, so when the website was out for 12 hours for update &amp; maintenance to ensure quality of service, that was not a big deal.</p>



<p class="wp-block-paragraph">That era is gone – we expect more of our software now. In terms of banking, we do it via mobile app and expect everything to work at all reasonable times, and to work quickly and efficiently. Code quality alone is not enough to cater to that.</p>



<p class="wp-block-paragraph">Let&#8217;s just look at the<strong> size of the global datasphere – 175ZB.</strong> There is more and more data in more and more companies, which makes testing harder. Since we have so much data in production that we cannot, or at least it&#8217;s not worth replicating in staging. So, we test in production:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">And if we&#8217;re testing in production, and being in production is what is needed to ensure the software quality that&#8217;s so important to us as developers, that means we now care what happens in production. </p>



<p class="wp-block-paragraph"></p>



<p class="wp-block-paragraph">And if we care about what happens in production, that means we have to be there when we throw untested software on unsuspecting customers in production and observe the blast radius from our bunker. We have to be there in the middle of the night with the ops people because this is actually the first time we get to test our software.</p>
</blockquote>



<h2 class="wp-block-heading"><span id="the-new-scope-of-work">The new scope of work</span></h2>



<p class="wp-block-paragraph">This new turn of events means that there are some more items on the developers&#8217; to-do list:</p>



<ul class="wp-block-list">
<li>We understand how our software is going to be deployed</li>



<li>The build is reliable, repeatable &amp; fast</li>



<li>The code is stateless &amp; scalable</li>



<li>It starts fast and dies fast</li>



<li>It is observable</li>



<li>It supports feature flags</li>



<li>It&#8217;s backward &amp; forward-compatible</li>



<li>The code emits event streams</li>
</ul>



<p class="wp-block-paragraph">This is what you need to ensure the quality of your software in production, says Baruch, so starting now, you care about this stuff:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">10 years in, developers are still not excited about DevOps. But still, we have to be there – because we care about what we&#8217;re doing. So how do we go from <img src="https://s.w.org/images/core/emoji/17.0.2/72x72/1f641.png" alt="🙁" class="wp-smiley" style="height: 1em; max-height: 1em;" /> to :). </p>
</blockquote>



<p class="wp-block-paragraph">So, is DevOps a plot by Ops people? For sure, Baruch says, but it&#8217;s also <strong>an evolution</strong> and<strong> a means to an end</strong>. The end being <strong>quality, new features, lean software and security</strong>. DevOps just delivers what every business needs.</p>



<h2 class="wp-block-heading"><span id="devops-engineers-are-rainbow-farting-unicorns">DevOps engineers are rainbow-farting unicorns</span></h2>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="768" src="https://shiftmag.dev/wp-content/uploads/2024/04/IMG_20240423_151102-1024x768.jpg?x94846" alt="" class="wp-image-3242" srcset="https://shiftmag.dev/wp-content/uploads/2024/04/IMG_20240423_151102-1024x768.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2024/04/IMG_20240423_151102-300x225.jpg 300w, https://shiftmag.dev/wp-content/uploads/2024/04/IMG_20240423_151102-768x576.jpg 768w" sizes="auto, (max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph">So now that you&#8217;re thrown into DevOps, you better get to know it. If we look at the traditional DevOps Venn diagram, we can see that DevOps is an intersection between Devs, QA, and Ops. So, are you supposed to master all those disciplines? If that thought terrifies you, Baruch has some good news: </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">That person does not exist. It is a rainbow-farting unicorn, completely made up. DevOps is not about one person; it&#8217;s about collaboration.&nbsp;</p>
</blockquote>



<p class="wp-block-paragraph">Instead, he suggests there are <strong><em>T-shaped people</em></strong>: People who know their job (coding) well but also&nbsp;understand&nbsp;what the Ops and QA part of the house means. This doesn&#8217;t mean developers have to become QA and Ops; they&nbsp;just take an interest in what they do.&nbsp;</p>



<p class="wp-block-paragraph">That&#8217;s not so bad, is it? Or, in Baruch&#8217;s own words:&nbsp;</p>



<p class="wp-block-paragraph">&#8220;DevOps is fine.&#8221;</p>
<p>The post <a href="https://shiftmag.dev/what-is-devops-3236/">Is DevOps just a conspiracy theory?</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 

Served from: shiftmag.dev @ 2026-06-22 07:57:12 by W3 Total Cache
-->