<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Zeljko Svedic, Author at ShiftMag</title>
	<atom:link href="https://shiftmag.dev/author/zeljkosvedic/feed/" rel="self" type="application/rss+xml" />
	<link>https://shiftmag.dev/author/zeljkosvedic/</link>
	<description>Insightful engineering content &#38; community</description>
	<lastBuildDate>Thu, 29 Aug 2024 13:36:38 +0000</lastBuildDate>
	<language>en-GB</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=7.0</generator>

<image>
	<url>https://shiftmag.dev/wp-content/uploads/2024/08/cropped-ShiftMag-favicon-32x32.png</url>
	<title>Zeljko Svedic, Author at ShiftMag</title>
	<link>https://shiftmag.dev/author/zeljkosvedic/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Two rules of AI business and startups that ignore them</title>
		<link>https://shiftmag.dev/two-rules-of-ai-business-and-startups-that-ignore-them-4109/</link>
		
		<dc:creator><![CDATA[Zeljko Svedic]]></dc:creator>
		<pubDate>Thu, 29 Aug 2024 13:02:25 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[AI]]></category>
		<category><![CDATA[AI business]]></category>
		<category><![CDATA[AI product]]></category>
		<category><![CDATA[AI startups]]></category>
		<category><![CDATA[large language models]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=4109</guid>

					<description><![CDATA[<p>Majority of AI entrepreneurs and engineers don’t pay attention to them, maybe because these rules show why their AI project will fail.</p>
<p>The post <a href="https://shiftmag.dev/two-rules-of-ai-business-and-startups-that-ignore-them-4109/">Two rules of AI business and startups that ignore them</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img fetchpriority="high" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2024/08/SOTA.png?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2024/08/SOTA.png 1200w, https://shiftmag.dev/wp-content/uploads/2024/08/SOTA-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2024/08/SOTA-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/08/SOTA-768x403.png 768w" sizes="(max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">These rules are not new, and they are not mine; I stole them from <a href="https://en.wikipedia.org/wiki/Andrew_Ng" target="_blank" rel="noreferrer noopener">Andrew Ng</a> and <a href="https://www.ben-evans.com/" target="_blank" rel="noreferrer noopener">Benedict Evans</a>, two men with a huge following. </p>



<h2 class="wp-block-heading"><span id="ai%e2%80%99s-law-of-diminishing-returns">AI’s Law of diminishing returns</span></h2>



<p class="wp-block-paragraph">To paraphrase Andrew’s words from <a href="https://www.coursera.org/specializations/deep-learning?msockid=354e180f1a3b6e2e10a40cbc1b8a6f0a">Coursera’s Deep Learning Specialization course</a>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">The effort to half an AI system&#8217;s error rate is similar, regardless of the starting error rate.&nbsp;</p>
</blockquote>



<p class="wp-block-paragraph">This is not very intuitive. If an AI system passes 90% of test cases and errors on 10%, then you are 90% done, right? Fix the remaining 10% of errors, and you will have 100% accuracy? Absolutely not.</p>



<p class="wp-block-paragraph">If it took you six months to halve the error rate from 20% to 10%, it will take you approximately another six months to halve 10% to 5%. And another six months to halve 5% to 2.5%. Ad infinitum. You will never achieve a 0% error rate on a real-world AI system. For an illustrative example, see this typical chart of error rate vs the number of training samples:</p>



<figure class="wp-block-image size-large"><img decoding="async" width="1024" height="538" src="https://shiftmag.dev/wp-content/uploads/2024/08/LLM-error-rate-1024x538.png?x94846" alt="" class="wp-image-4111" srcset="https://shiftmag.dev/wp-content/uploads/2024/08/LLM-error-rate-1024x538.png 1024w, https://shiftmag.dev/wp-content/uploads/2024/08/LLM-error-rate-300x158.png 300w, https://shiftmag.dev/wp-content/uploads/2024/08/LLM-error-rate-768x403.png 768w, https://shiftmag.dev/wp-content/uploads/2024/08/LLM-error-rate.png 1200w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>



<p class="wp-block-paragraph"><em>Credits: Zeljko Svedic</em></p>



<p class="wp-block-paragraph">Notice that later in the training process, the training set size increases exponentially with each error rate halving, and <strong>the error rate never reaches zero</strong>. Sure, you will get more efficient with acquiring training data (e.g., by using low-quality sources or synthetic data). Still, it is hard to believe that acquiring 10X more data is going to be much easier than acquiring the initial set.&nbsp;</p>



<p class="wp-block-paragraph">This rule becomes more intuitive when dissecting what an AI system error rate represents: uncovered real-world special cases. There are an infinite number of them. For example, one of the easiest machine learning (ML) tasks is <strong>classifying images of dogs and cats</strong>. It is an introductory task with <a href="https://wtfleming.github.io/blog/pytorch-cats-vs-dogs-part-3/" target="_blank" rel="noreferrer noopener">online tutorials that get 99% accuracy</a>. But solving the last 1% is incredibly hard. For example, is the <a href="https://s.abcnews.com/images/Lifestyle/HT-cat-dog-02-jef-161110_16x9_992.jpg" target="_blank" rel="noreferrer noopener">creature in this image</a> a dog or a cat?</p>



<p class="wp-block-paragraph">It is <a href="https://www.atchoumthecat.com/" target="_blank" rel="noreferrer noopener">Atchoum, the cat</a>, who rose to fame because half of the humans recognized him as a dog. The human accuracy on dog/cat classification within 30 seconds is <a href="https://www.microsoft.com/en-us/research/wp-content/uploads/2007/10/CCS2007.pdf">99.6%</a>. A dog/cat classifier with less than a 0.4% error rate would be superhuman. But it is possible. A training set with hundreds of thousands of strange-looking dogs and cats would teach a neural network to focus just on details encoded in dog or cat chromosomes (e.g. <a href="https://en.wikipedia.org/wiki/Cat_senses#Sight">cat eyes</a>).</p>



<p class="wp-block-paragraph">However, building such a dataset is orders of magnitude more complex than a tutorial with 99% accuracy. Other problems lurk in that 1% error rate: photos that are too dark, photos in low resolution, photo compression artifacts, photo post-processing by modern smartphones (adding of non-existing details), dogs and cats with medical conditions, etc. The problem space is infinite. This is still considered a solved ML problem, though, because a <strong>1% error rate is low enough for all practical purposes.&nbsp;</strong></p>



<p class="wp-block-paragraph">But for some problems, even a 0.01% error rate is not satisfactory, for example: full-self driving (FSD). Elon Musk said in a <a href="https://fortune.com/2015/12/21/elon-musk-interview/?xid=yahoo_fortune#:~:text=%E2%80%9CWe%E2%80%99re%20going%20to%20end%20up%20with%20complete%20autonomy%2C%20and%20I%20think%20we%20will%20have%20complete%20autonomy%20in%20approximately%20two%20years.%E2%80%9D">2015 article with Forbes</a>:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We’re going to end up with complete autonomy, and I think we will have complete autonomy in approximately two years.</p>
</blockquote>



<p class="wp-block-paragraph">Tesla was so confident in that prediction that they started <a href="https://en.wikipedia.org/wiki/Tesla_Autopilot#History" target="_blank" rel="noreferrer noopener">selling a full self-driving add-on package in 2016</a>, and they weren’t the only ones. <a href="https://en.wikipedia.org/wiki/Kyle_Vogt" target="_blank" rel="noreferrer noopener">Kyle Vogt</a>, CEO of Cruise, wrote a piece called <a href="https://medium.com/cruise/how-we-built-the-first-real-self-driving-car-really-bd17b0dbda55" target="_blank" rel="noreferrer noopener">How we built the first real self-driving car (really)</a> in 2017, in which <a href="https://medium.com/cruise/how-we-built-the-first-real-self-driving-car-really-bd17b0dbda55#:~:text=the%20most%20critical%20requirement%20for%20deployment%20at%20scale%20is%20actually%20the%20ability%20to%20manufacture%20the%20cars%20that%20run%20that%20software">he claimed</a>:</p>



<p class="wp-block-paragraph"><em>“&#8230;the most critical requirement for deployment at scale is actually the ability to manufacture the cars that run that software”</em></p>



<p class="wp-block-paragraph">So, the software and the working prototype are done; they just need to mass-produce “100,000 vehicles per year.”&nbsp;</p>



<p class="wp-block-paragraph">Fast forward to 2024. Elon Musk&#8217;s predictions for autonomous Tesla vehicles deserved a <a href="https://en.wikipedia.org/wiki/Criticism_of_Tesla,_Inc.#Musk's_promises" target="_blank" rel="noreferrer noopener">lengthy Wikipedia table, mostly in red</a>:&nbsp;</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXc2IyKAam_bocLaKBZ5nziCHufwEBeM9W9VmNYySzr2GayicGYxRsFpMOqFznvxwUI_EP3BdfY4sHmlv2t1QCAEGSXEz2s3IO_jjEZU16Yp99D9ttXsZqHVjowRwmzLmW8d6CdX8BIYTFoj8JXduOYO4C1_?key=qkGfQMxmVtI818WaLgV3Aw" alt=""/></figure>



<p class="wp-block-paragraph">Credits: Wikipedia</p>



<p class="wp-block-paragraph">What about Kyle Vogt? In October of 2023, Cruise’s car <a href="https://archive.ph/yQMOS" target="_blank" rel="noreferrer noopener">dragged a pedestrian for 20 feet</a>, after which California’s DMV suspended Cruise’s self-driving taxi license. Kyle <a href="https://archive.ph/SEeP3" target="_blank" rel="noreferrer noopener">“resigned” as CEO</a> in November 2023.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">Don’t misunderstand me—I believe autonomous cars will have a significant market share, probably in the next decade. The failed predictions above illustrate what happens when <strong>entrepreneurs don’t respect the AI law of diminishing returns.</strong> </p>



<p class="wp-block-paragraph">Elon and Kyle probably saw a demo of a full-self-driving car that could drive on its own on a sunny day on a marked road. Sure, a safety driver needed to intervene sometimes, but that was only 1% of the drive time. It is easy to conclude that “autonomous driving is a solved problem,” as Elon said in 2016. </p>



<p class="wp-block-paragraph">Notice how ML scientists and engineers didn’t make such bombastic claims. They were aware of many edge cases, some of which are described in crash reports. Edge cases include:</p>



<ul class="wp-block-list">
<li>A<a href="https://www.wsj.com/articles/video-shows-final-seconds-before-fatal-uber-self-driving-car-crash-1521673182" target="_blank" rel="noreferrer noopener"> pedestrian crossing a two-lane avenue with a bicycle at night and without lights</a> (2018 Uber crash).</li>



<li>The<a href="https://en.wikipedia.org/wiki/List_of_Tesla_Autopilot_crashes#Mountain_View,_California,_USA_(March_23,_2018)"> lane’s white lines diverging before a barrier</a> (2018, Tesla Model X crash).</li>



<li>A<a href="https://www.washingtonpost.com/technology/interactive/2023/tesla-autopilot-crash-analysis/"> white trailer truck against the white sky</a> (2019 crash, Tesla Model 3 decapitated the driver and continued driving for 40 seconds).</li>



<li>A vehicle kicking a pedestrian under an FSD car (2023 Cruise incident mentioned above).</li>
</ul>



<p class="wp-block-paragraph">Why so many companies promised a drastic reduction in self-driving error rates in such a short time without having a completely new ML architecture is an open question. Scaling laws for convolutional neural networks have been known for some time, and the new transformer architecture obeys a similar scaling law.&nbsp;</p>



<h2 class="wp-block-heading"><span id="ai%e2%80%99s-product-vs-feature-rule">AI’s Product vs Feature rule</span></h2>



<p class="wp-block-paragraph">When is an AI system a good stand-alone product, and when is it just a feature? In the words of Benedict Evans from <a href="https://another-podcast.simplecast.com/episodes/ai-summer-rzubFrA4" target="_blank" rel="noreferrer noopener">The AI Summer</a> podcast: “Is this feature or a product? Well, <strong>if you can’t guarantee it is right</strong>, it’s a feature. It needs to be wrapped in something that manages or controls expectations.” I love that statement. The “it is right” part can be broken down using the error rate:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If your AI system has a higher error rate than target users, you have an AI feature in an existing workflow, not a stand-alone AI product.</p>
</blockquote>



<p class="wp-block-paragraph">This rule is more intuitive than the law of diminishing returns. If target users are better at a task, they will not like stand-alone AI system results. They could still use AI to save them effort and time, but they will want to <em>review</em> and <em>edit</em> AI output. If AI completely fails at a task, humans will use <em>the old workflow and the old software</em> to finish the task.</p>



<p class="wp-block-paragraph">Let&#8217;s take MidJourney, for example, which generates whole images based on a text prompt. When I used it for a hobby project last year, satisfying artistic images appeared instantly, like magic. But then I spent hours fixing those creepy hands:</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXcnFrI68QKxjoIwDwyYsIEttVNjCoNIX7cfKfQppjp0i8ahHu1XL1iDPReK8XXKuiargkOWcr0rKaSpTSyuR2S_SHQfk7XefYely-tnSa79r8TOAAnzxeLAlClOyO4lyYGBa4fKAICZZTYSDeOc0sIVHJos?key=qkGfQMxmVtI818WaLgV3Aw" alt=""/></figure>



<p class="wp-block-paragraph">Credits: CC0 via <a href="https://neural.love/ai-art-generator/1ed5b225-76ef-66f4-b430-cbab93677b76/hand-shaking-hand">neural.love</a>&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">Each time MidJourney created a new image, one of the hands had strange artifacts. Finally, it generated an image with two normal hands—but then it destroyed the ears. The problem was less with wrong details and more with bad UI, which didn’t allow correction of the AI’s mistakes.</p>



<p class="wp-block-paragraph">Adobe’s approach is different—it treats <a href="https://blog.adobe.com/en/publish/2023/05/23/future-of-photoshop-powered-by-adobe-firefly" target="_blank" rel="noreferrer noopener">generative AI as just one feature</a> in its product suite. You use an existing tool, select an area, and then do a generative fill:</p>



<p class="wp-block-paragraph"><img decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfz1gz_voz1VJpnjvSYhiPV1ThH5PrJmo2AORrdOv9Lt0s83iYo2-k7QabtzcGpZopKt_-lfOpAGC9tf08RWrG0LhOqs5U3jK42F9AWkPo2yFjAmobH7BofcBG9URMDJXKTgZdt6bFZoBhxpqLZPQ7hoX4Z?key=qkGfQMxmVtI818WaLgV3Aw" width="315" height="178"><img loading="lazy" decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXd0fuRaZ1Z9s2Nb4LxSgLZlxOLlBvth7TjKQI_RiLOSGttE4lWMao5ppUIy-nhguf1udR3hY55KiclvABwNX8Cch5LnaVBO3lwA3oC8CDuRuU37-PrhRRCY5g2GtUQUSa4606smSgHUwhQFUVf7JvX1ZYNi?key=qkGfQMxmVtI818WaLgV3Aw" width="280" height="178"><br>Credits: <a href="https://news.adobe.com/news/news-details/2023/Adobe-Unveils-Future-of-Creative-Cloud-with-Generative-AI-as-a-Creative-Co-Pilot-in-Photoshop-default.aspx/default.aspx">Adobe press release</a></p>



<p class="wp-block-paragraph">You can use it for the smallest of tasks, like removing cigarette butts from grass in a wedding photoshoot. If you dislike AI grass, no problem—revert to the old designer joy of manually cloning grass. Also, Adobe Illustrator has<a href="https://www.theverge.com/2024/7/23/24204231/adobe-photoshop-illustrator-generative-ai-firefly-vector-features"> generative Vector AI</a> that generates vector shapes you can edit to your liking.</p>



<p class="wp-block-paragraph">MidJourney makes more impressive demos, but Adobe’s approach is more useful to professional designers. That doesn’t mean MidJourney doesn’t make sense as a product; its target users are the ones who don’t care about details. For example, last Christmas, I got the following greetings image over WhatsApp:</p>



<p class="wp-block-paragraph"><img loading="lazy" decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfx4KDbQOwxyuU_9gwupCnZ_XYyqNbi4Qzld9xLSyi8-V47xOXBVOhtTu-KtdwygTNPnr9TgqIqInO3C5kIXrGk0vxMjy13O7nBdYSrf3NaO7BgjrOytbgQMLss7ZIev85Xnq7W9Y1zFuN3fzNy9SXzgBfP?key=qkGfQMxmVtI818WaLgV3Aw" width="421" height="497"><br>Credits: Zeljko Svedic</p>



<p class="wp-block-paragraph">Did you notice baby Jesus&#8217; hands and eyes? Take another look:</p>



<p class="wp-block-paragraph"><img loading="lazy" decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXeneOxawFxlDXPURQN_fxRZrlyW-PsKhMPPDqp2F8aoioC2zzngMiM09_uoJlRwCKwMEqqEyUdO0-6ENGkLrCxO0DX82AHLMUhvZAmIUwV2-mCau3dn4gKUJfy6DLSbCD7l1oQ2xSgedQ_Nl1vR5Tzjt-8l?key=qkGfQMxmVtI818WaLgV3Aw" width="342" height="240"><br>Credits: Zeljko Svedic</p>



<p class="wp-block-paragraph">That would never pass with a designer, but that is not the point. There is a whole army of users who don’t care about image composition and details; they just want images that go with their content. In other words, MidJourney is not a replacement for Adobe’s Creative Suite—it is a replacement for stock photo libraries like Shutterstock and Getty Images. And judging by the recent popularity of AI-generated images on social media and the web, people like artsy MidJourney images more than stock photos.</p>



<p class="wp-block-paragraph">Low-hanging fruit in stand-alone AI products are use cases where a high error rate doesn’t matter or is still better than the human error rate. An unfortunate example is guided missiles; in the Gulf War,<a href="https://insidedefense.com/inside-navy/navy-says-fewer-60-tomahawks-were-successful-gulf-war"> the accuracy of Tomahawk missiles was less than 60%</a>. But the army was happy to buy Tomahawks because they were still much more accurate than older alternatives, as <a href="https://www.cna.org/our-media/indepth/2021/02/wrong-war-right-weapons">fewer than 1 in 14 unguided bombs hit their targets</a>.</p>



<h2 class="wp-block-heading"><span id="evaluating-startups-based-on-the-above-rules">Evaluating startups based on the above rules</span></h2>



<p class="wp-block-paragraph">The great thing is that error rates are measurable, so the above rules give a framework to judge an AI startup quickly. Below is a simple startup example.</p>



<p class="wp-block-paragraph">Devin AI made quite a splash in <a href="https://x.com/cognition_labs/status/1767548763134964000">March of 2024 with a video demo of developer AI</a> that can create fully working software projects. From the announcement Devin was “evaluated on the <a href="https://www.swebench.com/" target="_blank" rel="noreferrer noopener">SWE-Bench</a>” (relevant benchmark), and “correctly resolves 13.86% of the issues unassisted, far exceeding the previous state-of-the-art model performance of 1.96% unassisted.” So, the current state-of-the-art (SOTA) has a 98% error rate, and they claim to have an 86% error rate. Even if that claim is valid (it wasn’t independently verified), why do their promo videos show success after success? It turns out that the video examples were <a href="https://80.lv/articles/first-ai-software-engineer-creators-are-accused-of-lying/">cherry-picked, the task description was changed, and Devin took hours to complete</a>.</p>



<p class="wp-block-paragraph">In my opinion, Microsoft took the right approach with GitHub Copilot. Although <strong>LLMs work surprisingly well for coding</strong>, they still make a ton of mistakes and don’t make sense as a stand-alone product. Copilot is a feature integrated into popular IDEs that pops up with suggestions when they are likely to help. You can review, edit, or improve on each suggestion.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">Again, don’t get me wrong. I think coding SOTA will drastically improve over the next few years, and one day, <strong>AI will be able to solve 80% of GitHub issues</strong>. Devin AI is still far away from that day, although the company has<a href="https://fortune.com/2024/03/31/cognition-labs-ai-startup-seeks-2-billion-valuation-investor-frenzy-warnings-bubble/" target="_blank" rel="noreferrer noopener"> a valuation of $2 billion in 2024</a>.</p>



<p class="wp-block-paragraph">More formally, the framework for evaluation is:</p>



<ol class="wp-block-list">
<li>Find a <em>relevant benchmark</em> for a specific AI use case.&nbsp;</li>



<li>Find the current <em>state-of-the-art (SOTA) error rate</em> and <em>human error rate</em> on that benchmark.</li>



<li>Is the SOTA better or comparable to the human error rate?
<ol class="wp-block-list">
<li>If yes (unlikely): Great, the problem is solved, and you can create a stand-alone AI product by reproducing SOTA results.</li>



<li>If no (likely): Check if there is a niche customer segment that is more tolerant of errors. If yes, you can still have a niche stand-alone product. If you can’t find such a niche, go to the next step.</li>
</ol>
</li>



<li>You can’t release a stand-alone AI product. Wait for SOTA to get better, pour money into research, or go to the next step.</li>



<li>Think about how to integrate AI as a feature into the existing product. Make it easy for users to detect and correct AI’s mistakes. Then, measure AI’s return on investment:<br><br>AI_ROI = Effort_saved_by_accurate_AI_responses / Effort_lost_on_checking_and_modifying_AI_responses<br><br>If too much user time is spent checking and correcting AI errors (AI_ROI &lt;= 1), you don’t even have a feature.<br></li>
</ol>



<p class="wp-block-paragraph">Or, to summarize everything discussed here in one sentence:</p>



<p class="wp-block-paragraph"><img loading="lazy" decoding="async" src="https://lh7-rt.googleusercontent.com/docsz/AD_4nXfRoAPSKKycPMv7z4mWx-DgZW9MjVyqCyL0z69kGNmxalaB6ziD-xwt36610TWwx4Xc54UkzU7VuYFENbmZTbn8kPiNEx3Y14mqWxbh2d50OT-1RG-yFbO9wTBnxiI7jqodDdNiNm5Q2MTclmt-ehvH09Mj?key=qkGfQMxmVtI818WaLgV3Aw" width="333" height="419"><br>Credits: Zeljko Svedic</p>



<p class="wp-block-paragraph">Every innovative AI use case will eventually become a feature or a product once the error rates allow it. If you want to make it happen faster, become a researcher. </p>



<p class="wp-block-paragraph"><a href="https://openai.com/index/team-update-january/">OpenAI’s early employees</a> spent seven years on AI research before overnight success with ChatGPT. Ilya Sutskever, OpenAI’s chief scientist, still didn’t want to release ChatGPT 3.5 because <a href="https://www.businessinsider.com/chatgpt-was-inaccurate-boring-when-it-launched-openai-cofounder-2023-10#:~:text=OpenAI%27s%20chief%20scientist%20admitted%20that%20he%20didn%27t%20think,was%20taken%20by%20surprise%20by%20its%20explosive%20popularity.">he was afraid it hallucinated too much</a>. Science takes time.</p>



<p class="wp-block-paragraph"><em>If you found this article useful, please share.</em></p>
<p>The post <a href="https://shiftmag.dev/two-rules-of-ai-business-and-startups-that-ignore-them-4109/">Two rules of AI business and startups that ignore them</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>After 10 years, our remote-only company had its first in-person meeting</title>
		<link>https://shiftmag.dev/remote-only-company-testdome-976/</link>
		
		<dc:creator><![CDATA[Zeljko Svedic]]></dc:creator>
		<pubDate>Thu, 20 Jul 2023 14:45:42 +0000</pubDate>
				<category><![CDATA[Productivity]]></category>
		<category><![CDATA[Sponsored]]></category>
		<category><![CDATA[remote]]></category>
		<category><![CDATA[sponsored]]></category>
		<category><![CDATA[testdome]]></category>
		<guid isPermaLink="false">https://shiftmag.dev/?p=976</guid>

					<description><![CDATA[<p>This is how we went from a startup of 4 to a company of 20 people, all working remotely. </p>
<p>The post <a href="https://shiftmag.dev/remote-only-company-testdome-976/">After 10 years, our remote-only company had its first in-person meeting</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></description>
										<content:encoded><![CDATA[<figure class="wp-block-post-featured-image"><img loading="lazy" decoding="async" width="1200" height="630" src="https://shiftmag.dev/wp-content/uploads/2023/07/TestDome.jpg?x94846" class="attachment-post-thumbnail size-post-thumbnail wp-post-image" alt="" style="object-fit:cover;" srcset="https://shiftmag.dev/wp-content/uploads/2023/07/TestDome.jpg 1200w, https://shiftmag.dev/wp-content/uploads/2023/07/TestDome-300x158.jpg 300w, https://shiftmag.dev/wp-content/uploads/2023/07/TestDome-1024x538.jpg 1024w, https://shiftmag.dev/wp-content/uploads/2023/07/TestDome-768x403.jpg 768w" sizes="auto, (max-width: 1200px) 100vw, 1200px" /></figure>


<p class="wp-block-paragraph">Two years of Covid-19 did more for remote work than decades of business conferences and all of Dilbert comics. Both lovers and haters of remote work were forced to close their offices and work from their pajamas. But, as lockdowns ended, haters were quick to demand going back. Offices are &#8220;more productive&#8221; and &#8220;build the team spirit,&#8221; or at least those are the claims. The same experience—working from home—was perceived quite differently by the two camps. </p>



<p class="wp-block-paragraph">And then, there is a third camp—the ones who <em>didn&#8217;t notice</em> the lockdown in their workday. The ones without a strong opinion because they accepted remote work, with its pros and cons, long ago. Current online debates make them smile and remember the younger days when they started their remote journey. </p>



<p class="wp-block-paragraph">Our pre-employment testing company, <a href="https://www.testdome.com/">TestDome</a>, is <strong>firmly in the third camp</strong>. We went from a startup of 4 to a company of 20 people, <strong>all working remotely</strong>. This year we met each other for the first time to celebrate 10 years of doing business. Before I share funny anecdotes from that meeting, let me explain how we got here and what we learned about remote work.</p>



<h2 class="wp-block-heading"><span id="our-remote-experience">Our remote experience</span></h2>



<p class="wp-block-paragraph">We started remote work almost accidentally. The founding team was all from Croatia, but one developer wanted to work from a Croatian island, another from his home city, and I wanted to be flexible to travel around the world. So, in 2013 we decided to use Assembla for tickets and code hosting, Google Docs for writing documents, and email and Skype for communication. But the tools are less important than the mindset. We learned that: </p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><strong>Remote work requires <em>asynchronous communication</em></strong>, in contrast to the office, where colleagues being a few steps away facilitates synchronous communication.</p>
</blockquote>



<p class="wp-block-paragraph">Many companies use Slack or Discord chat rooms to simulate a feeling of constant online &#8220;togetherness.&#8221; <strong>But that never worked for us.</strong> You never know if the other party on chat will answer your question in 30 seconds or 30 minutes. We tried many tools, but:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">We never found chat software that <em>accurately</em> shows if the other person is at their computer, away, or in &#8220;do not disturb&#8221; mode because of an ongoing call.</p>
</blockquote>



<p class="wp-block-paragraph"><strong>If you count on people to manually change their status every time they go to the toilet, good luck.</strong> And requiring people to be constantly attentive to every message in Slack, be it important or funny, is a mass killer of company productivity. Instead, we adopted one style of communication that works for chat, email, and ticket mentions:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><strong>Provide full context when asking or requesting something</strong>, and get back to that task once the other person answers. </p>
</blockquote>



<p class="wp-block-paragraph">For example, when a coworker asked about his post idea over chat, he provided full context:</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh5.googleusercontent.com/uMZZO28fDpPtYnXo3IJYK7zfe8llNHbRABtUzHuRsf4E5t91rX3CXqTxhrFyey5MpUkj7Eah65IQvxD2RbtOgx7JbB3PDbudl8zUn4RdzmhDoM8lxF-3oQ2KvuzbjHfCLbnB_sM6lQmG56tmfrssUdU" alt=""/></figure>



<p class="wp-block-paragraph">Written communication is the king of remote work. <strong>You need to write clearly and concisely in tickets, wikis, documents, emails, etc. </strong>Everything is written down because:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph"><strong>The mortal sin of remote work is <em>blocking other colleagues</em>.</strong> The easiest way to block others is <em>not to write something important</em>, and then they need to wait until you respond in your time zone.</p>
</blockquote>



<p class="wp-block-paragraph">That explains the disagreement at the beginning of the article. If you are a structured person who likes to work in the zone, without distractions, and then write things down, then you will probably like remote work. If, on the other hand, you like impromptu conversations, fluent structure, being able to walk to a colleague and ask them a question, or dislike writing—then you will probably hate remote work. In general, it seems that:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Remote work is more suited to <em>introverts</em>, while office work is more suited to extroverts.</p>
</blockquote>



<p class="wp-block-paragraph">With all that wisdom, we were able to grow from 4 people in the beginning to 20 people today. We have remote employees in Brazil, Costa Rica, Morocco, Portugal, Finland, Croatia, Poland, Azerbaijan, India, China, and Australia. We found most of them using a lesser-known job board called <a href="https://weworkremotely.com/">We Work Remotely</a>. Only remote jobs are allowed there.</p>



<p class="wp-block-paragraph">FUN FACT:<strong> We Work Remotely </strong>started as a <a href="https://signalvnoise.com/posts/3671-our-new-job-board-weworkremotelycom">side project of 37 signals</a> and featured in their popular book <a href="https://basecamp.com/books/remote">REMOTE: Office not required</a>. I highly recommend this book; it explains many of the ideas mentioned here.</p>



<p class="wp-block-paragraph">The location of either companies or candidates is irrelevant on We Work Remotely, which led to one funny situation. A candidate called Bruno decided to try remote work. He had a bad experience working for local Croatian companies and decided to work for a foreign company. He applied via We Work Remotely and solved the pre-interview screening test on TestDome. When it was time for the interview, he was surprised that his interviewer spoke Croatian, not English. But he thought, &#8220;It is a big international company; they assigned someone from the region to interview me.&#8221; Only in the end, he realized that TestDome founders live in the same country he tried to avoid—Croatia!</p>



<p class="wp-block-paragraph">Employees aside, there is another essential ingredient:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">To be a remote-only company, you need to have <em>remote-only customers</em>.</p>
</blockquote>



<p class="wp-block-paragraph">If your customers demand physical meetings or want to come to your office, <strong>no remote book will help</strong>. Fortunately, TestDome is a prepaid SaaS solution, and most new customers come via SEO (Search Engine Optimization). Typically, our customer is a CTO or a project lead responsible for technical interviews. They get too many unqualified candidates; today, everybody claims to be a skilled programmer. When they google for e.g., &#8220;angular coding test,&#8221; TestDome’s free test shows up on top:</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh5.googleusercontent.com/pLrMYCmGcpDtSNP4O5Yooilg4gQWot7ZZtgJ6WIQiKxpNGUAJSRsUfb_2w86BMgjhIpBu9vmL6Dq4ZR7IBPbTkZDVtAXtEpwgxkQkRdKLwi5ceiBYdbp5C9z0dvQV8d7zL-oHY1KdMRXruW47DGz5Ns" alt=""/></figure>



<p class="wp-block-paragraph">If they like our free questions, they can sign-up and make a purchase with a credit card; <strong>everything is self-service</strong>. We have large customers we have never talked with or exchanged emails with, and honestly, they probably don&#8217;t want to get any emails or calls from us.&nbsp;</p>



<p class="wp-block-paragraph">In other words, <strong>it took an effort to have remote-only customers</strong>. We give 30% of our questions for free to have attractive landing pages and demonstrate value. We give developer certificates for free. Onboarding, global CDN, and responsive pages are all there for a smooth customer experience. If your product requires on-site support, remote is hard.&nbsp;&nbsp;</p>



<p class="wp-block-paragraph">It is not all milk and honey in the land of remote work. <strong>The most significant drawbacks we noticed are</strong>:</p>



<ul class="wp-block-list">
<li>The lack of team spirit.</li>



<li>General loneliness you can feel if you don&#8217;t leave your house for a day.</li>



<li>Sometimes you just want to make small talk.</li>
</ul>



<p class="wp-block-paragraph"><strong>We tried to address that with the following:</strong></p>



<ul class="wp-block-list">
<li>Daily team standup calls—There are separate calls for content and product teams, and they are in the middle of the workday to be convenient for all time zones.</li>



<li>Paid coworking membership—Choose any local coworking location you like, and we will pay for it. I worked from coworking places in Oxford, Berlin, Thailand, and Croatia.</li>



<li>Virtual team buildings—2-3 times a year, we have a virtual team building where the entire company plays silly team games.</li>
</ul>



<p class="wp-block-paragraph">Daily calls show what others in the team are working on. A coworking place surrounds you with people in a similar situation where you can hang out for coffee or lunch. Games on virtual team building are needed for people to relax and stop talking about business. Classics we played over the years are:</p>



<ul class="wp-block-list">
<li><strong>Two truths and a lie</strong>—each employee tells three personal things, and others need to guess which one is a lie. As it is often in business, the best liar wins.&nbsp;&nbsp;&nbsp;&nbsp;</li>



<li><a href="https://www.geoguessr.com/"><strong>Geoguessr</strong></a>—pin the exact location on a world map based on just the Google Street view. Quizmaster usually takes one location from each city our employees live in, so everybody has a fair chance, and we get to see what our home cities look like.</li>



<li><strong>Company silly events quiz</strong>—e.g., who did a code commit at 3AM, who replied to a customer with an unchanged template email, who crashed the server on Friday afternoon. Use events that happened some time ago; otherwise, not everybody will laugh.</li>
</ul>



<p class="wp-block-paragraph">Still, building team spirit and getting to know your colleagues is a challenge for a remote company. That leads us to the next topic, our first in-person team building.</p>



<h2 class="wp-block-heading"><span id="first-in-person-meeting">First in-person meeting</span></h2>



<p class="wp-block-paragraph">The first question was, where in the world to meet? Since our employees live from Brazil to Australia, going near a big international airport made sense. <strong>Rome</strong> won over Istanbul and Barcelona, both because of better flight connections (all flights lead to Rome) and because everyone wanted to visit the The Eternal City. </p>



<p class="wp-block-paragraph">In retrospect, <strong>we should&#8217;ve checked with EU visa officers</strong>. It turns out that, for some nationalities, getting EU business visas is quite hard. Applicants that are young, unmarried, without real estate, and without children (&#8220;strong ties&#8221; criteria) are flatly rejected, regardless of the company guarantee. We later learned that the unwritten rule of international team building is:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">If US-based, organize international team buildings in Mexico. If EU-based, organize in Turkey. Getting US or EU visas for international employees is a royal pain.</p>
</blockquote>



<p class="wp-block-paragraph">Unfortunately, our employees from Morocco and India were denied visas and couldn&#8217;t come. But that highlighted one enormous benefit of remote work:</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="wp-block-paragraph">Remote companies get access to an enormous talent pool, and remote workers get opportunities they couldn&#8217;t get otherwise.</p>
</blockquote>



<p class="wp-block-paragraph">It is easy to forget how divided the world is if you have a Western passport. Our employees both have education and years of experience in IT, but couldn&#8217;t get an EU business visitor visa. Ridiculous.</p>



<p class="wp-block-paragraph">We booked a hotel in Rome that has a meeting space and is close to all attractions. Visas were stamped, plane tickets received, and we flew across eight time zones to meet for four days, starting May 22nd.</p>



<p class="wp-block-paragraph"><strong>Needless to say, everybody was quite excited.</strong></p>



<p class="wp-block-paragraph">When everybody showed up on Monday, the first surprise came. International employees only saw the founders (Mario, Josip, and me) in a video call, where our heads occupied the same number of pixels. In reality, let&#8217;s just say we don&#8217;t occupy the same volume:</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh3.googleusercontent.com/mnXgkTHuTd1ZZYVDkt7eY_lRcWkNZg8gE6ZpPyWbpquHoptks5Vp4Y2E0IEhYOWVo105R1QKTp3fnYr-9jBj4F7OaJbly__AudJIwyzrPQ_ckNIFGEii_NRD2psJ_yrIhvso2j9mePCdPQ6KE242RZY" alt=""/></figure>



<p class="wp-block-paragraph">During the next four team-building days, another small surprise was that most of us were quiet introverts. I said &#8220;small surprise&#8221; because we kind of expected that—IT and remote work seem to select quiet <strong>introverts</strong>, and our company was both.&nbsp;</p>



<p class="wp-block-paragraph">Anticipating that people may be shy because they are meeting for the first time, we packed the next three days with social activities: Rome treasure hunt, pasta cooking class, walking tour, and a TestDome history presentation with a selection of funny events that happened in the last 10 years. While we were doing that, new funny anecdotes kept happening.</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh5.googleusercontent.com/0yYxMkI8CBPu7IoRUNRP3iLVjfd-Yam2Cab9v1cCxpEQ4qVD6pTrefmNfC2jOT3tGLtrdzUgP2BGPvLGXXRwdyisHCF9yv-_ARWxKliQBQDcZ8DOtXLSUjnI_Ve9H-CYgm1t0g-Bmr9ZtBlq9UVuYNk" alt=""/></figure>



<p class="wp-block-paragraph">For example, I was able, on my first try, to find the worst pizza restaurant in Rome. Pizza Romana is supposed to have a stiffer dough, but this one was both hard as a rock and small—a perfect frisbee!</p>



<p class="wp-block-paragraph">We learned that a senior team member has been rebuilding his house for 10 years now and still has problems with construction licenses and building statics. Coincidentally, he was also the first TestDome software architect.</p>



<p class="wp-block-paragraph">A senior on the content team impressed everybody by delivering an engaging one-hour presentation with just five slides. His strategy was simple—he mentioned how ChatGPT would change our questions in the future, and suddenly everybody was participating in the heated discussion.&nbsp;</p>



<figure class="wp-block-image"><img decoding="async" src="https://lh4.googleusercontent.com/6_iqz57DraoSYa-m7Nama2MTLPOZTnMFRF0hQHahJ18-GFe2m2CowxqlOUYlNNpSscOpAO1W5WDhUE3enRmTpDUSUaXSinWOsbhfSLnro319R6z0TYCfQR_mfKgWrKuul67LzDVpWEqoAqJATLxWstQ" alt=""/></figure>



<p class="wp-block-paragraph">We joked that a coworker from Brazil liked the team building so much that he decided to stay in Europe—and yes, he did use the travel opportunity of the team building to move from Brazil to Portugal. We are afraid to organize the next team building too soon.</p>



<h2 class="wp-block-heading"><span id="office-not-found-what-next">Office not found: what next?</span></h2>



<p class="wp-block-paragraph">We planned to have in-person team buildings on anniversaries like 10, 15, or 20 years of TestDome. Now everybody wants to have it every two or three years. Now <strong>people find it easier to ask or request something from a colleague after they have spent a few days with them</strong>. All communication flows easier, although it is still over emails and mentions. I will give it to the haters of remote work; there is something to the in-person connection.&nbsp;</p>



<p class="wp-block-paragraph">But I don&#8217;t see how we could transition to the physical office. Our employees are spread around the world, I don&#8217;t know how I could ever attract such a team in the small city of Zagreb. <strong>Search for talent</strong> was the initial reason we became a remote company. We became a local poster company for remote work by accident. But even if we could attract talent locally, requiring everyone to crawl through a rush hour to be in one office from 9 to 5 now seems like an inefficient use of people&#8217;s time. <strong>Remote work, with occasional in-person team building, is still our preferred mode of operation.&nbsp;</strong></p>
<p>The post <a href="https://shiftmag.dev/remote-only-company-testdome-976/">After 10 years, our remote-only company had its first in-person meeting</a> appeared first on <a href="https://shiftmag.dev">ShiftMag</a>.</p>
]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/?utm_source=w3tc&utm_medium=footer_comment&utm_campaign=free_plugin

Page Caching using Disk: Enhanced 

Served from: shiftmag.dev @ 2026-05-24 18:52:12 by W3 Total Cache
-->