Time-to-Wow Is Getting Shorter for Data-Driven Software Development – Here's How You Achieve It

Time-to-Wow Is Getting Shorter for Data-Driven Software Development – Here’s How You Achieve It

Joseph Sibony reading time: 5 minutes

December 7, 2020

Today we’re going to talk about data and how expectations around data usage have changed. It’s a well-known fact that the time-to-value expectations (which not long ago were considered unrealistic) became the standard.

Let me start with a quote from one of my favorite cult movies – The Princess Bride. No, I will not go with Inigo Montoya’s “you killed my father, prepare to die” or any of the other unforgettable ones. I will quote Miracle Max, the miracle maker, who said: “Don’t rush me, sonny. You rush a miracle man; you get rotten miracles. You got money?”

This was the motto behind a lot of the data analysis work done over the years regarding extracting value from data. The idea is that data analysts and data scientists are working their ‘magic’ or ‘miracles’ and that it takes time. And yes, it also costs a lot of money – whether we’re talking about setting up hefty data warehouses or hiring data scientists and analysts.

A lot has changed in the world of data analysis since those ‘miracle’ days.

Data usage evolved rapidly in the past few years. The approach transformed from merely understanding past events into predicting what will happen.

This was enhanced by several processes, among which are:

Billions of IoT devices sending data from their sensors.
Massive improvements in big data technologies, especially elastic cloud-based big data technologies.
Evolution of Data Science tools and libraries like SciPy and TensorFlow, making machine learning much simpler to implement.
A wealth of BI tools, where data can be analyzed in a speedy way and with less code-writing time and queries.

This means that where once there was a strong DBA team organizing the databases and data warehouses and enabling access to them – today, there are more flexible data structures with:

Data lakes to which a large variety of the data is poured
Large data warehouses giving great analytic power.

In today’s world of data warehouses like Redshift, BigQuery, and Snowflake, organizations get a lot of elasticity and can rapidly go from zero to a fully-fledged data warehouse pretty fast.

So the miracle women and men of data analytics and data science can now work miracles much faster.

And once organizations realize the potential of utilizing more and more data, they get ‘hooked.’ I see many companies simply throwing a plethora of data into their lakes and warehouses, enabling more teams within the organizations to access them.

And that’s only natural. As organizations realize that they sold more products because of optimization done for marketing and sales purposes, they want to implement more data-driven decisions in other areas, such as customer success, operations, purchasing, HR, and so on.

Time to Value, Time to Wow!

Everyone is riding the same wave of data-driven value, even your competitors.

It is crucial to shorten the excess time you spend on activities that are not actual data analytics and act upon the data.

Furthermore, since we’re all consenting adults deriving value from our data, the more time you actually spend on it instead of being distracted by other factors, the more chances you have to get not just value – but a wow.

And by wow, I mean an unexpected, expectation-exceeding value.

Here’s an example for a wow value:

Data scientists at a certain game studio discover that not only can they correlate between specific features and the probability that users will purchase in-game cosmetics, but they also find ways to match distinct clusters of users with particular classes or colors of in-game cosmetics.

You can imagine just how easy they can optimize their conversions…

Now that your organization has massive amounts of data sitting in your data repositories, and you have the right people and tools to turn this data into value, some of the things that can slow you down are security, compliance, and privacy.

Since the data probably contains sensitive information, there are inherent risks involved, such as data leaks and different compliance frameworks which you need to align with and report according to. Plus, you need to be aware of the PII (Personal Identifiable Information) data: Where it is, who has access to it, and who actually accesses it.

For example, let’s say that our business is a massive multiplayer game. We’re getting a lot of data from our game servers (as well as from other sources, such as enrichment services, web analytics, etc.). We’d like to get a set of features that will predict which players will spend the most money on premium cosmetic items to target them with a discount coupon.

We’d like to spend most of our efforts as a business on analyzing the data and creating the prediction algorithms, thus minimizing the time spent on other things. We’d like to sort out things like data access, security, and compliance quickly and simply, and—should we need to introduce software changes to our MMO—we don’t want them to take the edge out of our operation by being too slow.

So, for example, if you’re using a Snowflake warehouse for the analysis of the data pulled from the data lake and from other sources, you want to make sure you follow Snowflake’s Security guidelines and immediately identify sensitive data which is being retrieved as part of this project, and that you can build data access audit reports for different compliance regulations without disrupting the ‘Wow creation.’

To make use of the data we analyzed, we now need to make adjustments to our software. There need to be quick and agile development cycles. For such rapid iterations, especially in large codebases, it would be wise to make sure you reduce the build time so you can take advantage of the data analytics. This can be done by distributing the build cycles across your compute power.

Software Deployment Speed Is of the Essence

It is not just security you need to watch out for as collateral damage to all that data crunching.

Most companies cannot afford to be dealing with slow software processes that prevent them from reaching data on time.

Quick and agile development cycles are necessary, but with all that data (which means a large codebase), compilation time can be long, making that time-to-wow longer. And it’s just time wasted waiting around instead of making good use of that data. Luckily, technology assists in that aspect as well. We use distributed processing technology, which harnesses the power of other computers’ CPUs to reduce build time.

I tell you if talking about magic – that’s another trick up our sleeves that, honestly, whoever deals with data analysis should at least be aware of.

Once you can do true agile data-driven value creation and match it with fast software deployments, you can supply your customers with the value they deserve – the wow!

Ben Herzberg is Satori’s Chief Scientist

Joseph Sibony

Joseph Sibony, Incredibuild's Senior Content Manager, has spent his life surrounded by technology. From hardware to software and everything in between. He has worked in data science, cyber security, and has written extensively about the intersection of technology and society.

Cookie	Duration	Description
ARRAffinity	session	ARRAffinity cookie is set by Azure app service, and allows the service to choose the right instance established by a user to deliver subsequent requests made by that user.
ARRAffinitySameSite	session	This cookie is set by Windows Azure cloud, and is used for load balancing to make sure the visitor page requests are routed to the same server in any browsing session.
cf_use_ob	past	Cloudflare sets this cookie to improve page load times and to disallow any security restrictions based on the visitor's IP address.
cookielawinfo-checbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
__cf_bm	30 minutes	This cookie, set by Cloudflare, is used to support Cloudflare Bot Management.
bcookie	2 years	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	2 years	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_gat	1 minute	This cookie is installed by Google Universal Analytics to restrain request rate and thus limit the collection of data on high traffic sites.
_uetsid	1 day	Bing Ads sets this cookie to engage with a user that has previously visited the website.
_uetvid	1 year 24 days	Bing Ads sets this cookie to engage with a user that has previously visited the website.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_gat_UA-8508435-1	1 minute	A variation of the _gat cookie set by Google Analytics and Google Tag Manager to allow website owners to track visitor behaviour and measure site performance. The pattern element in the name contains the unique identity number of the account or website it relates to.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_gid	1 day	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect the first pageview session of a user. This is a True/False flag set by the cookie.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores a true/false value, indicating whether it was the first time Hotjar saw this user.
_hjIncludedInPageviewSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's pageview limit.
_hjIncludedInSessionSample	2 minutes	Hotjar sets this cookie to know whether a user is included in the data sampling defined by the site's daily session limit.
_hjTLDTest	session	To determine the most generic cookie path that has to be used instead of the page hostname, Hotjar sets the _hjTLDTest cookie to store different URL substring alternatives until it fails.
CONSENT	2 years	YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data.
MR	7 days	This cookie, set by Bing, is used to collect user information for analytics purposes.
utm_campaign	2 months	Google Ad Services sets this cookie to store session campaign value if present.
utm_content	2 months	This cookie is used for storing the session content value if present.
utm_source	2 months	This cookie is used to record from where the visitor came to the website orginally. This information is used by the website operator to know the efficiency of their marketing.
utm_term	2 months	This cookie is used to record from where the visitor came to the website orginally. This information is used by the website operator to know the efficiency of their marketing.
vuid	2 years	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.

Cookie	Duration	Description
_fbp	3 months	This cookie is set by Facebook to display advertisements when either on Facebook or on a digital platform powered by Facebook advertising, after visiting the website.
_mkto_trk	2 years	This cookie, provided by Marketo, has information (such as a unique user ID) that is used to track the user's site usage. The cookies set by Marketo are readable only by Marketo.
fr	3 months	Facebook sets this cookie to show relevant advertisements to users by tracking user behaviour across the web, on sites that have Facebook pixel or Facebook social plugin.
IDE	1 year 24 days	Google DoubleClick IDE cookies are used to store information about how the user uses the website to present them with relevant ads and according to the user profile.
MUID	1 year 24 days	Bing sets this cookie to recognize unique web browsers visiting Microsoft sites. This cookie is used for advertising, site analytics, and other operations.
personalization_id	2 years	Twitter sets this cookie to integrate and share features for social media and also store information about how the user uses the website, for tracking and targeting.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
utm_medium	2 months	This cookie is used to record from where the visitor came to the website orginally. This information is used by the website operator to know the efficiency of their marketing.
VISITOR_INFO1_LIVE	5 months 27 days	A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface.
YSC	session	YSC cookie is set by Youtube and is used to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt-remote-device-id	never	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
yt.innertube::nextId	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen.

Time-to-Wow Is Getting Shorter for Data-Driven Software Development – Here’s How You Achieve It

Time to Value, Time to Wow!

Software Deployment Speed Is of the Essence

Joseph Sibony

Table of Contents

Shorten your builds

Related Posts

5 minutes What is Platform Engineering?

5 minutes How Industry 4.0 is leading the way for another industrial revolution

5 minutes 9 Top Programming Languages for 2022 (And Why Devs Love Them)

Cookie	Duration	Description
_hjSession_2537450	30 minutes	No description
_hjSessionUser_2537450	1 year	No description
AnalyticsSyncHistory	1 month	No description
BIGipServersn-mch-v2-80	session	No description
BIGipServersn02web-nginx-app_https	session	No description
ib_last_referrer	2 months	No description
incap_ses_1319_2167377	session	No description
li_gc	2 years	No description
muc_ads	2 years	No description
nlbi_2167377	session	No description
original_req_url	past	No description
referrer66_00f	1 month	No description
visid_incap_2167377	1 year	No description
visitorId	1 year	No description